Skip to content

precompile for diffusion#860

Merged
pathfinder-pf merged 1 commit into
mainfrom
feat/precompile_for_diffusion
Mar 5, 2026
Merged

precompile for diffusion#860
pathfinder-pf merged 1 commit into
mainfrom
feat/precompile_for_diffusion

Conversation

@pathfinder-pf
Copy link
Copy Markdown
Collaborator

@pathfinder-pf pathfinder-pf commented Mar 5, 2026

Motivation

Modifications

  • fix flax version to 0.12.4, because flax==0.12.5 has breaking changes
  • precompile for diffusion stage
  • change server parameter from vae-decode-precompile-width-height to precompile-width-heights, --vae-decode-precompile-frame-paddings to precompile-frame-paddings

Accuracy Tests

JAX_COMPILATION_CACHE_DIR=/tmp/jit_cache python3 -u -m sgl_jax.launch_server --multimodal --model-path=/models/Wan-AI/Wan2.1-T2V-1.3B-Diffusers --precompile-token-paddings=8192 --precompile-bs-paddings=64 --precompile-width-heights 480*832 --precompile-frame-paddings 13 41
curl http://localhost:30000/api/v1/videos/generation -H "Content-Type: application/json" -d '{"prompt": "A curious raccoon", "size": "480*832", "num_frames": 41, "num_inference_steps": 50}'
8a47ddcaf0474c78a4934cda30b9ee7e.mp4

Benchmarking and Profiling

1st 58.729s
2nd 58.506s

Checklist

  • Please use English, otherwise it will be closed.
  • The purpose of the PR, or link existing issues this PR will resolve.
  • The test plan, such as providing test command.
  • (Optional) The necessary documentation update.

@gemini-code-assist
Copy link
Copy Markdown

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@pathfinder-pf pathfinder-pf force-pushed the feat/precompile_for_diffusion branch from 864848d to 57f6ff6 Compare March 5, 2026 05:24
@pathfinder-pf pathfinder-pf merged commit 03f337a into main Mar 5, 2026
19 checks passed
@pathfinder-pf pathfinder-pf deleted the feat/precompile_for_diffusion branch March 5, 2026 06:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants