[Feat] support SP for FLUX.2-klein by RuixiangMa · Pull Request #1250 · vllm-project/vllm-omni

RuixiangMa · 2026-02-06T09:47:26Z

Purpose

support SP (Ulysses & Ring) for FLUX.2-klein

Test Plan

Test Result

Target image:

tp = 1 + 4 * nvidia 4090(24G)

curl -s -X POST "http://localhost:8004/v1/images/edits" -F "image=@test.jpg" -F "prompt=Change the sky to orange sunset." -F "guidance_scale=1.0" -F "num_inference_steps=50" -F "n=1" -F "size=1024x1024" -F "output_format=png" | jq -r '.data[0].b64_json' | base64 --decode > output.png

Configuration	Ulysses degree	Ring degree	Generation Time	Speedup
Baseline	1	1	25.503s	1.00x
Ulysses	4	1	13.173s	1.94x
Ring	1	4	16.866s	1.51x
Hybrid Ulysses + Ring	2	2	14.812s	1.72x

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f3436b8532

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-06T09:50:09Z

vllm_omni/diffusion/models/flux2_klein/flux2_klein_transformer.py

+        if sp_size > 1:
+            sp_world_size = get_sequence_parallel_world_size()
+            sp_rank = get_sequence_parallel_rank()
+            original_shape = hidden_states.shape
+            hidden_states = torch.chunk(hidden_states, sp_world_size, dim=1)[sp_rank]
+            get_forward_context().split_text_embed_in_sp = False


Pad or validate SP splits for non-divisible seq_len

When sequence_parallel_size > 1, this code shards hidden_states with torch.chunk(...) without padding or validation. If the image token length is not divisible by the SP world size, torch.chunk yields uneven shapes across ranks, and the later get_sp_group().all_gather(output, dim=1) will fail because the group coordinator uses torch.distributed.all_gather_into_tensor, which requires equal-sized tensors. This makes SP mode crash for any input image size where the latent sequence length isn’t divisible by the SP degree; the existing SP auto-padding logic in diffusion/hooks/sequence_parallel.py is bypassed here.

Useful? React with 👍 / 👎.

Signed-off-by: Lancer <maruixiang6688@gmail.com>

hsliuustc0106 · 2026-02-06T14:20:46Z

vllm_omni/diffusion/models/flux2_klein/flux2_klein_transformer.py


-        image_rotary_emb = self.pos_embed(img_ids)
-        text_rotary_emb = self.pos_embed(txt_ids)
+        if current_omni_platform.is_npu():


@gcanlin do we have better ways to handle this difference? this is so awkward

@wtomin PTAL

RuixiangMa requested a review from hsliuustc0106 as a code owner February 6, 2026 09:47

chatgpt-codex-connector bot reviewed Feb 6, 2026

View reviewed changes

[Feat] support SP for FLUX.2-klein

e8fb739

Signed-off-by: Lancer <maruixiang6688@gmail.com>

RuixiangMa force-pushed the spforflux2klein branch from f3436b8 to e8fb739 Compare February 6, 2026 09:56

hsliuustc0106 reviewed Feb 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat] support SP for FLUX.2-klein#1250

[Feat] support SP for FLUX.2-klein#1250
RuixiangMa wants to merge 1 commit intovllm-project:mainfrom
RuixiangMa:spforflux2klein

RuixiangMa commented Feb 6, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Feb 6, 2026

Uh oh!

hsliuustc0106 Feb 6, 2026

Uh oh!

hsliuustc0106 Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RuixiangMa commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

hsliuustc0106 Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

hsliuustc0106 Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RuixiangMa commented Feb 6, 2026 •

edited

Loading