[Feat] support TeaCache for Flux2 klein by RuixiangMa · Pull Request #1234 · vllm-project/vllm-omni

RuixiangMa · 2026-02-06T00:01:48Z

Purpose

Add TeaCache support for Flux2 Klein model with dual-stream transformer architecture.

Test Plan

Test Result

1 * NVIDIA 4090 (24G)

curl -s -X POST "http://localhost:8004/v1/images/edits" -F "image=@test.jpg" -F "prompt=Change the sky to orange sunset." -F "guidance_scale=1.0" -F "num_inference_steps=50" -F "n=1" -F "size=1024x1024" -F "output_format=png" | jq -r '.data[0].b64_json' | base64 --decode > output.png

Origin image	NO TeaCache	TeaCache (0.2)	TeaCache (0.4)	TeaCache (0.6)

Time	25.479 s/img	16.043 s/img	10.051 s/img	7.599 s/img

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c52ff57eaf

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm_omni/diffusion/cache/teacache/hook.py

Signed-off-by: Lancer <maruixiang6688@gmail.com>

Copilot

Pull request overview

This pull request adds TeaCache support for the Flux2 Klein diffusion model, which uses a dual-stream transformer architecture. TeaCache is an adaptive caching technique that speeds up inference by reusing transformer computations when timestep embeddings are similar.

Changes:

Added Flux2Klein extractor function to extract model-specific context for caching decisions
Added Flux2Klein-specific logic in the hook to handle the model's dual-stream architecture with additional single_transformer_blocks
Configured polynomial coefficients for Flux2Klein (reusing FLUX.1 coefficients)
Registered Flux2KleinPipeline in the TeaCache backend with a custom enabler

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
vllm_omni/diffusion/cache/teacache/hook.py	Added special handling for Flux2 models that have both transformer_blocks and single_transformer_blocks
vllm_omni/diffusion/cache/teacache/extractors.py	Implemented extract_flux2_klein_context function with preprocessing, transformer execution, and postprocessing logic
vllm_omni/diffusion/cache/teacache/config.py	Added Flux2Klein polynomial coefficients (borrowed from FLUX.1)
vllm_omni/diffusion/cache/teacache/backend.py	Added enable_flux2_klein_teacache function and registered it in CUSTOM_TEACACHE_ENABLERS

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

vllm_omni/diffusion/cache/teacache/hook.py

vllm_omni/diffusion/cache/teacache/extractors.py

Signed-off-by: Lancer <maruixiang6688@gmail.com>

RuixiangMa requested a review from hsliuustc0106 as a code owner February 6, 2026 00:01

chatgpt-codex-connector bot reviewed Feb 6, 2026

View reviewed changes

vllm_omni/diffusion/cache/teacache/hook.py Outdated Show resolved Hide resolved

[Feat] support TeaCache for Flux2 klein

979fe87

Signed-off-by: Lancer <maruixiang6688@gmail.com>

RuixiangMa force-pushed the supportTeaCache branch from c52ff57 to 979fe87 Compare February 6, 2026 00:12

hsliuustc0106 requested a review from Copilot February 6, 2026 01:19

Copilot started reviewing on behalf of hsliuustc0106 February 6, 2026 01:19 View session

Copilot AI reviewed Feb 6, 2026

View reviewed changes

wtomin mentioned this pull request Feb 6, 2026

[RFC]: Continuous Diffusion Model Acceleration Support #1217

Open

1 task

upd

ca90ff0

Signed-off-by: Lancer <maruixiang6688@gmail.com>

RuixiangMa force-pushed the supportTeaCache branch from 1cb433d to ca90ff0 Compare February 6, 2026 05:08

RuixiangMa mentioned this pull request Feb 6, 2026

[Feat] support TeaCache for FLUX1.dev #1243

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat] support TeaCache for Flux2 klein#1234

[Feat] support TeaCache for Flux2 klein#1234
RuixiangMa wants to merge 2 commits intovllm-project:mainfrom
RuixiangMa:supportTeaCache

RuixiangMa commented Feb 6, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RuixiangMa commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

RuixiangMa commented Feb 6, 2026 •

edited

Loading