vLLM-Omni Model Support

### Motivation.

This section tracks the status of model integrations within the project, distinguishing between models that are actively under adaptation and those planned for future support. We hope the community can help us support these SOTA models together.

### Proposed Change.

**P0:**🙋

**Omni Pipeline**
- [x] [Qwen/Qwen3-Omni](https://huggingface.co/Qwen/Qwen3-Omni-30B-A3B-Instruct) #559 
- [x] [Qwen/Qwen2.5-Omni](https://huggingface.co/Qwen/Qwen2.5-Omni-7B) #12 
- [ ] [MiMo-Audio](https://github.com/XiaomiMiMo/MiMo-Audio) #151 #750 
- [ ] [HunyuanImage-3.0/Instruct/Instruct-Distil](https://github.com/Tencent-Hunyuan/HunyuanImage-3.0) #42 #794 
- [x] [Bagel](https://github.com/ByteDance-Seed/Bagel/tree/main) #203 #726 
- [ ] [zai-org/GLM-Image](https://huggingface.co/zai-org/GLM-Image) #792 #799 #763 #920 
- [ ] [naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B](https://huggingface.co/naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B) #743 #585
- [ ] [inclusionAI/Ming-flash-omni-Preview](https://huggingface.co/inclusionAI/Ming-flash-omni-Preview) #692 
- [ ] [stepfun-ai/Step-Audio-R1.1](https://huggingface.co/stepfun-ai/Step-Audio-R1.1) #862 
- [x] [Qwen/Qwen3-TTS](https://huggingface.co/Qwen/Qwen3-TTS-Tokenizer-12Hz) #895
- [ ] [openbmb/MiniCPM-o-4_5](https://huggingface.co/openbmb/MiniCPM-o-4_5) #1182 

**X2I**
- [x] [Z-Image/Z-Image-Turbo](https://github.com/Tongyi-MAI/Z-Image) #99 #149 
- [x] [Qwen-Image/Qwen-Image-2512](https://huggingface.co/Qwen/Qwen-Image-2512) #97 
- [x] [Qwen-Image-Edit](https://github.com/QwenLM/Qwen-Image/blob/main/Qwen-Image-Edit-2509.md) #187 #196  
- [x] [Qwen-Image-Edit-2509](https://huggingface.co/Qwen/Qwen-Image-Edit-2509) #330 
- [x] [Qwen-Image-Edit-2511](https://huggingface.co/Qwen/Qwen-Image-Edit-2511) #321 
- [x] [Qwen-Image-Layered](https://huggingface.co/Qwen/Qwen-Image-Layered) #381 
- [x] [Ovis-Image-7B](https://huggingface.co/AIDC-AI/Ovis-Image-7B) #224 #263 
- [ ] [FLUX.1-Kontext-dev](https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev) #359 #561 
- [ ] [black-forest-labs/FLUX.1-schnell](https://huggingface.co/black-forest-labs/FLUX.1-schnell) #574 
- [ ] [FLUX.2[dev]](https://huggingface.co/black-forest-labs/FLUX.2-dev) #153 #302  
- [x] [FLUX.2-klein](https://huggingface.co/black-forest-labs/FLUX.2-klein-9B) #809 
- [ ] [Owen777/UltraFlux-v1](https://huggingface.co/Owen777/UltraFlux-v1) #327 #611 
- [x] [LongCat-Image](https://huggingface.co/meituan-longcat/LongCat-Image) #220 #291 
- [x] [LongCat-Image-Edit](https://huggingface.co/meituan-longcat/LongCat-Image) #338  #392 
- [ ] [stepfun-ai/NextStep-1.1](https://huggingface.co/stepfun-ai/NextStep-1.1) #470 #612

**X2V**
- [x] [Wan-AI/Wan2.2-T2V-A14B-Diffusers](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers) #202
- [x] [Wan-AI/Wan2.2-I2V-A14B-Diffusers](https://huggingface.co/Wan-AI/Wan2.2-I2V-A14B-Diffusers) #325 
- [x] [Wan-AI/Wan2.2-TI2V-5B-Diffusers](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B-Diffusers) #329
- [ ] [wan2.1-t2v-1.3b](https://huggingface.co/Wan-AI/Wan2.1-T2V-1.3B) #201 #244 
- [ ] [HunyuanVideo-1.5](https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5) #228 #289 
- [ ] [Lightricks/LTX-2](https://huggingface.co/Lightricks/LTX-2) #674 #841 
- [ ] [OpenMOSS-Team/MOVA-720p](https://huggingface.co/OpenMOSS-Team/MOVA-720p) #1092 
- [ ] [SkyReels-V3](https://huggingface.co/collections/Skywork/skyreels-v3) #1093 

**X2S**
- [x] [Stable Audio Open (stabilityai/stable-audio-open-1.0)](https://huggingface.co/stabilityai/stable-audio-open-1.0) #324  #331 
- [ ] [zai-org/GLM-TTS](https://huggingface.co/zai-org/GLM-TTS) #821 #834 

**P1:**🙋

**Omni Pipeline**
- [ ] [LongCat-Flash-Omni](https://arxiv.org/pdf/2511.00279) #213 
- [ ] [Step-Audio2](https://github.com/stepfun-ai/Step-Audio2) #271 #464 
- [ ] [Step-Audio-EditX](https://huggingface.co/stepfun-ai/Step-Audio-EditX) #272 
- [ ] [MammothModa2-Preview](https://huggingface.co/bytedance-research/MammothModa2-Preview) #314 
- [ ] [Fun-Audio-Chat](https://huggingface.co/FunAudioLLM/Fun-Audio-Chat-8B) #452  
- [ ] [Higgs audio v2](https://huggingface.co/bosonai/higgs-audio-v2-generation-3B-base) #894 
- [ ] [Chatterbox TTS](https://huggingface.co/ResembleAI/chatterbox) #899 
- [ ] [VibeVoice-7B](https://huggingface.co/vibevoice/VibeVoice-7B) #184 #999

**X2I**

- [x] [Stable Diffusion 3 ](https://huggingface.co/papers/2403.03206) #439 
- [ ] [OmniGen2](https://github.com/VectorSpaceLab/OmniGen2) #225 #513 


**X2V**
- [ ] [IndexTeam/Index-anisora · Hugging Face](https://huggingface.co/IndexTeam/Index-anisora) #670 
- [ ] [tencent/HunyuanVideo-I2V](https://huggingface.co/tencent/HunyuanVideo-I2V) #383 

**X2S**
- [ ] [index-tts2](https://github.com/index-tts/index-tts) #229 #334 
- [ ] [Fun-CosyVoice3-0.5B](https://huggingface.co/FunAudioLLM/Fun-CosyVoice3-0.5B-2512) #315 #498 


### Feedback Period.

_No response_

### CC List.

@hsliuustc0106 @ZJY0516 @SamitHuang @wtomin @david6666666 

### Any Other Things.

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://vllm-omni.readthedocs.io), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vLLM-Omni Model Support #808

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

vLLM-Omni Model Support #808

Description

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions