Skip to content

[New Model]: XiaomiMiMo/MiMo-Audio-7B-Instruct support#750

Open
qibaoyuan wants to merge 239 commits intovllm-project:mainfrom
qibaoyuan:feature_mimo_audio
Open

[New Model]: XiaomiMiMo/MiMo-Audio-7B-Instruct support#750
qibaoyuan wants to merge 239 commits intovllm-project:mainfrom
qibaoyuan:feature_mimo_audio

Conversation

@qibaoyuan
Copy link
Contributor

@qibaoyuan qibaoyuan commented Jan 12, 2026

The model to consider

LLM Model Weights: https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Instruct
Audio Tokenizer Weights:https://huggingface.co/XiaomiMiMo/MiMo-Audio-Tokenizer
Model Code: https://github.com/XiaomiMiMo/MiMo-Audio

Model description

This PR enables stage-based deployment for the Mimo-Audio model, aligning it with the vllm-omni architecture. Specific changes include:

Added Stage Configuration

Introduced vllm_omni/model_executor/stage_configs/mimo_audio.yaml to define the multi-stage pipeline.

Refactored Model Structure: Split the Mimo-Audio into two stages:

Stage 0 (LLM+LocalForward)

  • Uses MiMoAudioLLMForConditionalGeneration (AR mode) for multimodal understanding and text generation.

Stage 1 (code2wav):

  • Uses MiMoAudioToken2WavForConditionalGenerationVLLM for video generation.

Test plan

NOTE: The MIMO_AUDIO_TOKENIZER_PATH environment variable is mandatory due to the specialized architecture.

offline

export MIMO_AUDIO_TOKENIZER_PATH="XiaomiMiMo/MiMo-Audio-Tokenizer"

python3 -u end2end.py \
  --stage-configs-path vllm_omni/model_executor/stage_configs/mimo_audio.yaml \
  --model-name XiaomiMiMo/MiMo-Audio-7B-Instruct \
  --query-type tts_sft \
  --text "The weather is so nice today."

online

server side:

export MIMO_AUDIO_TOKENIZER_PATH="XiaomiMiMo/MiMo-Audio-Tokenizer"

vllm-omni serve XiaomiMiMo/MiMo-Audio-7B-Instruct --omni \
--served-model-name "MiMo-Audio-7B-Instruct"  \
--port 8091 --stage-configs-path ./vllm_omni/model_executor/stage_configs/mimo_audio.yaml \
--chat-template ./examples/online_serving/mimo_audio/chat_template.jinja

client:

python openai_chat_completion_client_for_multimodal_generation.py \
--query-type multi_audios \
--message-json ../../offline_inference/mimo_audio/message_base64_wav.json

Test result

TTS(with reference audio)

0_6ca65429-1027-4797-963e-963a1de6c286.wav

Audio understanding

text:

vllm_text_output:
北京今天晴转多云,气温在25到34摄氏度之间,南风三到四级。体感会比较闷热,建议你注意防暑降温,并且尽量避开中午最热的时候出门。另外,紫外线也比较强,记得做好防晒措施。

audio:

0_43fce45f-3809-48f8-b059-351072a4743c.wav

Co-author notice

Please keep the Co-authored-by lines when squashing, as this PR includes contributions from multiple authors (including @Zhang Shijin, @Ding Ning).

Thank you!

hsliuustc0106 and others added 24 commits January 12, 2026 16:58
Signed-off-by: hsliu <liuhongsheng4@huawei.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
sync

Signed-off-by: Baoyuan Qi <qibaoyuan@126.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: gcanlin <canlinguosdu@gmail.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
…ct#718)

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>
Signed-off-by: Hongsheng Liu <liuhongsheng4@huawei.com>
Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>
Co-authored-by: WeiQing Chen <40507679+david6666666@users.noreply.github.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: gcanlin <canlinguosdu@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
…lm-project#735)

Signed-off-by: dongbo910220 <1275604947@qq.com>
Signed-off-by: dongbo910220 <32610838+dongbo910220@users.noreply.github.com>
Signed-off-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
…vllm-project#697)

Signed-off-by: Junhong Liu <98734602+LJH-LBJ@users.noreply.github.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: hsliu <liuhongsheng4@huawei.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>
Co-authored-by: wangyu31577 <wangyu31577@hundsun.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: hsliu <liuhongsheng4@huawei.com>
Signed-off-by: Hongsheng Liu <liuhongsheng4@huawei.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Signed-off-by: GG-li <3226868735@qq.com>
Signed-off-by: Sihao Li <111170255+GG-li@users.noreply.github.com>
Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
…llm-project#721)

Signed-off-by: XU Mingshi <91017482+mxuax@users.noreply.github.com>
Signed-off-by: mxuax <mxuax@connect.ust.hk>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Copy link

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 734774bb95

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Co-authored-by: Dovis01(shijin zhang) <zsj1364226740@gmail.com>

Signed-off-by: Baoyuan Qi <qibaoyuan@126.com>
qibaoyuan and others added 23 commits February 2, 2026 07:52
# Conflicts:
#	vllm_omni/worker/gpu_model_runner.py
Signed-off-by: Shijin Zhang <75300765+Dovis01@users.noreply.github.com>

Co-authored-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: Shijin Zhang <75300765+Dovis01@users.noreply.github.com>

Co-authored-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: Shijin Zhang <75300765+Dovis01@users.noreply.github.com>

Co-authored-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: Shijin Zhang <75300765+Dovis01@users.noreply.github.com>

Co-authored-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: ning ding <nndding@gmail.com>
qibaoyuan and others added 6 commits February 6, 2026 07:35
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: Shijin Zhang <75300765+Dovis01@users.noreply.github.com>

Co-authored-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: ning ding <nndding@gmail.com>
Signed-off-by: 齐保元 <qibaoyuan@xiaomi.com>
Co-authored-by: shijin zhang <75300765+Dovis01@users.noreply.github.com>
Co-authored-by: ning ding <nndding@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.