Skip to content

Add support for mistral3 (no vision encoding)#627

Draft
alex-jw-brooks wants to merge 1 commit intovllm-project:mainfrom
alex-jw-brooks:mistral_text_only
Draft

Add support for mistral3 (no vision encoding)#627
alex-jw-brooks wants to merge 1 commit intovllm-project:mainfrom
alex-jw-brooks:mistral_text_only

Conversation

@alex-jw-brooks
Copy link
Collaborator

Description

Supports running only the text part of multimodal mistral3 models through vLLM Spyre; I haven't been able to validate on AIU quite yet, but have at least ensured I can run it with the eager backend.

This PR is dependent on the following PR in FMS: foundation-model-stack/foundation-model-stack#501

I will follow up in both FMS and here (after the granite vision PR is merged) to add support for pixtral in separate PRs & implement the multimodal utils.

Signed-off-by: Alex-Brooks <Alex.Brooks@ibm.com>
@github-actions
Copy link

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, run ./format.sh.
Now you are good to go 🚀.

We also recommend installing prek and configuring it to check your code before every local commit.

@alex-jw-brooks alex-jw-brooks changed the title Add support for mistal3 (no vision encoding) Add support for mistral3 (no vision encoding) Jan 21, 2026
# TODO (Alex) move this to utils after granite vision
# is merged.
if hasattr(cfg, "text_config"):
return cfg.text_config.src_vocab_size
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: next line can be modified to be cfg.src_vocab_size

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants