Add batch generate and deploy model scripts by dongwang218 · Pull Request #127 · facebookresearch/matrix

dongwang218 · 2026-01-09T00:33:15Z

Why ?

Allow users to programmable start cluster and deploy model
offline batch processing need a familar API

How ?

Script to start cluster and deploy models
Batch generate api similar to vllm llm.generate
also fix the cache issue by making cache dir torch and vllm version specific.

Test plan

pytest tests/integration/app_server/test_qwen3vl_batch_generate.py -v -s -x

…ra_body, not working

xlei77

LGTM

…loyment

…'t show up until nodes are allocated

dongwang218 added 5 commits January 8, 2026 04:14

add script to deploy models and wait for them to be ready

5117d6a

add batch generate api

da931dc

attemp using vllm native format, and send the multi_modal_data as ext…

ba24ec2

…ra_body, not working

get rid of vllm style

7715ba2

simplify

33bbf6d

dongwang218 requested review from swdanielli and yangli5t as code owners January 9, 2026 00:33

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 9, 2026

lint

390a5f0

xlei77 approved these changes Jan 9, 2026

View reviewed changes

dongwang218 added 6 commits January 9, 2026 17:03

update deploy_models api

50e7939

test num of running replica to avoid redeploy, also remove failed dep…

dbe9830

…loyment

try to use add for deploy_models

8bb6cb0

revert total resource check, when request workers, their resource won…

4f51e5e

…'t show up until nodes are allocated

set version specific cache dir

ae631f3

lint

17ee52d

dongwang218 merged commit a0252c1 into main Jan 11, 2026
8 checks passed

dongwang218 deleted the offline_scripts branch January 11, 2026 05:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add batch generate and deploy model scripts#127

Add batch generate and deploy model scripts#127
dongwang218 merged 12 commits intomainfrom
offline_scripts

dongwang218 commented Jan 9, 2026 •

edited

Loading

Uh oh!

xlei77 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dongwang218 commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why ?

How ?

Test plan

Uh oh!

xlei77 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dongwang218 commented Jan 9, 2026 •

edited

Loading