Skip to content

Add batch generate and deploy model scripts#127

Merged
dongwang218 merged 12 commits intomainfrom
offline_scripts
Jan 11, 2026
Merged

Add batch generate and deploy model scripts#127
dongwang218 merged 12 commits intomainfrom
offline_scripts

Conversation

@dongwang218
Copy link
Contributor

@dongwang218 dongwang218 commented Jan 9, 2026

Why ?

Allow users to programmable start cluster and deploy model
offline batch processing need a familar API

How ?

Script to start cluster and deploy models
Batch generate api similar to vllm llm.generate
also fix the cache issue by making cache dir torch and vllm version specific.

Test plan

pytest tests/integration/app_server/test_qwen3vl_batch_generate.py -v -s -x

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 9, 2026
Copy link
Contributor

@xlei77 xlei77 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dongwang218 dongwang218 merged commit a0252c1 into main Jan 11, 2026
8 checks passed
@dongwang218 dongwang218 deleted the offline_scripts branch January 11, 2026 05:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants