fix served_model_name by yangli5t · Pull Request #121 · facebookresearch/matrix

yangli5t · 2025-12-10T02:39:37Z

Why ?

to make served_model_name working properly. with this arg, it allows pass in a customized model name when query the model.
arguments doc: https://docs.vllm.ai/en/v0.8.3/serving/openai_compatible_server.html
implementation was referred to https://github.com/vllm-project/vllm/blob/73a484caa1ad320d6e695f098c25c479a71e6774/vllm/entrypoints/openai/api_server.py#L1090-L1092

How ?

simply add "served_model_name" in the command, e.g. python -m matrix deploy_applications --applications '[{"model_name": "meta-llama/Llama-3.1-8B-Instruct", "model_size": "8B", "name": "8B", "min_replica": 1, "max_replica": 10, "enable_tools": "true", "served_model_name": "8B-matrix"}]'
then in query_llm.make_request, model="8B-matrix"

Test plan

end-to-end test

response = client.chat.completions.create(
        model='8B-matrix',
        messages=messages,
        temperature=0.6,
        top_p=0.95,
)

yangli5t requested review from dongwang218 and swdanielli as code owners December 10, 2025 02:39

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 10, 2025

fix served_model_name

ec69c25

yangli5t force-pushed the model_args branch from 4166c3e to ec69c25 Compare December 10, 2025 04:26

dongwang218 approved these changes Dec 10, 2025

View reviewed changes

dongwang218 merged commit 5b5f6b2 into main Dec 10, 2025
8 checks passed

dongwang218 deleted the model_args branch December 10, 2025 22:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix served_model_name#121

fix served_model_name#121
dongwang218 merged 1 commit intomainfrom
model_args

yangli5t commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yangli5t commented Dec 10, 2025

Why ?

How ?

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants