Skip to content

refactor to move under matrix.client.query_llm#5

Merged
dongwang218 merged 4 commits intomainfrom
dong/refactor1
Apr 16, 2025
Merged

refactor to move under matrix.client.query_llm#5
dongwang218 merged 4 commits intomainfrom
dong/refactor1

Conversation

@dongwang218
Copy link
Contributor

Why ?

query_llm was in the matrix.app_server.llm, now move it to client, so that we can add other clients.

How ?

similarly move EndpointCache to client.
move the llm default parameters into a separate file llm_config.py

Test plan

matrix deploy_applications --applications "[{'model_name': '/datasets/pretrained-llms/Llama-3.1-8B-Instruct', 'use_grpc': 'true', 'min_replica': 8, 'model_size': '8B', 'name': '8B_grpc'}]"
matrix check_health --app_name 8B_grpc

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 16, 2025
@dongwang218 dongwang218 requested a review from yangli5t April 16, 2025 06:07
@dongwang218 dongwang218 merged commit 6d11f4a into main Apr 16, 2025
5 checks passed
@dongwang218 dongwang218 deleted the dong/refactor1 branch April 16, 2025 19:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants