enable lora modules by yangli5t · Pull Request #4 · facebookresearch/matrix

yangli5t · 2025-04-14T20:02:59Z

Why ?

add a lora model example.
In this PR, we explicitly call init_static_loras() to load lora_request, this is due a change in vllm @ 0.7.0 vllm-project/vllm@ac2f3f7. Not sure why they made such change..

How ?

when deploy, need to specify lora-modules
e.g. {'lora-modules': 'sql-lora=/checkpoint/comem/huggingface_models/mistral_based_claim_extractor'}

and then in make_request, model="sql-lora"

dongwang218 · 2025-04-15T00:44:30Z

matrix/app_server/deploy_utils.py

            if "max_replica" not in app:
                app["max_replica"] = app["min_replica"]

+            if app_type == "llm":


change to if app_type in ["llm", "sglang_llm"]

dongwang218 · 2025-04-15T00:45:29Z

matrix/app_server/llm/ray_serve_vllm.py

+    async def check_health(self):
+        if self.healthy:
+            return {"status": "healthy"}
+        else:
+            raise RuntimeError("Replica unhealthy!")  # Triggers Ray Serve restart


let us merge #3 and rebase

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 14, 2025

dongwang218 reviewed Apr 15, 2025

View reviewed changes

enable lora modules

032b149

yangli5t force-pushed the new_model branch from 9da8e2d to 032b149 Compare April 15, 2025 01:08

dongwang218 approved these changes Apr 15, 2025

View reviewed changes

dongwang218 merged commit 573cb6a into main Apr 16, 2025
5 checks passed

dongwang218 deleted the new_model branch April 16, 2025 04:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable lora modules#4

enable lora modules#4
dongwang218 merged 1 commit intomainfrom
new_model

yangli5t commented Apr 14, 2025

Uh oh!

dongwang218 Apr 15, 2025

Uh oh!

dongwang218 Apr 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yangli5t commented Apr 14, 2025

Why ?

How ?

Uh oh!

dongwang218 Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

dongwang218 Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants