Skip to content

enable lora modules#4

Merged
dongwang218 merged 1 commit intomainfrom
new_model
Apr 16, 2025
Merged

enable lora modules#4
dongwang218 merged 1 commit intomainfrom
new_model

Conversation

@yangli5t
Copy link
Contributor

Why ?

add a lora model example.
In this PR, we explicitly call init_static_loras() to load lora_request, this is due a change in vllm @ 0.7.0 vllm-project/vllm@ac2f3f7. Not sure why they made such change..

How ?

when deploy, need to specify lora-modules
e.g. {'lora-modules': 'sql-lora=/checkpoint/comem/huggingface_models/mistral_based_claim_extractor'}

and then in make_request, model="sql-lora"

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 14, 2025
if "max_replica" not in app:
app["max_replica"] = app["min_replica"]

if app_type == "llm":
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

change to if app_type in ["llm", "sglang_llm"]

Comment on lines +344 to +348
async def check_health(self):
if self.healthy:
return {"status": "healthy"}
else:
raise RuntimeError("Replica unhealthy!") # Triggers Ray Serve restart
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let us merge #3 and rebase

@dongwang218 dongwang218 merged commit 573cb6a into main Apr 16, 2025
5 checks passed
@dongwang218 dongwang218 deleted the new_model branch April 16, 2025 04:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants