-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
- Host LLMs with sleep and wake endpoint. Support Multiple Models vllm-project/vllm#299 (comment)
- Move LLM hosting to k8s-based service deploy
- Move shiny apps to k8s-based deploy.
Metadata
Metadata
Assignees
Labels
No labels