🍌
- New York
Pinned Loading
-
llm-d/llm-d
llm-d/llm-d PublicAchieve state of the art inference performance with modern accelerators on Kubernetes
-
llm-d-incubation/llm-d-fast-model-actuation
llm-d-incubation/llm-d-fast-model-actuation PublicKubernetes controllers for fast model actuation using vLLM sleep/wake and launcher-based model swapping
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

