diegocastanibm

Follow

🍌

Diego Castan diegocastanibm

🍌

Follow

3 followers · 0 following

@IBM
New York

Pinned Loading

llm-d/llm-d llm-d/llm-d Public

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3k 428
llm-d-incubation/llm-d-fast-model-actuation llm-d-incubation/llm-d-fast-model-actuation Public

Kubernetes controllers for fast model actuation using vLLM sleep/wake and launcher-based model swapping

Go 11 14