Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 123 Bytes

File metadata and controls

5 lines (3 loc) · 123 Bytes

llm-inference

Contributions to accelerate and scale LLM inferences. For now, a simulator of vLLM scheduling strategy.