Change the repository type filter
All
Repositories list
26 repositories
tpu-inference
Public- Intelligent Router for Mixture-of-Models
guidellm
Public- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
- Community maintained hardware plugin for vLLM on Ascend
flash-attention
Publicrecipes
Publicvllm-openvino
Publicrfcs
Publicdashboard
Public