Neural Magic
Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM
Pinned Loading
Repositories
Showing 10 of 71 repositories
- model-validation-configs Public
neuralmagic/model-validation-configs’s past year of commit activity - compressed-tensors Public
A safetensors extension to efficiently store sparse quantized tensors on disk
neuralmagic/compressed-tensors’s past year of commit activity - gateway-api-inference-extension Public Forked from kubernetes-sigs/gateway-api-inference-extension
Gateway API Inference Extension
neuralmagic/gateway-api-inference-extension’s past year of commit activity - speculators Public
neuralmagic/speculators’s past year of commit activity - lighteval Public Forked from huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
neuralmagic/lighteval’s past year of commit activity
Top languages
Loading…