This repository was archived by the owner on Aug 30, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 38
An innovative library for efficient LLM inference via low-bit quantization
License
intel/neural-speed
ErrorLooks like something went wrong!
About
An innovative library for efficient LLM inference via low-bit quantization
Topics
Resources
License
Code of conduct
Security policy
Stars
Watchers
Forks
Packages 0
No packages published