New Users: Expert Kit, A Distributed, Expert-Centric Framework for MoE LLM Inference #6153
Xuanwo
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
https://github.com/expert-kit/expert-kit
They are using opendal to save/load tensors from local fs or s3.
Expert Kit (EK)is a high-performance framework for scalable MoE (Mixture of Experts) LLM inference. The vision of EK is to provide an efficient foundation of Expert Parallelism (EP) on heterogeneous hardware (e.g., CPU and GPU) over commodity networks (e.g. PCIe, TCP, RDMA), thereby enabling easy deployment and fine-grained expert-level scaling.Beta Was this translation helpful? Give feedback.
All reactions