New Users: Expert Kit, A Distributed, Expert-Centric Framework for MoE LLM Inference #6153

Xuanwo · 2025-05-07T02:42:39Z

Xuanwo
May 7, 2025
Collaborator

https://github.com/expert-kit/expert-kit

They are using opendal to save/load tensors from local fs or s3.

Expert Kit (EK) is a high-performance framework for scalable MoE (Mixture of Experts) LLM inference. The vision of EK is to provide an efficient foundation of Expert Parallelism (EP) on heterogeneous hardware (e.g., CPU and GPU) over commodity networks (e.g. PCIe, TCP, RDMA), thereby enabling easy deployment and fine-grained expert-level scaling.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Users: Expert Kit, A Distributed, Expert-Centric Framework for MoE LLM Inference #6153

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

New Users: Expert Kit, A Distributed, Expert-Centric Framework for MoE LLM Inference #6153

Uh oh!

Uh oh!

Xuanwo May 7, 2025 Collaborator

Replies: 0 comments

Xuanwo
May 7, 2025
Collaborator