Skip to content

Navigation Menu

llm-d

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

llm-d

llm-d is a Kubernetes-native high-performance distributed LLM inference framework

52 followers
United States of America
https://llm-d.ai
http://linkedin.com/company/llm-d
@_llm_d_
https://www.reddit.com/r/llm_d/

Overview
Repositories
Projects
Packages
People

More

Overview
Repositories
Projects
Packages
People

Popular repositories Loading

llm-d llm-d Public

llm-d is a Kubernetes-native high-performance distributed LLM inference framework

Makefile 103 6
llm-d-inference-scheduler llm-d-inference-scheduler Public

Inference scheduler for llm-d

Go 22 1
llm-d-deployer llm-d-deployer Public

Helm charts for llm-d

Shell 18 4
llm-d-kv-cache-manager llm-d-kv-cache-manager Public

Distributed KV cache coordinator

Go 14 1
llm-d-inference-sim llm-d-inference-sim Public

A light weight vLLM simulator, for mocking out replicas.

Go 7 1
llm-d-model-service llm-d-model-service Public

Incubating model service CRDs for llm-d

Go 7 2

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All Go Makefile Shell

Sort

Select order

Last updated Name Stars

Showing 9 of 9 repositories

llm-d Public
llm-d is a Kubernetes-native high-performance distributed LLM inference framework

llm-d/llm-d’s past year of commit activity

Makefile 103 Apache-2.0 6 5 7 Updated May 20, 2025
llm-d-benchmark Public
llm-d benchmark scripts and tooling

llm-d/llm-d-benchmark’s past year of commit activity

Shell 6 Apache-2.0 1 0 0 Updated May 20, 2025
llm-d-deployer Public
Helm charts for llm-d

llm-d/llm-d-deployer’s past year of commit activity

Shell 18 Apache-2.0 4 20 4 Updated May 20, 2025
llm-d-routing-sidecar Public
Incubating P/D sidecar for llm-d

llm-d/llm-d-routing-sidecar’s past year of commit activity

Go 6 Apache-2.0 1 5 (1 issue needs help) 2 Updated May 20, 2025
llm-d-kv-cache-manager Public
Distributed KV cache coordinator

llm-d/llm-d-kv-cache-manager’s past year of commit activity

Go 14 1 8 (2 issues need help) 0 Updated May 20, 2025
llm-d-inference-scheduler Public
Inference scheduler for llm-d

llm-d/llm-d-inference-scheduler’s past year of commit activity

Go 22 Apache-2.0 1 32 (2 issues need help) 0 Updated May 20, 2025
llm-d-inference-sim Public
A light weight vLLM simulator, for mocking out replicas.

llm-d/llm-d-inference-sim’s past year of commit activity

Go 7 1 2 0 Updated May 20, 2025
llm-d-pd-utils Public

llm-d/llm-d-pd-utils’s past year of commit activity

Makefile 2 1 0 1 Updated May 20, 2025
llm-d-model-service Public
Incubating model service CRDs for llm-d

llm-d/llm-d-model-service’s past year of commit activity

Go 7 Apache-2.0 2 25 2 Updated May 19, 2025

People

Top languages

Go Shell Makefile

Most used topics

incubating ai

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.