docs: update README to describe Capacity Planner and GPU Recommender by amito · Pull Request #182 · llm-d-incubation/llm-d-planner

amito · 2026-04-14T16:11:23Z

The README previously only described the conversational recommendation engine (originally NeuralNav).
Updated to cover all three capabilities: Capacity Planner (GPU memory estimation), GPU Recommender (roofline performance prediction), and the estimated performance fallback.
Added CLI section, updated feature list, key technologies, and milestone history.

anfredette · 2026-04-14T22:48:50Z

@amito @jgchn @namasl
I've made some updates, but want to spend more time on this tomorrow.

If we are going to rename Capacity Planner and GPU Recommender, now is probably the time to do it. I've updated the doc to call them "Capacity Analyzer" and "Performance Analyzer", but that's just a suggestion. Please let me know what you think.

jgchn

Hi @amito @anfredette thanks for starting this. I wonder if we can tie it to the LLM aspects more. Thoughts on "LLM Memory Analyzer" and "Inference Performance Analyzer"?

Actually "analyzer" makes me think that this is on benchmarked data rather than on estimated data. Maybe it's okay given that we eventually want just one unified user experience.

jgchn · 2026-04-14T23:39:25Z

- **⚡ One-Click Deployment** - Generate production-ready KServe/vLLM YAML and deploy to Kubernetes
- **📈 Performance Monitoring** - Track actual deployment status and test inference in real-time
- **💻 GPU-Free Development** - vLLM simulator enables local testing without GPU hardware
+- **Conversational Requirements Gathering** - Describe your use case in natural language


- **Conversational Requirements Gathering** - Describe your AI-powered use case in natural language

jgchn · 2026-04-14T23:41:55Z

 **Required before running `make setup`:**

 - **macOS or Linux** (Windows via WSL2)
 - **Docker Desktop** (must be running)


Is docker desktop required? I thought just Docker or Podman would work

Docker or Podman should do. I'll fix that.

jgchn · 2026-04-14T23:43:44Z

-6. **Security Hardening** - YAML validation, RBAC, network policies
-7. **Multi-Tenancy** - Namespaces, resource quotas, isolation
-8. **Advanced Simulation** - SimPy, Monte Carlo for what-if analysis
+1. **Prefill/Decode Disaggregation** - Support P/D disaggregation as a first-class deployment topology


I would also add exposing llm-d stack level configuration like routing in addition to PD and finer-grained vLLM params

The README previously only described the conversational recommendation engine (originally NeuralNav). Updated to cover all three capabilities: Capacity Planner (GPU memory estimation), GPU Recommender (roofline performance prediction), and the estimated performance fallback. Added CLI section, updated feature list, key technologies, and milestone history. Signed-off-by: Amit Oren <amoren@redhat.com>

Rename capabilities to Planner, Capacity Analyzer, and Performance Analyzer. Rewrite overview to highlight the unified platform story and how the analyzers both stand alone and feed into the Planner workflow. Merge redundant feature sections, align future enhancements with the llm-d-planner proposal, add Linux prereq links, and fix the contributing section. Signed-off-by: Andre Fredette <afredette@redhat.com>

anfredette marked this pull request as draft April 14, 2026 22:44

anfredette requested review from jgchn and namasl April 14, 2026 22:44

anfredette self-requested a review April 14, 2026 22:49

jgchn reviewed Apr 14, 2026

View reviewed changes

amito and others added 2 commits April 16, 2026 09:11

anfredette force-pushed the chore/update-readme branch from f65c47d to 6dddcb9 Compare April 16, 2026 13:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: update README to describe Capacity Planner and GPU Recommender#182

docs: update README to describe Capacity Planner and GPU Recommender#182
amito wants to merge 2 commits intollm-d-incubation:mainfrom
amito:chore/update-readme

amito commented Apr 14, 2026

Uh oh!

anfredette commented Apr 14, 2026

Uh oh!

jgchn left a comment •

edited

Loading

Uh oh!

jgchn Apr 14, 2026

Uh oh!

jgchn Apr 14, 2026

Uh oh!

amito Apr 15, 2026

Uh oh!

jgchn Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

amito commented Apr 14, 2026

Uh oh!

anfredette commented Apr 14, 2026

Uh oh!

jgchn left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jgchn Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

jgchn Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

amito Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

jgchn Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jgchn left a comment •

edited

Loading