docs: Add a setup documentation about examples/kv-cache-index by buraksekili · Pull Request #38 · llm-d/llm-d-kv-cache

buraksekili · 2025-06-06T14:48:35Z

This PR re-adds the setup docs related to examples/kv-cache-index/main.go, with minor updates:

explains environment variables needed for examples/kv-cache-index/main.go
parse those environment variables in the code,
update context cancellation just in case

vMaroon · 2025-06-06T15:01:08Z

Thanks @buraksekili. I think this can be promoted to a deployment doc instead of a phase-1 example, since "phase-1" could be confusing at this stage.

You can port and optionally improve the helm-guide, and perhaps extend it with the deployment of the inference-scheduler if you feel that it serves your needs. What do you think?

buraksekili · 2025-06-06T15:08:54Z

Sure, sounds good to me! Just to confirm, does that mean I can also use llm-d-inference-sim here for demonstrating the kv-cache-manager examples?

vMaroon · 2025-06-06T15:34:41Z

The simulator does not simulate KV-cache events yet. But in all cases, the helm-chart covers the vLLM deployments with LMCache and the Redis instance.

Then the kv-cache-index example can be used, but the model name must be aligned with that configured in the helm-chart.
It would be more interesting to have a simple router that utilizes the scorer example instead, but that can follow up (should be tracked in an issue). Does this make sense to you?

buraksekili · 2025-06-06T17:54:00Z

Thank you, @vMaroon, for your help here! I've just updated the docs and the example manager according to the charts.

Your suggestion about extending the current charts with the scheduler sounds good to me. However, due to limited time, I won’t be able to look into it until later next week. If no one picks it up by then, I’ll be happy to take a look.

vMaroon

Thank you for this contribution. This will certainly help other users!
Added some comments.

docs/deployment/setup.md

buraksekili · 2025-06-11T17:51:30Z

Thanks @vMaroon , I've updated the PR based on your suggestions. could you please have a look at the PR when you have availability?

vMaroon

Thank you for the update, apologies for the delay in reviewing. Minor changes left then ready to go!

docs/deployment/setup.md

examples/kv-cache-index/main.go

docs/deployment/setup.md

buraksekili · 2025-06-22T13:55:39Z

Thank you @vMaroon for the help! I have updated the code accordingly. Please have a look

vMaroon

Thanks, apologies for the long process, final set of suggestions then merging.

docs/deployment/setup.md

…to parse redis related envionment variables Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> move docs to deployment subfolder for brevity Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> use lower-case version of the vllm model label in the vllm deploment metadata, to prevent Kubernetes issues with models that contain upper-letters in their names Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> implement suggested changes according to the review Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> update release name for vllm helm deployment, to make it align with the purpose of the deployment Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> exit in case of errors in example Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> fix linter Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> Merge Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> add docs about kv-cache-index setup, and allow the example code base to parse redis related envionment variables Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> Apply suggestions from code review Co-authored-by: Maroon Ayoub <Maroonay@gmail.com> Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com> fix duplicate package Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com>

Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com>

vMaroon

Thank you for your contribution!

LGTM

vMaroon requested changes Jun 6, 2025

View reviewed changes

buraksekili force-pushed the docs/example-kv-cache-index branch from 4f0d841 to 3e2f6be Compare June 16, 2025 08:34

vMaroon requested changes Jun 20, 2025

View reviewed changes

buraksekili force-pushed the docs/example-kv-cache-index branch from 908ede4 to a22a39f Compare June 22, 2025 13:52

vMaroon requested changes Jun 23, 2025

View reviewed changes

buraksekili force-pushed the docs/example-kv-cache-index branch from 2e3d54a to 770ada5 Compare June 25, 2025 06:50

buraksekili force-pushed the docs/example-kv-cache-index branch from 770ada5 to 33d1260 Compare June 25, 2025 06:53

remove old file

3a977ea

Signed-off-by: Burak Sekili <32663655+buraksekili@users.noreply.github.com>

vMaroon approved these changes Jun 26, 2025

View reviewed changes

vMaroon merged commit 2d3b68d into llm-d:main Jun 26, 2025
1 check passed

Conversation

buraksekili commented Jun 6, 2025

Uh oh!

vMaroon commented Jun 6, 2025

Uh oh!

buraksekili commented Jun 6, 2025

Uh oh!

vMaroon commented Jun 6, 2025

Uh oh!

buraksekili commented Jun 6, 2025

Uh oh!

vMaroon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

buraksekili commented Jun 11, 2025

Uh oh!

vMaroon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

buraksekili commented Jun 22, 2025

Uh oh!

vMaroon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vMaroon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants