cache config llm-d by DolevAdas · Pull Request #5 · llm-d-incubation/llm-d-skills

DolevAdas · 2026-04-05T12:20:32Z

Modify cache memory settings in existing llm-d deployments without full redeployment.
Adjust GPU memory utilization,
KV cache capacity,
shared memory,
block size,
and context length to optimize performance for different workload patterns.

Signed-off-by: Dolev Adas <dolev.adas@ibm.com>

add cache-config-llm-d skill

5645b90

Signed-off-by: Dolev Adas <dolev.adas@ibm.com>

DolevAdas changed the title ~~cache-config-llm-d skill~~ cache config llm-d Apr 5, 2026

renamed from cache-config-llm-d to configure-cache-llm-d

922843f

Signed-off-by: Dolev Adas <dolev.adas@ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache config llm-d#5

cache config llm-d#5
DolevAdas wants to merge 2 commits intollm-d-incubation:mainfrom
DolevAdas:cache-config

DolevAdas commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DolevAdas commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant