Skip to content

Support for installation of inference-sim with kv-cache enabled#305

Merged
github-actions[bot] merged 1 commit intollm-d:mainfrom
mayabar:sim-with-kvcache
Aug 20, 2025
Merged

Support for installation of inference-sim with kv-cache enabled#305
github-actions[bot] merged 1 commit intollm-d:mainfrom
mayabar:sim-with-kvcache

Conversation

@mayabar
Copy link
Contributor

@mayabar mayabar commented Aug 19, 2025

  • add example of scheduler and simulator configuration to work with kv-cache
  • update kind-dev-env.sh to support model names with '/'
  • disable kind environment creation with both PD_ENABLED and KV_CACHE_ENABED true

kfswain
kfswain previously approved these changes Aug 19, 2025
- name: PD_PROMPT_LEN_THRESHOLD
value: '${PD_PROMPT_LEN_THRESHOLD}'
- name: PYTHONHASHSEED
value: "42"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this a Hitchhikers Guide Reference? 😂 (Just wondering why we chose 42)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copied from kv-cache-manager ;)

…cache

update kind-dev-env.sh to support model names with '/'
disable kind environment creation with both PD_ENABLED and KV_CACHE_ENABED true
remove irrelevant comment

Signed-off-by: Maya Barnea <mayab@il.ibm.com>
Copy link
Collaborator

@shmuelk shmuelk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

/approve

@shmuelk
Copy link
Collaborator

shmuelk commented Aug 20, 2025

/lgtm

/approve

@github-actions github-actions bot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 20, 2025
@github-actions github-actions bot merged commit 7cf35be into llm-d:main Aug 20, 2025
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

lgtm "Looks good to me", indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants