Skip to content

Conversation

@EunjuYang
Copy link
Contributor

Dependency of the PR

Commits to be reviewed in this PR

{commit-1}
  • This patch updates KVCache save/load.
  • In the previous version, we had to update nntr_config.json for the kv_cache saving and kv_cache using.
  • This patch merges the two configurations into one.
  • Expected nntr_confg.json
{
 ...
 "system_prompt": {
   "head_prompt": "blabla",
   "tail_prompt": "blabla",
   "kvcache":{ (optional)
    "pre_computed_cache_path": "system_prompt_cache_name.bin",
    "sys_prompt_token_size"(optional): 512
   }
 }

Self evaluation:

  1. Build test: [X]Passed [ ]Failed [ ]Skipped
  2. Run test: [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Eunju Yang [email protected]

Summary

  • This patch aims to merges the two kvcache configuration files into one nntr_config.json file.
  • Expected nntr_confg.json
{
 ...
 "system_prompt"(optional): {
   "head_prompt": "blabla",
   "tail_prompt": "blabla",
   "kvcache"(optional):{ 
    "pre_computed_cache_path": "system_prompt_cache_name.bin",
    "sys_prompt_token_size"(optional): 512
   },
  "sample_input": "put your user prompt here"
 }

Signed-off-by: Eunju Yang [email protected]

- This patch updates KVCache save/load.
- This patch aims to merges the kvcache saving configuration as one
  nntr_config.json file.
- In the previous version, we had to update nntr_config.json for the
  kv_cache saving and kv_cache using.
- This patch merges the two configurations into one.
- Expected nntr_confg.json
```json
{
 ...
 "system_prompt": {
   "head_prompt": "blabla",
   "tail_prompt": "blabla",
   "kvcache":{ (optional)
    "pre_computed_cache_path": "system_prompt_cache_name.bin",
    "sys_prompt_token_size"(optional): 512
   }
 }
```

Signed-off-by: Eunju Yang <[email protected]>
@EunjuYang EunjuYang force-pushed the causallm/kv_cache_save branch from b34fe9e to a41712d Compare September 18, 2025 01:58
Copy link
Member

@skykongkong8 skykongkong8 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions
Copy link

github-actions bot commented Oct 7, 2025

This PR is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 3 days.

@github-actions github-actions bot added the Stale label Oct 7, 2025
@github-actions
Copy link

This PR was closed because it has been stalled for 3 days with no activity.

@github-actions github-actions bot closed this Oct 10, 2025
@EunjuYang EunjuYang reopened this Oct 10, 2025
@github-actions github-actions bot removed the Stale label Oct 11, 2025
@github-actions
Copy link

This PR is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 3 days.

@github-actions github-actions bot added the Stale label Oct 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants