Skip to content

refactor: unify model caching on ResponseCache#1118

Merged
Luodian merged 1 commit into
dev-v0d7from
split/pr-cache-responsecache
Feb 21, 2026
Merged

refactor: unify model caching on ResponseCache#1118
Luodian merged 1 commit into
dev-v0d7from
split/pr-cache-responsecache

Conversation

@Luodian
Copy link
Copy Markdown
Contributor

@Luodian Luodian commented Feb 20, 2026

Summary

  • remove provider-local JSON cache code paths from API/chat wrappers and rely on evaluator-level ResponseCache
  • route generate_until_agentic through ResponseCache and normalize generated output handling
  • harden cache keying by including request content hash for generation requests to prevent prompt collision on same doc/index
  • add regression coverage for agentic cache reuse and same-doc different-prompt key separation

Validation

  • uv run python -m unittest discover -s test/cache -p "test_*cache*.py"
  • uv run python -m unittest discover -s test/models -p "test_model_registry_v2.py"

Refs: LMM-276

@Luodian Luodian merged commit 4aecb82 into dev-v0d7 Feb 21, 2026
2 checks passed
@Luodian Luodian deleted the split/pr-cache-responsecache branch February 23, 2026 08:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant