feat: extract cache tokens from prompt_tokens_details for non-Claude models by lewis617 · Pull Request #1226 · netease-lcap/wave-agent

lewis617 · 2026-06-12T08:16:21Z

Add ExtendedPromptTokensDetails interface extending OpenAI's PromptTokensDetails
with cache_creation_input_tokens field
Update ClaudeUsage to override prompt_tokens_details with extended type
Update extendUsageWithCacheMetrics to extract cache tokens from
prompt_tokens_details.cached_tokens and prompt_tokens_details.cache_creation_input_tokens
as fallback when Claude top-level fields are absent (Claude top-level takes priority)
Remove supportsPromptCaching gate from aiService cache extraction —
prompt_tokens_details is OpenAI-standard, applicable to all models
Remove unused modelName param from processStreamingResponse
Update Usage type comment to reflect dual-source cache fields
Add 3 test cases for prompt_tokens_details extraction
Update spec 021 docs to reflect cross-model cache token tracking

…models - Add ExtendedPromptTokensDetails interface extending OpenAI's PromptTokensDetails with cache_creation_input_tokens field - Update ClaudeUsage to override prompt_tokens_details with extended type - Update extendUsageWithCacheMetrics to extract cache tokens from prompt_tokens_details.cached_tokens and prompt_tokens_details.cache_creation_input_tokens as fallback when Claude top-level fields are absent (Claude top-level takes priority) - Remove supportsPromptCaching gate from aiService cache extraction — prompt_tokens_details is OpenAI-standard, applicable to all models - Remove unused modelName param from processStreamingResponse - Update Usage type comment to reflect dual-source cache fields - Add 3 test cases for prompt_tokens_details extraction - Update spec 021 docs to reflect cross-model cache token tracking

…cache token extraction Update spec 021 to explicitly distinguish two concerns: - supportsPromptCaching / WAVE_PROMPT_CACHE_REGEX gates cache_control marker injection (messages/tools) — Claude-only - Cache token extraction from usage applies to ALL models — no gate Updated: spec.md (FR-001, FR-007, FR-008, edge cases, key entities), data-model.md (relationships, model detection flow), quickstart.md (phase 3 integration notes), contracts/cache-control-api.md (scope note), research.md (universal caching alternatives)

lewis617 added 2 commits June 12, 2026 16:15

lewis617 merged commit e246f07 into main Jun 12, 2026
1 check passed

lewis617 deleted the feat/prompt-tokens-details-cache branch June 12, 2026 08:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: extract cache tokens from prompt_tokens_details for non-Claude models#1226

feat: extract cache tokens from prompt_tokens_details for non-Claude models#1226
lewis617 merged 2 commits into
mainfrom
feat/prompt-tokens-details-cache

lewis617 commented Jun 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lewis617 commented Jun 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant