fix(custom provider): enable prompt caching and accurate context window for custom endpoints serving Claude by Xiaochengzi2048 · Pull Request #50946 · NousResearch/hermes-agent

Xiaochengzi2048 · 2026-06-22T17:30:41Z

Problem

Two related issues affect users running Claude through a custom OpenAI-compatible endpoint (LiteLLM, Bedrock proxies, local gateways):

1. Prompt caching disabled

anthropic_prompt_cache_policy() returns (False, False) for provider=custom, even when the model name clearly identifies it as Claude. Every request re-bills the full prompt with no cache hits.

2. Context window reported as 200k

lookup_models_dev_context() returns None for provider=custom (not in PROVIDER_TO_MODELS_DEV), so hermes falls back to the 200k default. Claude Opus 4 users see 200k instead of the real 1M window.

Both issues share the same root cause: hermes does not recognise custom as a provider with known model metadata.

Fix

agent/agent_runtime_helpers.py

Add a branch for provider=custom + is_claude that enables caching with envelope layout (same as the existing openrouter branch):

agent/models_dev.py

When provider=custom has no mapping, fall back to the canonical provider based on model name:

This gives Claude Opus 4 its real 1M context window and Sonnet its real 200k — instead of both hitting the 200k fallback.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(custom provider): enable prompt caching and accurate context window for custom endpoints serving Claude#50946

fix(custom provider): enable prompt caching and accurate context window for custom endpoints serving Claude#50946
Xiaochengzi2048 wants to merge 2 commits into
NousResearch:mainfrom
Xiaochengzi2048:fix/custom-claude-prompt-caching

Xiaochengzi2048 commented Jun 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Xiaochengzi2048 commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

1. Prompt caching disabled

2. Context window reported as 200k

Fix

agent/agent_runtime_helpers.py

agent/models_dev.py

Related

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Xiaochengzi2048 commented Jun 22, 2026 •

edited

Loading