Skip to content

Commit 83176a6

Browse files
authored
Merge pull request #5248 from JSap0914/docs/2206-claude-split-routing
docs(config): explain split Claude routing
2 parents 82bd07b + 46a09f8 commit 83176a6

1 file changed

Lines changed: 39 additions & 0 deletions

File tree

docs/reference/configuration.md

Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1038,6 +1038,45 @@ The installer prepares Git Bash with normal detection, `OMO_CODEX_GIT_BASH_PATH`
10381038

10391039
Install [`opencode-antigravity-auth`](https://github.com/NoeFabris/opencode-antigravity-auth) for Google Gemini. Provides multi-account load balancing, dual quota, and variant-based thinking.
10401040

1041+
##### Split Claude Routing
1042+
1043+
Provider path affects the effective Claude context limit. Antigravity Claude
1044+
models are the stable 200k lane. Direct Anthropic Claude models are the 1M lane
1045+
for accounts and model IDs that support long context.
1046+
1047+
Use Antigravity for cheaper or quota-balanced work where 200k context is enough.
1048+
Use direct Anthropic for long-context planning, review, and research sessions
1049+
where early compaction would lose important context.
1050+
1051+
```jsonc
1052+
{
1053+
"agents": {
1054+
// 200k lane: Google Antigravity Claude.
1055+
"explore": {
1056+
"model": "google/antigravity-claude-sonnet-4-6"
1057+
},
1058+
"librarian": {
1059+
"model": "google/antigravity-claude-sonnet-4-6"
1060+
},
1061+
1062+
// 1M lane: direct Anthropic, only for eligible long-context accounts/models.
1063+
"sisyphus": {
1064+
"model": "anthropic/claude-opus-4-6",
1065+
"variant": "max"
1066+
},
1067+
"oracle": {
1068+
"model": "anthropic/claude-opus-4-6"
1069+
}
1070+
}
1071+
}
1072+
```
1073+
1074+
If you see an error like `prompt is too long ... > 200000`, check whether the
1075+
agent is routed through `google/antigravity-*`. Move that agent to a direct
1076+
`anthropic/*` model only when the account, model, and required beta/header setup
1077+
support 1M context. Keep the Antigravity lane explicit when you want predictable
1078+
200k behavior.
1079+
10411080
#### Ollama
10421081

10431082
**Must** disable streaming to avoid JSON parse errors:

0 commit comments

Comments
 (0)