feat(models): update Gemini model context windows and output limits by CleoMenezesJr · Pull Request #602 · Gitlawb/openclaude

CleoMenezesJr · 2026-04-11T19:55:07Z

Summary

What changed: Added context window and max output token entries for native Google Gemini models in openaiContextWindows.ts — covering gemini-2.0-flash, gemini-2.5-flash, gemini-2.5-pro, gemini-3-flash, and gemini-3.1-pro (used via CLAUDE_CODE_USE_GEMINI).
Why it changed: Without these entries, the system falls back to an 8k default context window. This causes calculateTokenWarningState to signal a blocking limit at session start, triggering auto-compaction before any real context is consumed.

Impact

user-facing: Gemini sessions no longer compact prematurely. Token warning thresholds now reflect the actual model limits (e.g., 1M tokens for Flash, 2M for Gemini 3.1 Pro).
developer/maintainer: Pure data addition — no logic changed. Entries follow the existing ascending-version ordering used for other provider families in the same file.

Testing

bun run build
bun run smoke
focused tests: launch with bun run dev:gemini using gemini-2.5-pro or gemini-3-flash, send a message, and confirm that the token indicator reflects the correct context window instead of triggering an immediate compaction prompt.

Notes

provider/model path tested: CLAUDE_CODE_USE_GEMINI=1 with gemini-2.5-pro and gemini-3-flash via native endpoint.
screenshots attached (if UI changed): N/A
follow-up work or known limitations: Output token limits for gemini-3-flash and gemini-2.0-flash are set conservatively at 8k. These should be updated if Google raises the limits in a future release.

Vasanthdev2004

Reviewed on head d5f5ce7 — adds context window and max output token entries for native Gemini models. CI green ✅

The intent is right — without these entries, native Gemini sessions fall back to 8k context and trigger premature compaction. But two of the five values need correction based on Google's official specs:

🔴 Incorrect values:

gemini-3.1-pro context window: 2,097,152 (2M) — Google's official model card states: "token context window of up to 1M". Should be 1,048,576.
gemini-3-flash max output tokens: 8,192 — Per sim.ai model tracking and Google's API docs, Gemini 3 Flash supports up to 65,536 output tokens (same as 2.5 Pro and 2.5 Flash). Setting this to 8k would unnecessarily limit responses from the model.

✅ Correct values:

Model	Context	Max Output	Source
`gemini-2.0-flash`	1,048,576	8,192	Correct per Google specs
`gemini-2.5-flash`	1,048,576	65,536	Correct
`gemini-2.5-pro`	1,048,576	65,536	Correct (was already in file, just reordered)
`gemini-3-flash`	1,048,576	~~8,192~~ → 65,536	Needs fix
`gemini-3.1-pro`	~~2,097,152~~ → 1,048,576	65,536	Needs fix

Verdict: Needs changes — two values don't match Google's published specs. Once corrected, this is approve-ready.

gnanam1990

I can’t approve this yet. The direction is right, but some of the Gemini context-window and output-limit values still need correction before merge. Since this data feeds runtime limit handling, I’d want the numbers fixed first, then I’m happy to recheck.

Vasanthdev2004

Re-reviewed on head 7def95e — both blockers from the previous review are fixed ✅

Model	Context	Max Output	Status
`gemini-2.0-flash`	1,048,576	8,192	✅
`gemini-2.5-flash`	1,048_576	65,536	✅
`gemini-2.5-pro`	1,048,576	65,536	✅
`gemini-3-flash`	1,048,576	65,536	✅ (was 8,192)
`gemini-3.1-pro`	1,048,576	65,536	✅ (was 2,097,152)

All values now match Google's published model cards. CI green. Pure data addition, no logic changes. Approved ✅

kevincodex1 · 2026-04-13T14:35:56Z

@gnanam1990 kindly have a look again bro

gnanam1990

.

gnanam1990

Still request changes.

bun run build passes on the current head, but I’m still not comfortable approving the exact Gemini metadata values being added here. This PR changes runtime context-window and max-output tables, and those numbers directly feed warning thresholds, blocking behavior, and auto-compact decisions. Before merging, I’d want the added Gemini limits verified against a clear source of truth or tightened so we’re not shipping incorrect runtime metadata.

auriti

Thanks for adding the missing Gemini native context windows — this is a real problem (the 8k fallback triggers premature compaction).

However, most of this PR is already covered by current main and by #783 (which adds the same entries plus OpenRouter variants and output token fallbacks). Here's the overlap:

Model	main (v0.5.2)	PR #783	This PR
`gemini-2.0-flash`	✅	✅	✅
`gemini-2.5-flash`	✅	✅	✅
`gemini-2.5-pro`	✅	✅	✅
`gemini-3.1-pro`	✅	✅	✅
`gemini-3-flash`	❌	❌	✅
`gemini-3-flash-preview`	❌	✅	❌
`gemini-3.1-pro-preview`	❌	✅	❌
`google/gemini-3*` (OpenRouter)	❌	✅	❌

The only entry unique to this PR is gemini-3-flash (without -preview). A couple of notes on that:

Naming: Google's current API uses gemini-3-flash-preview — the -preview suffix is required until the model reaches GA. gemini-3-flash without the suffix will likely not match any model ID today and would be a no-op entry. When Google promotes it to GA, the name may change — at which point we'd add it.
Context window values look correct: 1M for both Flash and Pro, 65k output tokens. These match the official specs.

Suggestion: If #783 lands first, this PR would be fully superseded. If you'd like to contribute the gemini-3-flash (GA name) entry as a forward-looking addition, I'd suggest rebasing on top of #783 and adding just that one entry — but only after confirming the model ID works against the native Gemini endpoint.

kevincodex1 requested a review from auriti April 12, 2026 07:23

kevincodex1 previously approved these changes Apr 12, 2026

View reviewed changes

kevincodex1 requested review from Vasanthdev2004, anandh8x and gnanam1990 April 12, 2026 13:27

Vasanthdev2004 requested changes Apr 12, 2026

View reviewed changes

gnanam1990 requested changes Apr 12, 2026

View reviewed changes

Vasanthdev2004 mentioned this pull request Apr 12, 2026

bug: auto-compact fires every turn for 3P models not in context window table (8k fallback causes infinite loop) #635

Closed

CleoMenezesJr dismissed kevincodex1’s stale review via d5f5ce7 April 13, 2026 01:25

CleoMenezesJr force-pushed the gemini-models branch 3 times, most recently from bbf4f56 to 7def95e Compare April 13, 2026 01:31

feat(models): update Gemini model context windows and output limits

7def95e

CleoMenezesJr requested review from Vasanthdev2004 and gnanam1990 April 13, 2026 01:32

Vasanthdev2004 mentioned this pull request Apr 13, 2026

Provider loading fix #623

Merged

Vasanthdev2004 approved these changes Apr 13, 2026

View reviewed changes

gnanam1990 requested changes Apr 13, 2026

View reviewed changes

auriti reviewed Apr 20, 2026

View reviewed changes

FightMan01 mentioned this pull request May 8, 2026

Prevent duplicate startup plugin hooks #1073

Merged

kevincodex1 closed this in #1073 May 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(models): update Gemini model context windows and output limits#602

feat(models): update Gemini model context windows and output limits#602
CleoMenezesJr wants to merge 1 commit into
Gitlawb:mainfrom
CleoMenezesJr:gemini-models

CleoMenezesJr commented Apr 11, 2026 •

edited

Loading

Uh oh!

Vasanthdev2004 left a comment

Uh oh!

gnanam1990 left a comment

Uh oh!

Vasanthdev2004 left a comment

Uh oh!

kevincodex1 commented Apr 13, 2026

Uh oh!

gnanam1990 left a comment •

edited

Loading

Uh oh!

gnanam1990 left a comment

Uh oh!

auriti left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

CleoMenezesJr commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Impact

Testing

Notes

Uh oh!

Vasanthdev2004 left a comment

Choose a reason for hiding this comment

Uh oh!

gnanam1990 left a comment

Choose a reason for hiding this comment

Uh oh!

Vasanthdev2004 left a comment

Choose a reason for hiding this comment

Uh oh!

kevincodex1 commented Apr 13, 2026

Uh oh!

gnanam1990 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gnanam1990 left a comment

Choose a reason for hiding this comment

Uh oh!

auriti left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CleoMenezesJr commented Apr 11, 2026 •

edited

Loading

gnanam1990 left a comment •

edited

Loading