docs(blog): fix Opus 4.6/4.7/4.8 thinking mode and effort guidance#255
Open
mateo-berri wants to merge 1 commit into
Open
docs(blog): fix Opus 4.6/4.7/4.8 thinking mode and effort guidance#255mateo-berri wants to merge 1 commit into
mateo-berri wants to merge 1 commit into
Conversation
The Adaptive Thinking notes told readers to pass the native thinking parameter directly to get explicit thinking budgets. That is wrong on Opus 4.7 and 4.8, where the Anthropic API rejects thinking {type: enabled, budget_tokens} with a 400 and adaptive is the only supported mode. On Opus 4.6 it still works but is deprecated. Each note now says so and points to output_config.effort for controlling thinking depth.
The 4.7 post also claimed max effort was Opus 4.6 only and unavailable on 4.7; max is in fact supported on 4.6, 4.7, and 4.8. The 4.7 effort section now lists all five levels, the 4.8 'previously 4.6 only' parenthetical is corrected, and the reasoning_effort enumeration in each note now includes max so it matches that model's effort section (4.6 gets max but not xhigh, which it never supported).
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
Collaborator
Author
|
bugbot run |
There was a problem hiding this comment.
✅ Bugbot reviewed your changes and found no new issues!
Comment @cursor review or bugbot run to trigger another review on this PR
Reviewed by Cursor Bugbot for commit c1d5fb6. Configure here.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
The day-0 blog posts for Claude Opus 4.6, 4.7, and 4.8 carried a copy-pasted "Adaptive Thinking" note that told readers to pass the native
thinkingparameter directly to use explicit thinking budgets withtype: "enabled". That guidance is wrong for Opus 4.7 and 4.8.Per Anthropic's adaptive thinking and effort docs, adaptive thinking is the only supported mode on Opus 4.7 and 4.8, and a manual
thinking: {type: "enabled", budget_tokens: N}is rejected with a 400 error. LiteLLM forwards the nativethinkingparameter unchanged on both/chat/completionsand the/v1/messagespassthrough, so following the old instructions just produces that 400. On Opus 4.6 the explicit budget still works but is deprecated and no longer recommended. Each note now states the actual behavior and points readers atoutput_config.effort, paired with adaptive thinking, as the way to control thinking depth.Effort ladder correction
While verifying the above, the 4.7 post claimed
maxeffort was "Claude Opus 4.6 only and is not available on 4.7." Anthropic's effort docs listmaxas available on Opus 4.6, 4.7, and 4.8, and LiteLLM'sREASONING_EFFORT_TO_OUTPUT_CONFIG_EFFORTmapsmaxtomaxwith no per-model gating. So the 4.7 effort section now lists all five levels (low,medium,high,xhigh,max), the 4.8 "max (previously Opus 4.6 only)" parenthetical is corrected to "also available on Opus 4.6 and 4.7," and thereasoning_effortenumeration in each Adaptive Thinking note now includesmaxso it matches that model's own effort section. The 4.6 note gainsmaxbut notxhigh, since 4.6 never supportedxhigh.Verification
Cross-checked against Anthropic's official docs (the adaptive thinking and effort pages) and the LiteLLM source.
_map_reasoning_effortshort-circuits every effort value tothinking: {type: "adaptive"}for adaptive-thinking models, thethinkingparameter is passed through verbatim, and the effort mapping applies no per-modelmaxrestriction. These are prose-only changes to three blog markdown files, with no code or behavior changes.Type
Documentation
Note
Low Risk
Markdown-only documentation corrections with no code or configuration changes.
Overview
Corrects Adaptive Thinking and Effort Levels guidance in the Opus 4.6, 4.7, and 4.8 day-0 blog posts so it matches Anthropic’s current API behavior and LiteLLM’s passthrough.
The shared note no longer tells readers to use native
thinking: {type: "enabled", budget_tokens: ...}as the primary path. Opus 4.7 and 4.8 are documented as adaptive-only (explicit budgets → 400); depth should be tuned withoutput_config.effortalongside adaptive thinking. Opus 4.6 keeps explicit budgets as still accepted but deprecated, with the same effort-based recommendation.Effort ladder fixes:
reasoning_effortlists now includemaxwhere appropriate (4.6 addsmax, notxhigh; 4.7/4.8 includemax). 4.7 drops the wrong claim thatmaxis 4.6-only and documents five levels includingmax, with updated guidance onxhighvsmax. 4.8 fixes the parenthetical thatmaxwas “previously Opus 4.6 only.”Prose-only; no runtime or gateway behavior changes.
Reviewed by Cursor Bugbot for commit c1d5fb6. Bugbot is set up for automated code reviews on this repo. Configure here.