fix: handle empty text and malformed JSON in parse_text for thinking+tools by gn00295120 · Pull Request #1240 · anthropics/anthropic-sdk-python

gn00295120 · 2026-03-12T05:20:00Z

Summary

Fixes #1204 — parse_text() crashes when used with structured output + extended thinking + tool use.

Two bugs addressed:

Empty text crash: When the model returns stop_reason="end_turn" with only a thinking block and an empty text block, parse_text("") calls validate_json("") which raises ValidationError.
Malformed JSON prefix: When the model prefixes the JSON payload with reasoning text or a partial generation artifact, validate_json() fails on the full string even though valid JSON exists at the end.

Changes

src/anthropic/lib/_parse/_response.py:

parse_text(): Return None for empty/whitespace text instead of crashing
parse_text(): Add fallback JSON extraction via _extract_last_json() — finds the last valid JSON object/array in malformed text
parse_response() / parse_beta_response(): Skip structured output parsing on stop_reason="tool_use" turns (intermediate tool-calling turns shouldn't have text parsed as structured output)

tests/lib/_parse/test_parse_text.py (new):

19 tests covering all edge cases: empty text, whitespace, valid JSON, malformed prefix recovery, schema validation failure, tool_use turn skipping, both parse_response and parse_beta_response

Test plan

parse_text("") returns None (not crash)
parse_text(" \n\t ") returns None
parse_text('partial garbage\n\n{"valid": "json"}') recovers correctly
parse_text('not json') still raises ValidationError
parse_response() with stop_reason="tool_use" skips text parsing
parse_response() with stop_reason="end_turn" parses normally
All 19 new tests pass; 158 existing tests pass (1 pre-existing failure unrelated to changes)

…tools When using structured output with thinking and tool_use, parse_text() crashes on empty text blocks (from intermediate thinking-only turns) and on malformed JSON (from model generation artifacts). - Skip parsing when text is empty/whitespace - Skip structured output parsing on tool_use turns (intermediate) - Add fallback JSON extraction for malformed text blocks - Add comprehensive tests for all edge cases Fixes anthropics#1204

Copilot

Pull request overview

This PR addresses crashes and parsing failures in the structured-output response parsing pipeline when used alongside extended thinking and tool use (Issue #1204). It hardens parse_text() against empty text blocks and attempts to recover valid JSON when the model prepends malformed content, and it avoids parsing structured output on intermediate tool_use turns.

Changes:

Update parse_text() to return None for empty/whitespace-only text and add a recovery path that attempts to extract the last JSON payload from malformed text.
Update parse_response() / parse_beta_response() to skip structured parsing for stop_reason="tool_use" turns.
Add a new unit test module covering parse_text(), _extract_last_json(), and parse_response()/parse_beta_response() edge cases.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
`src/anthropic/lib/_parse/_response.py`	Adds empty-text handling, JSON recovery extraction, and skips structured parsing on `tool_use` turns.
`tests/lib/_parse/test_parse_text.py`	Adds new tests for empty text, malformed prefix recovery, and tool-use turn behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

src/anthropic/lib/_parse/_response.py

tests/lib/_parse/test_parse_text.py

Previously `_extract_last_json()` used `str.find()` to locate the *first* opening brace/bracket before the last closing token. When the input contains a malformed/partial JSON object in a prefix (e.g. an unterminated string), `find()` would return that broken start position and depth counting would produce a non-zero result, so the function fell through and the final valid JSON was never recovered. Fix: replace the forward `find()` scan with a backward loop over every candidate `open_char` position (from `last_close` down to 0). We return the first (rightmost) position at which the depth count is balanced — which is always the last complete JSON payload in the text, regardless of what broken fragments appear before it. Also remove two unused imports (`typing.Optional`, `unittest.mock.MagicMock`) from the test file that would fail `ruff check --select F401`, and add a new test `test_malformed_prefix_with_partial_object_recovers_last_json` that exercises the exact scenario the review raised. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

gn00295120 requested a review from a team as a code owner March 12, 2026 05:20

Copilot AI review requested due to automatic review settings March 12, 2026 05:20

Copilot started reviewing on behalf of gn00295120 March 12, 2026 05:20 View session

Copilot AI reviewed Mar 12, 2026

View reviewed changes

src/anthropic/lib/_parse/_response.py Outdated Show resolved Hide resolved

tests/lib/_parse/test_parse_text.py Outdated Show resolved Hide resolved

tests/lib/_parse/test_parse_text.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle empty text and malformed JSON in parse_text for thinking+tools#1240

fix: handle empty text and malformed JSON in parse_text for thinking+tools#1240
gn00295120 wants to merge 2 commits intoanthropics:mainfrom
gn00295120:fix/parse-text-empty-crash

gn00295120 commented Mar 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gn00295120 commented Mar 12, 2026

Summary

Changes

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants