fix(vercel-ai): strip retry suffix from UI errorText, preserve LLM cache by tijmenhammer · Pull Request #4869 · pydantic/pydantic-ai

tijmenhammer · 2026-03-26T23:05:57Z

Summary

RetryPromptPart.model_response() appends "\n\nFix the errors and try again." — an instruction meant for the LLM, not for end users. This suffix was leaking into Vercel AI errorText in both the streaming and dump paths, breaking frontend logic that checks errorText values (e.g. errorText === "Cancelled" to distinguish cancelled vs failed state).

Core change: Add RetryPromptPart.error_description() — returns the error text without the retry suffix. model_response() now delegates to it, so all existing callers are unaffected.

Vercel AI adapter:

Dump path: Uses error_description() for UI-facing error_text; stores the full model_response() in call_provider_metadata.pydantic_ai.model_response for round-trip cache fidelity
Load path: Prefers provider_meta['model_response'] for ToolReturnPart.content, falls back to error_text — LLM prompt cache is preserved after dump → load
Streaming path: Uses error_description() directly

This ensures the LLM sees identical text before and after a dump/load round-trip (no cache break), while the UI gets clean error text without the retry instruction.

Test plan

All 106 test_vercel_ai.py tests pass
Snapshot tests updated for new metadata field and round-trip content
ruff check / ruff format / pyright clean on changed files
Verify errorText in Vercel AI frontend no longer contains suffix
Verify dump → load → next LLM turn produces identical prompt (cache hit)

… via metadata Add RetryPromptPart.error_description() to separate UI-facing error text from the LLM-facing "Fix the errors and try again." suffix. The Vercel AI adapter uses error_description() for errorText display and stores the full model_response() in call_provider_metadata for round-trip cache fidelity.

devin-ai-integration

Devin Review found 1 potential issue.

View 4 additional findings in Devin Review.

devin-ai-integration · 2026-03-26T23:11:17Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py

            yield ToolOutputDeniedChunk(tool_call_id=tool_call_id)
        elif isinstance(part, RetryPromptPart):
-            yield ToolOutputErrorChunk(tool_call_id=tool_call_id, error_text=part.model_response())
+            yield ToolOutputErrorChunk(tool_call_id=tool_call_id, error_text=part.error_description())


🔴 Streaming path silently drops retry suffix, breaking cache fidelity for the primary Vercel AI use case

The PR's stated goal is to preserve LLM cache fidelity by storing the full model_response() (with "Fix the errors and try again." suffix) in metadata. This works for the dump_messages path (_adapter.py:660), but the streaming path at _event_stream.py:281 sends error_description() (no suffix) via ToolOutputErrorChunk, which has no provider_metadata field to carry the full text. When a Vercel AI client reconstructs its message history from stream events and sends them back on the next turn, load_messages at _adapter.py:396 does provider_meta.get('model_response') or part.error_text — since no model_response key exists in the stream-derived metadata, it falls back to part.error_text (the suffix-free version). The resulting ToolReturnPart.content now differs from what the model originally saw during the agent run, invalidating prompt caches (e.g. Anthropic) and changing the model's view of its own history.

Before this PR, _event_stream.py:281 used part.model_response() (with suffix), so the stream→load round-trip was consistent. This PR introduces the regression specifically for the streaming path, which is the primary path for Vercel AI SDK (useChat) users.

Prompt for agents

In pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py at line 281, the ToolOutputErrorChunk is created with error_description() (no retry suffix), but there is no way to attach the full model_response() as metadata since ToolOutputErrorChunk lacks a provider_metadata field. Two possible fixes: 1. (Minimal) Revert line 281 to use part.model_response() instead of part.error_description(). This keeps the streaming path consistent with the pre-PR behavior and ensures cache fidelity, at the cost of showing the LLM-facing suffix in the UI during streaming. 2. (Complete) Add a provider_metadata field to ToolOutputErrorChunk (in response_types.py), populate it with the model_response in handle_function_tool_result, and update load_messages to extract it. This would achieve the PR's goal of clean UI text + cache fidelity for both streaming and dump_messages paths. The test at tests/test_vercel_ai.py line 2248 (test_run_stream_response_error) would need to be updated accordingly.

Was this helpful? React with 👍 or 👎 to provide feedback.

github-actions bot added size: S Small PR (≤100 weighted lines) bug Report that something isn't working, or PR implementing a fix labels Mar 26, 2026

tijmenhammer mentioned this pull request Mar 26, 2026

fix(vercel-ai): auto-detect deferred tool approval state in dump_messages() #4831

Open

11 tasks

devin-ai-integration bot reviewed Mar 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(vercel-ai): strip retry suffix from UI errorText, preserve LLM cache#4869

fix(vercel-ai): strip retry suffix from UI errorText, preserve LLM cache#4869
tijmenhammer wants to merge 1 commit intopydantic:mainfrom
tijmenhammer:fix/vercel-ai-retry-error-text

tijmenhammer commented Mar 26, 2026

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tijmenhammer commented Mar 26, 2026

Summary

Test plan

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants