ReAct (Reason + Act) — Overview

ReAct is the foundational agent pattern: a loop where the LLM reasons about what to do, acts by calling a tool, observes the result, and repeats until the task is complete. The LLM controls when to act and when to stop.

Evolves from: Prompt Chaining — adds dynamic tool selection and LLM-controlled looping.

Architecture

graph TD
    Input([User Task]) -->|"goal"| Loop[Agent Loop]
    Loop --> Think[Think:<br/>Reason about state + goal]
    Think --> Decide{Done?}
    Decide -->|"No"| ToolCall["Select & call tool"]
    ToolCall -->|"tool request"| Execute[Execute tool]
    Execute -->|"observation"| Loop
    Decide -->|"Yes"| Output([Final Answer])
    Guard[/"Max Iterations"/] -.->|"force stop"| Output

    style Input fill:#e3f2fd
    style Loop fill:#f3e5f5
    style Think fill:#fff3e0
    style Decide fill:#fce4ec
    style ToolCall fill:#e8f5e9
    style Execute fill:#e8f5e9
    style Output fill:#e3f2fd
    style Guard fill:#fff8e1

Figure: The ReAct loop. The LLM thinks, decides whether to act or respond, executes a tool if needed, and observes the result. A max iteration guard prevents infinite loops.

How It Works

The LLM receives the task and the available tool schemas
It generates a reasoning step ("I need to search for X because...")
It selects a tool and provides arguments
Your code executes the tool and returns the observation
The LLM reasons about the observation and decides the next action
Repeat until the LLM produces a final answer or hits the iteration limit

The key insight: the LLM interleaves thinking with acting. It doesn't just plan all steps upfront — it adapts based on what it discovers.

Minimal Example

Answer a compound question using search and a calculator — the agent decides which tools to call and when to stop.

from patterns.react.code.python.react_agent import ReActAgent, Tool

agent = ReActAgent(
    llm=your_llm,
    tools=[
        Tool("search",     "Search the web for current information", lambda q: search_api(q)),
        Tool("calculator", "Evaluate a math expression",             lambda expr: str(eval(expr))),
    ],
    max_steps=8,
)

result = agent.run(
    "What is the compound interest on $5,000 at the current US federal funds rate for 10 years?"
)
# result.answer            → final answer once the agent calls "Final Answer:"
# result.steps             → full Thought / Action / Observation trace
# result.stopped_by_guard  → True if max_steps was hit before a final answer

Example trace:

Thought: I need the current federal funds rate first.
Action: search | Input: "current US federal funds rate 2024"
Observation: The federal funds rate is 5.25–5.50% as of late 2024.

Thought: Now I'll calculate compound interest.
Action: calculator | Input: 5000 * (1 + 0.0525) ** 10
Observation: 8292.87

Thought: I now know the final answer.
Final Answer: At 5.25%, $5,000 grows to approximately $8,293 over 10 years.

Implementations

Variant	Language	File
Reference (MockLLM, framework-agnostic)	Python	`code/_reference.py`
Pydantic AI	Python	`code/python/pydantic-ai/react.py`
LangGraph (`create_react_agent`)	Python	`code/python/langgraph/react.py`
LangChain (`create_tool_calling_agent` + `AgentExecutor`)	Python	`code/python/langchain/react.py`
Vercel AI SDK (`generateText` + `tools`)	TypeScript	`code/typescript/vercel-ai-sdk/react.ts`

The reference file is the canonical control-flow doc — read it with design.md. The framework-specific files share an identical task (look up a word's definition via a single tool) so they're diff-friendly across stacks. The per-framework layout convention is documented in meta/style-guide.md.

Input / Output

Input: A user task/question + a set of available tools (with schemas)
Output: A final answer after zero or more tool calls
State: Message history accumulating reasoning steps and observations

Key Tradeoffs

Strength	Limitation
Handles open-ended, exploratory tasks	Unpredictable number of steps and cost
Adapts strategy based on observations	Can get stuck in loops or repeat failed actions
Simple to implement — one loop, one LLM	No upfront planning — may take inefficient paths
General-purpose — works for many task types	Reasoning quality degrades with long histories
Easy to add new tools without structural changes	Hard to test deterministically

When to Use

Open-ended tasks where the steps aren't known in advance
Tasks requiring tool use with adaptive behavior
Question-answering that may need multiple information sources
When you want the simplest possible agent architecture
As the starting point before deciding you need a more complex pattern

When NOT to Use

When steps are known in advance — use Prompt Chaining
When the task needs upfront strategic planning — use Plan & Execute
When quality needs iterative self-improvement — use Reflection
When multiple specialized capabilities are needed — use Multi-Agent

Related Patterns

Evolves from: Prompt Chaining — see evolution.md
Builds on: Tool Use — ReAct requires tool use as a component
Extends into: Plan & Execute (add planning), Reflection (add self-critique), RAG (add retrieval), Memory (add persistence)

Deeper Dive

Design — Loop mechanics, message history management, tool dispatch, termination strategies
Implementation — Pseudocode, interfaces, prompt templates, testing approach
Evolution — How ReAct emerges from prompt chaining

When NOT to use this pattern

Steps are predictable in advance — use a workflow (prompt chaining or orchestrator-worker).
Latency budget is tight — ReAct loops are unbounded by default and unpredictable.
You can't enforce a tool allow-list — ReAct's freedom amplifies the blast radius of any unsafe tool.

Next steps

Production version: see Blueprints → Deployments for the deployment agents that use this pattern.
Generate a starter project: see Blueprint → Spec → Scaffold.
Combine with other patterns: see the Composition guide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReAct (Reason + Act) — Overview

Architecture

How It Works

Minimal Example

Implementations

Input / Output

Key Tradeoffs

When to Use

When NOT to Use

Related Patterns

Deeper Dive

When NOT to use this pattern

Next steps

FilesExpand file tree

overview.md

Latest commit

History

overview.md

File metadata and controls

ReAct (Reason + Act) — Overview

Architecture

How It Works

Minimal Example

Implementations

Input / Output

Key Tradeoffs

When to Use

When NOT to Use

Related Patterns

Deeper Dive

When NOT to use this pattern

Next steps