Add tool_search_alternate_approaches notebook by noble-ant · Pull Request #306 · anthropics/claude-cookbooks

noble-ant · 2025-11-26T20:54:14Z

Description

Adds a new cookbook demonstrating alternate approaches to tool discovery with Claude. This cookbook complements the existing "Tool Search with Embeddings" cookbook by showing simpler patterns that
don't require embeddings infrastructure.

Key concepts demonstrated:

Dynamic tool loading: Tools don't need to be in the tools list until Claude needs them
defer_loading=True: Preserves prompt caching when adding discovered tools mid-conversation
tool_reference in results: How to signal tool availability to Claude

The cookbook uses a describe_tool example but emphasizes this is just one flavor—the same pattern applies to list_tools, hierarchical discovery, or hybrid approaches.

Type of Change

New cookbook
Bug fix (fixes an issue in existing cookbook)
Documentation update
Code quality improvement (refactoring, optimization)
Dependency update
Other (please describe):

Cookbook Checklist (if applicable)

Cookbook has a clear, descriptive title
Includes a problem statement or use case description
Code is well-commented and easy to follow
Includes expected outputs or results

Testing

I have tested this cookbook/change locally
All cells execute without errors

Additional Context

Companion to tool_use/tool_search_with_embeddings.ipynb
Uses the anthropic-beta: advanced-tool-use-2025-11-20 header for tool_reference support
Intentionally concise (~13 cells) compared to the embeddings cookbook

github-actions · 2025-11-26T20:54:39Z

Notebook Changes

This PR modifies the following notebooks:

📓 `tool_use/tool_search_alternate_approaches.ipynb`

View diff

nbdiff /dev/null tool_use/tool_search_alternate_approaches.ipynb (995c9655032cc74d78099f09285e4811e5da973b)
--- /dev/null  2025-11-26 20:53:47.328823
+++ tool_use/tool_search_alternate_approaches.ipynb (995c9655032cc74d78099f09285e4811e5da973b)  (no timestamp)
## added /cells:
+  markdown cell:
+    source:
+      # Tool Search: Alternate Approaches
+      
+      **Recommend first reading the cookbook: [Tool Search with Embeddings](./tool_search_with_embeddings.ipynb).**
+      
+      The goal of this cookbook is to show alternate approaches to tool search (or "tool discovery") with Claude. We'll demonstrate two useful techniques:
+      
+      1. **Tools can be discovered without "search"**. Instead of semantic search, we'll include all tool names in Claude's system prompt and provide a `describe_tool` tool for Claude to load specific tools into context.
+      2. **Tools don't need to be in the `tools` list initially**. Using `defer_loading=True`, tools can be added dynamically while preserving prompt caching.
+      
+      This cookbook demonstrates these techniques using a describe_tool example, but the same pattern applies to many discovery mechanisms (searching, listing by category, etc.).
+      
+      ## Prerequisites
+      
+      - Python 3.11+
+      - Anthropic API key
+      - Basic understanding of [Claude Tool Use](https://docs.anthropic.com/en/docs/build-with-claude/tool-use)
+      
+      ## Setup
+  code cell:
+    source:
+      %pip install -q anthropic python-dotenv
+  code cell:
+    source:
+      import anthropic
+      import json
+      from dotenv import load_dotenv
+      
+      load_dotenv()
+      
+      MODEL = "claude-sonnet-4-5-20250929"
+      client = anthropic.Anthropic()
+      
+      print("✓ Client initialized")
+  markdown cell:
+    source:
+      ## Define Tool Library
+      
+      We'll define 5 simple tools. In production, this could be hundreds or thousands of tools stored in a database or configuration file.
+  code cell:
+    source:
+      TOOL_LIBRARY = {
+          "get_weather": {
+              "name": "get_weather",
+              "description": "Get current weather for a city",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "city": {"type": "string", "description": "City name"},
+                  },
+                  "required": ["city"],
+              },
+          },
+          "get_stock_price": {
+              "name": "get_stock_price",
+              "description": "Get current stock price for a ticker symbol",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "ticker": {"type": "string", "description": "Stock ticker (e.g., AAPL)"},
+                  },
+                  "required": ["ticker"],
+              },
+          },
+          "convert_currency": {
+              "name": "convert_currency",
+              "description": "Convert amount between currencies",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "amount": {"type": "number"},
+                      "from_currency": {"type": "string"},
+                      "to_currency": {"type": "string"},
+                  },
+                  "required": ["amount", "from_currency", "to_currency"],
+              },
+          },
+          "calculate_tip": {
+              "name": "calculate_tip",
+              "description": "Calculate tip amount for a bill",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "bill_amount": {"type": "number"},
+                      "tip_percent": {"type": "number", "default": 20},
+                  },
+                  "required": ["bill_amount"],
+              },
+          },
+          "send_email": {
+              "name": "send_email",
+              "description": "Send an email to a recipient",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "to": {"type": "string", "description": "Recipient email"},
+                      "subject": {"type": "string"},
+                      "body": {"type": "string"},
+                  },
+                  "required": ["to", "subject", "body"],
+              },
+          },
+      }
+      
+      print(f"✓ Defined {len(TOOL_LIBRARY)} tools: {list(TOOL_LIBRARY.keys())}")
+  markdown cell:
+    source:
+      ## The `describe_tool` Tool
+      
+      Instead of semantic search, we give Claude a simple `describe_tool` tool. Claude calls this with a tool name to load that tool into context.
+      
+      The system prompt lists all available tool names, so Claude knows what's available without needing embeddings or search.
+  code cell:
+    source:
+      DESCRIBE_TOOL = {
+          "name": "describe_tool",
+          "description": "Load a tool's full definition into context. Call this before using any tool for the first time.",
+          "input_schema": {
+              "type": "object",
+              "properties": {
+                  "tool_name": {
+                      "type": "string",
+                      "description": "Name of the tool to load",
+                  },
+              },
+              "required": ["tool_name"],
+          },
+      }
+      
+      # Build system prompt with tool catalog
+      tool_names = list(TOOL_LIBRARY.keys())
+      SYSTEM_PROMPT = f"""You are a helpful assistant with access to various tools.
+      
+      Available tools: {', '.join(tool_names)}
+      
+      Before using any tool, you must first call describe_tool with the tool name to load it."""
+      
+      print("System prompt:")
+      print(SYSTEM_PROMPT)
+  markdown cell:
+    source:
+      ## Mock Tool Execution
+      
+      Simple mock responses for demonstration:
+  code cell:
+    source:
+      def execute_tool(name: str, inputs: dict) -> str:
+          """Mock tool execution."""
+          if name == "get_weather":
+              return json.dumps({"city": inputs["city"], "temp": "72°F", "conditions": "Sunny"})
+          elif name == "get_stock_price":
+              return json.dumps({"ticker": inputs["ticker"], "price": 185.50, "change": "+1.2%"})
+          elif name == "convert_currency":
+              rate = 0.92 if inputs["to_currency"] == "EUR" else 1.0
+              converted = inputs["amount"] * rate
+              return json.dumps({"converted": round(converted, 2), "to": inputs["to_currency"]})
+          elif name == "calculate_tip":
+              tip = inputs["bill_amount"] * (inputs.get("tip_percent", 20) / 100)
+              return json.dumps({"tip": round(tip, 2), "total": round(inputs["bill_amount"] + tip, 2)})
+          elif name == "send_email":
+              return json.dumps({"status": "sent", "to": inputs["to"]})
+          return json.dumps({"error": f"Unknown tool: {name}"})
+      
+      print("✓ Mock execution ready")
+  markdown cell:
+    source:
+      ## Conversation Loop with Dynamic Tool Loading
+      
+      The key pattern here:
+      
+      1. **Start with only `describe_tool`** in the tools list
+      2. **When Claude calls `describe_tool`**, return a `tool_reference` AND add the tool to `active_tools` with `defer_loading=True`
+      3. **`defer_loading=True` is critical** - it keeps the tool definition out of the cached prompt prefix, avoiding cache invalidation when tools are discovered
+      
+      Every time Claude sees a `tool_reference` in the conversation, the full tool definition is loaded into Claude's context at that point in the conversation.
+  code cell:
+    source:
+      def run_conversation(user_message: str, max_turns: int = 10):
+          """Run a conversation with dynamic tool loading."""
+          print(f"\n{'='*60}")
+          print(f"USER: {user_message}")
+          print(f"{'='*60}\n")
+      
+          messages = [{"role": "user", "content": user_message}]
+          
+          # Start with ONLY describe_tool - no other tools in the request
+          active_tools = [DESCRIBE_TOOL]
+          loaded_tools = set()  # Track which tools we've added
+      
+          for turn in range(max_turns):
+              print(f"--- Turn {turn + 1} (tools in request: {len(active_tools)}) ---")
+      
+              response = client.messages.create(
+                  model=MODEL,
+                  max_tokens=1024,
+                  system=SYSTEM_PROMPT,
+                  tools=active_tools,
+                  messages=messages,
+                  extra_headers={"anthropic-beta": "advanced-tool-use-2025-11-20"},
+              )
+      
+              messages.append({"role": "assistant", "content": response.content})
+      
+              if response.stop_reason == "end_turn":
+                  for block in response.content:
+                      if block.type == "text":
+                          print(f"\nASSISTANT: {block.text}")
+                  break
+      
+              # Process tool calls
+              tool_results = []
+              for block in response.content:
+                  if block.type == "text" and block.text:
+                      print(f"ASSISTANT: {block.text}")
+                  
+                  elif block.type == "tool_use":
+                      tool_name = block.name
+                      tool_input = block.input
+      
+                      if tool_name == "describe_tool":
+                          requested_tool = tool_input["tool_name"]
+                          print(f"🔍 describe_tool({requested_tool})")
+      
+                          if requested_tool in TOOL_LIBRARY:
+                              # Add tool to active_tools with defer_loading=True
+                              # This is critical for prompt caching!
+                              if requested_tool not in loaded_tools:
+                                  tool_def = {**TOOL_LIBRARY[requested_tool], "defer_loading": True}
+                                  active_tools.append(tool_def)
+                                  loaded_tools.add(requested_tool)
+                                  print(f"   ✓ Added {requested_tool} to tools (defer_loading=True)")
+      
+                              # Return tool_reference so Claude can use it
+                              tool_results.append({
+                                  "type": "tool_result",
+                                  "tool_use_id": block.id,
+                                  "content": [{"type": "tool_reference", "tool_name": requested_tool}],
+                              })
+                          else:
+                              tool_results.append({
+                                  "type": "tool_result",
+                                  "tool_use_id": block.id,
+                                  "content": f"Tool '{requested_tool}' not found.",
+                              })
+                      else:
+                          # Execute discovered tool
+                          print(f"🔧 {tool_name}({json.dumps(tool_input)})")
+                          result = execute_tool(tool_name, tool_input)
+                          print(f"   → {result}")
+                          tool_results.append({
+                              "type": "tool_result",
+                              "tool_use_id": block.id,
+                              "content": result,
+                          })
+      
+              if tool_results:
+                  messages.append({"role": "user", "content": tool_results})
+      
+          print(f"\n{'='*60}\n")
+      
+      print("✓ Conversation loop ready")
+  markdown cell:
+    source:
+      ## Example: Weather Query
+      
+      Watch how Claude:
+      1. Sees `get_weather` in the system prompt's tool list
+      2. Calls `describe_tool("get_weather")` to load it
+      3. Receives the `tool_reference` and can now use the tool
+  code cell:
+    source:
+      run_conversation("What's the weather in Tokyo?")
+  markdown cell:
+    source:
+      ## Example: Multi-Tool Query
+      
+      Claude can load multiple tools as needed:
+  code cell:
+    source:
+      run_conversation("Convert $100 to EUR, then calculate a 20% tip on a $85 dinner bill.")
+  markdown cell:
+    source:
+      ## Why `defer_loading=True` Matters
+      
+      When you add a tool to the `tools` list, the tool definition is normally loaded into the very beginning of Claude's context window. If you add new tools, you will lose most of the cache because the very beginning of the context window has changed.
+      
+      **With `defer_loading=True`:**
+      - The tool definition is NOT included into the beginning of Claude's context window.
+      - Instead, it's loaded into context when Claude sees the `tool_reference`
+      - This means your system prompt and initial tools stay cached even as you discover new tools
+      
+      **The pattern:**
+      ```python
+      # Initial request - only describe_tool, system prompt is cached
+      tools = [DESCRIBE_TOOL]
+      
+      # After Claude calls describe_tool("get_weather")
+      # Add with defer_loading to preserve cache
+      tools.append({**TOOL_LIBRARY["get_weather"], "defer_loading": True})
+      
+      # Return tool_reference so Claude knows it's available
+      tool_result = [{"type": "tool_reference", "tool_name": "get_weather"}]
+      ```
+      
+      This is essential for applications with hundreds or thousands of tools where you want to:
+      - Keep initial request size small
+      - Preserve prompt caching across tool discoveries
+      - Only load tools Claude actually needs
+  markdown cell:
+    source:
+      ## Conclusion
+      
+      The key insight from this cookbook: **tools don't need to be in the `tools` list until Claude needs them**. Combined with `defer_loading=True` and `tool_reference`, this lets you scale to thousands of tools while keeping requests small and preserving prompt caching. However, in this case your client will need to provide a tool discovery mechanism.
+      
+      The `describe_tool` approach shown here is just one flavor. Other patterns include:
+      - **`list_tools`** - Returns tool names matching a category or keyword
+      - **Hierarchical discovery** - Browse tool categories, then load specific tools
+      - **Hybrid** - Combine listing with semantic search for large catalogs
+      
+      The core pattern is always:
+      1. Return `tool_reference` when a tool is discovered
+      2. Add the tool with `defer_loading=True` to preserve caching
+      3. Claude can then use the tool immediately
+      
+      See [Tool Search with Embeddings](./tool_search_with_embeddings.ipynb) for the embeddings-based approach.

Generated by nbdime

github-actions · 2025-11-26T20:54:54Z

Summary

Status	Count
🔍 Total	3
✅ Successful	0
⏳ Timeouts	0
🔀 Redirected	0
👻 Excluded	0
❓ Unknown	0
🚫 Errors	3
⛔ Unsupported	0

Errors per input

Errors in temp_md/tool_search_alternate_approaches.md

[ERROR] file:///home/runner/work/claude-cookbooks/claude-cookbooks/temp_md/tool_search_with_embeddings.ipynb | Cannot find file: File not found. Check if file exists and path is correct
[200] https://docs.anthropic.com/claude/reference/getting-started-with-the-api | Rejected status code (this depends on your "accept" configuration): OK
[200] https://docs.anthropic.com/en/docs/build-with-claude/tool-use | Rejected status code (this depends on your "accept" configuration): OK
Full Github Actions output

github-actions · 2025-11-26T20:58:41Z

Notebook Changes

This PR modifies the following notebooks:

📓 `tool_use/tool_search_alternate_approaches.ipynb`

View diff

nbdiff /dev/null tool_use/tool_search_alternate_approaches.ipynb (2d09aba33d5dd397f7a5f49f57dc51174213bfbb)
--- /dev/null  2025-11-26 20:56:33.925793
+++ tool_use/tool_search_alternate_approaches.ipynb (2d09aba33d5dd397f7a5f49f57dc51174213bfbb)  (no timestamp)
## added /cells:
+  markdown cell:
+    source:
+      # Tool Search: Alternate Approaches
+      
+      **Recommend first reading see the cookbook: Tool Search with Embeddings.**
+      
+      The goal of this cookbook is to show of some alternate approaches to using tool search (or really "tool discovery") with Claude. In this cookbook we'll demonstrate two useful techniques:
+      
+      1. Tools can be discovered without "Search". In this cookbook, we'll include all of the tool names in Claude's system prompt and provide Claude with decribe_tool_tool to load the tool fully into 
+      Claude's context.
+      2. Tools do not have to be passed in the request's `tools` list if they have not been loaded into Claude's context yet. This can be a bit more application complexity to manage, but can allow your 
+      application to keep requests small, even while Claude has potential access to thousands of tools.
+      
+      Users have a lot of flexibility to design tool search to keep Claude's context (and Messages requests) as focused as possible.
+      
+      ## Prerequisites
+      
+      Before following this guide, ensure you have:
+      
+      **Required Knowledge**
+      - Python fundamentals - comfortable with functions, dictionaries, and basic data structures
+      - Basic understanding of Claude tool use - we recommend reading the [Tool Use Guide](https://docs.anthropic.com/en/docs/build-with-claude/tool-use) first
+      
+      **Required Tools**
+      - Python 3.11 or higher
+      - Anthropic API key ([get one here](https://docs.anthropic.com/claude/reference/getting-started-with-the-api))
+  code cell:
+    source:
+      %pip install -q anthropic python-dotenv
+  code cell:
+    source:
+      import anthropic
+      import json
+      from dotenv import load_dotenv
+      
+      load_dotenv()
+      
+      MODEL = "claude-sonnet-4-5-20250929"
+      client = anthropic.Anthropic()
+      
+      print("✓ Client initialized")
+  markdown cell:
+    source:
+      ## Define Tool Library
+      
+      We'll define 5 simple tools. In production, this could be hundreds or thousands of tools stored in a database or configuration file.
+  code cell:
+    source:
+      TOOL_LIBRARY = {
+          "get_weather": {
+              "name": "get_weather",
+              "description": "Get current weather for a city",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "city": {"type": "string", "description": "City name"},
+                  },
+                  "required": ["city"],
+              },
+          },
+          "get_stock_price": {
+              "name": "get_stock_price",
+              "description": "Get current stock price for a ticker symbol",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "ticker": {"type": "string", "description": "Stock ticker (e.g., AAPL)"},
+                  },
+                  "required": ["ticker"],
+              },
+          },
+          "convert_currency": {
+              "name": "convert_currency",
+              "description": "Convert amount between currencies",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "amount": {"type": "number"},
+                      "from_currency": {"type": "string"},
+                      "to_currency": {"type": "string"},
+                  },
+                  "required": ["amount", "from_currency", "to_currency"],
+              },
+          },
+          "calculate_tip": {
+              "name": "calculate_tip",
+              "description": "Calculate tip amount for a bill",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "bill_amount": {"type": "number"},
+                      "tip_percent": {"type": "number", "default": 20},
+                  },
+                  "required": ["bill_amount"],
+              },
+          },
+          "send_email": {
+              "name": "send_email",
+              "description": "Send an email to a recipient",
+              "input_schema": {
+                  "type": "object",
+                  "properties": {
+                      "to": {"type": "string", "description": "Recipient email"},
+                      "subject": {"type": "string"},
+                      "body": {"type": "string"},
+                  },
+                  "required": ["to", "subject", "body"],
+              },
+          },
+      }
+      
+      print(f"✓ Defined {len(TOOL_LIBRARY)} tools: {list(TOOL_LIBRARY.keys())}")
+  markdown cell:
+    source:
+      ## The `describe_tool` Tool
+      
+      Instead of semantic search, we give Claude a simple `describe_tool` tool. Claude calls this with a tool name to load that tool into context.
+      
+      The system prompt lists all available tool names, so Claude knows what's available without needing embeddings or search.
+  code cell:
+    source:
+      DESCRIBE_TOOL = {
+          "name": "describe_tool",
+          "description": "Load a tool's full definition into context. Call this before using any tool for the first time.",
+          "input_schema": {
+              "type": "object",
+              "properties": {
+                  "tool_name": {
+                      "type": "string",
+                      "description": "Name of the tool to load",
+                  },
+              },
+              "required": ["tool_name"],
+          },
+      }
+      
+      # Build system prompt with tool catalog
+      tool_names = list(TOOL_LIBRARY.keys())
+      SYSTEM_PROMPT = f"""You are a helpful assistant with access to various tools.
+      
+      Available tools: {', '.join(tool_names)}
+      
+      Before using any tool, you must first call describe_tool with the tool name to load it."""
+      
+      print("System prompt:")
+      print(SYSTEM_PROMPT)
+  markdown cell:
+    source:
+      ## Mock Tool Execution
+      
+      Simple mock responses for demonstration:
+  code cell:
+    source:
+      def execute_tool(name: str, inputs: dict) -> str:
+          """Mock tool execution."""
+          if name == "get_weather":
+              return json.dumps({"city": inputs["city"], "temp": "72°F", "conditions": "Sunny"})
+          elif name == "get_stock_price":
+              return json.dumps({"ticker": inputs["ticker"], "price": 185.50, "change": "+1.2%"})
+          elif name == "convert_currency":
+              rate = 0.92 if inputs["to_currency"] == "EUR" else 1.0
+              converted = inputs["amount"] * rate
+              return json.dumps({"converted": round(converted, 2), "to": inputs["to_currency"]})
+          elif name == "calculate_tip":
+              tip = inputs["bill_amount"] * (inputs.get("tip_percent", 20) / 100)
+              return json.dumps({"tip": round(tip, 2), "total": round(inputs["bill_amount"] + tip, 2)})
+          elif name == "send_email":
+              return json.dumps({"status": "sent", "to": inputs["to"]})
+          return json.dumps({"error": f"Unknown tool: {name}"})
+      
+      print("✓ Mock execution ready")
+  markdown cell:
+    source:
+      ## Conversation Loop with Dynamic Tool Loading
+      
+      The key pattern here:
+      
+      1. **Start with only `describe_tool`** in the tools list
+      2. **When Claude calls `describe_tool`**, return a `tool_reference` AND add the tool to `active_tools` with `defer_loading=True`
+      3. **`defer_loading=True` is critical** - it keeps the tool definition out of the cached prompt prefix, avoiding cache invalidation when tools are discovered
+      
+      Every time Claude sees a `tool_reference` in the conversation, the full tool definition is loaded into Claude's context at that point in the conversation.
+  code cell:
+    source:
+      def run_conversation(user_message: str, max_turns: int = 10):
+          """Run a conversation with dynamic tool loading."""
+          print(f"\n{'='*60}")
+          print(f"USER: {user_message}")
+          print(f"{'='*60}\n")
+      
+          messages = [{"role": "user", "content": user_message}]
+          
+          # Start with ONLY describe_tool - no other tools in the request
+          active_tools = [DESCRIBE_TOOL]
+          loaded_tools = set()  # Track which tools we've added
+      
+          for turn in range(max_turns):
+              print(f"--- Turn {turn + 1} (tools in request: {len(active_tools)}) ---")
+      
+              response = client.messages.create(
+                  model=MODEL,
+                  max_tokens=1024,
+                  system=SYSTEM_PROMPT,
+                  tools=active_tools,
+                  messages=messages,
+                  extra_headers={"anthropic-beta": "advanced-tool-use-2025-11-20"},
+              )
+      
+              messages.append({"role": "assistant", "content": response.content})
+      
+              if response.stop_reason == "end_turn":
+                  for block in response.content:
+                      if block.type == "text":
+                          print(f"\nASSISTANT: {block.text}")
+                  break
+      
+              # Process tool calls
+              tool_results = []
+              for block in response.content:
+                  if block.type == "text" and block.text:
+                      print(f"ASSISTANT: {block.text}")
+                  
+                  elif block.type == "tool_use":
+                      tool_name = block.name
+                      tool_input = block.input
+      
+                      if tool_name == "describe_tool":
+                          requested_tool = tool_input["tool_name"]
+                          print(f"🔍 describe_tool({requested_tool})")
+      
+                          if requested_tool in TOOL_LIBRARY:
+                              # Add tool to active_tools with defer_loading=True
+                              # This is critical for prompt caching!
+                              if requested_tool not in loaded_tools:
+                                  tool_def = {**TOOL_LIBRARY[requested_tool], "defer_loading": True}
+                                  active_tools.append(tool_def)
+                                  loaded_tools.add(requested_tool)
+                                  print(f"   ✓ Added {requested_tool} to tools (defer_loading=True)")
+      
+                              # Return tool_reference so Claude can use it
+                              tool_results.append({
+                                  "type": "tool_result",
+                                  "tool_use_id": block.id,
+                                  "content": [{"type": "tool_reference", "tool_name": requested_tool}],
+                              })
+                          else:
+                              tool_results.append({
+                                  "type": "tool_result",
+                                  "tool_use_id": block.id,
+                                  "content": f"Tool '{requested_tool}' not found.",
+                              })
+                      else:
+                          # Execute discovered tool
+                          print(f"🔧 {tool_name}({json.dumps(tool_input)})")
+                          result = execute_tool(tool_name, tool_input)
+                          print(f"   → {result}")
+                          tool_results.append({
+                              "type": "tool_result",
+                              "tool_use_id": block.id,
+                              "content": result,
+                          })
+      
+              if tool_results:
+                  messages.append({"role": "user", "content": tool_results})
+      
+          print(f"\n{'='*60}\n")
+      
+      print("✓ Conversation loop ready")
+  markdown cell:
+    source:
+      ## Example: Weather Query
+      
+      Watch how Claude:
+      1. Sees `get_weather` in the system prompt's tool list
+      2. Calls `describe_tool("get_weather")` to load it
+      3. Receives the `tool_reference` and can now use the tool
+  code cell:
+    source:
+      run_conversation("What's the weather in Tokyo?")
+  markdown cell:
+    source:
+      ## Example: Multi-Tool Query
+      
+      Claude can load multiple tools as needed:
+  code cell:
+    source:
+      run_conversation("Convert $100 to EUR, then calculate a 20% tip on a $85 dinner bill.")
+  markdown cell:
+    source:
+      ## Why `defer_loading=True` Matters
+      
+      When you add a tool to the `tools` list, the tool definition is normally loaded into the very beginning of Claude's context window. If you add new tools, you will lose most of the cache because the very beginning of the context window has changed.
+      
+      **With `defer_loading=True`:**
+      - The tool definition is NOT included into the beginning of Claude's context window.
+      - Instead, it's loaded into context when Claude sees the `tool_reference`
+      - This means your system prompt and initial tools stay cached even as you discover new tools
+      
+      **The pattern:**
+      ```python
+      # Initial request - only describe_tool, system prompt is cached
+      tools = [DESCRIBE_TOOL]
+      
+      # After Claude calls describe_tool("get_weather")
+      # Add with defer_loading to preserve cache
+      tools.append({**TOOL_LIBRARY["get_weather"], "defer_loading": True})
+      
+      # Return tool_reference so Claude knows it's available
+      tool_result = [{"type": "tool_reference", "tool_name": "get_weather"}]
+      ```
+      
+      This is essential for applications with hundreds or thousands of tools where you want to:
+      - Keep initial request size small
+      - Preserve prompt caching across tool discoveries
+      - Only load tools Claude actually needs
+  markdown cell:
+    source:
+      ## Conclusion
+      
+      The key insight from this cookbook: **tools don't need to be in the `tools` list until Claude needs them**. Combined with `defer_loading=True` and `tool_reference`, this lets you scale to thousands of tools while keeping requests small and preserving prompt caching. However, in this case your client will need to provide a tool discovery mechanism.
+      
+      The `describe_tool` approach shown here is just one flavor. Other patterns include:
+      - **`list_tools`** - Returns tool names matching a category or keyword
+      - **Hierarchical discovery** - Browse tool categories, then load specific tools
+      - **Hybrid** - Combine listing with semantic search for large catalogs
+      
+      The core pattern is always:
+      1. Return `tool_reference` when a tool is discovered
+      2. Add the tool with `defer_loading=True` to preserve caching
+      3. Claude can then use the tool immediately
+      
+      See [Tool Search with Embeddings](./tool_search_with_embeddings.ipynb) for the embeddings-based approach.

Generated by nbdime

PedramNavid · 2025-11-26T22:05:19Z

Hi! @noble-ant not sure why the notebook review didnt post! but here it is

Notebook Review: tool_search_alternate_approaches.ipynb
Overall Assessment: EXCELLENT
This is a well-crafted, technically sound notebook that effectively demonstrates an alternative approach to tool discovery with Claude. The code is production-ready and the explanations are clear and thorough.

Strengths

Clear Structure & Pedagogy
Excellent progression from simple concepts to advanced patterns
Well-organized with clear section headers
Strong prerequisite section helps readers assess readiness
Good use of incremental examples (single tool, then multi-tool)
Technical Excellence
Correctly uses the defer_loading=True parameter
Proper use of tool_reference in tool results
Correct API usage with the beta header: anthropic-beta: advanced-tool-use-2025-11-20
Uses the latest model: claude-sonnet-4-5-20250929
Documentation Quality
Clear explanation of WHY this approach matters (prompt caching preservation)
Good comparison with alternative patterns (list_tools, hierarchical, hybrid)
Helpful visual formatting with emojis in output (🔍, 🔧)
Code comments are concise and informative
Practical Implementation
Realistic mock tools covering diverse use cases
Complete conversation loop with proper turn management
Good tracking of loaded tools to avoid duplicates
Proper error handling for unknown tools
Issues Found

CRITICAL ISSUES: None

MAJOR ISSUES:

Typo in cell-0 (tool_search_alternate_approaches.ipynb:7)
Text says: "provide Claude with decribe_tool_tool to load the tool fully"
Should be: "provide Claude with describe_tool to load the tool fully"
Missing underscore and has duplicate "tool"

MINOR ISSUES:

Inconsistent reference in cell-0 (tool_search_alternate_approaches.ipynb:3)

States: "Recommend first reading see the cookbook: Tool Search with Embeddings"
Grammar is awkward ("reading see")
Suggested fix: "We recommend first reading the cookbook: Tool Search with Embeddings"
Link reference style (tool_search_alternate_approaches.ipynb:274)

Uses relative path: ./tool_search_with_embeddings.ipynb

Consider verifying this file exists at that path
Print statement formatting (cell-10, tool_search_alternate_approaches.ipynb:162)

The turn counter shows "tools in request" but doesn't show which tools

Could be more informative: f"--- Turn {turn + 1} (tools: {[t['name'] for t in active_tools]}) ---"
Recommendations
Educational Enhancements:

Consider adding a cell showing cache metrics or token counts to demonstrate the caching benefit

Could add a visualization comparing request sizes with/without defer_loading
Might benefit from a troubleshooting section for common mistakes

Code Quality:

The execute_tool function could use type hints for better IDE support
Consider adding a constant for max_turns default value
The hardcoded rate in execute_tool (0.92) could be noted as mock data more explicitly

Documentation:

Could explicitly mention that this pattern requires the "advanced-tool-use-2025-11-20" beta
Consider adding a performance comparison section (when to use this vs embeddings approach)

Code Correctness Verification
All code appears correct:

API calls use proper parameters
Tool schemas follow correct JSON Schema format
Message structure matches API requirements
Tool result format is correct with tool_reference
Loop logic handles all stop reasons appropriately

Security & Best Practices
✓ No hardcoded API keys (uses environment variables)
✓ No execution of arbitrary code
✓ Mock implementations are safe
✓ No sensitive data in examples
✓ Proper input validation in tool schemas

Conclusion
This is an excellent notebook that demonstrates advanced Claude API features with clarity and technical precision. The only required fix is the typo in cell-0. The minor grammar issue could be addressed, but the notebook is otherwise publication-ready.

Recommendation: APPROVE with minor corrections

Add tool_search_alternate_approaches notebook

995c965

noble-ant requested review from henrykeetay and noahp-anthropic November 26, 2025 20:54

noble-ant marked this pull request as ready for review November 26, 2025 20:54

Update the intro

2d09aba

henrykeetay approved these changes Nov 26, 2025

View reviewed changes

noble-ant merged commit 7dc310a into main Jan 5, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tool_search_alternate_approaches notebook#306

Add tool_search_alternate_approaches notebook#306
noble-ant merged 2 commits into
mainfrom
noble/tst_alt

noble-ant commented Nov 26, 2025

Uh oh!

github-actions Bot commented Nov 26, 2025

Uh oh!

github-actions Bot commented Nov 26, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Nov 26, 2025

Uh oh!

PedramNavid commented Nov 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

noble-ant commented Nov 26, 2025

Description

Type of Change

Cookbook Checklist (if applicable)

Testing

Additional Context

Uh oh!

github-actions Bot commented Nov 26, 2025

Notebook Changes

📓 tool_use/tool_search_alternate_approaches.ipynb

Uh oh!

github-actions Bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Errors per input

Errors in temp_md/tool_search_alternate_approaches.md

Uh oh!

github-actions Bot commented Nov 26, 2025

Notebook Changes

📓 tool_use/tool_search_alternate_approaches.ipynb

Uh oh!

PedramNavid commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

📓 `tool_use/tool_search_alternate_approaches.ipynb`

github-actions Bot commented Nov 26, 2025 •

edited

Loading

📓 `tool_use/tool_search_alternate_approaches.ipynb`

PedramNavid commented Nov 26, 2025 •

edited

Loading