Skip to content

Commit 71169b5

Browse files
fix: remove multi-tool query — agent reflection loop cannot chain tools
AutoGen's reflection loop returns 500 ("Reflect on tool use produced no valid text response") when two tools are called in a single turn. Revert the multi-tool query until the agent supports chained tool calls. Tested: 11/11 behavioral tests pass against agentic-mcp cluster. Co-authored-by: Cursor <cursoragent@cursor.com>
1 parent 421463c commit 71169b5

2 files changed

Lines changed: 0 additions & 11 deletions

File tree

agents/autogen/mcp_agent/evalhub/tool_use.yaml

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -22,10 +22,6 @@ queries:
2222
expected_tools: ["add"]
2323
expected_elements: ["300"]
2424

25-
- query: "First, use the add tool to compute 100 + 200. Then, use the subtract tool to subtract 50 from that result."
26-
expected_tools: ["add", "sub"]
27-
expected_elements: ["250"]
28-
2925
- query: "Hello, how are you today?"
3026
expected_tools: []
3127
expected_elements: []

agents/autogen/mcp_agent/tests/behavioral/fixtures/golden_queries.yaml

Lines changed: 0 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -33,13 +33,6 @@ queries:
3333
difficulty: medium
3434
category: sub
3535

36-
# --- Multi-tool: add then sub (exercises tool_sequence scorer) ---
37-
- query: "First, use the add tool to compute 100 + 200. Then, use the subtract tool to subtract 50 from that result."
38-
expected_tools: ["add", "sub"]
39-
expected_elements: ["250"]
40-
difficulty: medium
41-
category: multi_tool
42-
4336
# --- Greeting (no tools) ---
4437
- query: "Hello"
4538
expected_tools: []

0 commit comments

Comments
 (0)