Commit 6d74274
fix: default EvalHub adapter to stream=false for reliable tool scoring
Non-streaming responses include tool_invocations/tool_calls in the JSON
body. Streaming relies on delta.tool_calls which not all agents emit
(e.g. AutoGen uses a custom mcp.tool_usage SSE event). Defaulting to
false ensures tool scorers work for all agents out of the box; jobs can
still opt in to streaming via job parameters.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 71169b5 commit 6d74274
1 file changed
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
96 | 96 | | |
97 | 97 | | |
98 | 98 | | |
99 | | - | |
| 99 | + | |
100 | 100 | | |
101 | 101 | | |
102 | 102 | | |
| |||
0 commit comments