Skip to content

feat: add SWE-bench and TAU-bench benchmark suite #54

feat: add SWE-bench and TAU-bench benchmark suite

feat: add SWE-bench and TAU-bench benchmark suite #54

Triggered via pull request February 27, 2026 09:37
Status Success
Total duration 2m 30s
Artifacts

ci.yml

on: pull_request
Lint & TypeCheck
26s
Lint & TypeCheck
Unit Tests
53s
Unit Tests
Matrix: Provider E2E
Agent Integration
27s
Agent Integration
Comprehensive Agent Tests
19s
Comprehensive Agent Tests
CI Summary
4s
CI Summary
Fit to window
Zoom out
Zoom in