Skip to content

Commit 70d3de3

Browse files
committed
feat: add SWE-bench and TAU-bench benchmark suite, fix OpenAI baseUrl version path matching
feat(benchmark): use official SWE-bench Docker images and expand to 12 instances
1 parent 102dd40 commit 70d3de3

37 files changed

Lines changed: 5505 additions & 8 deletions

.env.test.example

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -63,3 +63,9 @@ E2B_TEMPLATE=base
6363

6464
# E2B 沙箱超时时间(毫秒,可选,默认 300000)
6565
E2B_TIMEOUT_MS=300000
66+
67+
# =============================================================================
68+
# Benchmark (for benchmark tests)
69+
# =============================================================================
70+
# Docker 代理(可选,SWE full 模式 git clone 和 Docker 容器使用)
71+
# BENCHMARK_DOCKER_PROXY=http://127.0.0.1:7897

0 commit comments

Comments
 (0)