You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
summary: "AI model coding capability rankings based on SWE-bench Bash-Only"
6
6
data_source: "https://www.swebench.com/"
7
7
benchmarks: [SWE-bench Bash-Only]
8
-
last_updated: "2026-05-16"
8
+
last_updated: "2026-05-23"
9
9
auto_updated: true
10
-
date: "2026-05-16"
10
+
date: "2026-05-23"
11
11
---
12
12
13
+
> Data source: SWE-bench official Bash-Only leaderboard (mini-SWE-agent v2.0.0, 500 instances, single attempt). Data retrieved February 2026. LMSYS Chatbot Arena and other leaderboards temporarily unavailable due to network restrictions.
14
+
13
15
| Rank | Model | Vendor | SWE-bench | Type |
14
16
|---|---|---|---|---|
15
17
| 🥇 | Claude 4.5 Opus | Anthropic | 76.8% | Closed |
0 commit comments