Skip to content

feat: add ARC-AGI-1, ARC-AGI-2, and BrowseComp benchmark tasks #39

feat: add ARC-AGI-1, ARC-AGI-2, and BrowseComp benchmark tasks

feat: add ARC-AGI-1, ARC-AGI-2, and BrowseComp benchmark tasks #39

compare-task-input-boundary

succeeded Feb 23, 2026 in 1m 56s