Change the repository type filter
All
Repositories list
70 repositories
sae-trait-annotation
PublicMind2Web-2
PublicQUEST
PublicACuRL
PublicGUI-Agents-Paper-List
Publicsaev
PublicFREA
PublicD3-Gym
PublicScienceAgentBench
Public[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific DiscoveryLoop-Think-Generalize
PublicLLM-IOAA
PublicCode and data for the paper "Large Language Models Achieve Gold Medal Performance at the International Olympiad on Astronomy & Astrophysics (IOAA)" (https://arx…UGround
Public[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI AgentsExplorer
Public[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web AgentsAutoElicit
PublicRedTeamCUA
Publiccobalt
PublicCode and data for the paper "Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation"GUI-Drag
PublicOnline-Mind2Web
PublicSciNav
PublicTravelPlanner
Public[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"Mind2Web
PublicAgentSafety
PublicWebDreamer
PublicHippoRAG
PublicAttributionBench
Publichal-harness
PublicAutoSDT
Public[EMNLP'25] AutoSDT is a fully automatic pipeline to collect data-driven scientific coding tasks to train co-scientist models.WebGuard
PublicGrokkedTransformer
Public
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.