Skip to content

Commit d7cab0c

Browse files
authored
Merge pull request #7 from ShiinaHiiragi/main
Add Paper: ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
2 parents a5e6efd + b87206a commit d7cab0c

File tree

1 file changed

+8
-2
lines changed

1 file changed

+8
-2
lines changed

data/benchmark.json

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -510,7 +510,13 @@
510510
"Paper_Url": "https://arxiv.org/abs/2503.07919",
511511
"Highlight": "Emphasizes interaction with live web pages and includes multimodal tasks (e.g., video, audio, 3D) that cannot be solved by text-only methods, addressing limitations of prior benchmarks relying on static or simulated environments.",
512512
"Code_Url": "https://bear-cubs.github.io"
513+
},
514+
{
515+
"Name": "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows",
516+
"Platform": "Desktop",
517+
"Date": "May 2025",
518+
"Paper_Url": "https://arxiv.org/abs/2505.19897",
519+
"Highlight": "A first-of-its-kind evaluation platform for multimodal agents in scientific workflows.",
520+
"Code_Url": "https://github.com/OS-Copilot/ScienceBoard"
513521
}
514-
515-
516522
]

0 commit comments

Comments
 (0)