Skip to content

Pull requests: openai/simple-evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

feat: add len_var scorer (B-0)
#71 opened May 8, 2025 by Yuu6798 Loading…
fix regex bug in browsecomp
#67 opened Apr 22, 2025 by tengyaolong2000 Loading…
fix: import collision for types
#66 opened Apr 21, 2025 by Ithanil Loading…
Small typo in grader
#57 opened Mar 25, 2025 by chiruu12 Loading…
add aime task
#55 opened Mar 12, 2025 by jason9693 Loading…
Add the F-score metric from the simpleqa paper.
#53 opened Mar 10, 2025 by wbaek Loading…
Initial commit
#45 opened Feb 1, 2025 by osmanjamalfarag Loading…
Grok Sampler
#40 opened Jan 9, 2025 by rolandgvc Loading…
correct string spelling error
#37 opened Dec 27, 2024 by owos Loading…
Use correct _pack_message function name
#12 opened May 20, 2024 by andrewmbenton Loading…
fix typo
#10 opened May 20, 2024 by dongZheX Loading…
Added Chartqa Dataset
#6 opened Apr 14, 2024 by tarunamasa Loading…
Remove blobfile dep, load directly from URL
#4 opened Apr 12, 2024 by arkadyark-cohere Loading…
ProTip! Filter pull requests by the default branch with base:main.