Skip to content

Pull requests: openai/simple-evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Enhance simple-evals for beginner to run
#87 opened Jun 5, 2025 by ECNU3D Loading…
Jules version of beginner friendly README
#81 opened May 20, 2025 by rarhs Loading…
feat: add len_var scorer (B-0)
#71 opened May 8, 2025 by Yuu6798 Loading…
fix regex bug in browsecomp
#67 opened Apr 22, 2025 by tengyaolong2000 Loading…
fix: import collision for types
#66 opened Apr 21, 2025 by Ithanil Loading…
Small typo in grader
#57 opened Mar 25, 2025 by chiruu12 Loading…
add aime task
#55 opened Mar 12, 2025 by jason9693 Loading…
Add the F-score metric from the simpleqa paper.
#53 opened Mar 10, 2025 by wbaek Loading…
Initial commit
#45 opened Feb 1, 2025 by osmanjamalfarag Loading…
Grok Sampler
#40 opened Jan 9, 2025 by rolandgvc Loading…
correct string spelling error
#37 opened Dec 27, 2024 by owos Loading…
Use correct _pack_message function name
#12 opened May 20, 2024 by andrewmbenton Loading…
fix typo
#10 opened May 20, 2024 by dongZheX Loading…
Added Chartqa Dataset
#6 opened Apr 14, 2024 by tarunamasa Loading…
Remove blobfile dep, load directly from URL
#4 opened Apr 12, 2024 by arkadyark-cohere Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.