Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Math 500
#3381 opened Nov 1, 2025 by seldereyy Loading…
[fix] crows_pairs dataset
#3378 opened Oct 31, 2025 by jannalulu Loading…
[feat] add graphwalks
#3377 opened Oct 31, 2025 by jannalulu Loading…
fix trust_remote_code=True for longbench
#3361 opened Oct 22, 2025 by jannalulu Loading…
Longbench group fix
#3359 opened Oct 22, 2025 by jannalulu Loading…
Fix issue 3355 assertion error
#3356 opened Oct 20, 2025 by marksverdhei Loading…
Add gsm_symbolic and gsm_symbolic_cot tasks
#3354 opened Oct 19, 2025 by MengAiDev Loading…
fix(tasks):pin correct MMLUSR version
#3350 opened Oct 16, 2025 by christinaexyou Loading…
added azure openai support
#3349 opened Oct 16, 2025 by zinccat Loading…
Added ULQA benchmark
#3340 opened Oct 13, 2025 by keramjan Loading…
Add MATH500
#3311 opened Sep 26, 2025 by jannalulu Loading…
Support torchrun vllm DP
#3304 opened Sep 19, 2025 by luccafong Loading…
Gemini evaluation support
#3300 opened Sep 15, 2025 by IsraelAbebe Loading…
Fix lambada_multilingual_stablelm
#3294 opened Sep 11, 2025 by jmichaelov Loading…
Adding SPaRC to lm eval harness
#3262 opened Aug 25, 2025 by lkaesberg Loading…
ProTip! What’s not been updated in a month: updated:<2025-10-03.