Skip to content

Pull requests: OpenGPTX/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Implemented Polyglotoxicityprompts
#109 opened Aug 23, 2024 by jjbuschhoff Loading…
Crowspairsde
#108 opened Feb 14, 2024 by NAM00 Loading…
Implemented EU22/EU5 translations
#105 opened Jan 11, 2024 by jjbuschhoff Loading…
Implemented Belebele benchmark
#101 opened Nov 15, 2023 by jjbuschhoff Loading…
Fix unnatural tokenizations if possible
#100 opened Nov 8, 2023 by KlaudiaTH Loading…
Megatronlm client
#94 opened Sep 18, 2023 by KlaudiaTH Loading…
Batch evaluation script
#92 opened Sep 12, 2023 by jjbuschhoff Loading…
Enables self-hosted test runner
#69 opened Jan 17, 2023 by malteos Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.