Skip to content

Pull requests: huggingface/lighteval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add LICA-Bench: graphic design VLM evaluation (39 tasks, 7 domains)
#1212 opened Apr 15, 2026 by purvanshi Loading…
3 of 4 tasks
POLLUX LLM-Judge metric
#1210 opened Apr 10, 2026 by ulyanaisaeva Loading…
catch task has no docs instead of throw
#1207 opened Apr 8, 2026 by BuiHoangTu Loading…
add multilingual flag to vllm
#1206 opened Apr 8, 2026 by BuiHoangTu Loading…
Add --load-tasks-multilingual and fix --custom-tasks for inspect backend
#1199 opened Mar 25, 2026 by dzautner Loading…
4 tasks done
[Bugfix] Check all responses when n>1 instead of only the first one
#1197 opened Mar 23, 2026 by eldarkurtic Contributor Loading…
[Litellm Enhancement] Enable extra sampling args for litellm backend
#1195 opened Mar 20, 2026 by eldarkurtic Contributor Loading…
Fix litellm connection pool limiting concurrent_requests
#1190 opened Mar 18, 2026 by sihyeonn Loading…
Update vllm version requirement to 0.17.0
#1183 opened Mar 9, 2026 by NathanHB Member Loading…
Fail fast on non-retriable LiteLLM status codes
#1182 opened Mar 8, 2026 by yangbaechu Loading…
Korean completed and Basque fixed
#1179 opened Mar 6, 2026 by inakiLakunza Loading…
ProTip! Adding no:label will show everything without a label.