Skip to content

Commit 1892a79

Browse files
committed
initial implementation of leaderboard. Lots of stuff can be improved but this brings the core idea
Signed-off-by: Oleg Silkin <97077423+RobotSail@users.noreply.github.com>
1 parent 84748e1 commit 1892a79

3 files changed

Lines changed: 587 additions & 1 deletion

File tree

pyproject.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,6 +42,7 @@ issues = "https://github.com/instructlab/eval/issues"
4242
"mmlu_branch" = "instructlab.eval.mmlu:MMLUBranchEvaluator"
4343
"mt_bench" = "instructlab.eval.mt_bench:MTBenchEvaluator"
4444
"mt_bench_branch" = "instructlab.eval.mt_bench:MTBenchBranchEvaluator"
45+
"leaderboard_v2" = "instructlab.eval.leaderboard:LeaderboardV2Evaluator"
4546

4647
[tool.setuptools_scm]
4748
version_file = "src/instructlab/eval/_version.py"

requirements.txt

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,8 @@ transformers
88
accelerate
99
pandas
1010
pandas-stubs
11-
lm-eval>=0.4.4
11+
# All optional dependencies like this can be found in lm-eval:
12+
# https://github.com/EleutherAI/lm-evaluation-harness/blob/main/pyproject.toml
13+
lm-eval[math,ifeval,sentencepiece,vllm]>=0.4.4
1214
httpx
1315
ragas>=0.2.11

0 commit comments

Comments
 (0)