MMLU-ProX
general
text
About
MMLU-ProX benchmark
Evaluation Stats
Total Models5
Organizations2
Verified Results0
Self-Reported5
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
5 models
Top Score
79.4%
Average Score
27.1%
High Performers (80%+)
0Top Organizations
#1Alibaba
1 model
79.4%
#2Google
4 models
14.0%
Leaderboard
Top 5 models ranked by performance
79.4%
Raw: 0.794
Self-reported
19.9%
Raw: 0.199
Self-reported
19.9%
Raw: 0.199
Self-reported
8.1%
Raw: 0.081
Self-reported
8.1%
Raw: 0.081
Self-reported