MultiPL-E

general
text
About

MultiPL-E benchmark

Evaluation Stats
Total Models10
Organizations2
Verified Results0
Self-Reported10
Benchmark Details
Max Score100
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

10 models
Top Score
87.9%
Average Score
72.7%
High Performers (80%+)
2

Top Organizations

#1Moonshot AI
1 model
85.7%
#2Alibaba
9 models
71.3%
Leaderboard
Top 10 models ranked by performance
87.9%
Raw: 87.9
Self-reported
85.7%
Raw: 85.7
Self-reported
75.4%
Raw: 75.4
Self-reported
75.1%
Raw: 75.1
Self-reported
72.8%
Raw: 72.8
Self-reported
70.4%
Raw: 70.39999999999999
Self-reported
69.2%
Raw: 69.19999999999999
Self-reported
65.9%
Raw: 65.94
Self-reported
65.8%
Raw: 65.8
Self-reported
59.1%
Raw: 59.09999999999999
Self-reported