Multipl-E MBPP

code
text
About

Multipl-E MBPP benchmark

Evaluation Stats
Total Models3
Organizations1
Verified Results0
Self-Reported3
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

3 models
Top Score
65.7%
Average Score
60.0%
High Performers (80%+)
0

Top Organizations

#1Meta
3 models
60.0%
Leaderboard
Top 3 models ranked by performance
65.7%
Raw: 0.657
Self-reported
62.0%
Raw: 0.62
Self-reported
52.4%
Raw: 0.524
Self-reported