MultiChallenge
general
text
About
MultiChallenge benchmark
Evaluation Stats
Total Models6
Organizations2
Verified Results0
Self-Reported6
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
6 models
Top Score
54.1%
Average Score
37.8%
High Performers (80%+)
0Top Organizations
#1Moonshot AI
1 model
54.1%
#2OpenAI
5 models
34.6%
Leaderboard
Top 6 models ranked by performance
54.1%
Raw: 0.541
Self-reported
35.8%
Raw: 0.358
Self-reported
15.0%
Raw: 0.15
Self-reported