MultiChallenge

general
text
About

MultiChallenge benchmark

Evaluation Stats
Total Models6
Organizations2
Verified Results0
Self-Reported6
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

6 models
Top Score
54.1%
Average Score
37.8%
High Performers (80%+)
0

Top Organizations

#1Moonshot AI
1 model
54.1%
#2OpenAI
5 models
34.6%
Leaderboard
Top 6 models ranked by performance
54.1%
Raw: 0.541
Self-reported
43.8%
Raw: 0.438
Self-reported
39.9%
Raw: 0.399
Self-reported
38.3%
Raw: 0.383
Self-reported
35.8%
Raw: 0.358
Self-reported
15.0%
Raw: 0.15
Self-reported