SuperGPQA

general
text
About

SuperGPQA benchmark

Evaluation Stats
Total Models4
Organizations2
Verified Results0
Self-Reported4
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

4 models
Top Score
62.6%
Average Score
52.1%
High Performers (80%+)
0

Top Organizations

#1Alibaba
2 models
53.3%
#2Moonshot AI
2 models
50.9%
Leaderboard
Top 4 models ranked by performance
62.6%
Raw: 0.626
Self-reported
57.2%
Raw: 0.572
Self-reported
44.7%
Raw: 0.447
Self-reported
44.1%
Raw: 0.4406
Self-reported