SimpleQA

general
text
About

SimpleQA benchmark

Evaluation Stats
Total Models23
Organizations7
Verified Results0
Self-Reported23
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

23 models
Top Score
62.5%
Average Score
26.8%
High Performers (80%+)
0

Top Organizations

#1Alibaba
1 model
54.3%
#2OpenAI
5 models
41.0%
#3Moonshot AI
2 models
33.1%
#4DeepSeek
3 models
27.6%
#5Google
9 models
20.3%
Leaderboard
Top 20 models ranked by performance
62.5%
Raw: 0.625
Self-reported
54.3%
Raw: 0.543
Self-reported
54.0%
Raw: 0.54
Self-reported
47.0%
Raw: 0.47
Self-reported
42.5%
Raw: 0.425
Self-reported
42.4%
Raw: 0.424
Self-reported
38.2%
Raw: 0.382
Self-reported
35.3%
Raw: 0.353
Self-reported
31.0%
Raw: 0.31
Self-reported
30.1%
Raw: 0.301
Self-reported
27.8%
Raw: 0.278
Self-reported
26.9%
Raw: 0.269
Self-reported
24.9%
Raw: 0.249
Self-reported
21.7%
Raw: 0.217
Self-reported
15.0%
Raw: 0.15
Self-reported
12.1%
Raw: 0.121
Self-reported
10.8%
Raw: 0.108
Self-reported
10.7%
Raw: 0.107
Self-reported
10.4%
Raw: 0.1043
Self-reported
10.0%
Raw: 0.1
Self-reported
Showing top 20 of 23 models