SimpleQA
general
text
About
SimpleQA benchmark
Evaluation Stats
Total Models23
Organizations7
Verified Results0
Self-Reported23
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
23 models
Top Score
62.5%
Average Score
26.8%
High Performers (80%+)
0Top Organizations
#1Alibaba
1 model
54.3%
#2OpenAI
5 models
41.0%
#3Moonshot AI
2 models
33.1%
#4DeepSeek
3 models
27.6%
#5Google
9 models
20.3%
Leaderboard
Top 20 models ranked by performance
54.3%
Raw: 0.543
Self-reported
54.0%
Raw: 0.54
Self-reported
42.5%
Raw: 0.425
Self-reported
6
42.4%
Raw: 0.424
Self-reported
35.3%
Raw: 0.353
Self-reported
31.0%
Raw: 0.31
Self-reported
10
30.1%
Raw: 0.301
Self-reported
27.8%
Raw: 0.278
Self-reported
26.9%
Raw: 0.269
Self-reported
13
24.9%
Raw: 0.249
Self-reported
21.7%
Raw: 0.217
Self-reported
12.1%
Raw: 0.121
Self-reported
17
10.8%
Raw: 0.108
Self-reported
10.7%
Raw: 0.107
Self-reported
10.4%
Raw: 0.1043
Self-reported
20
10.0%
Raw: 0.1
Self-reported
Showing top 20 of 23 models