CSimpleQA
general
text
About
CSimpleQA benchmark
Evaluation Stats
Total Models5
Organizations3
Verified Results0
Self-Reported5
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
5 models
Top Score
84.3%
Average Score
73.8%
High Performers (80%+)
1Top Organizations
#1Alibaba
1 model
84.3%
#2Moonshot AI
2 models
78.0%
#3DeepSeek
2 models
64.3%
Leaderboard
Top 5 models ranked by performance
84.3%
Raw: 0.843
Self-reported
78.4%
Raw: 0.784
Self-reported
77.6%
Raw: 0.776
Self-reported
64.8%
Raw: 0.648
Self-reported
63.7%
Raw: 0.637
Self-reported