MATH-500
math
text
About
MATH-500 benchmark
Evaluation Stats
Total Models22
Organizations8
Verified Results0
Self-Reported22
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
22 models
Top Score
97.4%
Average Score
91.3%
High Performers (80%+)
20Top Organizations
#1Moonshot AI
2 models
96.8%
#2NVIDIA
3 models
96.3%
#3Anthropic
1 model
96.2%
#4Microsoft
1 model
94.6%
#5DeepSeek
10 models
92.6%
Leaderboard
Top 20 models ranked by performance
97.4%
Raw: 0.974
Self-reported
97.3%
Raw: 0.973
Self-reported
97.0%
Raw: 0.97
Self-reported
96.6%
Raw: 0.966
Self-reported
96.2%
Raw: 0.962
Self-reported
96.2%
Raw: 0.962
Self-reported
95.9%
Raw: 0.959
Self-reported
95.4%
Raw: 0.954
Self-reported
94.6%
Raw: 0.946
Self-reported
94.5%
Raw: 0.945
Self-reported
94.3%
Raw: 0.943
Self-reported
94.0%
Raw: 0.94
Self-reported
93.9%
Raw: 0.939
Self-reported
92.8%
Raw: 0.928
Self-reported
90.6%
Raw: 0.906
Self-reported
17
90.2%
Raw: 0.902
Self-reported
89.1%
Raw: 0.891
Self-reported
83.9%
Raw: 0.839
Self-reported
Showing top 20 of 22 models