MATH-500

math
text
About

MATH-500 benchmark

Evaluation Stats
Total Models22
Organizations8
Verified Results0
Self-Reported22
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

22 models
Top Score
97.4%
Average Score
91.3%
High Performers (80%+)
20

Top Organizations

#1Moonshot AI
2 models
96.8%
#2NVIDIA
3 models
96.3%
#3Anthropic
1 model
96.2%
#4Microsoft
1 model
94.6%
#5DeepSeek
10 models
92.6%
Leaderboard
Top 20 models ranked by performance
97.4%
Raw: 0.974
Self-reported
97.3%
Raw: 0.973
Self-reported
97.0%
Raw: 0.97
Self-reported
96.6%
Raw: 0.966
Self-reported
96.2%
Raw: 0.962
Self-reported
96.2%
Raw: 0.962
Self-reported
95.9%
Raw: 0.959
Self-reported
95.4%
Raw: 0.954
Self-reported
94.6%
Raw: 0.946
Self-reported
94.5%
Raw: 0.945
Self-reported
94.3%
Raw: 0.943
Self-reported
94.0%
Raw: 0.94
Self-reported
93.9%
Raw: 0.939
Self-reported
92.8%
Raw: 0.928
Self-reported
90.6%
Raw: 0.906
Self-reported
90.6%
Raw: 0.906
Self-reported
90.2%
Raw: 0.902
Self-reported
90.0%
Raw: 0.9
Self-reported
89.1%
Raw: 0.891
Self-reported
83.9%
Raw: 0.839
Self-reported
Showing top 20 of 22 models