ARC-AGI v2

reasoning
text
About

ARC-AGI v2 benchmark

Evaluation Stats
Total Models5
Organizations5
Verified Results0
Self-Reported1
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

5 models
Top Score
15.9%
Average Score
7.4%
High Performers (80%+)
0

Top Organizations

#1xAI
1 model
15.9%
#2Anthropic
1 model
8.6%
#3OpenAI
1 model
6.5%
#4Google
1 model
4.9%
#5DeepSeek
1 model
1.3%
Leaderboard
Top 5 models ranked by performance
15.9%
Raw: 0.159
Self-reported
8.6%
Raw: 0.086
6.5%
Raw: 0.065
4.9%
Raw: 0.049
1.3%
Raw: 0.013