ARC-AGI v2
reasoning
text
About
ARC-AGI v2 benchmark
Evaluation Stats
Total Models5
Organizations5
Verified Results0
Self-Reported1
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
5 models
Top Score
15.9%
Average Score
7.4%
High Performers (80%+)
0Top Organizations
#1xAI
1 model
15.9%
#2Anthropic
1 model
8.6%
#3OpenAI
1 model
6.5%
#4Google
1 model
4.9%
#5DeepSeek
1 model
1.3%
Leaderboard
Top 5 models ranked by performance
8.6%
Raw: 0.086
4.9%
Raw: 0.049
1.3%
Raw: 0.013