ARC-AGI v2

reasoning

text

About

ARC-AGI v2 benchmark

Evaluation Stats

Total Models5

Organizations5

Verified Results0

Self-Reported1

Benchmark Details

Max Score1

Language

en

Performance Overview

Score distribution and top performers

Score Distribution

5 models

Top Score

15.9%

Average Score

7.4%

High Performers (80%+)

0

Top Organizations

#1xAI

1 model

15.9%

#2Anthropic

1 model

8.6%

#3OpenAI

1 model

6.5%

#4Google

1 model

4.9%

#5DeepSeek

1 model

1.3%

Leaderboard

Top 5 models ranked by performance

1

by xAI

15.9%

Raw: 0.159

Self-reported

2

8.6%

Raw: 0.086

3

6.5%

Raw: 0.065

4

4.9%

Raw: 0.049

5

1.3%

Raw: 0.013