ARC-AGI

reasoning

text

About

ARC-AGI benchmark

Evaluation Stats

Total Models2

Organizations2

Verified Results0

Self-Reported2

Benchmark Details

Max Score1

Language

en

Performance Overview

Score distribution and top performers

Score Distribution

2 models

Top Score

88.0%

Average Score

64.9%

High Performers (80%+)

1

Top Organizations

#1OpenAI

1 model

88.0%

#2Alibaba

1 model

41.8%

Leaderboard

Top 2 models ranked by performance

1

88.0%

Raw: 0.88

Self-reported

2

Qwen3-235B-A22B-Instruct-2507

41.8%

Raw: 0.418

Self-reported