AI2D
general
text
About
AI2D benchmark
Evaluation Stats
Total Models17
Organizations9
Verified Results0
Self-Reported17
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
17 models
Top Score
94.7%
Average Score
85.6%
High Performers (80%+)
14Top Organizations
#1Anthropic
1 model
94.7%
#2OpenAI
1 model
94.2%
#3Mistral AI
2 models
93.4%
#4Meta
2 models
91.7%
#5xAI
1 model
88.3%
Leaderboard
Top 17 models ranked by performance
94.7%
Raw: 0.947
Self-reported
93.8%
Raw: 0.938
Self-reported
92.9%
Raw: 0.9291
Self-reported
92.3%
Raw: 0.923
Self-reported
91.1%
Raw: 0.911
Self-reported
88.4%
Raw: 0.884
Self-reported
84.5%
Raw: 0.845
Self-reported
10
84.2%
Raw: 0.842
Self-reported
83.2%
Raw: 0.832
Self-reported
82.3%
Raw: 0.823
Self-reported
13
81.4%
Raw: 0.814
Self-reported
80.0%
Raw: 0.8
Self-reported
78.1%
Raw: 0.781
Self-reported
16
74.8%
Raw: 0.748
Self-reported
71.6%
Raw: 0.716
Self-reported