AI2D

general
text
About

AI2D benchmark

Evaluation Stats
Total Models17
Organizations9
Verified Results0
Self-Reported17
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

17 models
Top Score
94.7%
Average Score
85.6%
High Performers (80%+)
14

Top Organizations

#1Anthropic
1 model
94.7%
#2OpenAI
1 model
94.2%
#3Mistral AI
2 models
93.4%
#4Meta
2 models
91.7%
#5xAI
1 model
88.3%
Leaderboard
Top 17 models ranked by performance
94.7%
Raw: 0.947
Self-reported
94.2%
Raw: 0.942
Self-reported
93.8%
Raw: 0.938
Self-reported
92.9%
Raw: 0.9291
Self-reported
92.3%
Raw: 0.923
Self-reported
91.1%
Raw: 0.911
Self-reported
88.4%
Raw: 0.884
Self-reported
88.3%
Raw: 0.883
Self-reported
84.5%
Raw: 0.845
Self-reported
84.2%
Raw: 0.842
Self-reported
83.2%
Raw: 0.832
Self-reported
82.3%
Raw: 0.823
Self-reported
81.4%
Raw: 0.814
Self-reported
80.0%
Raw: 0.8
Self-reported
78.1%
Raw: 0.781
Self-reported
74.8%
Raw: 0.748
Self-reported
71.6%
Raw: 0.716
Self-reported