OpenBookQA

general
text
About

OpenBookQA benchmark

Evaluation Stats
Total Models4
Organizations2
Verified Results0
Self-Reported4
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

4 models
Top Score
89.6%
Average Score
77.2%
High Performers (80%+)
1

Top Organizations

#1Microsoft
3 models
82.7%
#2Mistral AI
1 model
60.6%
Leaderboard
Top 4 models ranked by performance
89.6%
Raw: 0.896
Self-reported
79.2%
Raw: 0.792
Self-reported
79.2%
Raw: 0.792
Self-reported
60.6%
Raw: 0.606
Self-reported