Natural Questions

general
text
About

Natural Questions benchmark

Evaluation Stats
Total Models7
Organizations2
Verified Results0
Self-Reported7
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

7 models
Top Score
34.5%
Average Score
24.0%
High Performers (80%+)
0

Top Organizations

#1Mistral AI
1 model
31.2%
#2Google
6 models
22.8%
Leaderboard
Top 7 models ranked by performance
34.5%
Raw: 0.345
Self-reported
31.2%
Raw: 0.312
Self-reported
29.2%
Raw: 0.292
Self-reported
20.9%
Raw: 0.209
Self-reported
20.9%
Raw: 0.209
Self-reported
15.5%
Raw: 0.155
Self-reported
15.5%
Raw: 0.155
Self-reported