Natural Questions

general

text

About

Natural Questions benchmark

Evaluation Stats

Total Models7

Organizations2

Verified Results0

Self-Reported7

Benchmark Details

Max Score1

Language

en

Performance Overview

Score distribution and top performers

Score Distribution

7 models

Top Score

34.5%

Average Score

24.0%

High Performers (80%+)

0

Top Organizations

#1Mistral AI

1 model

31.2%

#2Google

6 models

22.8%

Leaderboard

Top 7 models ranked by performance

1

34.5%

Raw: 0.345

Self-reported

2

Mistral NeMo Instruct

31.2%

Raw: 0.312

Self-reported

3

29.2%

Raw: 0.292

Self-reported

4

20.9%

Raw: 0.209

Self-reported

5

Gemma 3n E4B Instructed LiteRT Preview

20.9%

Raw: 0.209

Self-reported

6

Gemma 3n E2B Instructed LiteRT (Preview)

15.5%

Raw: 0.155

Self-reported

7

15.5%

Raw: 0.155

Self-reported