TriviaQA

general
text
About

TriviaQA benchmark

Evaluation Stats
Total Models13
Organizations4
Verified Results0
Self-Reported13
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

13 models
Top Score
85.1%
Average Score
74.3%
High Performers (80%+)
5

Top Organizations

#1Moonshot AI
1 model
85.1%
#2IBM
1 model
78.2%
#3Mistral AI
5 models
76.1%
#4Google
6 models
70.4%
Leaderboard
Top 13 models ranked by performance
85.1%
Raw: 0.851
Self-reported
83.7%
Raw: 0.837
Self-reported
80.5%
Raw: 0.805
Self-reported
80.5%
Raw: 0.805
Self-reported
80.3%
Raw: 0.8032
Self-reported
78.2%
Raw: 0.7818
Self-reported
76.6%
Raw: 0.766
Self-reported
73.8%
Raw: 0.738
Self-reported
70.2%
Raw: 0.702
Self-reported
70.2%
Raw: 0.702
Self-reported
65.5%
Raw: 0.655
Self-reported
60.8%
Raw: 0.608
Self-reported
60.8%
Raw: 0.608
Self-reported