BIG-Bench Extra Hard

general
text
About

BIG-Bench Extra Hard benchmark

Evaluation Stats
Total Models5
Organizations1
Verified Results0
Self-Reported5
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

5 models
Top Score
19.3%
Average Score
13.8%
High Performers (80%+)
0

Top Organizations

#1Google
5 models
13.8%
Leaderboard
Top 5 models ranked by performance
19.3%
Raw: 0.193
Self-reported
16.3%
Raw: 0.163
Self-reported
15.0%
Raw: 0.15
Self-reported
11.0%
Raw: 0.11
Self-reported
7.2%
Raw: 0.072
Self-reported