Wild Bench

general
text
About

Wild Bench benchmark

Evaluation Stats
Total Models4
Organizations2
Verified Results0
Self-Reported4
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

4 models
Top Score
65.3%
Average Score
52.1%
High Performers (80%+)
0

Top Organizations

#1Mistral AI
2 models
58.8%
#2AI21 Labs
2 models
45.5%
Leaderboard
Top 4 models ranked by performance
65.3%
Raw: 0.6533
Self-reported
52.2%
Raw: 0.522
Self-reported
48.5%
Raw: 0.485
Self-reported
42.4%
Raw: 0.424
Self-reported