Wild Bench
general
text
About
Wild Bench benchmark
Evaluation Stats
Total Models4
Organizations2
Verified Results0
Self-Reported4
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
4 models
Top Score
65.3%
Average Score
52.1%
High Performers (80%+)
0Top Organizations
#1Mistral AI
2 models
58.8%
#2AI21 Labs
2 models
45.5%
Leaderboard
Top 4 models ranked by performance
65.3%
Raw: 0.6533
Self-reported
52.2%
Raw: 0.522
Self-reported
48.5%
Raw: 0.485
Self-reported
42.4%
Raw: 0.424
Self-reported