Include

general
text
About

Include benchmark

Evaluation Stats
Total Models6
Organizations2
Verified Results0
Self-Reported6
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

6 models
Top Score
79.5%
Average Score
57.4%
High Performers (80%+)
0

Top Organizations

#1Alibaba
2 models
76.5%
#2Google
4 models
47.9%
Leaderboard
Top 6 models ranked by performance
79.5%
Raw: 0.795
Self-reported
73.5%
Raw: 0.7346
Self-reported
57.2%
Raw: 0.572
Self-reported
57.2%
Raw: 0.572
Self-reported
38.6%
Raw: 0.386
Self-reported
38.6%
Raw: 0.386
Self-reported