Social IQa
general
text
About
Social IQa benchmark
Evaluation Stats
Total Models9
Organizations2
Verified Results0
Self-Reported9
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
9 models
Top Score
78.0%
Average Score
58.9%
High Performers (80%+)
0Top Organizations
#1Microsoft
3 models
75.1%
#2Google
6 models
50.8%
Leaderboard
Top 9 models ranked by performance
78.0%
Raw: 0.78
Self-reported
74.7%
Raw: 0.747
Self-reported
72.5%
Raw: 0.725
Self-reported
53.7%
Raw: 0.537
Self-reported
5
53.4%
Raw: 0.534
Self-reported
50.0%
Raw: 0.5
Self-reported
50.0%
Raw: 0.5
Self-reported
48.8%
Raw: 0.488
Self-reported
48.8%
Raw: 0.488
Self-reported