PIQA
general
text
About
PIQA benchmark
Evaluation Stats
Total Models9
Organizations2
Verified Results0
Self-Reported9
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
9 models
Top Score
88.6%
Average Score
81.3%
High Performers (80%+)
6Top Organizations
#1Microsoft
3 models
82.4%
#2Google
6 models
80.8%
Leaderboard
Top 9 models ranked by performance
88.6%
Raw: 0.886
Self-reported
83.2%
Raw: 0.832
Self-reported
3
81.7%
Raw: 0.817
Self-reported
81.0%
Raw: 0.81
Self-reported
81.0%
Raw: 0.81
Self-reported
81.0%
Raw: 0.81
Self-reported
78.9%
Raw: 0.789
Self-reported
78.9%
Raw: 0.789
Self-reported
77.6%
Raw: 0.776
Self-reported