LiveCodeBench v6
code
text
About
LiveCodeBench v6 benchmark
Evaluation Stats
Total Models3
Organizations2
Verified Results0
Self-Reported3
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers
Score Distribution
3 models
Top Score
53.7%
Average Score
43.9%
High Performers (80%+)
0Top Organizations
#1Alibaba
1 model
51.8%
#2Moonshot AI
2 models
40.0%
Leaderboard
Top 3 models ranked by performance
53.7%
Raw: 0.537
Self-reported
51.8%
Raw: 0.518
Self-reported
26.3%
Raw: 0.263
Self-reported