CharXiv-R

general
text
About

CharXiv-R benchmark

Evaluation Stats
Total Models8
Organizations1
Verified Results0
Self-Reported8
Benchmark Details
Max Score1
Language
en
Performance Overview
Score distribution and top performers

Score Distribution

8 models
Top Score
81.1%
Average Score
62.5%
High Performers (80%+)
1

Top Organizations

#1OpenAI
8 models
62.5%
Leaderboard
Top 8 models ranked by performance
81.1%
Raw: 0.811
Self-reported
78.6%
Raw: 0.786
Self-reported
72.0%
Raw: 0.72
Self-reported
58.8%
Raw: 0.588
Self-reported
56.8%
Raw: 0.568
Self-reported
56.7%
Raw: 0.567
Self-reported
55.4%
Raw: 0.554
Self-reported
40.5%
Raw: 0.405
Self-reported