BrowseComp-zh
agents
text
About
Chinese version of BrowseComp benchmark for evaluating web browsing and comprehension capabilities
Evaluation Stats
Total Models2
Organizations1
Verified Results0
Self-Reported2
Benchmark Details
Max Score1
Language
zh
Performance Overview
Score distribution and top performers
Score Distribution
2 models
Top Score
49.2%
Average Score
42.4%
High Performers (80%+)
0Top Organizations
#1DeepSeek
2 models
42.4%
Leaderboard
2 models ranked by performance on BrowseComp-zh
License | Links | ||||
---|---|---|---|---|---|
Jan 10, 2025 | MIT | 49.2% | |||
May 28, 2025 | MIT | 35.7% |