BrowseComp Long Context 256k

long_context

text

About

BrowseComp long-context (256k) variant

Evaluation Stats

Total Models1

Organizations1

Verified Results0

Self-Reported1

Benchmark Details

Max Score1

Language

en

Performance Overview

Score distribution and top performers

Score Distribution

1 models

Top Score

88.8%

Average Score

88.8%

High Performers (80%+)

1

Top Organizations

#1OpenAI

1 model

88.8%

Leaderboard

Top 1 models ranked by performance

1

88.8%

Raw: 0.888

Self-reported