MM IF-Eval

code

text

About

MM IF-Eval benchmark

Evaluation Stats

Total Models1

Organizations1

Verified Results0

Self-Reported1

Benchmark Details

Max Score1

Language

en

Performance Overview

Score distribution and top performers

Score Distribution

1 models

Top Score

52.7%

Average Score

52.7%

High Performers (80%+)

0

Top Organizations

#1Mistral AI

1 model

52.7%

Leaderboard

Top 1 models ranked by performance

1

52.7%

Raw: 0.527

Self-reported