Mistral AI

Mistral Small 3.2 24B Instruct

Multimodal
Zero-eval
#1HumanEval Plus
#1IF
#1MBPP Plus
+1 more

by Mistral AI

About

Mistral Small 3.2 24B Instruct is a multimodal language model developed by Mistral AI. It achieves strong performance with an average score of 69.8% across 15 benchmarks. It excels particularly in DocVQA (94.9%), AI2D (92.9%), HumanEval Plus (92.9%). The model shows particular specialization in code tasks with an average performance of 85.6%. As a multimodal model, it can process and understand text, images, and other input formats seamlessly. It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Mistral AI's latest advancement in AI technology.

Timeline
AnnouncedJun 20, 2025
ReleasedJun 20, 2025
Knowledge CutoffOct 1, 2023
Specifications
Capabilities
Multimodal
License & Family
License
Apache 2.0
Base ModelMistral Small 3.1 24B Base
Benchmark Performance Overview
Performance metrics and category breakdown

Overall Performance

15 benchmarks
Average Score
69.8%
Best Score
94.9%
High Performers (80%+)
6

Top Categories

code
85.6%
vision
78.7%
math
68.3%
general
64.6%
Benchmark Performance
Top benchmark scores with normalized values (0-100%)
Ranking Across Benchmarks
Position relative to other models on each benchmark

DocVQA

Rank #5 of 26
#2Claude 3.5 Sonnet
95.2%
#3Qwen2.5-Omni-7B
95.2%
#4Qwen2.5 VL 7B Instruct
95.7%
#5Mistral Small 3.2 24B Instruct
94.9%
#6Qwen2.5 VL 32B Instruct
94.8%
#7Llama 4 Maverick
94.4%
#8Llama 4 Scout
94.4%

AI2D

Rank #4 of 17
#1Pixtral Large
93.8%
#2GPT-4o
94.2%
#3Claude 3.5 Sonnet
94.7%
#4Mistral Small 3.2 24B Instruct
92.9%
#5Llama 3.2 90B Instruct
92.3%
#6Llama 3.2 11B Instruct
91.1%
#7Qwen2.5 VL 72B Instruct
88.4%

HumanEval Plus

Rank #1 of 1
#1Mistral Small 3.2 24B Instruct
92.9%

ChartQA

Rank #8 of 24
#5Pixtral Large
88.1%
#6Qwen2-VL-72B-Instruct
88.3%
#7Llama 4 Scout
88.8%
#8Mistral Small 3.2 24B Instruct
87.4%
#9Qwen2.5 VL 7B Instruct
87.3%
#10Nova Lite
86.8%
#11DeepSeek VL2
86.0%

IF

Rank #1 of 1
#1Mistral Small 3.2 24B Instruct
84.8%
All Benchmark Results for Mistral Small 3.2 24B Instruct
Complete list of benchmark scores with detailed information
DocVQA
DocVQA benchmark
vision
multimodal
0.95
94.9%
Self-reported
AI2D
AI2D benchmark
general
text
0.93
92.9%
Self-reported
HumanEval Plus
HumanEval Plus benchmark
code
text
0.93
92.9%
Self-reported
ChartQA
ChartQA benchmark
general
multimodal
0.87
87.4%
Self-reported
IF
IF benchmark
general
text
0.85
84.8%
Self-reported
MMLU
MMLU benchmark
general
text
0.81
80.5%
Self-reported
MBPP Plus
MBPP Plus benchmark
code
text
0.78
78.3%
Self-reported
MATH
MATH benchmark
math
text
0.69
69.4%
Self-reported
MMLU-Pro
MMLU-Pro benchmark
general
text
0.69
69.1%
Self-reported
MathVista
MathVista benchmark
math
text
0.67
67.1%
Self-reported
Showing 1 to 10 of 15 benchmarks