DeepSeek

DeepSeek VL2 Small

Multimodal
Zero-eval
#2MMBench-V1.1
#2MME
#3MMT-Bench

by DeepSeek

About

DeepSeek VL2 Small is a multimodal language model developed by DeepSeek. It achieves strong performance with an average score of 69.6% across 14 benchmarks. It excels particularly in DocVQA (92.3%), ChartQA (84.5%), TextVQA (83.4%). As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2024, it represents DeepSeek's latest advancement in AI technology.

Timeline
AnnouncedDec 13, 2024
ReleasedDec 13, 2024
Specifications
Capabilities
Multimodal
License & Family
License
deepseek
Benchmark Performance Overview
Performance metrics and category breakdown

Overall Performance

14 benchmarks
Average Score
69.6%
Best Score
92.3%
High Performers (80%+)
6

Top Categories

vision
74.9%
general
68.9%
roleplay
62.9%
math
60.7%
Benchmark Performance
Top benchmark scores with normalized values (0-100%)
Ranking Across Benchmarks
Position relative to other models on each benchmark

DocVQA

Rank #17 of 26
#14Nova Lite
92.4%
#15GPT-4o
92.8%
#16Phi-4-multimodal-instruct
93.2%
#17DeepSeek VL2 Small
92.3%
#18Pixtral-12B
90.7%
#19Llama 3.2 90B Instruct
90.1%
#20DeepSeek VL2 Tiny
88.9%

ChartQA

Rank #15 of 24
#12Qwen2.5-Omni-7B
85.3%
#13Llama 3.2 90B Instruct
85.5%
#14GPT-4o
85.7%
#15DeepSeek VL2 Small
84.5%
#16Llama 3.2 11B Instruct
83.4%
#17Pixtral-12B
81.8%
#18Phi-3.5-vision-instruct
81.8%

TextVQA

Rank #5 of 15
#2DeepSeek VL2
84.2%
#3Qwen2.5-Omni-7B
84.4%
#4Qwen2.5 VL 7B Instruct
84.9%
#5DeepSeek VL2 Small
83.4%
#6Nova Pro
81.5%
#7DeepSeek VL2 Tiny
80.7%
#8Nova Lite
80.2%

OCRBench

Rank #5 of 7
#2Phi-4-multimodal-instruct
84.4%
#3Qwen2.5 VL 7B Instruct
86.4%
#4Qwen2-VL-72B-Instruct
87.7%
#5DeepSeek VL2 Small
83.4%
#6DeepSeek VL2
81.1%
#7DeepSeek VL2 Tiny
80.9%

MMBench

Rank #5 of 7
#2Phi-3.5-vision-instruct
81.9%
#3Qwen2.5 VL 7B Instruct
84.3%
#4Phi-4-multimodal-instruct
86.7%
#5DeepSeek VL2 Small
80.3%
#6DeepSeek VL2
79.6%
#7DeepSeek VL2 Tiny
69.2%
All Benchmark Results for DeepSeek VL2 Small
Complete list of benchmark scores with detailed information
DocVQA
DocVQA benchmark
vision
multimodal
0.92
92.3%
Self-reported
ChartQA
ChartQA benchmark
general
multimodal
0.84
84.5%
Self-reported
TextVQA
TextVQA benchmark
vision
multimodal
0.83
83.4%
Self-reported
OCRBench
OCRBench benchmark
general
text
0.83
83.4%
Self-reported
MMBench
MMBench benchmark
general
text
0.80
80.3%
Self-reported
AI2D
AI2D benchmark
general
text
0.80
80.0%
Self-reported
MMBench-V1.1
MMBench-V1.1 benchmark
general
text
0.79
79.3%
Self-reported
InfoVQA
InfoVQA benchmark
vision
multimodal
0.76
75.8%
Self-reported
RealWorldQA
RealWorldQA benchmark
general
text
0.65
65.4%
Self-reported
MMT-Bench
MMT-Bench benchmark
roleplay
text
0.63
62.9%
Self-reported
Showing 1 to 10 of 14 benchmarks