DeepSeek VL2 Small

Name: DeepSeek VL2 Small
Rating: 69.6 (14 reviews)
Author: DeepSeek

Multimodal

Zero-eval

#2MMBench-V1.1

#2MME

#3MMT-Bench

by DeepSeek

About

DeepSeek VL2 Small is a multimodal language model developed by DeepSeek. It achieves strong performance with an average score of 69.6% across 14 benchmarks. It excels particularly in DocVQA (92.3%), ChartQA (84.5%), TextVQA (83.4%). As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2024, it represents DeepSeek's latest advancement in AI technology.

Timeline

AnnouncedDec 13, 2024

ReleasedDec 13, 2024

Specifications

Capabilities

Multimodal

License & Family

License

deepseek

Benchmark Performance Overview

Performance metrics and category breakdown

Overall Performance

14 benchmarks

Average Score

69.6%

Best Score

92.3%

High Performers (80%+)

Top Categories

vision

74.9%

general

68.9%

roleplay

62.9%

math

60.7%

Benchmark Performance

Top benchmark scores with normalized values (0-100%)

Ranking Across Benchmarks

Position relative to other models on each benchmark

DocVQA

Rank #17 of 26

#14Nova Lite

92.4%

#15GPT-4o

92.8%

#16Phi-4-multimodal-instruct

93.2%

#17DeepSeek VL2 Small

92.3%

#18Pixtral-12B

90.7%

#19Llama 3.2 90B Instruct

90.1%

#20DeepSeek VL2 Tiny

88.9%

ChartQA

Rank #15 of 24

#12Qwen2.5-Omni-7B

85.3%

#13Llama 3.2 90B Instruct

85.5%

#14GPT-4o

85.7%

#15DeepSeek VL2 Small

84.5%

#16Llama 3.2 11B Instruct

83.4%

#17Pixtral-12B

81.8%

#18Phi-3.5-vision-instruct

81.8%

TextVQA

Rank #5 of 15

#2DeepSeek VL2

84.2%

#3Qwen2.5-Omni-7B

84.4%

#4Qwen2.5 VL 7B Instruct

84.9%

#5DeepSeek VL2 Small

83.4%

#6Nova Pro

81.5%

#7DeepSeek VL2 Tiny

80.7%

#8Nova Lite

80.2%

OCRBench

Rank #5 of 7

#2Phi-4-multimodal-instruct

84.4%

#3Qwen2.5 VL 7B Instruct

86.4%

#4Qwen2-VL-72B-Instruct

87.7%

#5DeepSeek VL2 Small

83.4%

#6DeepSeek VL2

81.1%

#7DeepSeek VL2 Tiny

80.9%

MMBench

Rank #5 of 7

#2Phi-3.5-vision-instruct

81.9%

#3Qwen2.5 VL 7B Instruct

84.3%

#4Phi-4-multimodal-instruct

86.7%

#5DeepSeek VL2 Small

80.3%

#6DeepSeek VL2

79.6%

#7DeepSeek VL2 Tiny

69.2%

All Benchmark Results for DeepSeek VL2 Small

Complete list of benchmark scores with detailed information


DocVQA DocVQA benchmark	vision	multimodal	0.92	92.3%	Self-reported
ChartQA ChartQA benchmark	general	multimodal	0.84	84.5%	Self-reported
TextVQA TextVQA benchmark	vision	multimodal	0.83	83.4%	Self-reported
OCRBench OCRBench benchmark	general	text	0.83	83.4%	Self-reported
MMBench MMBench benchmark	general	text	0.80	80.3%	Self-reported
AI2D AI2D benchmark	general	text	0.80	80.0%	Self-reported
MMBench-V1.1 MMBench-V1.1 benchmark	general	text	0.79	79.3%	Self-reported
InfoVQA InfoVQA benchmark	vision	multimodal	0.76	75.8%	Self-reported
RealWorldQA RealWorldQA benchmark	general	text	0.65	65.4%	Self-reported
MMT-Bench MMT-Bench benchmark	roleplay	text	0.63	62.9%	Self-reported

Showing 1 to 10 of 14 benchmarks

Resources

API Reference Playground Research Paper Repository Model Weights