DeepSeek

DeepSeek-V3 0324

Zero-eval

by DeepSeek

About

DeepSeek-V3 0324 is a language model developed by DeepSeek. It achieves strong performance with an average score of 70.4% across 5 benchmarks. It excels particularly in MATH-500 (94.0%), MMLU-Pro (81.2%), GPQA (68.4%). Released in 2025, it represents DeepSeek's latest advancement in AI technology.

Timeline
AnnouncedMar 25, 2025
ReleasedMar 25, 2025
Specifications
Training Tokens14.8T
License & Family
License
MIT + Model License (Commercial use allowed)
Benchmark Performance Overview
Performance metrics and category breakdown

Overall Performance

5 benchmarks
Average Score
70.4%
Best Score
94.0%
High Performers (80%+)
2

Top Categories

math
94.0%
general
69.7%
code
49.2%
Benchmark Performance
Top benchmark scores with normalized values (0-100%)
Ranking Across Benchmarks
Position relative to other models on each benchmark

MATH-500

Rank #12 of 22
#9DeepSeek R1 Distill Qwen 32B
94.3%
#10DeepSeek R1 Distill Llama 70B
94.5%
#11Phi 4 Mini Reasoning
94.6%
#12DeepSeek-V3 0324
94.0%
#13DeepSeek R1 Distill Qwen 14B
93.9%
#14DeepSeek R1 Distill Qwen 7B
92.8%
#15QwQ-32B
90.6%

MMLU-Pro

Rank #4 of 60
#1Qwen3-235B-A22B-Instruct-2507
83.0%
#2DeepSeek-R1
84.0%
#3DeepSeek-R1-0528
85.0%
#4DeepSeek-V3 0324
81.2%
#5Kimi K2 Instruct
81.1%
#6Llama 4 Maverick
80.5%
#7Claude 3.5 Sonnet
77.6%

GPQA

Rank #33 of 115
#30Phi 4 Reasoning Plus
68.9%
#31GPT-4.5
69.5%
#32Llama 4 Maverick
69.8%
#33DeepSeek-V3 0324
68.4%
#34Magistral Small 2506
68.2%
#35Claude 3.5 Sonnet
67.2%
#36Llama-3.3 Nemotron Super 49B v1
66.7%

AIME 2024

Rank #32 of 41
#29Kimi K2 Instruct
69.6%
#30Magistral Small 2506
70.7%
#31Gemini 2.0 Flash Thinking
73.3%
#32DeepSeek-V3 0324
59.4%
#33DeepSeek R1 Distill Qwen 1.5B
52.7%
#34QwQ-32B-Preview
50.0%
#35GPT-4.1 mini
49.6%

LiveCodeBench

Rank #23 of 44
#20QwQ-32B-Preview
50.0%
#21DeepSeek R1 Zero
50.0%
#22Magistral Medium
50.3%
#23DeepSeek-V3 0324
49.2%
#24Llama 4 Maverick
43.4%
#25DeepSeek R1 Distill Llama 8B
39.6%
#26DeepSeek-V3
37.6%
All Benchmark Results for DeepSeek-V3 0324
Complete list of benchmark scores with detailed information
MATH-500
MATH-500 benchmark
math
text
0.94
94.0%
Self-reported
MMLU-Pro
MMLU-Pro benchmark
general
text
0.81
81.2%
Self-reported
GPQA
GPQA benchmark
general
text
0.68
68.4%
Self-reported
AIME 2024
AIME 2024 benchmark
general
text
0.59
59.4%
Self-reported
LiveCodeBench
LiveCodeBench benchmark
code
text
0.49
49.2%
Self-reported