DeepSeek-V3 0324

Name: DeepSeek-V3 0324
Rating: 70.4 (5 reviews)
Author: DeepSeek

Zero-eval

by DeepSeek

About

DeepSeek-V3 0324 is a language model developed by DeepSeek. It achieves strong performance with an average score of 70.4% across 5 benchmarks. It excels particularly in MATH-500 (94.0%), MMLU-Pro (81.2%), GPQA (68.4%). Released in 2025, it represents DeepSeek's latest advancement in AI technology.

Timeline

AnnouncedMar 25, 2025

ReleasedMar 25, 2025

Specifications

Training Tokens14.8T

License & Family

License

MIT + Model License (Commercial use allowed)

Benchmark Performance Overview

Performance metrics and category breakdown

Overall Performance

5 benchmarks

Average Score

70.4%

Best Score

94.0%

High Performers (80%+)

Top Categories

math

94.0%

general

69.7%

code

49.2%

Benchmark Performance

Top benchmark scores with normalized values (0-100%)

Ranking Across Benchmarks

Position relative to other models on each benchmark

MATH-500

Rank #12 of 22

#9DeepSeek R1 Distill Qwen 32B

94.3%

#10DeepSeek R1 Distill Llama 70B

94.5%

#11Phi 4 Mini Reasoning

94.6%

#12DeepSeek-V3 0324

94.0%

#13DeepSeek R1 Distill Qwen 14B

93.9%

#14DeepSeek R1 Distill Qwen 7B

92.8%

#15QwQ-32B

90.6%

MMLU-Pro

Rank #4 of 60

#1Qwen3-235B-A22B-Instruct-2507

83.0%

#2DeepSeek-R1

84.0%

#3DeepSeek-R1-0528

85.0%

#4DeepSeek-V3 0324

81.2%

#5Kimi K2 Instruct

81.1%

#6Llama 4 Maverick

80.5%

#7Claude 3.5 Sonnet

77.6%

GPQA

Rank #33 of 115

#30Phi 4 Reasoning Plus

68.9%

#31GPT-4.5

69.5%

#32Llama 4 Maverick

69.8%

#33DeepSeek-V3 0324

68.4%

#34Magistral Small 2506

68.2%

#35Claude 3.5 Sonnet

67.2%

#36Llama-3.3 Nemotron Super 49B v1

66.7%

AIME 2024

Rank #32 of 41

#29Kimi K2 Instruct

69.6%

#30Magistral Small 2506

70.7%

#31Gemini 2.0 Flash Thinking

73.3%

#32DeepSeek-V3 0324

59.4%

#33DeepSeek R1 Distill Qwen 1.5B

52.7%

#34QwQ-32B-Preview

50.0%

#35GPT-4.1 mini

49.6%

LiveCodeBench

Rank #23 of 44

#20QwQ-32B-Preview

50.0%

#21DeepSeek R1 Zero

50.0%

#22Magistral Medium

50.3%

#23DeepSeek-V3 0324

49.2%

#24Llama 4 Maverick

43.4%

#25DeepSeek R1 Distill Llama 8B

39.6%

#26DeepSeek-V3

37.6%

All Benchmark Results for DeepSeek-V3 0324

Complete list of benchmark scores with detailed information


MATH-500 MATH-500 benchmark	math	text	0.94	94.0%	Self-reported
MMLU-Pro MMLU-Pro benchmark	general	text	0.81	81.2%	Self-reported
GPQA GPQA benchmark	general	text	0.68	68.4%	Self-reported
AIME 2024 AIME 2024 benchmark	general	text	0.59	59.4%	Self-reported
LiveCodeBench LiveCodeBench benchmark	code	text	0.49	49.2%	Self-reported

Resources

API Reference Playground Research Paper Repository Model Weights