🚀 Website under development • Launching soon

Moonshot AI

Kimi K2 0905

Zero-eval
#1HumanEval

by Moonshot AI

About

Kimi K2 0905 is a language model developed by Moonshot AI. This model demonstrates exceptional performance with an average score of 84.0% across 6 benchmarks. It excels particularly in HumanEval (94.5%), MMLU (90.2%), MATH (89.1%). It supports a 524K token context window for handling large documents. The model is available through 1 API provider. Released in 2025, it represents Moonshot AI's latest advancement in AI technology.

Pricing Range
Input (per 1M)$0.60 -$0.60
Output (per 1M)$2.50 -$2.50
Providers1
Timeline
AnnouncedSep 5, 2025
ReleasedSep 5, 2025
Specifications
License & Family
License
Proprietary
Base ModelKimi K2 Instruct
Benchmark Performance Overview
Performance metrics and category breakdown

Overall Performance

6 benchmarks
Average Score
84.0%
Best Score
94.5%
High Performers (80%+)
4

Performance Metrics

Max Context Window
524.3K

Top Categories

code
94.5%
math
89.1%
general
80.1%
Additional Information

Content coming soon...

All Benchmark Results for Kimi K2 0905
Complete list of benchmark scores with detailed information
HumanEval
code
text
0.94
94.5%
Self-reported
MMLU
general
text
0.90
90.2%
Self-reported
MATH
math
text
0.89
89.1%
Self-reported
MMLU-Pro
general
text
0.82
82.5%
Self-reported
GPQA
general
text
0.76
75.8%
Self-reported
AIME 2024
general
text
0.72
72.0%
Self-reported