Google

Gemini 2.5 Flash

Multimodal
Zero-eval
#2FACTS Grounding
#2LiveCodeBench v5
#3Global-MMLU-Lite
+1 more

by Google

About

Gemini 2.5 Flash is a multimodal language model developed by Google. It achieves strong performance with an average score of 62.5% across 14 benchmarks. It excels particularly in Global-MMLU-Lite (88.4%), AIME 2024 (88.0%), FACTS Grounding (85.3%). With a 1.1M token context window, it can handle extensive documents and complex multi-turn conversations. The model is available through 2 API providers. As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2025, it represents Google's latest advancement in AI technology.

Pricing Range
Input (per 1M)$0.30 -$0.30
Output (per 1M)$2.50 -$2.50
Providers2
Timeline
AnnouncedMay 20, 2025
ReleasedMay 20, 2025
Knowledge CutoffJan 31, 2025
Specifications
Capabilities
Multimodal
License & Family
License
Proprietary
Benchmark Performance Overview
Performance metrics and category breakdown

Overall Performance

14 benchmarks
Average Score
62.5%
Best Score
88.4%
High Performers (80%+)
4

Performance Metrics

Max Context Window
1.1M
Avg Throughput
85.0 tok/s
Avg Latency
1ms

Top Categories

factuality
85.3%
vision
79.7%
code
64.7%
general
58.0%
Benchmark Performance
Top benchmark scores with normalized values (0-100%)
Ranking Across Benchmarks
Position relative to other models on each benchmark

Global-MMLU-Lite

Rank #3 of 14
#1Gemini 2.5 Pro
88.6%
#2Gemini 2.5 Pro Preview 06-05
89.2%
#3Gemini 2.5 Flash
88.4%
#4Gemini 2.5 Flash-Lite
81.1%
#5Gemini 2.0 Flash-Lite
78.2%
#6Gemma 3 27B
75.1%

AIME 2024

Rank #7 of 41
#4DeepSeek-R1-0528
91.4%
#5o3
91.6%
#6Gemini 2.5 Pro
92.0%
#7Gemini 2.5 Flash
88.0%
#8o3-mini
87.3%
#9DeepSeek R1 Distill Llama 70B
86.7%
#10DeepSeek R1 Zero
86.7%

FACTS Grounding

Rank #2 of 9
#1Gemini 2.5 Pro Preview 06-05
87.8%
#2Gemini 2.5 Flash
85.3%
#3Gemini 2.5 Flash-Lite
84.1%
#4Gemini 2.0 Flash
83.6%
#5Gemini 2.0 Flash-Lite
83.6%

GPQA

Rank #10 of 115
#7Gemini 2.5 Pro
83.0%
#8o3
83.3%
#9Grok-3 Mini
84.0%
#10Gemini 2.5 Flash
82.8%
#11GPT-5 mini
82.3%
#12o4-mini
81.4%
#13DeepSeek-R1-0528
81.0%

MMMU

Rank #5 of 52
#2o4-mini
81.6%
#3Gemini 2.5 Pro Preview 06-05
82.0%
#4o3
82.9%
#5Gemini 2.5 Flash
79.7%
#6Gemini 2.5 Pro
79.6%
#7Grok-3
78.0%
#8o1
77.6%
All Benchmark Results for Gemini 2.5 Flash
Complete list of benchmark scores with detailed information
Global-MMLU-Lite
Global-MMLU-Lite benchmark
general
text
0.88
88.4%
Self-reported
AIME 2024
AIME 2024 benchmark
general
text
0.88
88.0%
Self-reported
FACTS Grounding
FACTS Grounding benchmark
factuality
text
0.85
85.3%
Self-reported
GPQA
GPQA benchmark
general
text
0.83
82.8%
Self-reported
MMMU
MMMU benchmark
vision
multimodal
0.80
79.7%
Self-reported
AIME 2025
AIME 2025 benchmark
general
text
0.72
72.0%
Self-reported
Vibe-Eval
Vibe-Eval benchmark
code
text
0.65
65.4%
Self-reported
LiveCodeBench v5
LiveCodeBench v5 benchmark
code
text
0.64
63.9%
Self-reported
Aider-Polyglot
Aider-Polyglot benchmark
general
text
0.62
61.9%
Self-reported
SWE-Bench Verified
SWE-Bench Verified benchmark
general
text
0.60
60.4%
Self-reported
Showing 1 to 10 of 14 benchmarks