
Gemini 2.5 Flash
Multimodal
Zero-eval
#2FACTS Grounding
#2LiveCodeBench v5
#3Global-MMLU-Lite
+1 more
by Google
About
Gemini 2.5 Flash is a multimodal language model developed by Google. It achieves strong performance with an average score of 62.5% across 14 benchmarks. It excels particularly in Global-MMLU-Lite (88.4%), AIME 2024 (88.0%), FACTS Grounding (85.3%). With a 1.1M token context window, it can handle extensive documents and complex multi-turn conversations. The model is available through 2 API providers. As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2025, it represents Google's latest advancement in AI technology.
Pricing Range
Input (per 1M)$0.30 -$0.30
Output (per 1M)$2.50 -$2.50
Providers2
Timeline
AnnouncedMay 20, 2025
ReleasedMay 20, 2025
Knowledge CutoffJan 31, 2025
Specifications
Capabilities
Multimodal
License & Family
License
Proprietary
Benchmark Performance Overview
Performance metrics and category breakdown
Overall Performance
14 benchmarks
Average Score
62.5%
Best Score
88.4%
High Performers (80%+)
4Performance Metrics
Max Context Window
1.1MAvg Throughput
85.0 tok/sAvg Latency
1msTop Categories
factuality
85.3%
vision
79.7%
code
64.7%
general
58.0%
Benchmark Performance
Top benchmark scores with normalized values (0-100%)
Ranking Across Benchmarks
Position relative to other models on each benchmark
Global-MMLU-Lite
Rank #3 of 14
#1Gemini 2.5 Pro
88.6%
#2Gemini 2.5 Pro Preview 06-05
89.2%
#3Gemini 2.5 Flash
88.4%
#4Gemini 2.5 Flash-Lite
81.1%
#5Gemini 2.0 Flash-Lite
78.2%
#6Gemma 3 27B
75.1%
AIME 2024
Rank #7 of 41
#4DeepSeek-R1-0528
91.4%
#5o3
91.6%
#6Gemini 2.5 Pro
92.0%
#7Gemini 2.5 Flash
88.0%
#8o3-mini
87.3%
#9DeepSeek R1 Distill Llama 70B
86.7%
#10DeepSeek R1 Zero
86.7%
FACTS Grounding
Rank #2 of 9
#1Gemini 2.5 Pro Preview 06-05
87.8%
#2Gemini 2.5 Flash
85.3%
#3Gemini 2.5 Flash-Lite
84.1%
#4Gemini 2.0 Flash
83.6%
#5Gemini 2.0 Flash-Lite
83.6%
GPQA
Rank #10 of 115
#7Gemini 2.5 Pro
83.0%
#8o3
83.3%
#9Grok-3 Mini
84.0%
#10Gemini 2.5 Flash
82.8%
#11GPT-5 mini
82.3%
#12o4-mini
81.4%
#13DeepSeek-R1-0528
81.0%
MMMU
Rank #5 of 52
#2o4-mini
81.6%
#3Gemini 2.5 Pro Preview 06-05
82.0%
#4o3
82.9%
#5Gemini 2.5 Flash
79.7%
#6Gemini 2.5 Pro
79.6%
#7Grok-3
78.0%
#8o1
77.6%
All Benchmark Results for Gemini 2.5 Flash
Complete list of benchmark scores with detailed information
Global-MMLU-Lite Global-MMLU-Lite benchmark | general | text | 0.88 | 88.4% | Self-reported |
AIME 2024 AIME 2024 benchmark | general | text | 0.88 | 88.0% | Self-reported |
FACTS Grounding FACTS Grounding benchmark | factuality | text | 0.85 | 85.3% | Self-reported |
GPQA GPQA benchmark | general | text | 0.83 | 82.8% | Self-reported |
MMMU MMMU benchmark | vision | multimodal | 0.80 | 79.7% | Self-reported |
AIME 2025 AIME 2025 benchmark | general | text | 0.72 | 72.0% | Self-reported |
Vibe-Eval Vibe-Eval benchmark | code | text | 0.65 | 65.4% | Self-reported |
LiveCodeBench v5 LiveCodeBench v5 benchmark | code | text | 0.64 | 63.9% | Self-reported |
Aider-Polyglot Aider-Polyglot benchmark | general | text | 0.62 | 61.9% | Self-reported |
SWE-Bench Verified SWE-Bench Verified benchmark | general | text | 0.60 | 60.4% | Self-reported |
Showing 1 to 10 of 14 benchmarks
Resources