Gemini 2.5 Flash

Name: Gemini 2.5 Flash
Price: 0.3 USD
Rating: 62.5 (14 reviews)
Author: Google

Multimodal

Zero-eval

#2FACTS Grounding

#2LiveCodeBench v5

#3Global-MMLU-Lite

+1 more

by Google

About

Gemini 2.5 Flash is a multimodal language model developed by Google. It achieves strong performance with an average score of 62.5% across 14 benchmarks. It excels particularly in Global-MMLU-Lite (88.4%), AIME 2024 (88.0%), FACTS Grounding (85.3%). With a 1.1M token context window, it can handle extensive documents and complex multi-turn conversations. The model is available through 2 API providers. As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2025, it represents Google's latest advancement in AI technology.

Pricing Range

Input (per 1M)$0.30 -$0.30

Output (per 1M)$2.50 -$2.50

Providers2

Timeline

AnnouncedMay 20, 2025

ReleasedMay 20, 2025

Knowledge CutoffJan 31, 2025

Specifications

Capabilities

Multimodal

License & Family

License

Proprietary

Benchmark Performance Overview

Performance metrics and category breakdown

Overall Performance

14 benchmarks

Average Score

62.5%

Best Score

88.4%

High Performers (80%+)

Performance Metrics

Max Context Window

1.1M

Avg Throughput

85.0 tok/s

Avg Latency

1ms

Top Categories

factuality

85.3%

vision

79.7%

code

64.7%

general

58.0%

Benchmark Performance

Top benchmark scores with normalized values (0-100%)

Ranking Across Benchmarks

Position relative to other models on each benchmark

Global-MMLU-Lite

Rank #3 of 14

#1Gemini 2.5 Pro

88.6%

#2Gemini 2.5 Pro Preview 06-05

89.2%

#3Gemini 2.5 Flash

88.4%

#4Gemini 2.5 Flash-Lite

81.1%

#5Gemini 2.0 Flash-Lite

78.2%

#6Gemma 3 27B

75.1%

AIME 2024

Rank #7 of 41

#4DeepSeek-R1-0528

91.4%

#5o3

91.6%

#6Gemini 2.5 Pro

92.0%

#7Gemini 2.5 Flash

88.0%

#8o3-mini

87.3%

#9DeepSeek R1 Distill Llama 70B

86.7%

#10DeepSeek R1 Zero

86.7%

FACTS Grounding

Rank #2 of 9

#1Gemini 2.5 Pro Preview 06-05

87.8%

#2Gemini 2.5 Flash

85.3%

#3Gemini 2.5 Flash-Lite

84.1%

#4Gemini 2.0 Flash

83.6%

#5Gemini 2.0 Flash-Lite

83.6%

GPQA

Rank #10 of 115

#7Gemini 2.5 Pro

83.0%

#8o3

83.3%

#9Grok-3 Mini

84.0%

#10Gemini 2.5 Flash

82.8%

#11GPT-5 mini

82.3%

#12o4-mini

81.4%

#13DeepSeek-R1-0528

81.0%

MMMU

Rank #5 of 52

#2o4-mini

81.6%

#3Gemini 2.5 Pro Preview 06-05

82.0%

#4o3

82.9%

#5Gemini 2.5 Flash

79.7%

#6Gemini 2.5 Pro

79.6%

#7Grok-3

78.0%

#8o1

77.6%

All Benchmark Results for Gemini 2.5 Flash

Complete list of benchmark scores with detailed information


Global-MMLU-Lite Global-MMLU-Lite benchmark	general	text	0.88	88.4%	Self-reported
AIME 2024 AIME 2024 benchmark	general	text	0.88	88.0%	Self-reported
FACTS Grounding FACTS Grounding benchmark	factuality	text	0.85	85.3%	Self-reported
GPQA GPQA benchmark	general	text	0.83	82.8%	Self-reported
MMMU MMMU benchmark	vision	multimodal	0.80	79.7%	Self-reported
AIME 2025 AIME 2025 benchmark	general	text	0.72	72.0%	Self-reported
Vibe-Eval Vibe-Eval benchmark	code	text	0.65	65.4%	Self-reported
LiveCodeBench v5 LiveCodeBench v5 benchmark	code	text	0.64	63.9%	Self-reported
Aider-Polyglot Aider-Polyglot benchmark	general	text	0.62	61.9%	Self-reported
SWE-Bench Verified SWE-Bench Verified benchmark	general	text	0.60	60.4%	Self-reported

Showing 1 to 10 of 14 benchmarks

Resources

API Reference Playground Blog Post