Google

Major Platform

ai.google.dev

Platform Stats

Total Models25

Organizations5

Verified Benchmarks0

Multimodal Models17

Pricing Overview

Avg Input (per 1M)$2.48

Avg Output (per 1M)$11.90

Cheapest Model$0.07

Premium Model$15.00

Supported Features

Number of models supporting each feature

web Search

function Calling

structured Output

code Execution

batch Inference

finetuning

Input Modalities

Models supporting different input types

text

25 (100%)

image

17 (68%)

audio

0 (0%)

video

0 (0%)

Models Overview

Top performers and pricing distribution

Pricing Distribution

Input pricing per 1M tokens

$0-1

12 models

$1-5

10 models

$5-15

1 models

$15+

2 models

Top Performing Models

By benchmark avg

#1Mistral Large 2

87.6%

#2Claude 3.5 Sonnet

84.1%

#3Claude 3 Opus

81.6%

#4Llama 3.1 405B Instruct

79.2%

#5Claude 3.7 Sonnet

74.1%

Most Affordable Models

Gemini 2.0 Flash-Lite

$0.07/1M

Gemini 1.5 Flash 8B

$0.07/1M

Gemini 2.5 Flash-Lite

$0.10/1M

Available Models

25 models available through Google

			License
Claude Sonnet 4 Claude Sonnet 4, part of the Claude 4 family, is a significant upgrade to Claude Sonnet 3.7. It excels in coding (72.7% on SWE-bench) and reasoning, responding more precisely to instructions. Sonnet 4 offers an optimal mix of capability and practicality, with enhanced steerability, and supports extended thinking with tool use.	Anthropic	May 22, 2025	Proprietary	72.7%	-	-	-	-
Claude Opus 4 Claude Opus 4 is Anthropic's most powerful model and the world's best coding model, part of the Claude 4 family. It delivers sustained performance on complex, long-running tasks and agent workflows. Opus 4 excels at coding, advanced reasoning, and can use tools (like web search) during extended thinking. It supports parallel tool execution and has improved memory capabilities.	Anthropic	May 22, 2025	Proprietary	72.5%	-	-	-	-
Claude 3.7 Sonnet The most intelligent Claude model and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. Shows particularly strong improvements in coding and front-end web development.	Anthropic	Feb 24, 2025	Proprietary	70.3%	-	-	-	-
Gemini 2.5 Pro Preview 06-05 The latest preview version of Google's most advanced reasoning Gemini model, capable of solving complex problems. Built for the agentic era with enhanced reasoning capabilities, multimodal understanding (text, image, video, audio), and a 1M token context window. Features thinking preview, code execution, grounding with Google Search, system instructions, function calling, and controlled generation. Supports up to 3,000 images per prompt, 45-60 minutes of video, and 8.4 hours of audio.	Google	Jun 5, 2025	Proprietary	67.2%	82.2%	-	69.0%	-
Gemini 2.5 Pro Our most intelligent AI model, built for the agentic era. Gemini 2.5 Pro leads on common benchmarks with enhanced reasoning, multimodal capabilities (text, image, video, audio input), and a 1M token context window.	Google	May 20, 2025	Proprietary	63.2%	76.5%	-	-	-
Gemini 2.5 Flash A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input context window.	Google	May 20, 2025	Proprietary	60.4%	61.9%	-	-	-
Claude 3.5 Sonnet Claude 3.5 Sonnet is a powerful AI model with industry-leading software engineering skills. It excels in coding, planning, and problem-solving, with significant improvements in agentic coding and tool use tasks. The model includes computer use capabilities in public beta, allowing it to interact with computer interfaces like a human user.	Anthropic	Oct 22, 2024	Proprietary	49.0%	-	93.7%	-	-
Claude 3.5 Haiku Claude 3.5 Haiku is Anthropic's fastest model, delivering advanced coding, tool use, and reasoning capabilities at an accessible price. It excels at user-facing products, specialized sub-agent tasks, and generating personalized experiences from large data volumes. The model is particularly well-suited for code completions, interactive chatbots, data extraction, and real-time content moderation.	Anthropic	Oct 22, 2024	Proprietary	40.6%	-	88.1%	-	-
Gemini 2.5 Flash-Lite Gemini 2.5 Flash-Lite is a model developed by Google DeepMind, designed to handle various tasks including reasoning, science, mathematics, code generation, and more. It features advanced capabilities in multilingual performance and long context understanding. It is optimized for low latency use cases, supporting multimodal input with a 1 million-token context length.	Google	Jun 17, 2025	Creative Commons Attribution 4.0 License	31.6%	26.7%	-	33.7%	-
Gemini 2.0 Flash-Lite A Gemini 2.0 Flash model optimized for cost efficiency and low latency	Google	Feb 5, 2025	Proprietary	-	-	-	-	-

Showing 1 to 10 of 25 models

Resources

Official Website