AI & Developer Tools

Gemini 2.5 Flash API Pricing

Gemini 2.5 Flash (Google) costs $0.30 per 1M input tokens and $2.50 per 1M output tokens on the standard tier, as of July 2026.

What that means in practice

Workload (per request)	Tokens in / out	Cost per request	Per 1,000 requests
Chat message	1,000 / 400	$0.0013	$1.30
Document summary	10,000 / 1,000	$0.0055	$5.50
Agent / long-context task	50,000 / 5,000	$0.0275	$27.50

Estimate your exact workload with the AI API cost calculator.

Ways to pay less for Gemini 2.5 Flash

Prompt caching — repeated input (system prompts, shared documents) is billed at roughly 10% of the input rate. For agents this often halves the bill.
Batch processing — non-urgent work submitted in batch runs at ~50% off.
Route easy tasks down-tier — for simple classification or extraction, Gemini 2.5 Flash-Lite or GPT-5.4 Nano cost a fraction as much.

How Gemini 2.5 Flash compares (July 2026)

Model	Provider	Input $/1M	Output $/1M
Gemini 2.5 Flash-Lite	Google	$0.10	$0.40
GPT-5.4 Nano	OpenAI	$0.20	$1.25
Gemini 2.5 Flash	Google	$0.30	$2.50
Claude Haiku 4.5	Anthropic	$1.00	$5.00
GPT-5.6 Luna	OpenAI	$1.00	$6.00
Gemini 3.5 Flash	Google	$1.50	$9.00
Gemini 3.1 Pro	Google	$2.00	$12.00
GPT-5.4	OpenAI	$2.50	$15.00
GPT-5.6 Terra	OpenAI	$2.50	$15.00
Claude Sonnet 5	Anthropic	$3.00	$15.00
Claude Opus 4.8	Anthropic	$5.00	$25.00
GPT-5.5	OpenAI	$5.00	$30.00
GPT-5.6 Sol	OpenAI	$5.00	$30.00
Claude Fable 5	Anthropic	$10.00	$50.00

Standard-tier, short-context list prices as of July 2026. Providers change pricing frequently — confirm on the official pricing page before committing. Long-context requests, priority tiers, and fine-tuned variants are priced differently.

Where Gemini 2.5 Flash fits

Gemini 2.5 Flash sits in the budget tier — built for high-volume, simple tasks like classification, extraction, tagging, and short answers, where per-token cost matters more than peak reasoning ability.

Other Google models: Gemini 2.5 Flash-Lite · Gemini 3.5 Flash · Gemini 3.1 Pro