Gemini 2.5 Flash API Pricing
Gemini 2.5 Flash (Google) costs $0.30 per 1M input tokens and $2.50 per 1M output tokens on the standard tier, as of July 2026.
What that means in practice
| Workload (per request) | Tokens in / out | Cost per request | Per 1,000 requests |
|---|---|---|---|
| Chat message | 1,000 / 400 | $0.0013 | $1.30 |
| Document summary | 10,000 / 1,000 | $0.0055 | $5.50 |
| Agent / long-context task | 50,000 / 5,000 | $0.0275 | $27.50 |
Estimate your exact workload with the AI API cost calculator.
Ways to pay less for Gemini 2.5 Flash
- Prompt caching — repeated input (system prompts, shared documents) is billed at roughly 10% of the input rate. For agents this often halves the bill.
- Batch processing — non-urgent work submitted in batch runs at ~50% off.
- Route easy tasks down-tier — for simple classification or extraction, Gemini 2.5 Flash-Lite or GPT-5.4 Nano cost a fraction as much.
How Gemini 2.5 Flash compares (July 2026)
| Model | Provider | Input $/1M | Output $/1M |
|---|---|---|---|
| Gemini 2.5 Flash-Lite | $0.10 | $0.40 | |
| GPT-5.4 Nano | OpenAI | $0.20 | $1.25 |
| Gemini 2.5 Flash | $0.30 | $2.50 | |
| Claude Haiku 4.5 | Anthropic | $1.00 | $5.00 |
| GPT-5.6 Luna | OpenAI | $1.00 | $6.00 |
| Gemini 3.5 Flash | $1.50 | $9.00 | |
| Gemini 3.1 Pro | $2.00 | $12.00 | |
| GPT-5.4 | OpenAI | $2.50 | $15.00 |
| GPT-5.6 Terra | OpenAI | $2.50 | $15.00 |
| Claude Sonnet 5 | Anthropic | $3.00 | $15.00 |
| Claude Opus 4.8 | Anthropic | $5.00 | $25.00 |
| GPT-5.5 | OpenAI | $5.00 | $30.00 |
| GPT-5.6 Sol | OpenAI | $5.00 | $30.00 |
| Claude Fable 5 | Anthropic | $10.00 | $50.00 |
Standard-tier, short-context list prices as of July 2026. Providers change pricing frequently — confirm on the official pricing page before committing. Long-context requests, priority tiers, and fine-tuned variants are priced differently.
Where Gemini 2.5 Flash fits
Gemini 2.5 Flash sits in the budget tier — built for high-volume, simple tasks like classification, extraction, tagging, and short answers, where per-token cost matters more than peak reasoning ability.
Other Google models: Gemini 2.5 Flash-Lite · Gemini 3.5 Flash · Gemini 3.1 Pro