#tHow Many Tokens?

← Back to counter

Gemini 2.5 Flash vs Pro

SpecGemini 2.5 FlashGemini 2.5 Pro
ProviderGoogleGoogle
Input price (per 1M)$0.30$1.25
Output price (per 1M)$2.50$10.00
Context window1,000,0002,000,000
Tokenizer accuracyexact (uses official tokenizer)exact (uses official tokenizer)

Cost per 1,000 calls across common workloads

Gemini 2.5 Flash is cheaper on 5 of 5 workloads against Gemini 2.5 Pro. Pricing as of the latest snapshot.
WorkloadGemini 2.5 FlashGemini 2.5 ProWinner
Short chat
(200 in / 100 out)
$310.00 $1,250.00 Gemini 2.5 Flash
75% cheaper
Medium chat
(1,000 in / 500 out)
$1,550.00 $6,250.00 Gemini 2.5 Flash
75% cheaper
Heavy generation
(1,000 in / 2,000 out)
$5,300.00 $21,250.00 Gemini 2.5 Flash
75% cheaper
Long context
(8,000 in / 500 out)
$3,650.00 $15,000.00 Gemini 2.5 Flash
76% cheaper
Code review
(3,000 in / 600 out)
$2,400.00 $9,750.00 Gemini 2.5 Flash
75% cheaper

Costs are per 1,000 API calls. Multiply by 1,000 for per-million-calls.

Verdict

Flash is the default for most Gemini workloads. Pro is for workloads where you specifically need either (a) frontier-tier reasoning quality, or (b) the 2M-token context window. Otherwise the 17× price gap on input doesn't pencil out.

Cost example

For a 1,000-token prompt with a 200-token reply:

Gemini Flash:  1000 × $0.075/M + 200 × $0.30/M = $0.000135 per call
Gemini Pro:    1000 × $1.25/M  + 200 × $10/M   = $0.00325 per call

Pro costs ~24× more per call at this typical ratio. For 1,000,000 calls per month: $135 vs $3,250, a $3,115/month difference.

For longer contexts, the gap widens further because input dominates. A 100k-token prompt with a 1k reply: Flash ~$7.80/call, Pro ~$135/call.

Capability comparison

SpecFlashPro
Context window1,000,0002,000,000
Multimodal intext, image, video, audiotext, image, video, audio
Reasoning tierMidFrontier
Function callingYesYes
LatencyFastModerate

Both share the same Gemini tokenizer, so token counts are identical between them, only the per-token rate differs.

When Pro is worth it

When Flash wins

How to decide

Default to Flash. Escalate to Pro only when you've measured Flash failing on your workload, or when you genuinely need >1M context. Most teams who reach for Pro discover Flash would have been fine.

Pro vs Flash isn't really a "frontier vs cheap" comparison. Flash is unusually capable for its price tier. The 17× cost gap is the gap between "good enough for most things" and "frontier-class on hard things."

More comparisons

Compare with your real prompt →