Gemini 3.1 Flash-Lite: token counter & pricing
Google · exact (uses official tokenizer) · pricing as of 2026-05-31.
- Provider
- API model ID
gemini-3.1-flash-lite-preview- Context window
- 1,000,000 tokens
- Input price
- $0.25 per 1M tokens
- Output price
- $1.50 per 1M tokens
- Tokenizer accuracy
- exact (uses official tokenizer)
- Pricing as of
- 2026-05-31
Open the counter to count tokens for Gemini 3.1 Flash-Lite in real time.
What is Gemini 3.1 Flash-Lite?
Gemini 3.1 Flash-Lite is the cheapest member of the Gemini 3 family, released as a Preview in April 2026 alongside 3.1 Pro and 3 Flash. Designed for high-volume agentic tasks, translation, and simple data processing.
$0.25 input / $1.50 output per 1M tokens, more expensive than the GA Gemini 2.5 Flash-Lite ($0.10/$0.40) but with Gemini 3.x reasoning improvements.
Currently in Preview. Pricing and behavior may change before GA.
How tokens are counted here
Google's official models.countTokens endpoint via our serverless proxy. Exact.
Pricing notes
$0.25 input / $1.50 output per 1M. Cached input $0.025/M (text/image/video).
For 1,000 input + 200 output: $0.00055 per call, $550 per 1M calls.
1M-token context window, same as the GA Gemini 2.5 Flash-Lite.
When to use Gemini 3.1 Flash-Lite
- Workloads currently on 2.5 Flash-Lite where you want to test the Gemini 3 reasoning quality lift.
- High-volume agentic loops that need stronger reasoning than 2.5 Flash-Lite at the lowest Gemini 3 tier.
- Translation and multilingual workloads. Gemini's tokenizer compresses non-English text efficiently.
When not to use it:
- Production-stable workloads. Preview status means pricing/behavior could shift.
- Pure cost extremes, Gemini 2.5 Flash-Lite ($0.10/$0.40) is 60% cheaper on input.
- Pure text reasoning where GPT-5 Nano ($0.05/$0.40) is dramatically cheaper.
Common questions
Gemini 3.1 Flash-Lite vs 2.5 Flash-Lite?
| Model | Input | Output | Status |
|---|---|---|---|
| 2.5 Flash-Lite | $0.10 | $0.40 | GA |
| 3.1 Flash-Lite Preview | $0.25 | $1.50 | Preview |
3.1 Flash-Lite is 2.5× more expensive on both axes for the Gemini 3 generation reasoning lift. For most cost-sensitive workloads, 2.5 Flash-Lite is the rational default until 3.1 Flash-Lite leaves Preview and pricing settles.
Gemini 3.1 Flash-Lite vs Gemini 3 Flash?
3 Flash: $0.50/$3. Pro-tier reasoning at Flash prices. 3.1 Flash-Lite: $0.25/$1.50, half the price of 3 Flash with less reasoning capacity. For most routing/classification workloads, Flash-Lite is enough.
When does it leave Preview?
No published date. Preview models can change pricing without notice, verify before invoicing.
Compare Gemini 3.1 Flash-Lite to other models
- Gemini 3.1 Pro (Google, $2.00/$12.00)
- Gemini 3 Flash (Google, $0.50/$3.00)
- Gemini 2.5 Pro (Google, $1.25/$10.00)
- Gemini 2.5 Flash (Google, $0.30/$2.50)
- Gemini 2.5 Flash-Lite (Google, $0.10/$0.40)
- GPT-5 Mini (OpenAI, $0.25/$2.00)
- DeepSeek V3 (DeepSeek, $0.27/$1.10)
- GPT-5.4 Nano (OpenAI, $0.20/$1.25)