#tHow Many Tokens?

← All models

GPT-5.4 Mini: token counter & pricing

OpenAI · exact (uses official tokenizer) · pricing as of 2026-05-31.

Provider
OpenAI
API model ID
gpt-5.4-mini
Context window
400,000 tokens
Input price
$0.75 per 1M tokens
Output price
$4.50 per 1M tokens
Tokenizer accuracy
exact (uses official tokenizer)
Pricing as of
2026-05-31

Open the counter to count tokens for GPT-5.4 Mini in real time.

What is GPT-5.4 Mini?

GPT-5.4 Mini is the mid-tier in the GPT-5.4 family, substantially cheaper than full GPT-5.4 ($2.50 input → $0.75 input, 3.3× cheaper) while keeping the same o200k_base tokenizer and 400K context.

The sweet spot between GPT-5.4 Nano (too small for harder tasks) and GPT-5.4 (overkill for routine work).

How tokens are counted here

OpenAI's o200k_base tokenizer. Browser-side via js-tiktoken. Exact.

Pricing notes

$0.75 input / $4.50 output per 1M. Cached input $0.075/M.

No long-context tier, single price across the full 400K window. That makes Mini predictable on cost regardless of prompt length.

For a 1,000-token prompt with 200-token reply: $0.00165 per call, ~$1,650 per 1M calls.

When to use GPT-5.4 Mini

When not to use it:

Common questions

How does GPT-5.4 Mini compare to GPT-5 Mini?

GPT-5 Mini: $0.25/$2, 3× cheaper on input than 5.4 Mini ($0.75/$4.50). The price gap reflects the reasoning improvement: 5.4 Mini benchmarks meaningfully better on multi-step tasks. For routine workloads GPT-5 Mini is the rational default; reach for 5.4 Mini when you've measured a quality win.

What about Gemini 3 Flash?

Gemini 3 Flash ($0.50/$3) is cheaper on both input (-33%) and output (-33%) than 5.4 Mini, with 1M context and multimodal input. The trade-off is OpenAI's more mature tool-use reliability and function-calling. Test both with your prompts.

Caching savings?

$0.075/M cached input is 10% of standard. Worth it on agent loops with stable system prompts.

Compare GPT-5.4 Mini to other models