How much does GPT-4o cost per million tokens?
The short answer
As of 2026-04-26, OpenAI charges:
- $2.50 per 1,000,000 input tokens
- $10.00 per 1,000,000 output tokens
Output is 4× the price of input. This is typical for OpenAI and Anthropic models, generation is more expensive than reading.
What that means in real money
For a typical chat exchange (1,000-token prompt, 200-token reply):
input: 1,000 / 1,000,000 × $2.50 = $0.0025
output: 200 / 1,000,000 × $10.00 = $0.002
total per call: $0.0045
total per 1 million calls: $4,500
For a longer RAG-style query (10,000 input, 500 output):
input: 10,000 / 1,000,000 × $2.50 = $0.025
output: 500 / 1,000,000 × $10.00 = $0.005
total per call: $0.03
total per 1 million calls: $30,000
The longer the input, the more dominant the input cost becomes, and the more your model choice matters for total spend.
Cheaper alternatives
If you're cost-sensitive, GPT-4o is mid-range:
| Model | Input ($/M) | Output ($/M) | When to consider |
|---|---|---|---|
| GPT-4o mini | $0.15 | $0.60 | Most workloads, 17× cheaper, smaller quality gap than you'd expect |
| Gemini 2.5 Flash | $0.075 | $0.30 | Cheapest exact-tokenizer option, 1M context |
| Claude Haiku 4.5 | $0.80 | $4.00 | When you want Claude's instruction-following at low cost |
| DeepSeek V3 | $0.27 | $1.10 | Cheapest frontier-tier model (subject to compliance fit) |
More expensive but stronger on hard prompts
| Model | Input | Output | When to consider |
|---|---|---|---|
| Claude Sonnet 4.6 | $3.00 | $15.00 | Better instruction-following on nuanced tasks |
| Claude Opus 4.8 | $15.00 | $75.00 | Frontier reasoning, when output quality justifies the bill |
| Gemini 2.5 Pro | $1.25 | $10.00 | Long-context (2M tokens), multimodal |
Get a live cost estimate
Paste your actual prompt into the counter. It will show exact token counts across every model and the per-call cost based on your expected output ratio.
Try this on every model
- Claude Opus 4.8 $5.00/$25.00
- Claude Opus 4.8 (Fast Mode) $10.00/$50.00
- Claude Sonnet 4.6 $3.00/$15.00
- Claude Haiku 4.5 $1.00/$5.00
- GPT-5.5 $5.00/$30.00
- GPT-5.5 Pro $30.00/$180.00
- GPT-5.4 $2.50/$15.00
- GPT-5.4 Mini $0.75/$4.50
- GPT-5.4 Nano $0.20/$1.25
- GPT-5.4 Pro $30.00/$180.00
- GPT-5.3 $1.75/$14.00
- GPT-5.2 $1.75/$14.00
- GPT-5.2 Pro $21.00/$168.00
- GPT-5.1 $1.25/$10.00
- GPT-5 $1.25/$10.00
- GPT-5 Mini $0.25/$2.00
- GPT-5 Nano $0.05/$0.40
- GPT-5 Pro $15.00/$120.00
- GPT-4.1 $2.00/$8.00
- GPT-4.1 Mini $0.40/$1.60
- GPT-4.1 Nano $0.10/$0.40
- o3 $2.00/$8.00
- o3-mini $1.10/$4.40
- o3-pro $20.00/$80.00
- o4-mini $1.10/$4.40
- GPT-4o $2.50/$10.00
- GPT-4o mini $0.15/$0.60
- GPT-4 Turbo $10.00/$30.00
- Gemini 3.1 Pro $2.00/$12.00
- Gemini 3 Flash $0.50/$3.00
- Gemini 3.1 Flash-Lite $0.25/$1.50
- Gemini 2.5 Pro $1.25/$10.00
- Gemini 2.5 Flash $0.30/$2.50
- Gemini 2.5 Flash-Lite $0.10/$0.40
- Llama 3.3 70B $0.88/$0.88
- Llama 3.1 405B $3.50/$3.50
- Llama 3.1 70B $0.59/$0.79
- Llama 3.1 8B $0.18/$0.18
- Mistral Large $2.00/$6.00
- DeepSeek V3 $0.27/$1.10
- DeepSeek V3.1 $0.60/$1.70
- DeepSeek R1 $3.00/$7.00
- Qwen 2.5 72B $0.90/$0.90
- Qwen 2.5 Coder 32B $0.80/$0.80
- Qwen3 Coder 480B $2.00/$2.00
- GLM-5.1 $1.40/$4.40