OpenAI API Pricing 2026 — GPT-5.4, O3, O1 & GPT-4o Cost Per Token
March 2026: GPT-5.4 is $2.50/M input tokens. O3 Pro is $150/M. See all OpenAI API rates for GPT-5, O1, O3, GPT-4o, GPT-4o-mini with free tier limits.
··3 min read
Share
OpenAI API Pricing — March 2026
Last updated: March 5, 2026 — GPT-5.4 now rolling out
Latest: GPT-5.4 (March 5, 2026)
- GPT-5.4 brings advances in reasoning, coding, and agentic workflows
- Available in ChatGPT, API, and Codex
- Includes GPT-5.4 Thinking and GPT-5.4 Pro variants
TL;DR — OpenAI API Prices (March 2026)
- GPT-5.2: $1.75/$14.00 per 1M input/output tokens (latest flagship)
- GPT-5: $1.25/$10.00 per 1M tokens
- GPT-5 Nano: $0.05/$0.40 per 1M tokens (cheapest)
- o4-mini: $1.10/$4.40 per 1M tokens (best value reasoning)
- o3-pro: $20.00/$80.00 per 1M tokens (strongest reasoning)
OpenAI offers the widest range of models from cheap fast models to expensive reasoning powerhouses.
GPT-5 Family (Flagship)
| Model | Context | Input / 1M tokens | Output / 1M tokens | Cached Input |
|---|---|---|---|---|
| GPT-5.2 Pro | 200K | $21.00 | $168.00 | $2.10 |
| GPT-5.2 | 200K | $1.75 | $14.00 | $0.175 |
| GPT-5.1 | 128K | $1.25 | $10.00 | $0.125 |
| GPT-5 | 128K | $1.25 | $10.00 | $0.125 |
| GPT-5 Mini | 200K | $0.25 | $2.00 | $0.025 |
| GPT-5 Nano | 128K | $0.05 | $0.40 | $0.005 |
O-Series (Reasoning)
| Model | Context | Input / 1M tokens | Output / 1M tokens | Cached Input |
|---|---|---|---|---|
| o1-pro | 200K | $150.00 | $600.00 | — |
| o3-pro | 200K | $20.00 | $80.00 | — |
| o1 | 200K | $15.00 | $60.00 | $7.50 |
| o3 | 200K | $2.00 | $8.00 | $1.00 |
| o4-mini | 200K | $1.10 | $4.40 | $0.275 |
| o3-mini | 200K | $1.10 | $4.40 | $0.275 |
GPT-4 Family (Previous Gen)
| Model | Context | Input / 1M tokens | Output / 1M tokens | Cached Input |
|---|---|---|---|---|
| GPT-4.1 | 1M | $2.00 | $8.00 | $0.20 |
| GPT-4.1 Mini | 1M | $0.40 | $1.60 | $0.04 |
| GPT-4.1 Nano | 1M | $0.10 | $0.40 | $0.01 |
| GPT-4o | 128K | $2.50 | $10.00 | $1.25 |
| GPT-4o Mini | 128K | $0.15 | $0.60 | $0.075 |
Cost Optimization Tips
- Use Nano/Mini models for simple tasks — 10-50x cheaper
- Prompt caching — 90% off for repeated context (cached = 10% of input price)
- Batch API — 50% off for async processing within 24 hours
- Choose the right model — Don't use o1-pro for simple queries
Related
Related Resources
ResourceLLM API Pricing 2026 — Compare GPT-5, Claude 4, Gemini 2.5, DeepSeek CostsApril 2026: GPT-5.4 $2.50/M, Claude Sonnet $3/$15, Gemini Flash $0.30, DeepSeek $0.14. Compare 30+ LLM prices. Find the cheapest API for your app.ResourceClaude API Pricing (March 2026): Opus $5/M Tokens, Sonnet $3, Haiku $0.25Claude API pricing 2026: Opus $5.00/M input, $25.00/M output. Sonnet $3.00/$15.00, Haiku $0.25/$1.25. Compare all models — Updated March 2026.ResourceDeepSeek API Pricing 2026 — Cheapest LLM ($0.14/M tokens)DeepSeek V3.2 API: $0.14/$0.28 per 1M tokens — cheapest major LLM. 90% cache discount. Free tier. Compare vs GPT-5, Claude, Gemini pricing.ResourceGemini API Pricing 2026 — Complete Cost Guide (2.5 Pro, Flash)Gemini API pricing 2026: 2.5 Pro $1.25/10M tokens, Flash $0.30. Free tier included. Compare with GPT-5 & Claude. Updated April.
Want more resources?
Subscribe to get the latest AI tools, guides, and updates.
Newsletter
Stay ahead of the curve
Key insights from top tech podcasts, delivered daily. Join 10,000+ engineers, founders, and investors.
One email per day. Unsubscribe anytime.