DeepSeek API Pricing 2026 — Cheapest LLM ($0.14/M tokens)
DeepSeek V3.2 API: $0.14/$0.28 per 1M tokens — cheapest major LLM. 90% cache discount. Free tier. Compare vs GPT-5, Claude, Gemini pricing.
··2 min read
Share
DeepSeek API Pricing — March 2026
Last updated: April 5, 2026 No pricing changes since February 2026.
TL;DR — DeepSeek API Prices (April 2026)
- DeepSeek V3.2 (Chat): $0.14/$0.28 per 1M input/output tokens
- DeepSeek V3.2 (Reasoner): $0.14/$0.28 per 1M tokens
- Cache Hit: $0.014 per 1M input tokens (90% off)
DeepSeek V3.2 replaced both V3 and R1 with a unified model that handles both chat and reasoning at the same price.
DeepSeek V3.2 (Current)
| Model | Context | Input / 1M tokens | Output / 1M tokens | Cached Input |
|---|---|---|---|---|
| deepseek-chat (V3.2) | 128K | $0.14 | $0.28 | $0.014 |
| deepseek-reasoner (V3.2) | 128K | $0.14 | $0.28 | $0.014 |
Max output: 8K tokens (chat), 64K tokens (reasoner).
Previous Models (Deprecated)
| Model | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
| DeepSeek V3 | $0.14 | $0.28 |
| DeepSeek R1 | $0.55 | $2.19 |
Why DeepSeek is Popular
- Still very cheap — $0.28/$0.42 is a fraction of GPT-5 or Claude pricing
- Unified model — same price for chat and reasoning
- 90% cache discount — $0.028/M for repeated context
- Open weights — can self-host
Related
Related Resources
ResourceCheapest LLM API (March 2026) — DeepSeek $0.14 vs Gemini Flash $0.10 Per 1M TokensMarch 2026: Gemini 2.0 Flash-Lite at $0.075/M, DeepSeek V3.2 at $0.28/M. Compare cheapest LLM APIs with free tiers. Save up to 90% on your API costs now!ResourceClaude API Pricing (March 2026): Opus $5/M Tokens, Sonnet $3, Haiku $0.25Claude API pricing 2026: Opus $5.00/M input, $25.00/M output. Sonnet $3.00/$15.00, Haiku $0.25/$1.25. Compare all models — Updated March 2026.ResourceGemini API Pricing 2026 — Complete Cost Guide (2.5 Pro, Flash)Gemini API pricing 2026: 2.5 Pro $1.25/10M tokens, Flash $0.30. Free tier included. Compare with GPT-5 & Claude. Updated April.ResourceOpenAI API Pricing 2026 — GPT-5.4, O3, O1 & GPT-4o Cost Per TokenMarch 2026: GPT-5.4 is $2.50/M input tokens. O3 Pro is $150/M. See all OpenAI API rates for GPT-5, O1, O3, GPT-4o, GPT-4o-mini with free tier limits.
Want more resources?
Subscribe to get the latest AI tools, guides, and updates.
Newsletter
Stay ahead of the curve
Key insights from top tech podcasts, delivered daily. Join 10,000+ engineers, founders, and investors.
One email per day. Unsubscribe anytime.