AI Model Pricing Guide

Because tokens cost money and you're not made of it | Updated Mar 17, 2026

CHEAPEST: Gemini 2.0 Flash-Lite / Grok 4.1 Fast BEST VALUE: Claude Sonnet 4.6 / Grok 4.1 Fast SMARTEST: Claude Opus 4.6 NEW: GPT-5.4 / Grok 4.20 / Gemini 3 Flash
O

OpenAI

The OG of AI APIs. GPT kicked off the revolution and they're still leading.

BUDGET
GPT-5 mini
Fast & cheap
In
$0.25
per 1M
Out
$2.00
per 1M
128K ctx Fast
Lightweight champion. Surprisingly capable for simple tasks and high-volume apps.
Best: Chatbots, simple QA, data extraction
NEW
o4-mini
Reinforcement tuned
In
$4.00
per 1M
Out
$16.00
per 1M
200K ctx Fine-tuning
Optimized for reinforcement fine-tuning workflows. Create custom reasoning patterns.
Best: Fine-tuning, custom reasoning
POWER
GPT-5.2
Reasoning beast
In
$1.75
per 1M
Out
$14.00
per 1M
6.6h horizon 200K ctx
Top 3 on METR. Excels at complex tasks, code, and multi-step reasoning.
Best: Code, analysis, agent workflows
FLAGSHIP
GPT-5.2 pro
Smartest model
In
$21.00
per 1M
Out
$168.00
per 1M
200K ctx Premium
OpenAI's most precise model. For when you need the absolute best reasoning.
Best: Hardest problems, precision work
A

Anthropic

Safety-first company. Claude is beloved by developers for being genuinely helpful.

FAST
Claude Haiku 4.5
Speed demon
In
$1.00
per 1M
Out
$5.00
per 1M
200K ctx Fastest
Optimized for fast responses. Perfect for real-time apps and bulk processing.
Best: Real-time chat, bulk processing
FLAGSHIP
Claude Opus 4.6
Current SOTA
In
$5.00
per 1M
Out
$25.00
per 1M
14.5h horizon 200K ctx
#1 on METR. Works autonomously on 14+ hour tasks. When you need the best.
Best: Hard problems, research, complex agents
X

xAI

Elon's AI. Grok has real-time X data access and absurdly low pricing.

NEW
Grok 4.20 Beta
Multi-agent
In
$2.00
per 1M
Out
$6.00
per 1M
2M ctx Multi-agent Vision
Latest beta with multi-agent capabilities. Reasoning and non-reasoning variants available.
Best: Complex multi-agent workflows
CHEAPEST
Grok 4 / 4.1 Fast
Crazy cheap
In
$0.20
per 1M
Out
$0.50
per 1M
2M ctx Reasoning Vision
Insane value. 2M context + real-time X data. Think twice at these prices.
Best: High-volume, X analysis, massive context
NEW
Grok Code Fast 1
Coding specialist
In
$0.20
per 1M
Out
$1.50
per 1M
256K ctx Reasoning
Optimized for code generation. Same input price as Grok 4 Fast but better for programming tasks.
Best: Code generation, programming
BUDGET
Grok 3 Mini
Older gen cheap
In
$0.30
per 1M
Out
$0.50
per 1M
131K ctx Reasoning
Budget fallback if Grok 4's 2M context is overkill for your use case.
Best: Simple tasks, testing
POWER
Grok 4-0709
Premium tier
In
$3.00
per 1M
Out
$15.00
per 1M
256K ctx Reasoning Vision
Premium Grok. Smaller context but more reasoning power than Fast.
Best: Grok style with more smarts
G

Google DeepMind

Gemini has quietly become excellent. Massive context, strong multimodal, and a generous free tier.

NEW
Gemini 3 Flash
New budget
In
$0.50
per 1M
Out
$3.00
per 1M
Preview Fast
New Gemini 3 Flash preview. Balanced performance at budget pricing.
Best: Budget apps, prototyping
VALUE
Gemini 2.5 Flash
Best value
In
$0.30
per 1M
Out
$2.50
per 1M
1M ctx Multimodal Free tier
Cheapest way to process 1M context. Free tier available. Multimodal - images, video, audio.
Best: High-volume, multimodal, prototypes
FLAGSHIP
Gemini 3.1 Pro
New flagship
In
$1.25
per 1M
Out
$10.00
per 1M
4h horizon 1M ctx Video
77.1% ARC-AGI-2. Native video understanding. Matches GPT-5 pricing with 2.5x context.
Best: Video analysis, complex reasoning
LONG
Gemini 2.5 Pro
Long outputs
In
$1.25
per 1M
Out
$10.00
per 1M
1M ctx 64K output
Same price as 3.1 Pro but 64K max output vs 16K. Choose for long-form content generation.
Best: Long-form writing, large outputs
BUDGET
Gemini 2.0 Flash
Legacy cheap
In
$0.10
per 1M
Out
$0.40
per 1M
1M ctx 8K output
Lowest cost option. Older generation but still capable. 8K output limit.
Best: Budget apps, legacy projects
1M tokens = 750K words
That's roughly 1,500 pages. A whole novel. Process it for $0.15 with Gemini Flash or $0.20 with Grok 4.
Grok vs Opus
Send 25x more tokens through Grok 4 Fast for the same price as Opus 4.6 output.
Gemini 3.1 Pro = GPT-5 price
Same $1.25/$10 pricing but with 1M context (2.5x GPT-5's 400K). Plus free tier for prototyping.
Rule of thumb
Simple: Haiku/Flash. Medium: Sonnet/GPT-5.2. Hard: Opus. Budget: Grok. Huge context: Gemini.
← Back to Ai-Edu