AI Model Pricing Guide

Because tokens cost money and you're not made of it | Updated Mar 17, 2026

CHEAPEST: Gemini 2.0 Flash-Lite / Grok 4.1 Fast BEST VALUE: Claude Sonnet 4.6 / Grok 4.1 Fast SMARTEST: Claude Opus 4.6 NEW: GPT-5.4 / Grok 4.20 / Gemini 3 Flash

OpenAI

The OG of AI APIs. GPT kicked off the revolution and they're still leading.

Pricing API Keys ChatGPT Playground

BUDGET

GPT-5 mini

Fast & cheap

$0.25

per 1M

Out

$2.00

per 1M

128K ctx Fast

Lightweight champion. Surprisingly capable for simple tasks and high-volume apps.

Best: Chatbots, simple QA, data extraction

NEW

o4-mini

Reinforcement tuned

$4.00

per 1M

Out

$16.00

per 1M

200K ctx Fine-tuning

Optimized for reinforcement fine-tuning workflows. Create custom reasoning patterns.

Best: Fine-tuning, custom reasoning

FLAGSHIP

GPT-5.4

Most capable

$2.50

per 1M

Out

$15.00

per 1M

270K ctx Reasoning

OpenAI's most capable model. Professional-grade for complex, multi-step problems.

Best: Hardest problems, professional work

POWER

GPT-5.2

Reasoning beast

$1.75

per 1M

Out

$14.00

per 1M

6.6h horizon 200K ctx

Top 3 on METR. Excels at complex tasks, code, and multi-step reasoning.

Best: Code, analysis, agent workflows

FLAGSHIP

GPT-5.2 pro

Smartest model

$21.00

per 1M

Out

$168.00

per 1M

200K ctx Premium

OpenAI's most precise model. For when you need the absolute best reasoning.

Best: Hardest problems, precision work

Anthropic

Safety-first company. Claude is beloved by developers for being genuinely helpful.

Pricing API Keys Claude Docs

FAST

Claude Haiku 4.5

Speed demon

$1.00

per 1M

Out

$5.00

per 1M

200K ctx Fastest

Optimized for fast responses. Perfect for real-time apps and bulk processing.

Best: Real-time chat, bulk processing

BEST

Claude Sonnet 4.6

The sweet spot

$3.00

per 1M

Out

$15.00

per 1M

Balanced 200K ctx

Best balance of speed and smarts. Most developers find this is all they need.

Best: Most tasks, code, writing, general use

FLAGSHIP

Claude Opus 4.6

Current SOTA

$5.00

per 1M

Out

$25.00

per 1M

14.5h horizon 200K ctx

#1 on METR. Works autonomously on 14+ hour tasks. When you need the best.

Best: Hard problems, research, complex agents

xAI

Elon's AI. Grok has real-time X data access and absurdly low pricing.

Pricing API Keys Grok Docs

NEW

Grok 4.20 Beta

Multi-agent

$2.00

per 1M

Out

$6.00

per 1M

2M ctx Multi-agent Vision

Latest beta with multi-agent capabilities. Reasoning and non-reasoning variants available.

Best: Complex multi-agent workflows

CHEAPEST

Grok 4 / 4.1 Fast

Crazy cheap

$0.20

per 1M

Out

$0.50

per 1M

2M ctx Reasoning Vision

Insane value. 2M context + real-time X data. Think twice at these prices.

Best: High-volume, X analysis, massive context

NEW

Grok Code Fast 1

Coding specialist

$0.20

per 1M

Out

$1.50

per 1M

256K ctx Reasoning

Optimized for code generation. Same input price as Grok 4 Fast but better for programming tasks.

Best: Code generation, programming

BUDGET

Grok 3 Mini

Older gen cheap

$0.30

per 1M

Out

$0.50

per 1M

131K ctx Reasoning

Budget fallback if Grok 4's 2M context is overkill for your use case.

Best: Simple tasks, testing

POWER

Grok 4-0709

Premium tier

$3.00

per 1M

Out

$15.00

per 1M

256K ctx Reasoning Vision

Premium Grok. Smaller context but more reasoning power than Fast.

Best: Grok style with more smarts

Google DeepMind

Gemini has quietly become excellent. Massive context, strong multimodal, and a generous free tier.

Pricing API Keys Gemini AI Studio

NEW

Gemini 3 Flash

New budget

$0.50

per 1M

Out

$3.00

per 1M

Preview Fast

New Gemini 3 Flash preview. Balanced performance at budget pricing.

Best: Budget apps, prototyping

VALUE

Gemini 2.5 Flash

Best value

$0.30

per 1M

Out

$2.50

per 1M

1M ctx Multimodal Free tier

Cheapest way to process 1M context. Free tier available. Multimodal - images, video, audio.

Best: High-volume, multimodal, prototypes

FLAGSHIP

Gemini 3.1 Pro

New flagship

$1.25

per 1M

Out

$10.00

per 1M

4h horizon 1M ctx Video

77.1% ARC-AGI-2. Native video understanding. Matches GPT-5 pricing with 2.5x context.

Best: Video analysis, complex reasoning

LONG

Gemini 2.5 Pro

Long outputs

$1.25

per 1M

Out

$10.00

per 1M

1M ctx 64K output

Same price as 3.1 Pro but 64K max output vs 16K. Choose for long-form content generation.

Best: Long-form writing, large outputs

BUDGET

Gemini 2.0 Flash

Legacy cheap

$0.10

per 1M

Out

$0.40

per 1M

1M ctx 8K output

Lowest cost option. Older generation but still capable. 8K output limit.

Best: Budget apps, legacy projects

1M tokens = 750K words

That's roughly 1,500 pages. A whole novel. Process it for $0.15 with Gemini Flash or $0.20 with Grok 4.

Grok vs Opus

Send 25x more tokens through Grok 4 Fast for the same price as Opus 4.6 output.

Gemini 3.1 Pro = GPT-5 price

Same $1.25/$10 pricing but with 1M context (2.5x GPT-5's 400K). Plus free tier for prototyping.

Rule of thumb

Simple: Haiku/Flash. Medium: Sonnet/GPT-5.2. Hard: Opus. Budget: Grok. Huge context: Gemini.

← Back to Ai-Edu