AI Model Pricing

Token pricing, context windows, and capabilities across major providers.

Prices in USD per million tokens. Providers update pricing without notice. Verify with the provider before estimating production costs. Open-source model prices reflect representative third-party inference rates and vary by provider.
Provider
Tier
Capabilities
Sort by
Claude Opus 4.7
Anthropic Flagship
Input
$15.0/1M
Output
$75.0
Context
200k
Released
2026-04
MULTI TOOLS BATCH

Highest capability model for complex tasks

Claude Sonnet 4.6
Anthropic Standard
Input
$3.0/1M
Output
$15.0
Context
200k
Released
2025-10
MULTI TOOLS BATCH

Best balance of speed and capability

Claude Haiku 4.5
Anthropic Economy
Input
$0.8/1M
Output
$4.0
Context
200k
Released
2025-07
MULTI TOOLS BATCH

Fastest Anthropic model for high-throughput tasks

GPT-4o
OpenAI Standard
Input
$5.0/1M
Output
$15.0
Context
128k
Released
2024-05
MULTI TOOLS BATCH
GPT-4o mini
OpenAI Economy
Input
$0.15/1M
Output
$0.6
Context
128k
Released
2024-07
MULTI TOOLS BATCH
o3
OpenAI Flagship
Input
$10.0/1M
Output
$40.0
Context
200k
Released
2025-04
MULTI TOOLS REASON

Extended reasoning — charges for thinking tokens

o4-mini
OpenAI Standard
Input
$1.1/1M
Output
$4.4
Context
200k
Released
2025-04
MULTI TOOLS REASON

Efficient reasoning model

o1
OpenAI Flagship
Input
$15.0/1M
Output
$60.0
Context
200k
Released
2024-12
MULTI TOOLS REASON

First-generation reasoning model

Gemini 2.5 Pro
Google Flagship
Input
$1.25/1M
Output
$10.0
Context
1.0M
Released
2025-03
MULTI TOOLS REASON

1M token context; thinking mode available

Gemini 2.5 Flash
Google Standard
Input
$0.075/1M
Output
$0.3
Context
1.0M
Released
2025-05
MULTI TOOLS REASON

High-throughput; 1M token context

Gemini 2.0 Flash
Google Economy
Input
$0.1/1M
Output
$0.4
Context
1.0M
Released
2025-02
MULTI TOOLS
Llama 4 Maverick
Meta Standard
Input
$0.27/1M
Output
$0.85
Context
1.0M
Released
2025-04
MULTI TOOLS OSS

Open-source MoE — price via third-party inference

Llama 4 Scout
Meta Economy
Input
$0.11/1M
Output
$0.34
Context
10.0M
Released
2025-04
MULTI TOOLS OSS

Open-source; 10M token context

Llama 3.3 70B
Meta Economy
Input
$0.59/1M
Output
$0.79
Context
128k
Released
2024-12
TOOLS OSS

Open-source — price via third-party inference

Mistral Large 2
Mistral Standard
Input
$2.0/1M
Output
$6.0
Context
128k
Released
2024-07
TOOLS
Mistral Small 3.1
Mistral Economy
Input
$0.1/1M
Output
$0.3
Context
128k
Released
2025-03
MULTI TOOLS OSS

Open-source

Codestral
Mistral Standard
Input
$0.2/1M
Output
$0.6
Context
256k
Released
2024-05
TOOLS

Optimised for code generation and completion