AI Model Pricing
Token pricing, context windows, and capabilities across major providers.
Updated 2026-05-03
About this data
Prices in USD per million tokens.
Providers update pricing without notice. Verify with the provider before estimating production costs.
Open-source model prices reflect representative third-party inference rates and vary by provider.
Provider
Tier
Capabilities
Sort by
Claude Opus 4.7
Anthropic
Flagship
Input
$15.0/1M
Output
$75.0
Context
200k
Released
2026-04
MULTI
TOOLS
BATCH
Highest capability model for complex tasks
Claude Sonnet 4.6
Anthropic
Standard
Input
$3.0/1M
Output
$15.0
Context
200k
Released
2025-10
MULTI
TOOLS
BATCH
Best balance of speed and capability
Claude Haiku 4.5
Anthropic
Economy
Input
$0.8/1M
Output
$4.0
Context
200k
Released
2025-07
MULTI
TOOLS
BATCH
Fastest Anthropic model for high-throughput tasks
GPT-4o
OpenAI
Standard
Input
$5.0/1M
Output
$15.0
Context
128k
Released
2024-05
MULTI
TOOLS
BATCH
GPT-4o mini
OpenAI
Economy
Input
$0.15/1M
Output
$0.6
Context
128k
Released
2024-07
MULTI
TOOLS
BATCH
o3
OpenAI
Flagship
Input
$10.0/1M
Output
$40.0
Context
200k
Released
2025-04
MULTI
TOOLS
REASON
Extended reasoning — charges for thinking tokens
o4-mini
OpenAI
Standard
Input
$1.1/1M
Output
$4.4
Context
200k
Released
2025-04
MULTI
TOOLS
REASON
Efficient reasoning model
o1
OpenAI
Flagship
Input
$15.0/1M
Output
$60.0
Context
200k
Released
2024-12
MULTI
TOOLS
REASON
First-generation reasoning model
Gemini 2.5 Pro
Google
Flagship
Input
$1.25/1M
Output
$10.0
Context
1.0M
Released
2025-03
MULTI
TOOLS
REASON
1M token context; thinking mode available
Gemini 2.5 Flash
Google
Standard
Input
$0.075/1M
Output
$0.3
Context
1.0M
Released
2025-05
MULTI
TOOLS
REASON
High-throughput; 1M token context
Gemini 2.0 Flash
Google
Economy
Input
$0.1/1M
Output
$0.4
Context
1.0M
Released
2025-02
MULTI
TOOLS
Llama 4 Maverick
Standard
Input
$0.27/1M
Output
$0.85
Context
1.0M
Released
2025-04
MULTI
TOOLS
OSS
Open-source MoE — price via third-party inference
Llama 4 Scout
Economy
Input
$0.11/1M
Output
$0.34
Context
10.0M
Released
2025-04
MULTI
TOOLS
OSS
Open-source; 10M token context
Llama 3.3 70B
Economy
Input
$0.59/1M
Output
$0.79
Context
128k
Released
2024-12
TOOLS
OSS
Open-source — price via third-party inference
Mistral Large 2
Mistral
Standard
Input
$2.0/1M
Output
$6.0
Context
128k
Released
2024-07
TOOLS
Mistral Small 3.1
Mistral
Economy
Input
$0.1/1M
Output
$0.3
Context
128k
Released
2025-03
MULTI
TOOLS
OSS
Open-source
Codestral
Mistral
Standard
Input
$0.2/1M
Output
$0.6
Context
256k
Released
2024-05
TOOLS
Optimised for code generation and completion
No models match the selected filters.