AI Model Pricing

Token pricing, context windows, and capabilities across major providers.

Updated 2026-05-03

About this data
Prices in USD per million tokens. Providers update pricing without notice. Verify with the provider before estimating production costs. Open-source model prices reflect representative third-party inference rates and vary by provider.
Provider
Tier
Capabilities
Sort by
Claude Opus 4.7
Anthropic Flagship
Input
$15.0/1M
Output
$75.0
Context
200k
Released
2026-04
MULTI TOOLS BATCH

Highest capability model for complex tasks

Claude Sonnet 4.6
Anthropic Standard
Input
$3.0/1M
Output
$15.0
Context
200k
Released
2025-10
MULTI TOOLS BATCH

Best balance of speed and capability

Claude Haiku 4.5
Anthropic Economy
Input
$0.8/1M
Output
$4.0
Context
200k
Released
2025-07
MULTI TOOLS BATCH

Fastest Anthropic model for high-throughput tasks

GPT-4o
OpenAI Standard
Input
$5.0/1M
Output
$15.0
Context
128k
Released
2024-05
MULTI TOOLS BATCH
GPT-4o mini
OpenAI Economy
Input
$0.15/1M
Output
$0.6
Context
128k
Released
2024-07
MULTI TOOLS BATCH
o3
OpenAI Flagship
Input
$10.0/1M
Output
$40.0
Context
200k
Released
2025-04
MULTI TOOLS REASON

Extended reasoning — charges for thinking tokens

o4-mini
OpenAI Standard
Input
$1.1/1M
Output
$4.4
Context
200k
Released
2025-04
MULTI TOOLS REASON

Efficient reasoning model

o1
OpenAI Flagship
Input
$15.0/1M
Output
$60.0
Context
200k
Released
2024-12
MULTI TOOLS REASON

First-generation reasoning model

Gemini 2.5 Pro
Google Flagship
Input
$1.25/1M
Output
$10.0
Context
1.0M
Released
2025-03
MULTI TOOLS REASON

1M token context; thinking mode available

Gemini 2.5 Flash
Google Standard
Input
$0.075/1M
Output
$0.3
Context
1.0M
Released
2025-05
MULTI TOOLS REASON

High-throughput; 1M token context

Gemini 2.0 Flash
Google Economy
Input
$0.1/1M
Output
$0.4
Context
1.0M
Released
2025-02
MULTI TOOLS
Llama 4 Maverick
Meta Standard
Input
$0.27/1M
Output
$0.85
Context
1.0M
Released
2025-04
MULTI TOOLS OSS

Open-source MoE — price via third-party inference

Llama 4 Scout
Meta Economy
Input
$0.11/1M
Output
$0.34
Context
10.0M
Released
2025-04
MULTI TOOLS OSS

Open-source; 10M token context

Llama 3.3 70B
Meta Economy
Input
$0.59/1M
Output
$0.79
Context
128k
Released
2024-12
TOOLS OSS

Open-source — price via third-party inference

Mistral Large 2
Mistral Standard
Input
$2.0/1M
Output
$6.0
Context
128k
Released
2024-07
TOOLS
Mistral Small 3.1
Mistral Economy
Input
$0.1/1M
Output
$0.3
Context
128k
Released
2025-03
MULTI TOOLS OSS

Open-source

Codestral
Mistral Standard
Input
$0.2/1M
Output
$0.6
Context
256k
Released
2024-05
TOOLS

Optimised for code generation and completion