Models

Browse checked model pricing across providers

Each model card shows input, output, cached-input pricing, batch discount behavior, context window, and the official source used during the latest check.

Provider index

Jump to a provider section or open the provider detail page.

OpenAI

Broad GPT and reasoning lineup for production assistants, multimodal flows, and coding tools.

Wide coverage across price tiers, plus cached-input and batch discounts on much of the catalog.

Open provider page

gpt

GPT-5.4

Checked 2026-04-19

Input rate

$2.50 / 1M

Cached-input rate

$0.25 / 1M

Output rate

$15.00 / 1M

Batch discount

50%

Context window 1,050,000Max output 128,000

gpt

GPT-5.4 mini

Checked 2026-04-19

Input rate

$0.75 / 1M

Cached-input rate

$0.08 / 1M

Output rate

$4.50 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5.4 nano

Checked 2026-04-19

Input rate

$0.20 / 1M

Cached-input rate

$0.02 / 1M

Output rate

$1.25 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5.2

Checked 2026-04-19

Input rate

$1.75 / 1M

Cached-input rate

$0.18 / 1M

Output rate

$14.00 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5.2 pro

Checked 2026-04-19

Input rate

$21.00 / 1M

Cached-input rate

Not published

Output rate

$168 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5.1

Checked 2026-04-19

Input rate

$1.25 / 1M

Cached-input rate

$0.13 / 1M

Output rate

$10.00 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5

Checked 2026-04-19

Input rate

$1.25 / 1M

Cached-input rate

$0.13 / 1M

Output rate

$10.00 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5 mini

Checked 2026-04-19

Input rate

$0.25 / 1M

Cached-input rate

$0.03 / 1M

Output rate

$2.00 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5 nano

Checked 2026-04-19

Input rate

$0.05 / 1M

Cached-input rate

$0.01 / 1M

Output rate

$0.40 / 1M

Batch discount

50%

Context window 400,000Max output 128,000

gpt

GPT-5 pro

Checked 2026-04-19

Input rate

$15.00 / 1M

Cached-input rate

Not published

Output rate

$120 / 1M

Batch discount

50%

Context window 400,000Max output 272,000

gpt

GPT-4.1

Checked 2026-04-19

Input rate

$2.00 / 1M

Cached-input rate

$0.50 / 1M

Output rate

$8.00 / 1M

Batch discount

50%

Context window 1,047,576Max output 32,768

gpt

GPT-4.1 mini

Checked 2026-04-19

Input rate

$0.40 / 1M

Cached-input rate

$0.10 / 1M

Output rate

$1.60 / 1M

Batch discount

50%

Context window 1,047,576Max output 32,768

gpt

GPT-4.1 nano

Checked 2026-04-19

Input rate

$0.10 / 1M

Cached-input rate

$0.03 / 1M

Output rate

$0.40 / 1M

Batch discount

50%

Context window 1,047,576Max output 32,768

gpt

GPT-4o

Checked 2026-04-19

Input rate

$2.50 / 1M

Cached-input rate

$1.25 / 1M

Output rate

$10.00 / 1M

Batch discount

50%

Context window 128,000Max output 16,384

gpt

GPT-4o mini

Checked 2026-04-19

Input rate

$0.15 / 1M

Cached-input rate

$0.08 / 1M

Output rate

$0.60 / 1M

Batch discount

50%

Context window 128,000Max output 16,384

Anthropic

Claude models built for long-context analysis, research, and assistant-style workflows.

Strong long-context capability, with premium pricing once you move up the model ladder.

Open provider page

claude

Claude Opus 4.7

Checked 2026-04-19

Input rate

$5.00 / 1M

Cached-input rate

$0.50 / 1M

Output rate

$25.00 / 1M

Batch discount

50%

Context window 1,000,000Max output 128,000

claude

Claude Opus 4.6

Checked 2026-04-19

Input rate

$5.00 / 1M

Cached-input rate

$0.50 / 1M

Output rate

$25.00 / 1M

Batch discount

50%

Context window 1,000,000Max output 128,000

claude

Claude Sonnet 4.6

Checked 2026-04-19

Input rate

$3.00 / 1M

Cached-input rate

$0.30 / 1M

Output rate

$15.00 / 1M

Batch discount

50%

Context window 1,000,000Max output 64,000

claude

Claude Sonnet 4.5

Checked 2026-04-19

Input rate

$3.00 / 1M

Cached-input rate

$0.30 / 1M

Output rate

$15.00 / 1M

Batch discount

50%

Context window 200,000Max output 64,000

claude

Claude Haiku 4.5

Checked 2026-04-19

Input rate

$1.00 / 1M

Cached-input rate

$0.10 / 1M

Output rate

$5.00 / 1M

Batch discount

50%

Context window 200,000Max output 64,000

Google Gemini

Gemini API models with large context windows and strong throughput economics.

Often competitive on high-volume and multimodal workloads, especially when context size matters.

Open provider page

gemini

Gemini 2.5 Pro

Checked 2026-04-19

Input rate

$1.25 / 1M

Cached-input rate

$0.13 / 1M

Output rate

$10.00 / 1M

Batch discount

50%

Context window 1,000,000Max output 65,536

gemini

Gemini 2.5 Flash

Checked 2026-04-19

Input rate

$0.30 / 1M

Cached-input rate

$0.03 / 1M

Output rate

$2.50 / 1M

Batch discount

50%

Context window 1,000,000Max output 65,536

gemini

Gemini 2.5 Flash-Lite

Checked 2026-04-19

Input rate

$0.10 / 1M

Cached-input rate

$0.01 / 1M

Output rate

$0.40 / 1M

Batch discount

50%

Context window 1,000,000Max output 65,536

gemini

Gemini 2.0 Flash

Checked 2026-04-19

Input rate

$0.10 / 1M

Cached-input rate

$0.03 / 1M

Output rate

$0.40 / 1M

Batch discount

50%

Context window 1,048,576Max output 8,192

gemini

Gemini 2.0 Flash-Lite

Checked 2026-04-19

Input rate

$0.08 / 1M

Cached-input rate

Not published

Output rate

$0.30 / 1M

Batch discount

50%

Context window 1,048,576Max output 8,192

Mistral

Commercial and open models positioned for teams that care about cost efficiency first.

Usually worth checking when budget discipline matters more than premium long-context features.

Open provider page

mistral

Mistral Large 3

Checked 2026-04-19

Input rate

$0.50 / 1M

Cached-input rate

Not published

Output rate

$1.50 / 1M

Batch discount

No

Context window 256,000Max output 32,000

mistral

Mistral Small 4

Checked 2026-04-19

Input rate

$0.15 / 1M

Cached-input rate

Not published

Output rate

$0.60 / 1M

Batch discount

No

Context window 256,000Max output 32,000

mistral

Mistral Medium 3

Checked 2026-04-19

Input rate

$0.40 / 1M

Cached-input rate

Not published

Output rate

$2.00 / 1M

Batch discount

No

Context window 128,000Max output 32,000

mistral

Devstral 2

Checked 2026-04-19

Input rate

$0.40 / 1M

Cached-input rate

Not published

Output rate

$2.00 / 1M

Batch discount

No

Context window 256,000Max output 32,000

mistral

Devstral Small 2

Checked 2026-04-19

Input rate

$0.10 / 1M

Cached-input rate

Not published

Output rate

$0.30 / 1M

Batch discount

No

Context window 256,000Max output 32,000

mistral

Codestral

Checked 2026-04-19

Input rate

$0.30 / 1M

Cached-input rate

Not published

Output rate

$0.90 / 1M

Batch discount

No

Context window 128,000Max output 32,000

DeepSeek

Reasoning and coding models with aggressively low pricing for high-usage teams.

Often lands among the lowest-cost options in this dataset for reasoning-heavy or coding-heavy work.

Open provider page

deepseek

DeepSeek Chat

Checked 2026-04-19

Input rate

$0.28 / 1M

Cached-input rate

$0.03 / 1M

Output rate

$0.42 / 1M

Batch discount

No

Context window 128,000Max output 8,192

deepseek

DeepSeek Reasoner

Checked 2026-04-19

Input rate

$0.28 / 1M

Cached-input rate

$0.03 / 1M

Output rate

$0.42 / 1M

Batch discount

No

Context window 128,000Max output 64,000

OpenRouter

A routing layer that exposes multiple providers behind one API surface.

Convenient for multi-provider routing, with pass-through inference pricing and extra platform economics when you buy credits.

Open provider page

gpt

OpenRouter GPT-5.4

Checked 2026-04-19

Input rate

$2.50 / 1M

Cached-input rate

$0.25 / 1M

Output rate

$15.00 / 1M

Batch discount

No

Context window 1,050,000Max output 128,000

gpt

OpenRouter GPT-5.4 mini

Checked 2026-04-19

Input rate

$0.75 / 1M

Cached-input rate

$0.08 / 1M

Output rate

$4.50 / 1M

Batch discount

No

Context window 400,000Max output 128,000

claude

OpenRouter Claude Opus 4.7

Checked 2026-04-19

Input rate

$5.00 / 1M

Cached-input rate

$0.50 / 1M

Output rate

$25.00 / 1M

Batch discount

No

Context window 1,000,000Max output 128,000