Live pricing — refreshed daily

LLM Token Cost Calculator

Compare API pricing across 202+ language models from OpenAI, Anthropic, Google, Meta, Mistral, xAI, DeepSeek and Cohere. Includes cached-input pricing, batch-API discounts and reasoning-token math. Prices auto-refresh every 24 hours via OpenRouter.

Featured model pricing (per 1M tokens)

ModelProviderInputOutputCached inputContext
Anthropic: Claude Haiku 4.5Anthropic$1.00$5.00$0.10200K
Anthropic: Claude Opus 4.7Anthropic$5.00$25.00$0.501000K
Anthropic: Claude Sonnet 4.6Anthropic$3.00$15.00$0.301000K
Cohere: Command R (08-2024)Cohere$0.15$0.60128K
Cohere: Command R+ (08-2024)Cohere$2.50$10.00128K
Cohere: Command R7B (12-2024)Cohere$0.04$0.15128K
DeepSeek: DeepSeek V3.1 TerminusDeepSeek$0.21$0.79$0.13164K
DeepSeek: DeepSeek V3.2DeepSeek$0.25$0.38$0.03131K
DeepSeek: DeepSeek V3.2 ExpDeepSeek$0.27$0.41164K
DeepSeek: DeepSeek V3.2 SpecialeDeepSeek$0.40$1.20$0.20164K
DeepSeek: R1DeepSeek$0.70$2.5064K
DeepSeek: R1 0528DeepSeek$0.50$2.15$0.35164K
DeepSeek: R1 Distill Llama 70BDeepSeek$0.70$0.80131K
DeepSeek: R1 Distill Qwen 32BDeepSeek$0.29$0.2933K
Google: Gemini 2.5 ProGoogle$1.25$10.00$0.131049K
Google: Gemini 2.5 Pro Preview 05-06Google$1.25$10.00$0.131049K
Google: Gemini 2.5 Pro Preview 06-05Google$1.25$10.00$0.131049K
Google: Gemini 3 Flash PreviewGoogle$0.50$3.00$0.051049K
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)Google$2.00$12.00$0.2066K
Meta: Llama 3.3 70B InstructMeta$0.12$0.38131K
Meta: Llama 4 MaverickMeta$0.15$0.601049K
Meta: Llama 4 ScoutMeta$0.08$0.30328K
Mistral LargeMistral$2.00$6.00$0.20128K
Mistral Large 2407Mistral$2.00$6.00$0.20131K
Mistral Large 2411Mistral$2.00$6.00$0.20131K
Mistral: Mistral Large 3 2512Mistral$0.50$1.50$0.05262K
OpenAI: GPT-4oOpenAI$2.50$10.00128K
OpenAI: GPT-4o (2024-05-13)OpenAI$5.00$15.00128K
OpenAI: GPT-4o (2024-08-06)OpenAI$2.50$10.00$1.25128K
OpenAI: GPT-4o (2024-11-20)OpenAI$2.50$10.00$1.25128K
OpenAI: GPT-4o AudioOpenAI$2.50$10.00128K
OpenAI: GPT-4o Search PreviewOpenAI$2.50$10.00128K
OpenAI: GPT-4o-miniOpenAI$0.15$0.60$0.07128K
OpenAI: GPT-4o-mini (2024-07-18)OpenAI$0.15$0.60$0.07128K
OpenAI: GPT-4o-mini Search PreviewOpenAI$0.15$0.60128K
OpenAI: GPT-5OpenAI$1.25$10.00$0.13400K
OpenAI: GPT-5 ChatOpenAI$1.25$10.00$0.13128K
OpenAI: GPT-5 CodexOpenAI$1.25$10.00$0.13400K
OpenAI: GPT-5 ImageOpenAI$10.00$10.00$1.25400K
OpenAI: GPT-5 Image MiniOpenAI$2.50$2.00$0.25400K
OpenAI: GPT-5 MiniOpenAI$0.25$2.00$0.02400K
OpenAI: GPT-5 NanoOpenAI$0.05$0.40$0.01400K
OpenAI: GPT-5 ProOpenAI$15.00$120.00400K
OpenAI: GPT-5.1OpenAI$1.25$10.00$0.13400K
OpenAI: GPT-5.1 ChatOpenAI$1.25$10.00$0.13128K
OpenAI: GPT-5.1-CodexOpenAI$1.25$10.00$0.13400K
OpenAI: GPT-5.1-Codex-MaxOpenAI$1.25$10.00$0.13400K
OpenAI: GPT-5.1-Codex-MiniOpenAI$0.25$2.00$0.03400K
OpenAI: GPT-5.2OpenAI$1.75$14.00$0.17400K
OpenAI: GPT-5.2 ChatOpenAI$1.75$14.00$0.17128K
OpenAI: GPT-5.2 ProOpenAI$21.00$168.00400K
OpenAI: GPT-5.2-CodexOpenAI$1.75$14.00$0.17400K
OpenAI: GPT-5.3 ChatOpenAI$1.75$14.00$0.17128K
OpenAI: GPT-5.3-CodexOpenAI$1.75$14.00$0.17400K
OpenAI: GPT-5.4OpenAI$2.50$15.00$0.251050K
OpenAI: GPT-5.4 MiniOpenAI$0.75$4.50$0.07400K
OpenAI: GPT-5.4 NanoOpenAI$0.20$1.25$0.02400K
OpenAI: GPT-5.4 ProOpenAI$30.00$180.001050K
OpenAI: o3OpenAI$2.00$8.00$0.50200K
OpenAI: o3 Deep ResearchOpenAI$10.00$40.00$2.50200K
OpenAI: o3 MiniOpenAI$1.10$4.40$0.55200K
OpenAI: o3 Mini HighOpenAI$1.10$4.40$0.55200K
OpenAI: o3 ProOpenAI$20.00$80.00200K
OpenAI: o4 MiniOpenAI$1.10$4.40$0.28200K
OpenAI: o4 Mini Deep ResearchOpenAI$2.00$8.00$0.50200K
OpenAI: o4 Mini HighOpenAI$1.10$4.40$0.28200K
Qwen: Qwen3 14BQwen$0.06$0.2441K
Qwen: Qwen3 235B A22BQwen$0.45$1.82131K
Qwen: Qwen3 235B A22B Instruct 2507Qwen$0.07$0.10262K
Qwen: Qwen3 235B A22B Thinking 2507Qwen$0.13$0.60262K
Qwen: Qwen3 30B A3BQwen$0.08$0.2841K
Qwen: Qwen3 30B A3B Instruct 2507Qwen$0.09$0.30262K
Qwen: Qwen3 30B A3B Thinking 2507Qwen$0.08$0.40$0.08131K
Qwen: Qwen3 32BQwen$0.08$0.24$0.0441K
Qwen: Qwen3 8BQwen$0.05$0.40$0.0541K
Qwen: Qwen3 Coder 30B A3B InstructQwen$0.07$0.27160K
Qwen: Qwen3 Coder 480B A35BQwen$0.22$1.00$0.02262K
Qwen: Qwen3 Coder FlashQwen$0.20$0.97$0.041000K
Qwen: Qwen3 Coder NextQwen$0.15$0.80$0.12262K
Qwen: Qwen3 Coder PlusQwen$0.65$3.25$0.131000K
Qwen: Qwen3 MaxQwen$0.78$3.90$0.16262K
Qwen: Qwen3 Max ThinkingQwen$0.78$3.90262K
Qwen: Qwen3 Next 80B A3B InstructQwen$0.09$1.10262K
Qwen: Qwen3 Next 80B A3B ThinkingQwen$0.10$0.78131K
Qwen: Qwen3 VL 235B A22B InstructQwen$0.20$0.88$0.11262K
Qwen: Qwen3 VL 235B A22B ThinkingQwen$0.26$2.60131K
Qwen: Qwen3 VL 30B A3B InstructQwen$0.13$0.52131K
Qwen: Qwen3 VL 30B A3B ThinkingQwen$0.13$1.56131K
Qwen: Qwen3 VL 32B InstructQwen$0.10$0.42131K
Qwen: Qwen3 VL 8B InstructQwen$0.08$0.50131K
Qwen: Qwen3 VL 8B ThinkingQwen$0.12$1.36131K
Qwen: Qwen3.5 397B A17BQwen$0.39$2.34$0.20262K
Qwen: Qwen3.5 Plus 2026-02-15Qwen$0.26$1.561000K
Qwen: Qwen3.5-122B-A10BQwen$0.26$2.08262K
Qwen: Qwen3.5-27BQwen$0.20$1.56262K
Qwen: Qwen3.5-35B-A3BQwen$0.16$1.30262K
Qwen: Qwen3.5-9BQwen$0.10$0.15262K
Qwen: Qwen3.5-FlashQwen$0.07$0.261000K
Qwen: Qwen3.6 PlusQwen$0.33$1.951000K
xAI: Grok 4xAI$3.00$15.00$0.75256K
xAI: Grok 4 FastxAI$0.20$0.50$0.052000K
xAI: Grok 4.1 FastxAI$0.20$0.50$0.052000K
xAI: Grok 4.20xAI$2.00$6.00$0.202000K
xAI: Grok 4.20 Multi-AgentxAI$2.00$6.00$0.202000K

Frequently asked questions

How are these LLM prices kept up to date?

Prices are pulled directly from OpenRouter’s public models API once every 24 hours via a Convex cron job, then normalized to per-1M-token figures. Each row displays the timestamp of the last successful refresh.

What are cached-input and batch-API discounts?

Cached input applies when you send the same system prompt or context repeatedly — providers like OpenAI and Anthropic charge 50–90% less for cached tokens. Batch mode processes requests asynchronously within 24 hours for a flat 50% discount on both input and output. Toggle either in the calculator above to model real-world cost.

How accurate are the token estimates?

The paste-text estimator uses a rough chars÷4 heuristic. For mission-critical billing forecasts, run your prompts through the official tokenizer for your provider (tiktoken for OpenAI, Anthropic’s count endpoint, Gemini’scount_tokens API).

Which model is cheapest for high-volume production use?

For raw price-per-token, the small-model tier (GPT-5 nano, Claude Haiku 4.5, Gemini 3 Flash, DeepSeek V3) is typically 10–50× cheaper than the flagship reasoning models. Use the calculator to compare your specific input/output ratio — output tokens are usually 3–5× more expensive than input tokens, so verbose models can lose to less-verbose ones at scale.