QwenUpdated 2h ago

Qwen2.5 72B Instruct API Pricing

Live token cost for Qwen2.5 72B Instruct from Qwen. Use the figures below for budgeting, then tune your exact request mix in the interactive calculator. Prices refresh every 24 hours from OpenRouter.

Input
$0.120
/ 1M tokens
Output
$0.390
/ 1M tokens
Cached input
Not supported

Capabilities

33K context16K max output

Qwen2.5 72B Instruct cost at scale

Estimated monthly cost across common production volumes. Assumes 30-day months and the request shapes shown.

TierRequests / dayIn / out tokens$ / month
Hobby1,000500 / 200$4.14
Startup10,0001,500 / 500$112.50
Growth100,0003,000 / 800$2,016.00
Enterprise1,000,0008,000 / 2,000$52,200.00
Open Qwen2.5 72B Instruct in interactive calculator →

Compare Qwen2.5 72B Instruct vs.

Frequently asked questions

How much does Qwen2.5 72B Instruct cost?

Qwen2.5 72B Instruct costs $0.12 per 1M input tokens and $0.39 per 1M output tokens. A typical 1,500-token in / 500-token out request costs $0.00038.

Does Qwen2.5 72B Instruct support cached input?

No. Qwen2.5 72B Instruct does not currently expose cached-input pricing through Qwen. Every input token is billed at the full rate.

What is the Qwen2.5 72B Instruct context window?

Qwen2.5 72B Instruct supports a context window of 32,768 tokens (33K). Max output per response is 16,384 tokens.

What is Qwen2.5 72B Instruct good for?

Qwen2.5 72B Instruct is a good fit for general-purpose LLM tasks via the Qwen API — chat, code, writing, summarization. For other use cases, run your specific input/output mix through the interactive calculator to compare against alternative models.