QwenUpdated 2h ago

Qwen: Qwen3 30B A3B Thinking 2507 API Pricing

Live token cost for Qwen: Qwen3 30B A3B Thinking 2507 from Qwen. Use the figures below for budgeting, then tune your exact request mix in the interactive calculator. Prices refresh every 24 hours from OpenRouter.

Input
$0.080
/ 1M tokens
Output
$0.400
/ 1M tokens
Cached input
$0.080
/ 1M tokens

Capabilities

131K context131K max outputPrompt cachingReasoning tokens

Qwen: Qwen3 30B A3B Thinking 2507 cost at scale

Estimated monthly cost across common production volumes. Assumes 30-day months and the request shapes shown.

TierRequests / dayIn / out tokens$ / month
Hobby1,000500 / 200$3.60
Startup10,0001,500 / 500$96.00
Growth100,0003,000 / 800$1,680.00
Enterprise1,000,0008,000 / 2,000$43,200.00
Open Qwen: Qwen3 30B A3B Thinking 2507 in interactive calculator →

Compare Qwen: Qwen3 30B A3B Thinking 2507 vs.

Frequently asked questions

How much does Qwen: Qwen3 30B A3B Thinking 2507 cost?

Qwen: Qwen3 30B A3B Thinking 2507 costs $0.08 per 1M input tokens and $0.40 per 1M output tokens, with cached input at $0.08 per 1M tokens. A typical 1,500-token in / 500-token out request costs $0.00032.

Does Qwen: Qwen3 30B A3B Thinking 2507 support cached input?

Yes. Qwen: Qwen3 30B A3B Thinking 2507 supports prompt caching at $0.08 per 1M cached input tokens. Reuse the same system prompt or context across requests to cut input cost dramatically.

What is the Qwen: Qwen3 30B A3B Thinking 2507 context window?

Qwen: Qwen3 30B A3B Thinking 2507 supports a context window of 131,072 tokens (131K). Max output per response is 131,072 tokens.

What is Qwen: Qwen3 30B A3B Thinking 2507 good for?

Qwen: Qwen3 30B A3B Thinking 2507 is a good fit for general-purpose LLM tasks via the Qwen API — chat, code, writing, summarization. For other use cases, run your specific input/output mix through the interactive calculator to compare against alternative models.