QwenUpdated 14h ago

Qwen: Qwen3 VL 235B A22B Instruct API Pricing

Live token cost for Qwen: Qwen3 VL 235B A22B Instruct from Qwen. Use the figures below for budgeting, then tune your exact request mix in the interactive calculator. Prices refresh every 24 hours from OpenRouter.

Input

$0.210

/ 1M tokens

Output

$1.90

/ 1M tokens

Cached input

$0.100

/ 1M tokens

Capabilities

262K context33K max outputVisionPrompt caching

Qwen: Qwen3 VL 235B A22B Instruct cost at scale

Estimated monthly cost across common production volumes. Assumes 30-day months and the request shapes shown.

Tier	Requests / day	In / out tokens	$ / month
Hobby	1,000	500 / 200	$14.55
Startup	10,000	1,500 / 500	$379.50
Growth	100,000	3,000 / 800	$6,450.00
Enterprise	1,000,000	8,000 / 2,000	$164,400.00

Open Qwen: Qwen3 VL 235B A22B Instruct in interactive calculator →

Compare Qwen: Qwen3 VL 235B A22B Instruct vs.

OpenAI

OpenAI: GPT-4o

$2.50 in · $10.00 out

Compare side-by-side →

Anthropic

Anthropic: Claude Haiku 4.5

$1.00 in · $5.00 out

Compare side-by-side →

Google

Google: Gemini 2.5 Pro

$1.25 in · $10.00 out

Compare side-by-side →

Frequently asked questions

How much does Qwen: Qwen3 VL 235B A22B Instruct cost?

Qwen: Qwen3 VL 235B A22B Instruct costs $0.21 per 1M input tokens and $1.90 per 1M output tokens, with cached input at $0.10 per 1M tokens. A typical 1,500-token in / 500-token out request costs $0.00127.

Does Qwen: Qwen3 VL 235B A22B Instruct support cached input?

Yes. Qwen: Qwen3 VL 235B A22B Instruct supports prompt caching at $0.10 per 1M cached input tokens. Reuse the same system prompt or context across requests to cut input cost dramatically.

What is the Qwen: Qwen3 VL 235B A22B Instruct context window?

Qwen: Qwen3 VL 235B A22B Instruct supports a context window of 262,144 tokens (262K). Max output per response is 32,768 tokens.

What is Qwen: Qwen3 VL 235B A22B Instruct good for?

Qwen: Qwen3 VL 235B A22B Instruct is a good fit for general-purpose LLM tasks via the Qwen API — chat, code, writing, summarization. For other use cases, run your specific input/output mix through the interactive calculator to compare against alternative models.