Qwen: Qwen3 VL 30B A3B Instruct API Pricing
Live token cost for Qwen: Qwen3 VL 30B A3B Instruct from Qwen. Use the figures below for budgeting, then tune your exact request mix in the interactive calculator. Prices refresh every 24 hours from OpenRouter.
Capabilities
Qwen: Qwen3 VL 30B A3B Instruct cost at scale
Estimated monthly cost across common production volumes. Assumes 30-day months and the request shapes shown.
| Tier | Requests / day | In / out tokens | $ / month |
|---|---|---|---|
| Hobby | 1,000 | 500 / 200 | $5.07 |
| Startup | 10,000 | 1,500 / 500 | $136.50 |
| Growth | 100,000 | 3,000 / 800 | $2,418.00 |
| Enterprise | 1,000,000 | 8,000 / 2,000 | $62,400.00 |
Compare Qwen: Qwen3 VL 30B A3B Instruct vs.
Frequently asked questions
How much does Qwen: Qwen3 VL 30B A3B Instruct cost?
Qwen: Qwen3 VL 30B A3B Instruct costs $0.13 per 1M input tokens and $0.52 per 1M output tokens. A typical 1,500-token in / 500-token out request costs $0.00046.
Does Qwen: Qwen3 VL 30B A3B Instruct support cached input?
No. Qwen: Qwen3 VL 30B A3B Instruct does not currently expose cached-input pricing through Qwen. Every input token is billed at the full rate.
What is the Qwen: Qwen3 VL 30B A3B Instruct context window?
Qwen: Qwen3 VL 30B A3B Instruct supports a context window of 131,072 tokens (131K). Max output per response is 32,768 tokens.
What is Qwen: Qwen3 VL 30B A3B Instruct good for?
Qwen: Qwen3 VL 30B A3B Instruct is a good fit for general-purpose LLM tasks via the Qwen API — chat, code, writing, summarization. For other use cases, run your specific input/output mix through the interactive calculator to compare against alternative models.