Meta: Llama 3.3 70B Instruct API Pricing
Live token cost for Meta: Llama 3.3 70B Instruct from Meta. Use the figures below for budgeting, then tune your exact request mix in the interactive calculator. Prices refresh every 24 hours from OpenRouter.
Capabilities
Meta: Llama 3.3 70B Instruct cost at scale
Estimated monthly cost across common production volumes. Assumes 30-day months and the request shapes shown.
| Tier | Requests / day | In / out tokens | $ / month |
|---|---|---|---|
| Hobby | 1,000 | 500 / 200 | $4.08 |
| Startup | 10,000 | 1,500 / 500 | $111.00 |
| Growth | 100,000 | 3,000 / 800 | $1,992.00 |
| Enterprise | 1,000,000 | 8,000 / 2,000 | $51,600.00 |
Compare Meta: Llama 3.3 70B Instruct vs.
Frequently asked questions
How much does Meta: Llama 3.3 70B Instruct cost?
Meta: Llama 3.3 70B Instruct costs $0.12 per 1M input tokens and $0.38 per 1M output tokens. A typical 1,500-token in / 500-token out request costs $0.00037.
Does Meta: Llama 3.3 70B Instruct support cached input?
No. Meta: Llama 3.3 70B Instruct does not currently expose cached-input pricing through Meta. Every input token is billed at the full rate.
What is the Meta: Llama 3.3 70B Instruct context window?
Meta: Llama 3.3 70B Instruct supports a context window of 131,072 tokens (131K). Max output per response is 131,072 tokens.
What is Meta: Llama 3.3 70B Instruct good for?
Meta: Llama 3.3 70B Instruct is a good fit for open-source baseline, fine-tuning research, edge inference. For other use cases, run your specific input/output mix through the interactive calculator to compare against alternative models.