MetaUpdated 2h ago

Llama Guard 3 8B API Pricing

Live token cost for Llama Guard 3 8B from Meta. Use the figures below for budgeting, then tune your exact request mix in the interactive calculator. Prices refresh every 24 hours from OpenRouter.

Input
$0.480
/ 1M tokens
Output
$0.030
/ 1M tokens
Cached input
Not supported

Capabilities

131K context

Llama Guard 3 8B cost at scale

Estimated monthly cost across common production volumes. Assumes 30-day months and the request shapes shown.

TierRequests / dayIn / out tokens$ / month
Hobby1,000500 / 200$7.38
Startup10,0001,500 / 500$220.50
Growth100,0003,000 / 800$4,392.00
Enterprise1,000,0008,000 / 2,000$117,000.00
Open Llama Guard 3 8B in interactive calculator →

Compare Llama Guard 3 8B vs.

Frequently asked questions

How much does Llama Guard 3 8B cost?

Llama Guard 3 8B costs $0.48 per 1M input tokens and $0.03 per 1M output tokens. A typical 1,500-token in / 500-token out request costs $0.00073.

Does Llama Guard 3 8B support cached input?

No. Llama Guard 3 8B does not currently expose cached-input pricing through Meta. Every input token is billed at the full rate.

What is the Llama Guard 3 8B context window?

Llama Guard 3 8B supports a context window of 131,072 tokens (131K).

What is Llama Guard 3 8B good for?

Llama Guard 3 8B is a good fit for general-purpose LLM tasks via the Meta API — chat, code, writing, summarization. For other use cases, run your specific input/output mix through the interactive calculator to compare against alternative models.