← All models
★ FeaturedFlash — fastest and cheapest, lighter reasoningOpen Weights
Llama 3.3 70B Pricing
Meta · llama-3-3-70b
Input / 1M tokens
$0.100
Output / 1M tokens
$0.320
Context window
131,072 tokens
≈ 175 pages of text
Max output
16,384 tokens
What it costs in practice
Typical request (1,200 in + 400 out tokens)$0.0002
1,000 requests / month$0.248/mo10,000 requests / month$2.48/mo100,000 requests / month$24.80/moEstimate your own workloadOpens the calculator with Llama 3.3 70B preloaded — adjust volume and token counts there.
Price history
Only one price point recorded so far.
The staircase chart appears once a price change is detected.
What it excels at
Mature, well-supported open-weight model for general chat and lightweight agents — wide availability across inference providers keeps pricing competitive.
The business tradeoff
Falls behind newer flagship models on complex reasoning and coding — best suited to well-scoped, repetitive tasks.
Cheaper from Meta
Head-to-head
Vendor list rates, as of Jun 15, 2026 · source: openrouter · per-request examples assume 1,200 input + 400 output tokens.