chooseaimodel
← All models
★ FeaturedFlashfastest and cheapest, lighter reasoningOpen Weights

Llama 3.3 70B Pricing

Meta · llama-3-3-70b

Input / 1M tokens
$0.100
Output / 1M tokens
$0.320
Context window
131,072 tokens
175 pages of text
Max output
16,384 tokens

What it costs in practice

Estimate your own workloadOpens the calculator with Llama 3.3 70B preloaded — adjust volume and token counts there.

Price history

Only one price point recorded so far.

The staircase chart appears once a price change is detected.

What it excels at

Mature, well-supported open-weight model for general chat and lightweight agents — wide availability across inference providers keeps pricing competitive.

The business tradeoff

Falls behind newer flagship models on complex reasoning and coding — best suited to well-scoped, repetitive tasks.

Cheaper from Meta

Same tier from other providers

Head-to-head

Vendor list rates, as of Jun 15, 2026 · source: openrouter · per-request examples assume 1,200 input + 400 output tokens.