← Compare models
Open all in the calculator — adjust volume and token counts →
Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) vs Llama 3.3 70B
List-price comparison ·Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) details ·Llama 3.3 70B details
For a typical workload (100,000 requests / mo), Llama 3.3 70B is the cheapest — 72% less than the priciest here ($24.80/mo vs $90.00/mo).
| Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) | Llama 3.3 70B | |
|---|---|---|
| Input / 1M tokens | $0.250 | $0.100 |
| Output / 1M tokens | $1.50 | $0.320 |
| Typical request (1,200 in + 400 out) | $0.0009 | $0.0002 |
| Context window | 65,536 tokens | 131,072 tokens |
| Max output | 66k tokens | 16,384 tokens |
| Quality score | 58/100 | 60/100 |
| Tier | Flash | Flash |
| Provider | Meta |
Projected monthly cost
| Requests / mo | Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) | Llama 3.3 70B |
|---|---|---|
| 1,000 | $0.9 | $0.248 |
| 10,000 | $9.00 | $2.48 |
| 100,000 | $90.00 | $24.80 |
Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image)
Auto-synced from OpenRouter — no editorial write-up yet.
Llama 3.3 70B
Mature, well-supported open-weight model for general chat and lightweight agents — wide availability across inference providers keeps pricing competitive.
Falls behind newer flagship models on complex reasoning and coding — best suited to well-scoped, repetitive tasks.
More comparisons
Vendor list rates · per-request examples assume 1,200 input + 400 output tokens.