← All models
Flash — fastest and cheapest, lighter reasoningOpen Weights
GLM 5V Turbo Pricing
Z.ai · z-ai-glm-5v-turbo
Input / 1M tokens
$1.20
Output / 1M tokens
$4.00
Context window
202,752 tokens
≈ 270 pages of text
Max output
131,072 tokens
What it costs in practice
Typical request (1,200 in + 400 out tokens)$0.003
1,000 requests / month$3.04/mo10,000 requests / month$30.40/mo100,000 requests / month$304.00/moEstimate your own workloadOpens the calculator with GLM 5V Turbo preloaded — adjust volume and token counts there.
Price history
Only one price point recorded so far.
The staircase chart appears once a price change is detected.
What it excels at
Supports extended multimodal inputs with a 202k-token context window, suitable for document-level vision-language processing.
The business tradeoff
Open-weights model requires user-managed hosting, resulting in variable latency and quality depending on deployment configuration.
Head-to-head
Vendor list rates, as of Jun 22, 2026 · source: openrouter · per-request examples assume 1,200 input + 400 output tokens.