Flash — fastest and cheapest, lighter reasoningOpen Weights

GLM 5V Turbo Pricing

Z.ai · z-ai-glm-5v-turbo

Input / 1M tokens

$1.20

Output / 1M tokens

$4.00

Context window

202,752 tokens

≈ 270 pages of text

Max output

131,072 tokens

What it costs in practice

Typical request (1,200 in + 400 out tokens)$0.003

Estimate your own workloadOpens the calculator with GLM 5V Turbo preloaded — adjust volume and token counts there.

Only one price point recorded so far.

The staircase chart appears once a price change is detected.

Supports extended multimodal inputs with a 202k-token context window, suitable for document-level vision-language processing.

Open-weights model requires user-managed hosting, resulting in variable latency and quality depending on deployment configuration.

Vendor list rates, as of Jun 22, 2026 · source: openrouter · per-request examples assume 1,200 input + 400 output tokens.