R1 Distill Qwen 32B vs DeepSeek R1

For a typical workload (100,000 requests / mo), R1 Distill Qwen 32B costs 68% less — $46.40/mo vs $146.00/mo.

Projected monthly cost

Rows open both models in the calculator — adjust volume and token counts there.

Auto-synced from OpenRouter — no editorial write-up yet.

Chain-of-thought reasoning competitive with closed flagship models on math and logic benchmarks, at open-weight pricing.

Emits long reasoning traces that inflate output token counts and cost — cap reasoning length or use a distilled variant for latency-sensitive paths.

Vendor list rates · per-request examples assume 1,200 input + 400 output tokens.