chooseaimodel
← All models

Gemini 3.1 Pro vs Qwen3 Max

List-price comparison · Gemini 3.1 Pro details · Qwen3 Max details

For a typical workload (100,000 requests / mo), Qwen3 Max costs 65% less $249.60/mo vs $720.00/mo.

 Gemini 3.1 ProQwen3 Max
Input / 1M tokens$2.00$0.780
Output / 1M tokens$12.00$3.90
Typical request (1,200 in + 400 out)$0.0072$0.0025
Context window1,048,576 tokens262,144 tokens
Max output65,536 tokens32,768 tokens
TierFlagshipBalanced
ProviderGoogleQwen

Projected monthly cost

Rows open both models in the calculator — adjust volume and token counts there.

Gemini 3.1 Pro

Native audio, video, and document understanding with a ~1M token context window — strong fit for multi-modal RAG and large-document analysis.

Output token cost is steep relative to input, and the largest context windows can be slow to fill — chunk large documents where possible.

Qwen3 Max

Strong multilingual reasoning (especially Chinese/English) and competitive coding benchmarks at open-weight pricing.

Cache-write pricing is unusually high relative to input cost — workloads that rely heavily on prompt caching should benchmark actual savings.

More comparisons

Vendor list rates · per-request examples assume 1,200 input + 400 output tokens.