0 providers50 models

Model crosswalk

Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.

DeepSeek V3.2
vs
Qwen 3 72B Instruct
DeepSeek V3.2A

DeepSeek V3.2

671B params · 131K context · deepseek

Cheapest providertogether-ai
$/1M input$270000.00
$/1M output$1100000.00
Qwen 3 72B InstructB

Qwen 3 72B Instruct

72B params · 131K context · qwen

Cheapest providerfireworks-ai
$/1M input$220000.00
$/1M output$880000.00
Specs and cheapest providers
SpecDeepSeek V3.2Qwen 3 72B Instruct
Parameters671B72B
Context window131K tokens131K tokens
Licensedeepseekqwen
Released2025-05-072025-04-28
Cheapest provider
Providertogether-aifireworks-ai
Input / 1M tokens$270000.00$220000.00🏆
Output / 1M tokens$1100000.00$880000.00🏆

Add a third model to compare

Benchmark comparison

No benchmark data available for either model yet.

Sample workload — 5M in + 2M out per month

using each model's cheapest provider
DeepSeek V3.2
$3550000.00 /mo
Qwen 3 72B Instruct
$2860000.00 /mo

What changes at scale

Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.

1M in · 250K out$545000.00 · $440000.00
5M in · 2M out$3550000.00 · $2860000.00
20M in · 10M out$16400000.00 · $13200000.00
100M in · 60M out$93000000.00 · $74800000.00

Capability vs price

scatter
// scatter: benchmark × $/1M out
Calculate cost for your workload

Compare total monthly cost across providers for DeepSeek V3.2 and Qwen 3 72B Instruct using your own input/output token mix.

Open workload calculator →
Editor's take
[DeepSeek V3.2](/models/deepseek--deepseek-v3.2) is a 671B sparse MoE with ~37B active parameters per token, trained primarily on English and Chinese data with strong code and reasoning emphasis. [Qwen 3 72B Instruct](/models/alibaba--qwen-3-72b-instruct) is a dense 72B model from Alibaba's Qwen 3 series, with notably broad multilingual coverage across 100+ languages and an architecture tuned for instruction following at the mid-size model tier. Pricing is close at the low end: Qwen 3 72B frequently appears under $0.20/M input tokens on providers that have prioritized it, while DeepSeek V3.2 runs $0.28–0.50/M at comparable tiers. For pure token economics at moderate quality requirements, Qwen 3 72B can undercut V3.2. On reasoning benchmarks, DeepSeek V3.2 holds a clear lead — MATH, competition-level coding, and multi-step logical reasoning tasks show 10–20 point gaps favoring V3.2's larger total capacity. Qwen 3 72B narrows this gap on tasks where the reasoning chain is short and instruction-following precision matters more than depth. Qwen 3 72B is the stronger choice for multilingual workloads: Alibaba's investment in Qwen 3's language coverage is substantial, with documented strong performance on Chinese, Arabic, Japanese, Korean, and European languages. For applications serving non-English users at scale, Qwen 3 72B's multilingual alignment is a structural advantage over V3.2. Pick DeepSeek V3.2 if complex reasoning, code generation, or long-context synthesis drives your workload. Pick Qwen 3 72B Instruct if multilingual coverage, lower per-token cost at moderate quality, or Alibaba's open license terms are the deciding factors.
Related comparisons
Full model details