Model crosswalk
Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.
DeepSeek V3.2
vs
Qwen 3 72B Instruct
DeepSeek V3.2A
DeepSeek V3.2
671B params · 131K context · deepseek
Cheapest providertogether-ai
$/1M input$270000.00
$/1M output$1100000.00
Qwen 3 72B InstructB
Qwen 3 72B Instruct
72B params · 131K context · qwen
Cheapest providerfireworks-ai
$/1M input$220000.00
$/1M output$880000.00
Specs and cheapest providers
| Spec | DeepSeek V3.2 | Qwen 3 72B Instruct |
|---|---|---|
| Parameters | 671B | 72B |
| Context window | 131K tokens | 131K tokens |
| License | deepseek | qwen |
| Released | 2025-05-07 | 2025-04-28 |
| Cheapest provider | ||
| Provider | together-ai | fireworks-ai |
| Input / 1M tokens | $270000.00 | $220000.00🏆 |
| Output / 1M tokens | $1100000.00 | $880000.00🏆 |
Add a third model to compare
Benchmark comparison
No benchmark data available for either model yet.
Sample workload — 5M in + 2M out per month
using each model's cheapest providerWhat changes at scale
Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.
1M in · 250K out$545000.00 · $440000.00
5M in · 2M out$3550000.00 · $2860000.00
20M in · 10M out$16400000.00 · $13200000.00
100M in · 60M out$93000000.00 · $74800000.00
Capability vs price
scatter// scatter: benchmark × $/1M out
Calculate cost for your workload
Compare total monthly cost across providers for DeepSeek V3.2 and Qwen 3 72B Instruct using your own input/output token mix.
Open workload calculator →Editor's take
[DeepSeek V3.2](/models/deepseek--deepseek-v3.2) is a 671B sparse MoE with ~37B active parameters per token, trained primarily on English and Chinese data with strong code and reasoning emphasis. [Qwen 3 72B Instruct](/models/alibaba--qwen-3-72b-instruct) is a dense 72B model from Alibaba's Qwen 3 series, with notably broad multilingual coverage across 100+ languages and an architecture tuned for instruction following at the mid-size model tier.
Pricing is close at the low end: Qwen 3 72B frequently appears under $0.20/M input tokens on providers that have prioritized it, while DeepSeek V3.2 runs $0.28–0.50/M at comparable tiers. For pure token economics at moderate quality requirements, Qwen 3 72B can undercut V3.2.
On reasoning benchmarks, DeepSeek V3.2 holds a clear lead — MATH, competition-level coding, and multi-step logical reasoning tasks show 10–20 point gaps favoring V3.2's larger total capacity. Qwen 3 72B narrows this gap on tasks where the reasoning chain is short and instruction-following precision matters more than depth.
Qwen 3 72B is the stronger choice for multilingual workloads: Alibaba's investment in Qwen 3's language coverage is substantial, with documented strong performance on Chinese, Arabic, Japanese, Korean, and European languages. For applications serving non-English users at scale, Qwen 3 72B's multilingual alignment is a structural advantage over V3.2.
Pick DeepSeek V3.2 if complex reasoning, code generation, or long-context synthesis drives your workload. Pick Qwen 3 72B Instruct if multilingual coverage, lower per-token cost at moderate quality, or Alibaba's open license terms are the deciding factors.
Related comparisons
Full model details