Model crosswalk
Side-by-side on price, capability and workload — three-way comparison.
Mistral Large 2
vs
Qwen 2.5 72B Instruct
vs
Qwen 3 32B Instruct
Mistral Large 2A
Mistral Large 2
123B params · 131K context · mistral-research
Cheapest provideropenrouter
$/1M input$1800000.00
$/1M output$5400000.00
Qwen 2.5 72B InstructB
Qwen 2.5 72B Instruct
72B params · 131K context · qwen
Cheapest providerdeepinfra
$/1M input$180000.00
$/1M output$350000.00
Qwen 3 32B InstructC
Qwen 3 32B Instruct
32B params · 131K context · qwen
Cheapest provideropenrouter
$/1M input$140000.00
$/1M output$550000.00
Specs and cheapest providers
| Spec | Mistral Large 2 | Qwen 2.5 72B Instruct | Qwen 3 32B Instruct |
|---|---|---|---|
| Parameters | 123B | 72B | 32B |
| Context window | 131K tokens | 131K tokens | 131K tokens |
| License | mistral-research | qwen | qwen |
| Released | 2024-07-24 | 2024-09-19 | 2025-04-28 |
| Cheapest provider | |||
| Provider | openrouter | deepinfra | openrouter |
| Input / 1M tokens | $1800000.00 | $180000.00 | $140000.00🏆 |
| Output / 1M tokens | $5400000.00 | $350000.00🏆 | $550000.00 |
Benchmark comparison
No benchmark data available yet.
Editor's take
This comparison is partly about parameter tiers and partly about licensing philosophy. Mistral Large 2 is the largest and most expensive of the three — a 123B dense model from Mistral AI with a 128K context window, strong function-calling support, and European multilingual quality. Its Research License channels most production traffic through Mistral's own managed API, meaning you get polished tooling and reliability guarantees but give up self-hosting flexibility without an enterprise agreement.
Qwen 2.5 72B Instruct is Alibaba's September 2024 release, still widely deployed a year later because inference pipelines pinned to a specific checkpoint rarely migrate quickly. At 72 billion parameters with a 131K context window, it remains competitive with Mistral Large 2 on English-language benchmarks while exceeding it on multilingual evaluations, particularly CJK and Arabic. The Qwen commercial license is permissive. Provider coverage is broad, and per-token rates have softened as Qwen 3 capacity has expanded.
Qwen 3 32B Instruct is the practical middle-ground in Alibaba's current generation — 32 billion parameters, 131K context, approximately 85 percent of Qwen 3 72B benchmark quality at roughly half the per-token cost. It handles multilingual coding tasks and mixed-language documents well. For most workloads that currently run on Qwen 2.5 72B, the 3 32B offers better quality-per-dollar without requiring a wholesale migration to the 72B tier.
Pick Mistral Large 2 when managed API support, European language quality, and Mistral's function-calling ecosystem are worth the premium. Pick Qwen 2.5 72B when you are pinning a stable checkpoint for reproducibility and need multilingual breadth at competitive cost. Pick Qwen 3 32B for new deployments where cost-adjusted quality matters more than hitting the 72B ceiling.
Compare two at a time
Frequently asked questions
- How does Mistral Large 2 compare to Qwen 2.5 72B Instruct and Qwen 3 32B Instruct on price?
- Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
- Which model is best for coding: Mistral Large 2, Qwen 2.5 72B Instruct, or Qwen 3 32B Instruct?
- HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
- What is the context window for Mistral Large 2, Qwen 2.5 72B Instruct, and Qwen 3 32B Instruct?
- Context window sizes are listed in the Specs row of the comparison table above.
Full model details