0 providers50 models

Model crosswalk

Side-by-side on price, capability and workload — three-way comparison.

Mistral Large 2
vs
Qwen 2.5 72B Instruct
vs
Qwen 3 32B Instruct
Mistral Large 2A

Mistral Large 2

123B params · 131K context · mistral-research

Cheapest provideropenrouter
$/1M input$1800000.00
$/1M output$5400000.00
Qwen 2.5 72B InstructB

Qwen 2.5 72B Instruct

72B params · 131K context · qwen

Cheapest providerdeepinfra
$/1M input$180000.00
$/1M output$350000.00
Qwen 3 32B InstructC

Qwen 3 32B Instruct

32B params · 131K context · qwen

Cheapest provideropenrouter
$/1M input$140000.00
$/1M output$550000.00
Specs and cheapest providers
SpecMistral Large 2Qwen 2.5 72B InstructQwen 3 32B Instruct
Parameters123B72B32B
Context window131K tokens131K tokens131K tokens
Licensemistral-researchqwenqwen
Released2024-07-242024-09-192025-04-28
Cheapest provider
Provideropenrouterdeepinfraopenrouter
Input / 1M tokens$1800000.00$180000.00$140000.00🏆
Output / 1M tokens$5400000.00$350000.00🏆$550000.00
Benchmark comparison

No benchmark data available yet.

Editor's take
This comparison is partly about parameter tiers and partly about licensing philosophy. Mistral Large 2 is the largest and most expensive of the three — a 123B dense model from Mistral AI with a 128K context window, strong function-calling support, and European multilingual quality. Its Research License channels most production traffic through Mistral's own managed API, meaning you get polished tooling and reliability guarantees but give up self-hosting flexibility without an enterprise agreement. Qwen 2.5 72B Instruct is Alibaba's September 2024 release, still widely deployed a year later because inference pipelines pinned to a specific checkpoint rarely migrate quickly. At 72 billion parameters with a 131K context window, it remains competitive with Mistral Large 2 on English-language benchmarks while exceeding it on multilingual evaluations, particularly CJK and Arabic. The Qwen commercial license is permissive. Provider coverage is broad, and per-token rates have softened as Qwen 3 capacity has expanded. Qwen 3 32B Instruct is the practical middle-ground in Alibaba's current generation — 32 billion parameters, 131K context, approximately 85 percent of Qwen 3 72B benchmark quality at roughly half the per-token cost. It handles multilingual coding tasks and mixed-language documents well. For most workloads that currently run on Qwen 2.5 72B, the 3 32B offers better quality-per-dollar without requiring a wholesale migration to the 72B tier. Pick Mistral Large 2 when managed API support, European language quality, and Mistral's function-calling ecosystem are worth the premium. Pick Qwen 2.5 72B when you are pinning a stable checkpoint for reproducibility and need multilingual breadth at competitive cost. Pick Qwen 3 32B for new deployments where cost-adjusted quality matters more than hitting the 72B ceiling.
Compare two at a time
Frequently asked questions
How does Mistral Large 2 compare to Qwen 2.5 72B Instruct and Qwen 3 32B Instruct on price?
Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
Which model is best for coding: Mistral Large 2, Qwen 2.5 72B Instruct, or Qwen 3 32B Instruct?
HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
What is the context window for Mistral Large 2, Qwen 2.5 72B Instruct, and Qwen 3 32B Instruct?
Context window sizes are listed in the Specs row of the comparison table above.
Full model details