How does Mistral Large 2 compare to Qwen 2.5 72B Instruct and Qwen 3 32B Instruct on price?

Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.

Which model is best for coding: Mistral Large 2, Qwen 2.5 72B Instruct, or Qwen 3 32B Instruct?

HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.

What is the context window for Mistral Large 2, Qwen 2.5 72B Instruct, and Qwen 3 32B Instruct?

Context window sizes are listed in the Specs row of the comparison table above.

Mistral Large 2 vs Qwen 2.5 72b Instruct vs Qwen 3 32b Instruct (2026) — 3-way comparison

Model crosswalk

Side-by-side on price, capability and workload — three-way comparison.

Mistral Large 2

Qwen 2.5 72B Instruct

Qwen 3 32B Instruct

Mistral Large 2A

Mistral Large 2

123B params · 131K context · mistral-research

Cheapest provideropenrouter

$/1M input$1800000.00

$/1M output$5400000.00

Qwen 2.5 72B InstructB

Qwen 2.5 72B Instruct

72B params · 131K context · qwen

Cheapest providerdeepinfra

$/1M input$180000.00

$/1M output$350000.00

Qwen 3 32B InstructC

Qwen 3 32B Instruct

32B params · 131K context · qwen

Cheapest provideropenrouter

$/1M input$140000.00

$/1M output$550000.00

Specs and cheapest providers

Spec	Mistral Large 2	Qwen 2.5 72B Instruct	Qwen 3 32B Instruct
Parameters	123B	72B	32B
Context window	131K tokens	131K tokens	131K tokens
License	mistral-research	qwen	qwen
Released	2024-07-24	2024-09-19	2025-04-28
Cheapest provider
Provider	openrouter	deepinfra	openrouter
Input / 1M tokens	$1800000.00	$180000.00	$140000.00🏆
Output / 1M tokens	$5400000.00	$350000.00🏆	$550000.00

Benchmark comparison

No benchmark data available yet.

Editor's take

This comparison is partly about parameter tiers and partly about licensing philosophy. Mistral Large 2 is the largest and most expensive of the three — a 123B dense model from Mistral AI with a 128K context window, strong function-calling support, and European multilingual quality. Its Research License channels most production traffic through Mistral's own managed API, meaning you get polished tooling and reliability guarantees but give up self-hosting flexibility without an enterprise agreement. Qwen 2.5 72B Instruct is Alibaba's September 2024 release, still widely deployed a year later because inference pipelines pinned to a specific checkpoint rarely migrate quickly. At 72 billion parameters with a 131K context window, it remains competitive with Mistral Large 2 on English-language benchmarks while exceeding it on multilingual evaluations, particularly CJK and Arabic. The Qwen commercial license is permissive. Provider coverage is broad, and per-token rates have softened as Qwen 3 capacity has expanded. Qwen 3 32B Instruct is the practical middle-ground in Alibaba's current generation — 32 billion parameters, 131K context, approximately 85 percent of Qwen 3 72B benchmark quality at roughly half the per-token cost. It handles multilingual coding tasks and mixed-language documents well. For most workloads that currently run on Qwen 2.5 72B, the 3 32B offers better quality-per-dollar without requiring a wholesale migration to the 72B tier. Pick Mistral Large 2 when managed API support, European language quality, and Mistral's function-calling ecosystem are worth the premium. Pick Qwen 2.5 72B when you are pinning a stable checkpoint for reproducibility and need multilingual breadth at competitive cost. Pick Qwen 3 32B for new deployments where cost-adjusted quality matters more than hitting the 72B ceiling.

Compare two at a time

Mistral Large 2 vs Qwen 2.5 72B Instruct Mistral Large 2 vs Qwen 3 32B Instruct Qwen 2.5 72B Instruct vs Qwen 3 32B Instruct

Frequently asked questions

How does Mistral Large 2 compare to Qwen 2.5 72B Instruct and Qwen 3 32B Instruct on price?: Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
Which model is best for coding: Mistral Large 2, Qwen 2.5 72B Instruct, or Qwen 3 32B Instruct?: HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
What is the context window for Mistral Large 2, Qwen 2.5 72B Instruct, and Qwen 3 32B Instruct?: Context window sizes are listed in the Specs row of the comparison table above.

Full model details

All providers for Mistral Large 2 →All providers for Qwen 2.5 72B Instruct →All providers for Qwen 3 32B Instruct →