How does Mistral Large 2 compare to Qwen 2.5 Coder 32b Instruct and Qwen 3 72b Instruct on price?

Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.

Which model is best for coding: Mistral Large 2, Qwen 2.5 Coder 32b Instruct, or Qwen 3 72b Instruct?

HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.

What is the context window for Mistral Large 2, Qwen 2.5 Coder 32b Instruct, and Qwen 3 72b Instruct?

Context window sizes are listed in the Specs row of the comparison table above.

Mistral Large 2 vs Qwen 2.5 Coder 32b Instruct vs Qwen 3 72b Instruct (2026) — 3-way comparison

Model crosswalk

Side-by-side on price, capability and workload — three-way comparison.

Mistral Large 2

Qwen 2.5 Coder 32b Instruct

Qwen 3 72b Instruct

Mistral Large 2A

Mistral Large 2

Cheapest provider—

$/1M input—

$/1M output—

Qwen 2.5 Coder 32b InstructB

Qwen 2.5 Coder 32b Instruct

Cheapest provider—

$/1M input—

$/1M output—

Qwen 3 72b InstructC

Qwen 3 72b Instruct

Cheapest provider—

$/1M input—

$/1M output—

Specs and cheapest providers

Spec	Mistral Large 2	Qwen 2.5 Coder 32b Instruct	Qwen 3 72b Instruct
Parameters	—	—	—
Context window	—	—	—
License	—	—	—
Released	—	—	—
Cheapest provider
Provider	—	—	—
Input / 1M tokens	—	—	—
Output / 1M tokens	—	—	—

Benchmark comparison

No benchmark data available yet.

Editor's take

A frontier-class generalist, a code-benchmark leader at 32B, and a multilingual 72B instruction model — three models with overlapping capabilities but distinct optimization targets. Mistral Large 2 is Mistral AI's flagship dense model, a strong generalist that handles complex instruction-following, coding, summarization, and multilingual tasks with competitive performance in the 70–100B parameter range. Released in 2024 under Mistral's commercial license, it is available primarily through Mistral's own API (la Plateforme) and selected cloud partnerships. For teams already using the Mistral ecosystem or those that want a balanced large generalist, it is a natural anchor. Coding performance is solid for a generalist but not code-specialist tuned. Qwen 2.5 Coder 32B Instruct, from Alibaba's November 2024 release, packs 32 billion parameters into a purpose-built code model with 92 programming language support and a 131K context window. On LiveCodeBench and MultiPL-E it ranks alongside DeepSeek Coder V2 — meaningful credibility for production code generation. The smaller parameter count translates to lower inference cost than 70B-class models, while the coding specialization means it outperforms generalist 70B models on HumanEval-style benchmarks. The Qwen license permits commercial deployment. Qwen 3 72B Instruct, the top tier of Alibaba's Qwen 3 family, delivers strong performance across coding, multilingual generation (CJK languages, Arabic), and instruction-following tasks. The 131K context window handles long-document pipelines. Its coding benchmarks exceed generalist 70B peers, though dedicated code fine-tunes may still edge it on narrow HumanEval tasks. The Qwen license covers commercial use, and it is hosted across major inference providers. Pick Mistral Large 2 for Mistral-ecosystem workloads and strong general-purpose tasks. Pick Qwen 2.5 Coder 32B when code generation quality is the primary axis and you want lower inference costs than a 70B. Pick Qwen 3 72B when you need multilingual breadth plus strong coding from a single model without sacrificing general-purpose capability.

Compare two at a time

Mistral Large 2 vs Qwen 2.5 Coder 32b Instruct Mistral Large 2 vs Qwen 3 72b Instruct Qwen 2.5 Coder 32b Instruct vs Qwen 3 72b Instruct

Frequently asked questions

How does Mistral Large 2 compare to Qwen 2.5 Coder 32b Instruct and Qwen 3 72b Instruct on price?: Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
Which model is best for coding: Mistral Large 2, Qwen 2.5 Coder 32b Instruct, or Qwen 3 72b Instruct?: HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
What is the context window for Mistral Large 2, Qwen 2.5 Coder 32b Instruct, and Qwen 3 72b Instruct?: Context window sizes are listed in the Specs row of the comparison table above.

Full model details

All providers for Mistral Large 2 →All providers for Qwen 2.5 Coder 32b Instruct →All providers for Qwen 3 72b Instruct →