Model crosswalk
Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.
Codestral 22b
vs
Qwen 2.5 Coder 32b Instruct
Codestral 22bA
Codestral 22b
Cheapest provider—
$/1M input—
$/1M output—
Qwen 2.5 Coder 32b InstructB
Qwen 2.5 Coder 32b Instruct
Cheapest provider—
$/1M input—
$/1M output—
Specs and cheapest providers
| Spec | Codestral 22b | Qwen 2.5 Coder 32b Instruct |
|---|---|---|
| Parameters | — | — |
| Context window | — | — |
| License | — | — |
| Released | — | — |
| Cheapest provider | ||
| Provider | — | — |
| Input / 1M tokens | — | — |
| Output / 1M tokens | — | — |
Add a third model to compare
Benchmark comparison
No benchmark data available for either model yet.
Sample workload — 5M in + 2M out per month
using each model's cheapest providerWhat changes at scale
Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.
1M in · 250K out$0.00 · $0.00
5M in · 2M out$0.00 · $0.00
20M in · 10M out$0.00 · $0.00
100M in · 60M out$0.00 · $0.00
Capability vs price
scatter// scatter: benchmark × $/1M out
Calculate cost for your workload
Compare total monthly cost across providers for Codestral 22b and Qwen 2.5 Coder 32b Instruct using your own input/output token mix.
Open workload calculator →Editor's take
Qwen 2.5 Coder 32B outweighs Codestral 22B by 10B parameters and shows it in code benchmarks: on HumanEval it scores around 92%, compared to Codestral's strong but lower ~81%. The parameter gap translates to better multi-file reasoning and more reliable function signature generation — but you're also paying for the extra compute. At the time of writing, Qwen 2.5 Coder 32B runs roughly 1.5–2× more expensive per token than Codestral 22B on shared-inference providers.
Codestral 22B's context window sits at 32K tokens; Qwen 2.5 Coder 32B supports up to 128K. That 4× context advantage is decisive for certain workloads. Check live pricing on [Codestral 22B's model page](/models/mistralai--codestral-22b) to see how current rates compare.
For agentic coding workflows — where an AI coding assistant iterates over a large repo, ingests multiple files, and generates cross-file diffs — Qwen 2.5 Coder 32B's extended context and higher benchmark accuracy make it the clearer choice. The extra cost is likely absorbed by fewer retries and hallucinated imports.
Codestral 22B is the better call for IDE inline completion and single-function generation where latency matters more than deep-context reasoning. Its smaller footprint means providers can run it at lower cost with faster TTFT, and Mistral's fill-in-the-middle training data makes it particularly good at cursor-position completions. See the full provider list on [Qwen 2.5 Coder 32B's model page](/models/alibaba--qwen-2.5-coder-32b-instruct).
**Pick Codestral 22B** for low-latency completions and cost-sensitive inline suggestions. **Pick Qwen 2.5 Coder 32B** for agentic tasks, long-context repo analysis, and higher accuracy on complex generation.
Related comparisons
Full model details