0 providers0 models

Model crosswalk

Side-by-side on price, capability and workload — three-way comparison.

Qwen 2.5 Coder 32b Instruct
vs
Stable Code Instruct 3b
vs
Starcoder2 15b Instruct
Qwen 2.5 Coder 32b InstructA

Qwen 2.5 Coder 32b Instruct

Cheapest provider
$/1M input
$/1M output
Stable Code Instruct 3bB

Stable Code Instruct 3b

Cheapest provider
$/1M input
$/1M output
Starcoder2 15b InstructC

Starcoder2 15b Instruct

Cheapest provider
$/1M input
$/1M output
Specs and cheapest providers
SpecQwen 2.5 Coder 32b InstructStable Code Instruct 3bStarcoder2 15b Instruct
Parameters
Context window
License
Released
Cheapest provider
Provider
Input / 1M tokens
Output / 1M tokens
Benchmark comparison

No benchmark data available yet.

Editor's take
Three code-generation models spanning a wide size and capability range, with notably different stories on long-term production viability. Qwen 2.5 Coder 32B Instruct, released by Alibaba in November 2024, is the strongest performer in this group. At 32 billion parameters with a 131K context window and support for 92 programming languages, it benchmarks alongside DeepSeek Coder V2 on LiveCodeBench and MultiPL-E — making it one of the more capable sub-frontier code models available through hosted APIs. The Qwen license permits commercial deployment, and several inference providers offer competitive per-token pricing. For teams running CI-integrated code generation or agentic coding pipelines at scale, this is a credible backbone. StarCoder2 15B Instruct, from the BigCode collaboration of HuggingFace and ServiceNow, was released September 2024 and trained on The Stack v2 — a corpus restricted to permissively licensed open-source code. At 15 billion parameters with a 16K context window, it trails Qwen 2.5 Coder on benchmark leaderboards. That tradeoff is deliberate for certain teams: regulated industries and enterprise environments with IP audit requirements often value training-data chain of custody above raw HumanEval numbers. BigCode OpenRAIL-M is commercially usable with narrow application restrictions. Stable Code Instruct 3B was Stability AI's entry into the small code-completion space, released January 2024 at 3 billion parameters with a 16K context window. By mid-2026, it has been effectively displaced: Qwen 2.5 Coder and DeepSeek Coder variants deliver materially stronger scores at comparable hosted prices, and Stability AI's non-commercial community license requires a paid Stability membership for production use — a friction cost that few teams accept when Apache-licensed alternatives exist at the same size tier. Pick Qwen 2.5 Coder 32B for production code generation. Pick StarCoder2 15B when training-data provenance is a compliance requirement. Skip Stable Code 3B for new production deployments.
Compare two at a time
Frequently asked questions
How does Qwen 2.5 Coder 32b Instruct compare to Stable Code Instruct 3b and Starcoder2 15b Instruct on price?
Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
Which model is best for coding: Qwen 2.5 Coder 32b Instruct, Stable Code Instruct 3b, or Starcoder2 15b Instruct?
HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
What is the context window for Qwen 2.5 Coder 32b Instruct, Stable Code Instruct 3b, and Starcoder2 15b Instruct?
Context window sizes are listed in the Specs row of the comparison table above.
Full model details