Model crosswalk
Side-by-side on price, capability and workload — three-way comparison.
Gemma 2 27b It
vs
Mistral Small 3
vs
Qwen 3 32b Instruct
Gemma 2 27b ItA
Gemma 2 27b It
Cheapest provider—
$/1M input—
$/1M output—
Mistral Small 3B
Mistral Small 3
Cheapest provider—
$/1M input—
$/1M output—
Qwen 3 32b InstructC
Qwen 3 32b Instruct
Cheapest provider—
$/1M input—
$/1M output—
Specs and cheapest providers
| Spec | Gemma 2 27b It | Mistral Small 3 | Qwen 3 32b Instruct |
|---|---|---|---|
| Parameters | — | — | — |
| Context window | — | — | — |
| License | — | — | — |
| Released | — | — | — |
| Cheapest provider | |||
| Provider | — | — | — |
| Input / 1M tokens | — | — | — |
| Output / 1M tokens | — | — | — |
Benchmark comparison
No benchmark data available yet.
Editor's take
Gemma 2 27B IT, Mistral Small 3, and Qwen 3 32B Instruct occupy the 22–32B parameter range where teams increasingly land when they want genuine capability without paying frontier model rates. The key differentiators are context window, multilingual depth, and licensing terms.
Gemma 2 27B IT is Google DeepMind's 27-billion-parameter instruction-tuned model, released July 2024 and distilled from Gemini Ultra training data. It posts competitive MMLU and MT-Bench scores for its parameter class, but the 8K context ceiling is a real constraint — RAG pipelines, long document analysis, or multi-turn sessions that accumulate history will hit this limit quickly. The Gemma license is permissive for commercial use but not OSI-approved; confirm with your legal team before deploying in regulated environments.
Mistral Small 3 is a compact dense model from Mistral AI that targets the sub-30B inference sweet spot. It handles general instruction-following and coding tasks with a strong quality-per-token profile and benefits from Mistral's broad hosted provider coverage. Context varies by deployment configuration; it is typically in the 32K range, a meaningful step up from Gemma 2 27B's 8K.
Qwen 3 32B Instruct is Alibaba's 32-billion-parameter model with a 131K context window and genuine multilingual breadth across CJK and Arabic. It scores roughly 85 percent of Qwen 3 72B benchmark results while costing around half as much across providers like Together, Fireworks, Groq, and DeepInfra. Released under the Qwen license with commercial terms. For teams with East Asian language requirements or long-document workloads, it is the strongest choice in this size bracket.
Pick Gemma 2 27B for short-context tasks in Google ecosystem contexts. Pick Mistral Small 3 for balanced English instruction-following with wide provider support. Pick Qwen 3 32B if you need long context, multilingual support, or the best benchmark-per-dollar ratio in the 30B class.
Compare two at a time
Frequently asked questions
- How does Gemma 2 27b It compare to Mistral Small 3 and Qwen 3 32b Instruct on price?
- Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
- Which model is best for coding: Gemma 2 27b It, Mistral Small 3, or Qwen 3 32b Instruct?
- HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
- What is the context window for Gemma 2 27b It, Mistral Small 3, and Qwen 3 32b Instruct?
- Context window sizes are listed in the Specs row of the comparison table above.
Full model details