Model crosswalk
Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.
Gemma 2 27B IT
vs
Qwen 3 32B Instruct
Gemma 2 27B ITA
Gemma 2 27B IT
27B params · 8K context · gemma
Cheapest provider—
$/1M input—
$/1M output—
Qwen 3 32B InstructB
Qwen 3 32B Instruct
32B params · 131K context · qwen
Cheapest provideropenrouter
$/1M input$140000.00
$/1M output$550000.00
Specs and cheapest providers
| Spec | Gemma 2 27B IT | Qwen 3 32B Instruct |
|---|---|---|
| Parameters | 27B | 32B |
| Context window | 8K tokens | 131K tokens🏆 |
| License | gemma | qwen |
| Released | 2024-07-31 | 2025-04-28 |
| Cheapest provider | ||
| Provider | — | openrouter |
| Input / 1M tokens | — | $140000.00 |
| Output / 1M tokens | — | $550000.00 |
Add a third model to compare
Benchmark comparison
No benchmark data available for either model yet.
Sample workload — 5M in + 2M out per month
using each model's cheapest providerWhat changes at scale
Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.
1M in · 250K out$0.00 · $277500.00
5M in · 2M out$0.00 · $1800000.00
20M in · 10M out$0.00 · $8300000.00
100M in · 60M out$0.00 · $47000000.00
Capability vs price
scatter// scatter: benchmark × $/1M out
Calculate cost for your workload
Compare total monthly cost across providers for Gemma 2 27B IT and Qwen 3 32B Instruct using your own input/output token mix.
Open workload calculator →Editor's take
Gemma 2 27B IT and Qwen 3 32B Instruct sit close in parameter count but diverge sharply on context window and multilingual capability. The most important architectural gap: Gemma 2 27B IT is capped at **8K tokens** while [Qwen 3 32B Instruct](/models/alibaba--qwen-3-32b-instruct) supports **131K tokens** — a 16× difference that determines which model is even viable for long-document work.
On pricing, Qwen 3 32B typically runs $0.10–$0.20/M input tokens at competitive providers, roughly on par with Gemma 2 27B at $0.08–$0.18/M. Neither commands a premium for the parameter range, so cost is largely a wash unless you're pushing very high volume.
Benchmark-wise, Qwen 3 32B scores higher on MMLU (~82 vs ~74 for Gemma 2 27B) and shows substantially stronger performance on Chinese, Japanese, and Korean text — relevant if your user base is multilingual. Gemma 2 27B has a cleaner safety profile and tighter latency at p50 for short completions, owing partly to its smaller effective compute footprint.
**Gemma 2 27B IT** fits English-only classification, structured extraction, or RAG pipelines where retrieved chunks stay well under 8K and you want fast, predictable latency. Its Google provenance also means good availability across hosted inference providers.
**Qwen 3 32B Instruct** is the pick for long-context summarization (legal, financial, technical docs), agentic loops that accumulate multi-turn context, or any pipeline with Asian-language content.
Pick [Gemma 2 27B IT](/models/google--gemma-2-27b-it) if your context fits 8K and you need reliable English output at low latency. Pick Qwen 3 32B if you need 131K context or multilingual coverage.
Related comparisons
Full model details