0 providers50 models

Model crosswalk

Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.

Gemma 2 27B IT
vs
Qwen 3 32B Instruct
Gemma 2 27B ITA

Gemma 2 27B IT

27B params · 8K context · gemma

Cheapest provider
$/1M input
$/1M output
Qwen 3 32B InstructB

Qwen 3 32B Instruct

32B params · 131K context · qwen

Cheapest provideropenrouter
$/1M input$140000.00
$/1M output$550000.00
Specs and cheapest providers
SpecGemma 2 27B ITQwen 3 32B Instruct
Parameters27B32B
Context window8K tokens131K tokens🏆
Licensegemmaqwen
Released2024-07-312025-04-28
Cheapest provider
Provideropenrouter
Input / 1M tokens$140000.00
Output / 1M tokens$550000.00

Add a third model to compare

Benchmark comparison

No benchmark data available for either model yet.

Sample workload — 5M in + 2M out per month

using each model's cheapest provider
Gemma 2 27B IT
$0.00 /mo
Qwen 3 32B Instruct
$1800000.00 /mo

What changes at scale

Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.

1M in · 250K out$0.00 · $277500.00
5M in · 2M out$0.00 · $1800000.00
20M in · 10M out$0.00 · $8300000.00
100M in · 60M out$0.00 · $47000000.00

Capability vs price

scatter
// scatter: benchmark × $/1M out
Calculate cost for your workload

Compare total monthly cost across providers for Gemma 2 27B IT and Qwen 3 32B Instruct using your own input/output token mix.

Open workload calculator →
Editor's take
Gemma 2 27B IT and Qwen 3 32B Instruct sit close in parameter count but diverge sharply on context window and multilingual capability. The most important architectural gap: Gemma 2 27B IT is capped at **8K tokens** while [Qwen 3 32B Instruct](/models/alibaba--qwen-3-32b-instruct) supports **131K tokens** — a 16× difference that determines which model is even viable for long-document work. On pricing, Qwen 3 32B typically runs $0.10–$0.20/M input tokens at competitive providers, roughly on par with Gemma 2 27B at $0.08–$0.18/M. Neither commands a premium for the parameter range, so cost is largely a wash unless you're pushing very high volume. Benchmark-wise, Qwen 3 32B scores higher on MMLU (~82 vs ~74 for Gemma 2 27B) and shows substantially stronger performance on Chinese, Japanese, and Korean text — relevant if your user base is multilingual. Gemma 2 27B has a cleaner safety profile and tighter latency at p50 for short completions, owing partly to its smaller effective compute footprint. **Gemma 2 27B IT** fits English-only classification, structured extraction, or RAG pipelines where retrieved chunks stay well under 8K and you want fast, predictable latency. Its Google provenance also means good availability across hosted inference providers. **Qwen 3 32B Instruct** is the pick for long-context summarization (legal, financial, technical docs), agentic loops that accumulate multi-turn context, or any pipeline with Asian-language content. Pick [Gemma 2 27B IT](/models/google--gemma-2-27b-it) if your context fits 8K and you need reliable English output at low latency. Pick Qwen 3 32B if you need 131K context or multilingual coverage.
Related comparisons
Full model details