Gemma 2 27B IT vs Yi 1.5 34B Chat (2026) — pricing, benchmarks, cheapest providers

Model crosswalk

Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.

Gemma 2 27B IT

Yi 1.5 34B Chat

Gemma 2 27B ITA

Gemma 2 27B IT

27B params · 8K context · gemma

Cheapest provider—

$/1M input—

$/1M output—

Yi 1.5 34B ChatB

Yi 1.5 34B Chat

34B params · 4K context · apache-2.0

Cheapest provider—

$/1M input—

$/1M output—

Specs and cheapest providers

Spec	Gemma 2 27B IT	Yi 1.5 34B Chat
Parameters	27B	34B
Context window	8K tokens🏆	4K tokens
License	gemma	apache-2.0
Released	2024-07-31	2024-05-13
Cheapest provider
Provider	—	—
Input / 1M tokens	—	—
Output / 1M tokens	—	—

Add a third model to compare

Benchmark comparison

No benchmark data available for either model yet.

Sample workload — 5M in + 2M out per month

using each model's cheapest provider

Gemma 2 27B IT

$0.00 /mo

Yi 1.5 34B Chat

$0.00 /mo

What changes at scale

Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.

1M in · 250K out$0.00 · $0.00

5M in · 2M out$0.00 · $0.00

20M in · 10M out$0.00 · $0.00

100M in · 60M out$0.00 · $0.00

Capability vs price

scatter

// scatter: benchmark × $/1M out

Calculate cost for your workload

Compare total monthly cost across providers for Gemma 2 27B IT and Yi 1.5 34B Chat using your own input/output token mix.

Open workload calculator →

Editor's take

The headline difference between [Gemma 2 27B IT](/models/google--gemma-2-27b-it) and Yi 1.5 34B Chat is context length: Gemma 2 tops out at **8K tokens**, while Yi 1.5 34B Chat extends to **200K tokens** — the widest window in this parameter range. That alone routes most long-document workloads to Yi. Parameter count favors Yi 1.5 34B at 34B vs 27B, and that gap shows on benchmarks. Yi 1.5 34B scores approximately 76–77 on MMLU compared to Gemma 2 27B's ~74. The difference is modest in English, but Yi 1.5 was trained heavily on Chinese and English corpora, giving it a clear lead on Chinese-language tasks — often 10+ points on C-Eval benchmarks. On pricing, both models sit in the $0.10–$0.25/M input token range depending on provider, with Gemma 2 27B often slightly cheaper due to its broader availability. Yi 1.5 34B's provider set is narrower, which limits price competition. **Gemma 2 27B IT** is the right call for short-context English workloads: structured extraction, classification, function-calling pipelines where the input reliably fits 8K. Its tighter latency profile and broad provider availability make it easy to deploy cost-effectively. **Yi 1.5 34B Chat** handles anything requiring a large context window — summarizing long contracts, multi-document RAG, or extended agentic sessions — and is the clear choice for Chinese-language inference. Pick [Yi 1.5 34B Chat](/models/01-ai--yi-1.5-34b-chat) if you need 200K context or Chinese-language quality. Pick Gemma 2 27B IT if your context fits 8K and you want the widest provider selection.

Related comparisons

Gemma 2 27b It vs Qwen 3 32b Instruct →Yi 1.5 34b Chat vs Qwen 3 32b Instruct →Gemma 2 27b It vs Mistral Small 3 →Gemma 2 27b It vs Solar Pro 22b →

Full model details

All providers for Gemma 2 27B IT →All providers for Yi 1.5 34B Chat →