Model crosswalk
Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.
Gemma 2 27B IT
vs
Yi 1.5 34B Chat
Gemma 2 27B ITA
Gemma 2 27B IT
27B params · 8K context · gemma
Cheapest provider—
$/1M input—
$/1M output—
Yi 1.5 34B ChatB
Yi 1.5 34B Chat
34B params · 4K context · apache-2.0
Cheapest provider—
$/1M input—
$/1M output—
Specs and cheapest providers
| Spec | Gemma 2 27B IT | Yi 1.5 34B Chat |
|---|---|---|
| Parameters | 27B | 34B |
| Context window | 8K tokens🏆 | 4K tokens |
| License | gemma | apache-2.0 |
| Released | 2024-07-31 | 2024-05-13 |
| Cheapest provider | ||
| Provider | — | — |
| Input / 1M tokens | — | — |
| Output / 1M tokens | — | — |
Add a third model to compare
Benchmark comparison
No benchmark data available for either model yet.
Sample workload — 5M in + 2M out per month
using each model's cheapest providerWhat changes at scale
Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.
1M in · 250K out$0.00 · $0.00
5M in · 2M out$0.00 · $0.00
20M in · 10M out$0.00 · $0.00
100M in · 60M out$0.00 · $0.00
Capability vs price
scatter// scatter: benchmark × $/1M out
Calculate cost for your workload
Compare total monthly cost across providers for Gemma 2 27B IT and Yi 1.5 34B Chat using your own input/output token mix.
Open workload calculator →Editor's take
The headline difference between [Gemma 2 27B IT](/models/google--gemma-2-27b-it) and Yi 1.5 34B Chat is context length: Gemma 2 tops out at **8K tokens**, while Yi 1.5 34B Chat extends to **200K tokens** — the widest window in this parameter range. That alone routes most long-document workloads to Yi.
Parameter count favors Yi 1.5 34B at 34B vs 27B, and that gap shows on benchmarks. Yi 1.5 34B scores approximately 76–77 on MMLU compared to Gemma 2 27B's ~74. The difference is modest in English, but Yi 1.5 was trained heavily on Chinese and English corpora, giving it a clear lead on Chinese-language tasks — often 10+ points on C-Eval benchmarks.
On pricing, both models sit in the $0.10–$0.25/M input token range depending on provider, with Gemma 2 27B often slightly cheaper due to its broader availability. Yi 1.5 34B's provider set is narrower, which limits price competition.
**Gemma 2 27B IT** is the right call for short-context English workloads: structured extraction, classification, function-calling pipelines where the input reliably fits 8K. Its tighter latency profile and broad provider availability make it easy to deploy cost-effectively.
**Yi 1.5 34B Chat** handles anything requiring a large context window — summarizing long contracts, multi-document RAG, or extended agentic sessions — and is the clear choice for Chinese-language inference.
Pick [Yi 1.5 34B Chat](/models/01-ai--yi-1.5-34b-chat) if you need 200K context or Chinese-language quality. Pick Gemma 2 27B IT if your context fits 8K and you want the widest provider selection.
Related comparisons
Full model details