Model crosswalk
Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.
Qwen 3 32b Instruct
vs
Yi 1.5 34b Chat
Qwen 3 32b InstructA
Qwen 3 32b Instruct
Cheapest provider—
$/1M input—
$/1M output—
Yi 1.5 34b ChatB
Yi 1.5 34b Chat
Cheapest provider—
$/1M input—
$/1M output—
Specs and cheapest providers
| Spec | Qwen 3 32b Instruct | Yi 1.5 34b Chat |
|---|---|---|
| Parameters | — | — |
| Context window | — | — |
| License | — | — |
| Released | — | — |
| Cheapest provider | ||
| Provider | — | — |
| Input / 1M tokens | — | — |
| Output / 1M tokens | — | — |
Benchmark comparison
No benchmark data available for either model yet.
Sample workload — 5M in + 2M out per month
using each model's cheapest providerWhat changes at scale
Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.
1M in · 250K out$0.00 · $0.00
5M in · 2M out$0.00 · $0.00
20M in · 10M out$0.00 · $0.00
100M in · 60M out$0.00 · $0.00
Capability vs price
scatter// scatter: benchmark × $/1M out
Calculate cost for your workload
Compare total monthly cost across providers for Qwen 3 32b Instruct and Yi 1.5 34b Chat using your own input/output token mix.
Open workload calculator →Editor's take
[Qwen 3 32B Instruct](/models/alibaba--qwen-3-32b-instruct) and [Yi 1.5 34B Chat](/models/01-ai--yi-1.5-34b-chat) occupy the same parameter tier, but Qwen 3 is a newer generation model with measurably stronger benchmark scores. On MMLU, Qwen 3 32B scores approximately 82–85% versus Yi 1.5 34B Chat's 76–78% — a 6–7 point gap that reflects Alibaba's more recent training run. Pricing is similar: both land in the $0.20–0.40/M token range, though Yi 1.5 34B Chat is often cheaper on commodity providers by $0.05–0.10/M.
Yi 1.5 34B Chat has strong Chinese-language performance — 01.AI's training corpus prioritizes Mandarin alongside English, making it a solid option for bilingual Chinese/English applications. It also has a large community of fine-tunes and adapters, particularly for roleplay and document summarization tasks. If you're building on provider ecosystems with strong Yi support (several Asian cloud providers have optimized Yi serving), you may see better operational SLAs.
Qwen 3 32B Instruct outperforms on reasoning chains, math, and code generation. Its instruction-following is tighter, with fewer off-format responses on structured output tasks. For agentic systems where the model needs to follow multi-step tool-use protocols reliably, the generation quality difference is measurable in production.
Pick Yi 1.5 34B Chat if Chinese-language quality, cost minimization, or an existing Yi-optimized provider is the driver. Pick Qwen 3 32B Instruct if you need the higher benchmark ceiling for reasoning-heavy or structured-output workloads.
Related comparisons
Full model details