Model crosswalk
Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.
Qwen 3 8b Instruct
vs
Yi 1.5 9b Chat
Qwen 3 8b InstructA
Qwen 3 8b Instruct
Cheapest provider—
$/1M input—
$/1M output—
Yi 1.5 9b ChatB
Yi 1.5 9b Chat
Cheapest provider—
$/1M input—
$/1M output—
Specs and cheapest providers
| Spec | Qwen 3 8b Instruct | Yi 1.5 9b Chat |
|---|---|---|
| Parameters | — | — |
| Context window | — | — |
| License | — | — |
| Released | — | — |
| Cheapest provider | ||
| Provider | — | — |
| Input / 1M tokens | — | — |
| Output / 1M tokens | — | — |
Benchmark comparison
No benchmark data available for either model yet.
Sample workload — 5M in + 2M out per month
using each model's cheapest providerWhat changes at scale
Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.
1M in · 250K out$0.00 · $0.00
5M in · 2M out$0.00 · $0.00
20M in · 10M out$0.00 · $0.00
100M in · 60M out$0.00 · $0.00
Capability vs price
scatter// scatter: benchmark × $/1M out
Calculate cost for your workload
Compare total monthly cost across providers for Qwen 3 8b Instruct and Yi 1.5 9b Chat using your own input/output token mix.
Open workload calculator →Editor's take
[Qwen 3 8B Instruct](/models/alibaba--qwen-3-8b-instruct) and [Yi 1.5 9B Chat](/models/01-ai--yi-1.5-9b-chat) are similar in size but differ in training generation — Qwen 3 is the newer architecture with updated instruction tuning, while Yi 1.5 is a well-established model with a large community footprint. Both price in the $0.03–0.07/M token range, making cost roughly equivalent across providers. The differentiation is quality and language coverage.
Qwen 3 8B Instruct scores approximately 72–74% on MMLU versus Yi 1.5 9B Chat's 65–68%, a 5–8 point gap that reflects the training generation difference. Qwen 3 8B also has tighter instruction-following — structured output tasks like JSON extraction and step-by-step reasoning with explicit formatting constraints have a higher success rate out of the box, reducing the need for few-shot examples.
Yi 1.5 9B Chat has strong Mandarin Chinese performance relative to its parameter count, making it a practical choice for Chinese-language applications where 01.AI's training priorities pay off. The model also has a wider selection of community fine-tunes for domain-specific use cases (legal, medical, customer support), which matters if you're building on a fine-tuning workflow.
Pick Yi 1.5 9B Chat if Chinese-language quality is the primary requirement or if you're building on an existing Yi-based fine-tune ecosystem. Pick Qwen 3 8B Instruct for English-dominant workloads where MMLU ceiling, instruction-following reliability, or structured output fidelity drives model selection.
Related comparisons
Full model details