Qwen 3 8B Instruct vs Yi 1.5 9B Chat (2026) — pricing, benchmarks, cheapest providers

Model crosswalk

Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.

Qwen 3 8b Instruct

Yi 1.5 9b Chat

Qwen 3 8b InstructA

Qwen 3 8b Instruct

Cheapest provider—

$/1M input—

$/1M output—

Yi 1.5 9b ChatB

Yi 1.5 9b Chat

Cheapest provider—

$/1M input—

$/1M output—

Specs and cheapest providers

Spec	Qwen 3 8b Instruct	Yi 1.5 9b Chat
Parameters	—	—
Context window	—	—
License	—	—
Released	—	—
Cheapest provider
Provider	—	—
Input / 1M tokens	—	—
Output / 1M tokens	—	—

Benchmark comparison

No benchmark data available for either model yet.

Sample workload — 5M in + 2M out per month

using each model's cheapest provider

Qwen 3 8b Instruct

$0.00 /mo

Yi 1.5 9b Chat

$0.00 /mo

What changes at scale

Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.

1M in · 250K out$0.00 · $0.00

5M in · 2M out$0.00 · $0.00

20M in · 10M out$0.00 · $0.00

100M in · 60M out$0.00 · $0.00

Capability vs price

scatter

// scatter: benchmark × $/1M out

Calculate cost for your workload

Compare total monthly cost across providers for Qwen 3 8b Instruct and Yi 1.5 9b Chat using your own input/output token mix.

Open workload calculator →

Editor's take

[Qwen 3 8B Instruct](/models/alibaba--qwen-3-8b-instruct) and [Yi 1.5 9B Chat](/models/01-ai--yi-1.5-9b-chat) are similar in size but differ in training generation — Qwen 3 is the newer architecture with updated instruction tuning, while Yi 1.5 is a well-established model with a large community footprint. Both price in the $0.03–0.07/M token range, making cost roughly equivalent across providers. The differentiation is quality and language coverage. Qwen 3 8B Instruct scores approximately 72–74% on MMLU versus Yi 1.5 9B Chat's 65–68%, a 5–8 point gap that reflects the training generation difference. Qwen 3 8B also has tighter instruction-following — structured output tasks like JSON extraction and step-by-step reasoning with explicit formatting constraints have a higher success rate out of the box, reducing the need for few-shot examples. Yi 1.5 9B Chat has strong Mandarin Chinese performance relative to its parameter count, making it a practical choice for Chinese-language applications where 01.AI's training priorities pay off. The model also has a wider selection of community fine-tunes for domain-specific use cases (legal, medical, customer support), which matters if you're building on a fine-tuning workflow. Pick Yi 1.5 9B Chat if Chinese-language quality is the primary requirement or if you're building on an existing Yi-based fine-tune ecosystem. Pick Qwen 3 8B Instruct for English-dominant workloads where MMLU ceiling, instruction-following reliability, or structured output fidelity drives model selection.

Related comparisons

Qwen 3 8b Instruct vs Qwen 3 14b Instruct →Qwen 3 8b Instruct vs Llama 3.1 8b Instruct →Yi 1.5 9b Chat vs Llama 3.1 8b Instruct →Qwen 3 8b Instruct vs Gemma 2 9b It →

Full model details

All providers for Qwen 3 8b Instruct →All providers for Yi 1.5 9b Chat →