0 providers0 models

Model crosswalk

Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.

Qwen 3 8b Instruct
vs
Yi 1.5 9b Chat
Qwen 3 8b InstructA

Qwen 3 8b Instruct

Cheapest provider
$/1M input
$/1M output
Yi 1.5 9b ChatB

Yi 1.5 9b Chat

Cheapest provider
$/1M input
$/1M output
Specs and cheapest providers
SpecQwen 3 8b InstructYi 1.5 9b Chat
Parameters
Context window
License
Released
Cheapest provider
Provider
Input / 1M tokens
Output / 1M tokens
Benchmark comparison

No benchmark data available for either model yet.

Sample workload — 5M in + 2M out per month

using each model's cheapest provider
Qwen 3 8b Instruct
$0.00 /mo
Yi 1.5 9b Chat
$0.00 /mo

What changes at scale

Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.

1M in · 250K out$0.00 · $0.00
5M in · 2M out$0.00 · $0.00
20M in · 10M out$0.00 · $0.00
100M in · 60M out$0.00 · $0.00

Capability vs price

scatter
// scatter: benchmark × $/1M out
Calculate cost for your workload

Compare total monthly cost across providers for Qwen 3 8b Instruct and Yi 1.5 9b Chat using your own input/output token mix.

Open workload calculator →
Editor's take
[Qwen 3 8B Instruct](/models/alibaba--qwen-3-8b-instruct) and [Yi 1.5 9B Chat](/models/01-ai--yi-1.5-9b-chat) are similar in size but differ in training generation — Qwen 3 is the newer architecture with updated instruction tuning, while Yi 1.5 is a well-established model with a large community footprint. Both price in the $0.03–0.07/M token range, making cost roughly equivalent across providers. The differentiation is quality and language coverage. Qwen 3 8B Instruct scores approximately 72–74% on MMLU versus Yi 1.5 9B Chat's 65–68%, a 5–8 point gap that reflects the training generation difference. Qwen 3 8B also has tighter instruction-following — structured output tasks like JSON extraction and step-by-step reasoning with explicit formatting constraints have a higher success rate out of the box, reducing the need for few-shot examples. Yi 1.5 9B Chat has strong Mandarin Chinese performance relative to its parameter count, making it a practical choice for Chinese-language applications where 01.AI's training priorities pay off. The model also has a wider selection of community fine-tunes for domain-specific use cases (legal, medical, customer support), which matters if you're building on a fine-tuning workflow. Pick Yi 1.5 9B Chat if Chinese-language quality is the primary requirement or if you're building on an existing Yi-based fine-tune ecosystem. Pick Qwen 3 8B Instruct for English-dominant workloads where MMLU ceiling, instruction-following reliability, or structured output fidelity drives model selection.
Related comparisons
Full model details