0 providers50 models

Model crosswalk

Side-by-side on price, capability and workload — three-way comparison.

Gemma 2 9B IT
vs
Mistral 7B Instruct v0.3
vs
Qwen 3 8B Instruct
Gemma 2 9B ITA

Gemma 2 9B IT

9B params · 8K context · gemma

Cheapest providerdeepinfra
$/1M input$50000.00
$/1M output$60000.00
Mistral 7B Instruct v0.3B

Mistral 7B Instruct v0.3

7B params · 33K context · apache-2.0

Cheapest provider
$/1M input
$/1M output
Qwen 3 8B InstructC

Qwen 3 8B Instruct

8B params · 131K context · qwen

Cheapest provider
$/1M input
$/1M output
Specs and cheapest providers
SpecGemma 2 9B ITMistral 7B Instruct v0.3Qwen 3 8B Instruct
Parameters9B7B8B
Context window8K tokens33K tokens131K tokens🏆
Licensegemmaapache-2.0qwen
Released2024-07-312024-05-222025-04-28
Cheapest provider
Providerdeepinfra
Input / 1M tokens$50000.00
Output / 1M tokens$60000.00
Benchmark comparison

No benchmark data available yet.

Editor's take
Three distinct philosophies packed into roughly the same parameter budget. Gemma 2 9B IT is Google DeepMind's mid-tier entry from July 2024 — nine billion parameters, sub-$0.20-per-million-token pricing, and Groq-class latency for classification and extraction pipelines. The 8K context window is the genuine constraint: anything that needs to process more than a few thousand tokens in a single pass is going to struggle. Mistral 7B Instruct v0.3, the final iteration of Mistral's original 7B series, ships with a 32K context window and native function-calling support added in May 2024. It consistently lands below $0.10 per million tokens on hosted providers, making it the price-floor option in this group. General-chat benchmarks show it trailing the other two by a modest but real margin, and the function-calling addition is what keeps it relevant for lightweight agentic pipelines. Qwen 3 8B Instruct is the most capable of the three on current benchmark suites, particularly in multilingual evaluation sets covering CJK and Arabic. The 131K context window is a structural advantage the other two cannot match — Qwen 3 8B can process entire documents where Gemma 2 9B and Mistral 7B both need chunking strategies. At below $0.10 per million tokens on most platforms, it undercuts Gemma 2 9B on cost too. Pick Qwen 3 8B when multilingual coverage or long-context document processing matters. Pick Mistral 7B v0.3 when you need a battle-tested function-calling backbone at the absolute lowest cost tier and already have adapters tuned to its tokenizer. Pick Gemma 2 9B when Groq-hosted sub-10B latency is the primary constraint and 8K context is enough.
Compare two at a time
Frequently asked questions
How does Gemma 2 9B IT compare to Mistral 7B Instruct v0.3 and Qwen 3 8B Instruct on price?
Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
Which model is best for coding: Gemma 2 9B IT, Mistral 7B Instruct v0.3, or Qwen 3 8B Instruct?
HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
What is the context window for Gemma 2 9B IT, Mistral 7B Instruct v0.3, and Qwen 3 8B Instruct?
Context window sizes are listed in the Specs row of the comparison table above.
Full model details