Gemma 2 9B IT — pricing, providers, and benchmarks
Parameters
9B
Context window
8k tokens
License
gemma
Released
2024-07-31
Provider pricing
Sorted by total monthly cost for 100M input + 10M output tokens.
| Provider | Input / 1M | Output / 1M | Monthly cost | Context |
|---|---|---|---|---|
| DeepInfra | $0.0500 | $0.0600 | $5.60 | 8k |
| Fireworks AI | $0.0800 | $0.0800 | $8.80 | 8k |
| Groq | $0.0800 | $0.0800 | $8.80 | 8k |
Frequently asked questions
How much does it cost to run Gemma 2 9B IT for 100M tokens?▾
Running Gemma 2 9B IT with 100M input and 10M output tokens per month costs approximately $5.60 on DeepInfra, the cheapest available provider as of the latest pricing data. Costs vary significantly depending on your input/output ratio and whether you use prompt caching.
What is the cheapest provider for Gemma 2 9B IT?▾
DeepInfra currently offers Gemma 2 9B IT at the lowest total cost for a standard workload. Prices change frequently — check the table above for the latest data.
What context window does Gemma 2 9B IT support?▾
Gemma 2 9B IT supports a context window of 8,192 tokens. Individual providers may cap this lower — see the pricing table for per-provider context limits.