Qwen 2.5 72B Instruct — pricing, providers, and benchmarks

Parameters
72B
Context window
131k tokens
License
qwen
Released
2024-09-19

Provider pricing

Sorted by total monthly cost for 100M input + 10M output tokens.

ProviderInput / 1MOutput / 1MMonthly costContext
DeepInfra$0.1800$0.3500$21.50131k
Fireworks AI$0.2000$0.8000$28.00131k
OpenRouter$0.2200$0.7500$29.50131k

Frequently asked questions

How much does it cost to run Qwen 2.5 72B Instruct for 100M tokens?

Running Qwen 2.5 72B Instruct with 100M input and 10M output tokens per month costs approximately $21.50 on DeepInfra, the cheapest available provider as of the latest pricing data. Costs vary significantly depending on your input/output ratio and whether you use prompt caching.

What is the cheapest provider for Qwen 2.5 72B Instruct?

DeepInfra currently offers Qwen 2.5 72B Instruct at the lowest total cost for a standard workload. Prices change frequently — check the table above for the latest data.

What context window does Qwen 2.5 72B Instruct support?

Qwen 2.5 72B Instruct supports a context window of 131,072 tokens. Individual providers may cap this lower — see the pricing table for per-provider context limits.