DeepInfra — models and pricing

Model catalog (12 models)

ModelInput / 1M tokensOutput / 1M tokensContext
Qwen 2.5 72B Instruct$0.1800$0.3500131k
Qwen 2.5 Coder 32B Instruct$0.1200$0.2500131k
Qwen 3 72B Instruct$0.2300$0.4500131k
DeepSeek R1$0.4000$2.0000131k
DeepSeek R1 Distill Llama 70B$0.2800$0.5500131k
DeepSeek V3$0.2000$0.8500131k
Gemma 2 9B IT$0.0500$0.06008k
Llama 3.1 405B Instruct$2.7000$8.0000131k
Llama 3.1 70B Instruct$0.2300$0.4000131k
Llama 3.1 8B Instruct$0.0600$0.0600131k
Llama 3.3 70B Instruct$0.2300$0.4000131k
Mixtral 8x22B Instruct$0.6000$0.650066k