0 providers50 models

Model crosswalk

Side-by-side on price, capability and workload — three-way comparison.

Command R+
vs
DBRX Instruct
vs
Llama 3.1 405B Instruct
Command R+A

Command R+

104B params · 131K context · cohere-cc-by-nc

Cheapest provideropenrouter
$/1M input$2500000.00
$/1M output$10000000.00
DBRX InstructB

DBRX Instruct

132B params · 33K context · databricks-open-model

Cheapest provider
$/1M input
$/1M output
Llama 3.1 405B InstructC

Llama 3.1 405B Instruct

405B params · 131K context · llama-3

Cheapest providerdeepinfra
$/1M input$2700000.00
$/1M output$8000000.00
Specs and cheapest providers
SpecCommand R+DBRX InstructLlama 3.1 405B Instruct
Parameters104B132B405B
Context window131K tokens33K tokens131K tokens
Licensecohere-cc-by-ncdatabricks-open-modelllama-3
Released2024-04-042024-03-272024-07-23
Cheapest provider
Provideropenrouterdeepinfra
Input / 1M tokens$2500000.00🏆$2700000.00
Output / 1M tokens$10000000.00$8000000.00🏆
Benchmark comparison

No benchmark data available yet.

Editor's take
Three large open-weights models from 2024, each positioned for a different enterprise buyer. Command R+ from Cohere targets enterprise teams building RAG and tool-use pipelines. At 104B parameters and 131K context, it performs reliably on function-calling evaluations and was designed from the ground up around grounding and retrieval workflows. The CC-BY-NC license means commercial workloads run through Cohere's managed API — self-hosting is not a production option without a separate agreement. If your budget is Cohere's API and you want enterprise support contracts, this is a coherent choice. DBRX Instruct is Databricks' March 2024 MoE, 132B total parameters with 36B active per token and a 32K context window. It outperformed Mixtral 8x7B at launch on MMLU and HumanEval, but the context ceiling and the narrower provider ecosystem have limited its appeal outside the Databricks platform. The Databricks Open Model License covers most commercial use. Since late 2024, models like DeepSeek V3 and Mixtral 8x22B have overtaken it on cost-adjusted quality, making Databricks ecosystem integration the primary remaining justification. Llama 3.1 405B Instruct is Meta's July 2024 maximum-scale open release — 405 billion dense parameters, 131K context, Llama 3 community license. It represents the quality ceiling for openly licensed models from Meta and is the right choice when raw capability at maximum parameter scale is the requirement and you have the infrastructure to run it. Lambda Labs and a handful of GPU-heavy hosts carry it. Per-token costs are significantly higher than 70B alternatives. Pick Command R+ for managed enterprise RAG with Cohere tooling. Pick DBRX when you are already in the Databricks ecosystem. Pick Llama 3.1 405B when you need the highest parameter count with a near-Apache license and have the infrastructure to justify it.
Compare two at a time
Frequently asked questions
How does Command R+ compare to DBRX Instruct and Llama 3.1 405B Instruct on price?
Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
Which model is best for coding: Command R+, DBRX Instruct, or Llama 3.1 405B Instruct?
HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
What is the context window for Command R+, DBRX Instruct, and Llama 3.1 405B Instruct?
Context window sizes are listed in the Specs row of the comparison table above.
Full model details