How does Command R+ compare to DeepSeek V3.2 and Llama 3.3 70B Instruct on price?

Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.

Which model is best for coding: Command R+, DeepSeek V3.2, or Llama 3.3 70B Instruct?

HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.

What is the context window for Command R+, DeepSeek V3.2, and Llama 3.3 70B Instruct?

Context window sizes are listed in the Specs row of the comparison table above.

Command R Plus vs Deepseek V3.2 vs Llama 3.3 70b Instruct (2026) — 3-way comparison

Model crosswalk

Side-by-side on price, capability and workload — three-way comparison.

Command R+

DeepSeek V3.2

Llama 3.3 70B Instruct

Command R+A

Command R+

104B params · 131K context · cohere-cc-by-nc

Cheapest provideropenrouter

$/1M input$2500000.00

$/1M output$10000000.00

DeepSeek V3.2B

DeepSeek V3.2

671B params · 131K context · deepseek

Cheapest providertogether-ai

$/1M input$270000.00

$/1M output$1100000.00

Llama 3.3 70B InstructC

Llama 3.3 70B Instruct

70B params · 131K context · llama-3

Cheapest providerfireworks-ai

$/1M input$220000.00

$/1M output$880000.00

Specs and cheapest providers

Spec	Command R+	DeepSeek V3.2	Llama 3.3 70B Instruct
Parameters	104B	671B	70B
Context window	131K tokens	131K tokens	131K tokens
License	cohere-cc-by-nc	deepseek	llama-3
Released	2024-04-04	2025-05-07	2024-12-06
Cheapest provider
Provider	openrouter	together-ai	fireworks-ai
Input / 1M tokens	$2500000.00	$270000.00	$220000.00🏆
Output / 1M tokens	$10000000.00	$1100000.00	$880000.00🏆

Benchmark comparison

No benchmark data available yet.

Editor's take

Three models that sell to different buyers. Command R+ is Cohere's 104B-parameter enterprise model, released April 2024 with a 131K context window and explicit design investment in retrieval-augmented generation and multi-step tool use. Cohere built proprietary RAG evaluation suites into the development process, and the function-calling benchmark performance reflects that investment. The catch: the CC-BY-NC license prohibits commercial deployment through third-party hosts, which means production workloads run through Cohere's own API. If you are buying the RAG tooling and the managed enterprise service, the model price makes more sense. If you want a self-hostable alternative, the license rules it out. DeepSeek V3.2 is a 671B MoE from May 2025 with roughly 37B active parameters per forward pass and a 128K context window. Its benchmark numbers on code, math, and general reasoning exceed Command R+ across most evaluations. DeepSeek's own commercial license is permissive for most use cases, and per-token costs across providers like DeepInfra and Fireworks are significantly lower than Command R+ via Cohere's API. Llama 3.3 70B Instruct is the most permissive option — Meta's December 2024 dense 70B with 131K context and the Llama 3 community license. It lacks Command R+'s dedicated RAG investment but works well with standard retrieval pipelines and runs cheaply across dozens of hosted providers. Pick Command R+ when you need Cohere's enterprise RAG ecosystem and are willing to work within their managed API. Pick DeepSeek V3.2 when quality ceiling and cost-efficiency are paramount and you can manage the licensing review. Pick Llama 3.3 70B for flexible, permissively licensed general deployment with broad provider choice.

Compare two at a time

Command R+ vs DeepSeek V3.2 Command R+ vs Llama 3.3 70B Instruct DeepSeek V3.2 vs Llama 3.3 70B Instruct

Frequently asked questions

How does Command R+ compare to DeepSeek V3.2 and Llama 3.3 70B Instruct on price?: Use the table above to compare input and output prices per 1M tokens across the cheapest available providers for each model.
Which model is best for coding: Command R+, DeepSeek V3.2, or Llama 3.3 70B Instruct?: HumanEval and other code benchmarks are shown in the table. For production code tasks, also consider context window size and provider latency.
What is the context window for Command R+, DeepSeek V3.2, and Llama 3.3 70B Instruct?: Context window sizes are listed in the Specs row of the comparison table above.

Full model details

All providers for Command R+ →All providers for DeepSeek V3.2 →All providers for Llama 3.3 70B Instruct →