0 providers50 models

Model crosswalk

Side-by-side on price, capability and workload. Both columns use the cheapest provider for that model.

OLMo 2 13B Instruct
vs
Phi-3 Medium 128K
OLMo 2 13B InstructA

OLMo 2 13B Instruct

13B params · 4K context · apache-2.0

Cheapest provider
$/1M input
$/1M output
Phi-3 Medium 128KB

Phi-3 Medium 128K

14B params · 131K context · mit

Cheapest provider
$/1M input
$/1M output
Specs and cheapest providers
SpecOLMo 2 13B InstructPhi-3 Medium 128K
Parameters13B14B
Context window4K tokens131K tokens🏆
Licenseapache-2.0mit
Released2024-11-212024-05-21
Cheapest provider
Provider
Input / 1M tokens
Output / 1M tokens

Add a third model to compare

Benchmark comparison

No benchmark data available for either model yet.

Sample workload — 5M in + 2M out per month

using each model's cheapest provider
OLMo 2 13B Instruct
$0.00 /mo
Phi-3 Medium 128K
$0.00 /mo

What changes at scale

Output tokens dominate cost above a 1:3 input/output ratio. Below 1:1, input dominates and cheaper-input providers win regardless of headline price.

1M in · 250K out$0.00 · $0.00
5M in · 2M out$0.00 · $0.00
20M in · 10M out$0.00 · $0.00
100M in · 60M out$0.00 · $0.00

Capability vs price

scatter
// scatter: benchmark × $/1M out
Calculate cost for your workload

Compare total monthly cost across providers for OLMo 2 13B Instruct and Phi-3 Medium 128K using your own input/output token mix.

Open workload calculator →
Editor's take
OLMo 2 13B and Phi-3 Medium 128K are both ~13–14B dense models, but they represent different design philosophies. Phi-3 Medium was trained on a heavily curated "textbook-quality" dataset, yielding strong MMLU scores (~78) and coding performance that punches well above its parameter count. OLMo 2 13B prioritizes full transparency — Apache 2.0 weights, fully documented training data — with MMLU around 63. On price, both models occupy a similar band: $0.18–$0.35/M tokens depending on provider, though Phi-3 Medium's 128K context window can trigger premium pricing at long context on some platforms. The 128K context is Phi-3 Medium's defining advantage. For workloads that involve long documents, multi-turn chat histories, or large codebases passed in-context, this removes the chunking overhead that OLMo 2 13B's shorter context (typically 4K–8K effective) forces on you. **Where OLMo 2 13B wins:** scenarios requiring full model transparency, on-prem deployment with zero license restrictions, or research pipelines where auditable training data matters. The Apache 2.0 license has no commercial restrictions whatsoever. **Where Phi-3 Medium 128K wins:** long-document summarization, retrieval-free Q&A over large corpora, or coding tasks where quality-per-parameter efficiency matters. The curated training data consistently surfaces better reasoning on structured tasks. Pick [OLMo 2 13B Instruct](/models/allenai--olmo-2-13b-instruct) when openness, reproducibility, or on-prem licensing are requirements. Pick [Phi-3 Medium 128K](/models/microsoft--phi-3-medium-128k) when you need a long context window or better benchmark quality at comparable cost.
Related comparisons
Full model details