Hardware match · 16 GB usable
Best open-weight LLMs for the CPU only (no GPU)
With ~16 GB available for model weights, 0 open-weight models run locally on a CPU only (no GPU) — each at the best quantization that fits. The 5 that need more memory are listed below with the cheapest provider to rent them instead. VRAM figures are estimates.Updated May 2026.
Ranked for CPU only (no GPU)
5 modelsDifferent machine? Pick another rig or enter your VRAM →
Your hardwareor
VRAM ≈ parameters × bytes-per-param × 1.2 overhead, at the best available quantization — see the methodology docs. Estimates, not a guarantee.