Compare open-weights LLM inference
across 5 providers

Real pricing data, updated daily. Find the cheapest or fastest provider for your exact workload in seconds — no sign-up required.

Data last verified: May 17, 2026

Workload calculator

Enter your monthly token volumes and constraints. The calculator ranks every provider by cost and flags rate limit or latency mismatches before you commit.

Try the calculator →

Top models by parameter count

Tracked providers