LLM API Pricing Calculator

Compare real-time API costs across Claude, GPT, Gemini, DeepSeek, and more.

Usage Calculator

ModelProviderContextInput costOutput costTotal / callMonthlyvs. expensive
MiMo V2
Free coding model; 256K context; open weights
Xiaomi256KFreeFreeFreeFree
Devstral 2cheapest
Cheapest agentic coding model; 256K context
Mistral256K$0.000050$0.000110$0.000160$0.4800 -99%
Llama 4 Scout
10M context industry record; 109B MoE (17B active)
Meta10M$0.000080$0.000150$0.000230$0.6900 -99%
Gemini 3.1 Flash Lite
#3 Arena overall; #1 creative writing; ultra-fast
Google1M$0.000100$0.000200$0.000300$0.9000 -99%
Llama 4 Maverick
400B MoE (17B active); strong multimodal; open weights
Meta1M$0.000150$0.000300$0.000450$1.35 -99%
DeepSeek V3.2
~90% GPT-5.4 quality at 1/50th cost; best value model
DeepSeek128K$0.000280$0.000210$0.000490$1.47 -98%
Mistral Large 3
675B MoE (41B active); Apache 2.0; best cost-efficiency frontier
Mistral256K$0.000500$0.000750$0.001250$3.75 -96%
Gemini 2.5 Flash
Cheapest frontier model at scale
Google1M$0.000300$0.001250$0.001550$4.65 -95%
DeepSeek R1
671B MoE (37B active); MIT license; distilled variants available
DeepSeek128K$0.000550$0.001095$0.001645$4.94 -95%
Kimi K2
1T params; Agent Swarm (100 agents); Modified MIT
Moonshot128K$0.000550$0.001100$0.001650$4.95 -95%
Qwen 3 235B
235B MoE (22B active); Apache 2.0; strongest OSS competitive programming
Alibaba128K$0.000860$0.001000$0.001860$5.58 -94%
Claude Haiku 4.5
Fastest Claude, cheapest tier
Anthropic200K$0.000800$0.002000$0.002800$8.40 -91%
Gemini 2.5 Pro
Thinking model; top WebDev Arena 1415; native multimodal
Google1M$0.001250$0.005000$0.006250$18.75 -79%
GPT-4o
Legacy but still available; superseded by GPT-5 family
OpenAI128K$0.002500$0.005000$0.007500$22.50 -75%
GPT-5.4
Unifies Codex + GPT; 1M context; built-in computer use
OpenAI1M$0.003000$0.007500$0.0105$31.50 -65%
Claude Sonnet 4.6
Best value frontier; beats Opus 4.5 in 59% head-to-head
Anthropic1M$0.003000$0.007500$0.0105$31.50 -65%
Grok 3
Strong math/science; now legacy (Grok 4 series launched)
xAI131K$0.003000$0.007500$0.0105$31.50 -65%
Claude Opus 4.6
#1 Arena Hard Prompts & Coding; 128K max output
Anthropic1M$0.005000$0.0125$0.0175$52.50 -42%
Claude Opus 4.5
Major price cut from Opus 4; strong agentic coding
Anthropic200K$0.005000$0.0125$0.0175$52.50 -42%
Grok 4
Top-5 Arena; strong reasoning & real-time X data
xAI256K$0.005000$0.0125$0.0175$52.50 -42%
o3
Strongest OpenAI reasoning model
OpenAI200K$0.0100$0.0200$0.0300$90.00

Pricing sourced from official provider documentation. Figures are per 1 million tokens. Self-hosted models (Llama 4) cost only compute. Monthly estimate = daily calls × 30 days. Prices subject to change.