LLM API Pricing Calculator
Compare real-time API costs across Claude, GPT, Gemini, DeepSeek, and more.
Usage Calculator
| Model | Provider | Context | Input cost | Output cost | Total / call | Monthly | vs. expensive |
|---|---|---|---|---|---|---|---|
MiMo V2 Free coding model; 256K context; open weights | Xiaomi | 256K | Free | Free | Free | Free | — |
Devstral 2cheapest Cheapest agentic coding model; 256K context | Mistral | 256K | $0.000050 | $0.000110 | $0.000160 | $0.4800 | -99% |
Llama 4 Scout 10M context industry record; 109B MoE (17B active) | Meta | 10M | $0.000080 | $0.000150 | $0.000230 | $0.6900 | -99% |
Gemini 3.1 Flash Lite #3 Arena overall; #1 creative writing; ultra-fast | 1M | $0.000100 | $0.000200 | $0.000300 | $0.9000 | -99% | |
Llama 4 Maverick 400B MoE (17B active); strong multimodal; open weights | Meta | 1M | $0.000150 | $0.000300 | $0.000450 | $1.35 | -99% |
DeepSeek V3.2 ~90% GPT-5.4 quality at 1/50th cost; best value model | DeepSeek | 128K | $0.000280 | $0.000210 | $0.000490 | $1.47 | -98% |
Mistral Large 3 675B MoE (41B active); Apache 2.0; best cost-efficiency frontier | Mistral | 256K | $0.000500 | $0.000750 | $0.001250 | $3.75 | -96% |
Gemini 2.5 Flash Cheapest frontier model at scale | 1M | $0.000300 | $0.001250 | $0.001550 | $4.65 | -95% | |
DeepSeek R1 671B MoE (37B active); MIT license; distilled variants available | DeepSeek | 128K | $0.000550 | $0.001095 | $0.001645 | $4.94 | -95% |
Kimi K2 1T params; Agent Swarm (100 agents); Modified MIT | Moonshot | 128K | $0.000550 | $0.001100 | $0.001650 | $4.95 | -95% |
Qwen 3 235B 235B MoE (22B active); Apache 2.0; strongest OSS competitive programming | Alibaba | 128K | $0.000860 | $0.001000 | $0.001860 | $5.58 | -94% |
Claude Haiku 4.5 Fastest Claude, cheapest tier | Anthropic | 200K | $0.000800 | $0.002000 | $0.002800 | $8.40 | -91% |
Gemini 2.5 Pro Thinking model; top WebDev Arena 1415; native multimodal | 1M | $0.001250 | $0.005000 | $0.006250 | $18.75 | -79% | |
GPT-4o Legacy but still available; superseded by GPT-5 family | OpenAI | 128K | $0.002500 | $0.005000 | $0.007500 | $22.50 | -75% |
GPT-5.4 Unifies Codex + GPT; 1M context; built-in computer use | OpenAI | 1M | $0.003000 | $0.007500 | $0.0105 | $31.50 | -65% |
Claude Sonnet 4.6 Best value frontier; beats Opus 4.5 in 59% head-to-head | Anthropic | 1M | $0.003000 | $0.007500 | $0.0105 | $31.50 | -65% |
Grok 3 Strong math/science; now legacy (Grok 4 series launched) | xAI | 131K | $0.003000 | $0.007500 | $0.0105 | $31.50 | -65% |
Claude Opus 4.6 #1 Arena Hard Prompts & Coding; 128K max output | Anthropic | 1M | $0.005000 | $0.0125 | $0.0175 | $52.50 | -42% |
Claude Opus 4.5 Major price cut from Opus 4; strong agentic coding | Anthropic | 200K | $0.005000 | $0.0125 | $0.0175 | $52.50 | -42% |
Grok 4 Top-5 Arena; strong reasoning & real-time X data | xAI | 256K | $0.005000 | $0.0125 | $0.0175 | $52.50 | -42% |
o3 Strongest OpenAI reasoning model | OpenAI | 200K | $0.0100 | $0.0200 | $0.0300 | $90.00 | — |
Pricing sourced from official provider documentation. Figures are per 1 million tokens. Self-hosted models (Llama 4) cost only compute. Monthly estimate = daily calls × 30 days. Prices subject to change.