Question 1

What is a token in LLM pricing?

Accepted Answer

A token is a sub-word unit of text. Roughly 1 token = 0.75 English words. Prices are quoted per million tokens (Mtok).

Question 2

Why are output tokens more expensive than input?

Accepted Answer

Generating output is sequential and uses more GPU time per token, while input can be processed in parallel. That's why output prices are typically 3–10× input prices.

Question 3

Which is the cheapest LLM in 2026?

Accepted Answer

For frontier-class quality, Llama 4 Scout and Gemini 2.5 Flash are the cheapest mainstream options at ~$0.10–$0.30 per million input tokens on most hosted providers.

Question 4

Which is the most expensive LLM?

Accepted Answer

Claude Opus 4 is the most expensive frontier model at roughly $15 in / $75 out per million tokens, followed by GPT-5 and Grok 4 at ~$5 in / $15 out.

					Modality
Meta (hosted)	Llama 4 Scout	$0.11	$0.34	1M	Text, Image
OpenAI	GPT-5 mini	$0.25	$2.00	400k	Text, Image
Meta (hosted)	Llama 4 Maverick	$0.27	$0.85	1M	Text, Image
DeepSeek	DeepSeek V3	$0.27	$1.10	128k	Text
Google	Gemini 2.5 Flash	$0.30	$2.50	1M	Text, Image, Audio, Video
Mistral	Codestral	$0.30	$0.90	256k	Text (code)
Alibaba	Qwen 3 Max	$0.40	$1.60	256k	Text, Image
DeepSeek	DeepSeek-R1	$0.55	$2.19	128k	Text
Anthropic	Claude Haiku 4	$0.80	$4.00	200k	Text, Image
Google	Gemini 2.5 Pro	$1.25	$10.00	1M	Text, Image, Audio, Video
Mistral	Mistral Large 2	$2.00	$6.00	128k	Text
Cohere	Command A	$2.50	$10.00	256k	Text
Anthropic	Claude Sonnet 4	$3.00	$15.00	200k	Text, Image
OpenAI	GPT-5	$5.00	$15.00	400k	Text, Image
xAI	Grok 4	$5.00	$15.00	256k	Text, Image
Anthropic	Claude Opus 4	$15.00	$75.00	200k	Text, Image

LLM pricing 2026

FAQ

What is a token?

Why is output more expensive than input?

Which LLM is cheapest?

Which LLM is most expensive?

Go deeper