Which LLM has the largest context window?

As of July 2026, Meta's open-weights Llama 5 leads with a 5 million token context window. Among proprietary models, GPT-5.5, GPT-5.4, Claude Fable 5, Claude Opus 4.8, Claude Sonnet 5, and the Gemini 3 family all offer roughly 1 million tokens. Note that some providers bill long prompts at premium rates — GPT-5.x doubles input pricing past 272K tokens, and Gemini 3.1 Pro steps up past 200K.

What does input vs output pricing mean?

LLM APIs charge separately for prompt tokens (input) and generated tokens (output). Output typically costs 3-5x more than input because generation is computationally heavier. Pricing is quoted per 1 million tokens.

What is the difference between open-source and closed-source LLMs?

Open-weights LLMs (Llama 5, Mistral Large 3, DeepSeek V4) publish their weights publicly — you can run them locally or fine-tune them without per-token API costs. Licenses vary: DeepSeek V4 is MIT, Mistral Large 3 is Apache 2.0, and Llama 5 uses Meta's community license. Closed-source models (GPT-5.x, Claude, Gemini) are accessed only through paid APIs.

What are modalities in the context of LLMs?

Modalities refer to the types of input a model supports: text (all models), image input (GPT-5.x, Claude, Gemini, Llama 5), audio (GPT-5.5, Gemini 3), and video (Gemini 3 family only among the compared models). Multimodal models process more than one type; DeepSeek V4 remains text-and-code only.

LLM Model Comparison

13 models · Updated 2025 · Prices per 1M tokens

13 models

Model	Context	Input $/1M	Output $/1M	Released	Modalities	License	Strengths
Claude Fable 5 Anthropic · Claude 5	1M	$10.00	$50.00	2026	textimagecode	Prop.	Deepest reasoningLong-horizon agentsWriting
Claude Haiku 4.5 Anthropic · Claude 4	200K	$1.00	$5.00	Oct 2025	textimagecode	Prop.	Fastest ClaudeLow costCoding
Claude Opus 4.8 Anthropic · Claude 4	1M	$5.00	$25.00	2026	textimagecode	Prop.	Long-horizon agentsCodingKnowledge work
Claude Sonnet 5 Anthropic · Claude 5	1M	$3.00	$15.00	2026	textimagecode	Prop.	CodingAgentic work1M context
DeepSeek V4 Flash DeepSeek · DeepSeek V4	1M	$0.14	$0.28	Apr 2026	textcode	Open	284B MoE (13B active)High volumeSpeed
DeepSeek V4 Pro DeepSeek · DeepSeek V4	1M	$0.44	$0.87	Apr 2026	textcode	Open	1.6T MoE (49B active)Code & mathUltra-low cost
GPT-5.4 OpenAI · GPT-5	1M	$2.50	$15.00	2026	textimagecode	Prop.	CodingTool searchStructured output
GPT-5.5 OpenAI · GPT-5	1M	$5.00	$30.00	2026	textimageaudiocode	Prop.	Flagship reasoningAgentic tool useMultimodal
Gemini 3.1 Pro Google · Gemini 3	1M	$2.00	$12.00	2026	textimageaudiovideocode	Prop.	Multimodal reasoningComputer useVideo input
Gemini 3.5 Flash Google · Gemini 3	1M	$1.50	$9.00	2026	textimageaudiovideocode	Prop.	Agentic codingSpeedLong context
Llama 5 Meta · Llama 5	5M	Free / Self-host	Free / Self-host	Apr 2026	textimagecode	Open	5M contextOpen weights600B params
Mistral Large 3 Mistral · Mistral	256K	Free / Self-host	Free / Self-host	Dec 2025	textimagecode	Open	Multilingual (200+ languages)675B MoE (41B active)Apache 2.0
o3 OpenAI · o-series	200K	$2.00	$8.00	Apr 2025	textimagecode	Prop.	Deep reasoningMathScience

Prices shown are standard API rates as of July 2026. Batch/cached rates may be lower; some providers charge premium rates above a long-context threshold. Open-source models shown as "Free / Self-host" — inference costs vary by provider.

Head-to-head comparisons

Detailed side-by-side pages with pricing math, verdicts, and use-case guidance.

GPT-5.5 vs Claude Opus 4.8 Claude Sonnet 5 vs GPT-5.4 Gemini 3.1 Pro vs GPT-5.5 Claude Opus 4.8 vs Gemini 3.1 Pro Claude Fable 5 vs GPT-5.5 DeepSeek V4 Pro vs GPT-5.5 DeepSeek V4 Pro vs Claude Sonnet 5 Gemini 3.5 Flash vs Claude Haiku 4.5 GPT-5.4 vs Gemini 3.5 Flash Llama 5 vs DeepSeek V4 Pro Mistral Large 3 vs Llama 5 o3 vs DeepSeek V4 Pro

LLM Model Comparison

Head-to-head comparisons

About

How to use