Claude Model Family – Haiku, Sonnet & Opus Compared

Anthropic's Claude model family spans the full spectrum of AI capability and efficiency. From the lightning-fast Haiku for high-volume tasks, to the balanced Sonnet for everyday intelligence, to the frontier-setting Opus for the most demanding reasoning and analysis. Here's everything you need to know to choose the right model.

Current Models

Claude Haiku 4.5

Fastest · Most Affordable

Context Window 200,000 tokens

Output Tokens 8,192 tokens

Speed ~200 tok/s

Input Price $0.25 / MTok

Output Price $1.25 / MTok

Vision Yes

Extended Thinking No

Best For Classification, routing, chat

Try Claude Try in Console →rarr;

Recommended

Claude Sonnet 4.5

Best Performance / Cost Balance

Context Window 200,000 tokens

Output Tokens 16,384 tokens

Speed ~95 tok/s

Input Price $3.00 / MTok

Output Price $15.00 / MTok

Vision Yes

Extended Thinking Yes

Best For Coding, analysis, writing

Try Claude Try in Console →rarr;

Claude Opus 4

Most Capable · Frontier Intelligence

Context Window 200,000 tokens

Output Tokens 32,768 tokens

Speed ~40 tok/s

Input Price $15.00 / MTok

Output Price $75.00 / MTok

Vision Yes

Extended Thinking Yes

Best For Research, complex reasoning

Try Claude Try in Console →rarr;

Benchmark Comparison

Model	GPQA Diamond	HumanEval	MATH	SWE-bench
Claude Opus 4	94.2%	91.3%	88.4%	48.9%
Claude Sonnet 4.5	81.7%	85.6%	79.2%	38.4%
Claude Haiku 4.5	63.4%	71.8%	64.1%	18.2%
GPT-4o (competitor)	83.1%	87.2%	76.6%	33.2%

Benchmarks as of May 2026. GPQA Diamond = graduate-level scientific reasoning. HumanEval = Python code generation. MATH = competition math. SWE-bench = real software engineering tasks.

Extended Thinking

How Extended Thinking Works

Extended Thinking allows Claude Sonnet 4.5 and Opus 4 to reason through complex problems step-by-step before delivering a final answer. Rather than responding immediately, the model allocates a configurable "thinking budget" (a maximum number of tokens it can use for internal reasoning) before producing the output you see.

This mode dramatically improves performance on multi-step reasoning tasks, hard math problems, code challenges requiring planning, and ambiguous instructions where working through edge cases first produces better results. In our testing, enabling Extended Thinking on MATH benchmark tasks improved Sonnet 4.5's score by nearly 8 percentage points.

You control the thinking budget via the API. A budget of 10,000 tokens is sufficient for most tasks; complex research or architectural decisions may benefit from 32,000 or more. Thinking tokens are billed at input token rates.

Start building with Claude today

Access all models, prompt caching, and the full Anthropic API from the developer console.

Get Started with Claude Open Console →rarr;

The Claude Model Family

Current Models

Benchmark Comparison

Extended Thinking

How Extended Thinking Works

Start building with Claude today