Question 1

How much does Claude Sonnet cost per token?

Accepted Answer

Claude Sonnet 4.6 costs $3.00 per million input tokens and $15.00 per million output tokens at standard rates. With prompt caching enabled, cache write tokens cost $3.75/MTok (1.25× the input rate) and cache read tokens cost $0.30/MTok — a 90% discount vs the standard input price. This makes Claude Sonnet cost-competitive with GPT-4o for repeated-context workloads.

Question 2

Is Claude Sonnet cheaper than GPT-4o?

Accepted Answer

Claude Sonnet 4.6 ($3.00/MTok input) is slightly more expensive than GPT-4o ($2.50/MTok) at standard prices. However, Claude Sonnet's prompt caching provides a 90% discount on cache reads ($0.30/MTok), versus GPT-4o's 50% discount ($1.25/MTok). For applications with repeated system prompts or document context, Claude Sonnet can end up significantly cheaper total cost of ownership.

Question 3

What is Claude Sonnet best used for?

Accepted Answer

Claude Sonnet 4.6 is Anthropic's flagship production model — best for: complex reasoning and analysis, advanced code generation and review, nuanced instruction following, multi-step agents requiring reliability, RAG applications, content generation at quality, and any task where Claude Haiku's output isn't sufficient but Opus's cost isn't justified. It's the default choice for most serious production AI applications.

Question 4

How does Claude Sonnet compare to Claude Opus?

Accepted Answer

Claude Sonnet 4.6 costs $3.00/MTok vs Claude Opus 4.7 at $15.00/MTok — 80% cheaper on input. For the vast majority of production use cases, Sonnet delivers near-Opus quality at a fraction of the cost. Use Opus only when you need maximum reasoning depth, complex multi-step planning, or the highest quality creative output — and even then, consider using Opus for final verification while Sonnet handles most of the workload.

Question 5

What is the Claude Sonnet prompt caching discount?

Accepted Answer

Claude Sonnet 4.6 offers a 90% discount on cached prompt reads: $0.30/MTok cache read rate vs $3.00/MTok standard input. Cache writes cost $3.75/MTok (one-time per 5-minute TTL window). For a RAG app with a 5,000-token document context queried 50,000 times/day, caching saves ~$600/day ($18,000/month) vs standard pricing. This is Claude's most powerful cost optimization — especially for high-volume production apps.

Token Type	Price (per 1M tokens)	Notes
Input (standard)	$3.00	Your prompt + context
Output	$15.00	Model-generated response
Cache write	$3.75	1.25× input price, one-time per TTL window
Cache read 90% off	$0.30	10% of standard input price — the key saving

Model	Input	Output	Cache Read	Context
Gemini 2.0 Flash	$0.10	$0.40	$0.025	1M tokens
GPT-4o-mini	$0.15	$0.60	$0.075	128k tokens
GPT-4o	$2.50	$10.00	$1.25 (50% off)	128k tokens
Claude Sonnet 4.6 Best cache savings	$3.00	$15.00	$0.30 (90% off)	200k tokens
Gemini 1.5 Pro	$3.50	$10.50	—	2M tokens
Claude Opus 4.7	$15.00	$75.00	$1.50 (90% off)	200k tokens

Model	System Prompt Cost/day (cached)	User input + output/day	Monthly Total
GPT-4o (50% cache)	$18.75 (cache read)	$225 input + $150 output	~$11,800
Claude Sonnet 4.6 (90% cache) Winner	$4.50 (cache read)	$90 input + $225 output	~$9,600

Scenario	Input Tokens	Output Tokens	Cost/Call (standard)	Cost/Call (cached input)
Code review	1,500	600	$0.0135	$0.0045 (cached sys prompt)
Document Q&A	4,000	400	$0.018	$0.0072
Agent tool call	2,000	200	$0.009	$0.003
Long-form writing	500	2,000	$0.0315	$0.0315

Claude Sonnet Pricing 2026

Claude Sonnet 4.6 — Full Pricing Breakdown

Claude Sonnet vs Flagship Model Alternatives

When Claude Sonnet Beats GPT-4o on Total Cost

Best Use Cases for Claude Sonnet 4.6

Production AI applications

Complex code generation

RAG and document analysis

Multi-step agents

Claude Sonnet Cost Calculator — Quick Reference

Calculate Your Actual Claude Sonnet Cost

Frequently Asked Questions

How much does Claude Sonnet cost per token?

Is Claude Sonnet the same as Claude 3.5 Sonnet?

Should I use Claude Sonnet or Claude Haiku?

Does Claude Sonnet support extended thinking?

What is Claude Sonnet's context window?