Question 1

What does Claude 4 cost per token?

Accepted Answer

Claude 4 family pricing (per million tokens): Claude Sonnet 4.6 — $3.00 input / $15.00 output. Claude Haiku 4.5 — $0.80 input / $4.00 output. Claude Opus 4.7 — $15.00 input / $75.00 output. With prompt caching, cache reads cost 10% of the standard input price — giving 90% savings on repeated context. Cache writes cost 125% of the input rate (one-time cost per TTL window, typically 5 minutes).

Question 2

Is Claude 4 more expensive than GPT-4o?

Accepted Answer

Claude Sonnet 4.6 ($3.00/MTok input) is slightly more expensive than GPT-4o ($2.50/MTok input) at standard rates. However, Claude Sonnet's prompt caching (cache reads at $0.30/MTok) outperforms GPT-4o's cached rate for long-context applications. Claude Haiku 4.5 ($0.80/MTok) is significantly more expensive than GPT-4o-mini ($0.15/MTok) without caching, but competitive with caching on (cache reads drop to $0.08/MTok vs GPT-4o-mini's $0.075/MTok).

Question 3

Which Claude 4 model should I use?

Accepted Answer

Use Claude Haiku 4.5 for: classification, extraction, high-volume chatbots, simple Q&A, lightweight agents. Cost: $0.80/MTok input. Use Claude Sonnet 4.6 for: most production tasks, code generation, complex reasoning, agentic workflows, content creation. Cost: $3.00/MTok input. Use Claude Opus 4.7 for: the hardest tasks — advanced research, complex multi-step reasoning, frontier coding benchmarks. Cost: $15.00/MTok input. Rule of thumb: start with Haiku, move to Sonnet when quality isn't sufficient, reach for Opus only when Sonnet plateaus.

Question 4

How much does the Claude 4 batch API discount?

Accepted Answer

Anthropic's Batch API gives a 50% discount on both input and output tokens for all Claude 4 models. Batch requests are processed asynchronously (typically within 24 hours). Discounted rates: Claude Sonnet 4.6 — $1.50/MTok input, $7.50/MTok output. Claude Haiku 4.5 — $0.40/MTok input, $2.00/MTok output. Claude Opus 4.7 — $7.50/MTok input, $37.50/MTok output. Batch API is ideal for data processing pipelines, document classification at scale, and any workload that doesn't require real-time responses.

Question 5

How can I estimate my Claude 4 API costs?

Accepted Answer

Use prompt-pricing.vercel.app to paste any prompt and instantly see token counts plus cost across all Claude 4 models (and GPT-4o, Gemini). For back-of-envelope estimates: 1 million tokens ≈ 750,000 words ≈ ~5,000 typical chat turns (200 tokens avg). A chatbot handling 10,000 conversations/day with 1,500 avg tokens each = 15M tokens/day = $4.50/day on Haiku 4.5 or $45/day on Sonnet 4.6 before caching. With prompt caching (1,000-token system prompt cached), Haiku drops to ~$1.05/day.

Model	Input ($/MTok)	Output ($/MTok)	Cache Write ($/MTok)	Cache Read ($/MTok)	Context Window
Claude Haiku 4.5BUDGET	$0.80	$4.00	$1.00	$0.08	200K tokens
Claude Sonnet 4.6BEST VALUE	$3.00	$15.00	$3.75	$0.30	200K tokens
Claude Opus 4.7FRONTIER	$15.00	$75.00	$18.75	$1.50	200K tokens

Task	Approx Tokens	Haiku 4.5	Sonnet 4.6	Opus 4.7
Short classification (100 in / 20 out)	120 tokens	$0.000096	$0.00036	$0.0018
Chat turn (300 in / 200 out)	500 tokens	$0.0010	$0.0039	$0.0195
Code review (2K in / 500 out)	2,500 tokens	$0.0036	$0.0135	$0.0675
Document summary (10K in / 1K out)	11,000 tokens	$0.013	$0.045	$0.225
Long analysis (50K in / 2K out)	52,000 tokens	$0.048	$0.180	$0.900

Model	Cost Without Caching/day	Cost With Caching/day	Savings/day
Haiku 4.5	$20.80	$3.68	$17.12 (82%)
Sonnet 4.6	$78.00	$13.80	$64.20 (82%)
Opus 4.7	$390.00	$69.00	$321.00 (82%)

Use Case	Recommended Model	Why
Classification / extraction	Haiku 4.5	Fast, cheap, sufficient quality
Customer support chatbot	Haiku 4.5 (cached)	Repeated system prompt = 82% cost savings
Production code generation	Sonnet 4.6	Best quality/cost balance for coding
Complex reasoning / research	Sonnet 4.6 or Opus 4.7	Depends on task complexity
Large document analysis	Sonnet 4.6 (batch)	200K context + 50% batch discount
Frontier performance required	Opus 4.7	Top benchmark scores, 5× Sonnet cost
Data pipeline at scale	Haiku 4.5 (batch)	$0.40/MTok — lowest available rate

Claude 4 API Pricing 2026

Standard API Pricing

Batch API Pricing (50% Discount)

Real-World Cost Estimates

Prompt Caching Savings Example

Which Claude 4 Model to Use

Calculate your exact Claude 4 costs

Frequently Asked Questions

What does Claude 4 cost per token?

Is Claude 4 more expensive than GPT-4o?

How do I enable prompt caching for Claude 4?

Does Claude 4 have a free tier?

What is the cheapest way to use Claude 4?

Model	Batch Input ($/MTok)	Batch Output ($/MTok)	Savings vs Standard
Claude Haiku 4.5	$0.40	$2.00	50% off
Claude Sonnet 4.6	$1.50	$7.50	50% off
Claude Opus 4.7	$7.50	$37.50	50% off