Anthropic API Pricing
Claude series developer, known for safety, long context, and coding excellence
14 paid models · 24 free · Price range: $0.25 - $15.00 /1M
About Anthropic
Anthropic was founded by former OpenAI researchers with a focus on AI safety. Their Claude models feature industry-leading 200K context windows, exceptional coding abilities, and the Constitutional AI safety framework. Claude 3.5 Sonnet is widely regarded as one of the best models for coding tasks.
Key Highlights
- 200K ultra-long context window
- Industry-leading code generation quality
- Constitutional AI safety framework
- Native PDF and image understanding
- Excellent instruction following
Pricing Features
- Pay-per-token billing
Supports Prompt Caching (up to 90% discount on cached tokens) and Batch API (50% off). Message Batches API for async processing.
API Features
Common Use Cases
- • Enterprise Chat Applications
- • Code Generation & Review
- • Long Document Analysis
- • Research Assistance
- • Content Writing
Anthropic API pricing guide
Focused notes for developers comparing official pricing, API docs, token billing, and model fit.
Anthropic API pricing for Claude Sonnet, Opus, and Haiku
Use this page to compare Anthropic API pricing across Claude models by input price, output price, context length, and supported capabilities. Claude 3.5 Sonnet and newer Sonnet-family models are usually the first place developers look for coding, agent workflows, and long document analysis, while Haiku-family models are better candidates when latency and budget matter more than peak reasoning quality.
- Compare input and output token prices separately before choosing a Claude model.
- Use Sonnet models for code generation, software agents, and complex analysis.
- Use Haiku models for high-volume chat, classification, and lightweight extraction tasks.
- Check Opus-family models when maximum reasoning quality matters more than unit cost.
How to reduce Anthropic Claude API costs
Anthropic pricing can change materially when you use prompt caching or batch processing. Prompt caching is useful when the same system prompt, policy document, or knowledge base context is reused across many requests. Batch processing is better for offline workloads that can wait for asynchronous completion.
- Use prompt caching for repeated instructions, long reference documents, and stable agent context.
- Use batch requests for offline analysis, enrichment, and non-real-time content workflows.
- Estimate real spend with your own input/output token ratio instead of comparing only headline prices.
Related search terms
Useful next steps
📊 Anthropic Model Comparison
Compare all models side by side. Sorted by total price (input + output).
| Model | Tier | Input /1M | Output /1M | Total /1M | Context | Best For |
|---|---|---|---|---|---|---|
| Claude 3 Haiku | Budget | $0.25 | $1.25 | $1.50 | 200k | Image analysis |
| Claude 3 Haiku | Budget | $0.25 | $1.25 | $1.50 | 200k | Image analysis |
| Claude 3.5 Haiku | Budget | $0.80 | $4.00 | $4.80 | 200k | Image analysis |
| Claude Haiku 4.5 | Budget | $1.00 | $5.00 | $6.00 | 200k | Complex reasoning, math |
| Claude 3.7 Sonnet | Budget | $3.00 | $15.00 | $18.00 | 200k | Complex reasoning, math |
| Claude Sonnet 4 | Budget | $3.00 | $15.00 | $18.00 | 200k | Complex reasoning, math |
🎯 Which Anthropic Model Should You Choose?
Quick recommendations based on your use case.
💰 Anthropic Monthly Cost Examples
Estimated monthly costs for common use cases.
| Use Case | Monthly Usage | Claude 3 Haiku (Budget) |
Claude Sonnet 4.6 (Flagship) |
|---|---|---|---|
|
Customer Service Bot
1000 conversations/day
|
500k input 200k output |
$0.38/mo | $0.00/mo |
|
Code Assistant
200 requests/day
|
1.0M input 500k output |
$0.88/mo | $0.00/mo |
|
Data Analysis
500 analyses/day
|
2.0M input 300k output |
$0.88/mo | $0.00/mo |
⚔️ Anthropic vs Competitors
How does {brand} compare to other major AI providers?
| Brand | Model | Input /1M | Output /1M | Total /1M | Context | vs {brand} |
|---|---|---|---|---|---|---|
Anthropic
|
Claude Sonnet 4.6 Current | - | - | Free | 1.0M | — |
OpenAI
|
Codex Mini | $1.50 | $6.00 | $7.50 | 200k | Infinity% more |
OpenAI
|
GPT-5.2 Pro | $21.00 | $168.00 | $189.00 | 400k | Infinity% more |
OpenAI
|
GPT-5.2 | $1.75 | $14.00 | $15.75 | 400k | Infinity% more |
OpenAI
|
GPT-5.1-Codex-Max | $1.25 | $10.00 | $11.25 | 400k | Infinity% more |
OpenAI
|
GPT-5.1 | $1.25 | $10.00 | $11.25 | 400k | Infinity% more |
OpenAI
|
GPT-5.1-Codex | $1.25 | $10.00 | $11.25 | 400k | Infinity% more |
All Models
❓ Anthropic Pricing FAQ
What is the cheapest Anthropic model?
The cheapest Anthropic model is Claude 3 Haiku at $1.50 per 1M tokens (input + output combined).
What is the maximum context length for Anthropic models?
Anthropic models support up to 1.0M context length, allowing you to process large documents and maintain long conversations.
How do I choose between Anthropic models?
For budget projects, choose the cheapest model. For code generation, prioritize low output price. For complex reasoning, choose models with reasoning capability. Use our scenario guide above.
What is the best Anthropic model for Claude 3.5 Sonnet pricing searches?
Start with the Sonnet-family rows in the model table, then compare input price, output price, and context length. Sonnet is commonly chosen for coding and agent tasks because quality often matters more than the lowest possible token price.
Does Anthropic API pricing include separate input and output token prices?
Yes. For budgeting, treat input and output tokens separately. Long prompts or documents increase input cost, while code generation, summaries, and agent traces usually increase output cost.
When should I use Anthropic prompt caching?
Use prompt caching when many requests share the same long prompt, policy text, examples, or retrieval context. It can reduce repeated-context cost, but it is less useful for one-off prompts.