Cohere

Cohere API Pricing

Enterprise-focused AI with industry-leading RAG and embedding models

4 paid models · 5 free · Price range: $0.04 - $2.50 /1M

About Cohere

Cohere specializes in enterprise AI solutions with a focus on retrieval-augmented generation (RAG) and embeddings. Their Command models excel at following complex instructions, while Embed models are among the best for semantic search. Cohere offers strong data privacy guarantees for enterprise customers.

Key Highlights

  • Industry-leading embedding models
  • Excellent RAG capabilities
  • Enterprise-grade security and privacy
  • Rerank models for search optimization
  • Multilingual support (100+ languages)
Why Choose: Best-in-class for RAG and enterprise search applications. Strong choice for companies prioritizing data privacy.
9
Total Models
$0.04
Lowest Input
256k
Max Context
2
Capabilities

Pricing Features

  • Pay-per-token billing
Pricing Notes:

Enterprise pricing with volume discounts. Embed models offer excellent value for search applications.

API Features

StreamingFunction CallingRAGEmbeddingsRerankMultilingual

Common Use Cases

  • • Enterprise Search
  • • RAG Applications
  • • Semantic Search
  • • Document Classification
  • • Multilingual Support

📊 Cohere Model Comparison

Compare all models side by side. Sorted by total price (input + output).

Model Tier Input /1M Output /1M Total /1M Context Best For
Command R7B (12-2024) Budget $0.04 $0.15 $0.19 128k General tasks
Command R (08-2024) Budget $0.15 $0.60 $0.75 128k General tasks
Command R+ (08-2024) Flagship $2.50 $10.00 $12.50 128k General tasks
Command A Flagship $2.50 $10.00 $12.50 256k General tasks

🎯 Which Cohere Model Should You Choose?

Quick recommendations based on your use case.

💰

Lowest Cost

Best value for budget-conscious projects.

💬

Chat / Customer Service

High volume, short responses.

📄

Long Documents

Process large files and contexts.

Command A
256k context

💰 Cohere Monthly Cost Examples

Estimated monthly costs for common use cases.

Use Case Monthly Usage Command R7B (12-2024)
(Budget)
Command R (03-2024)
(Flagship)
Customer Service Bot
1000 conversations/day
500k input
200k output
$0.05/mo $0.00/mo
Code Assistant
200 requests/day
1.0M input
500k output
$0.11/mo $0.00/mo
Data Analysis
500 analyses/day
2.0M input
300k output
$0.12/mo $0.00/mo

⚔️ Cohere vs Competitors

How does {brand} compare to other major AI providers?

Brand Model Input /1M Output /1M Total /1M Context vs {brand}
Cohere Cohere Command R (03-2024) Current Free Free Free 128k
OpenAI OpenAI GPT-5.2 Chat $1.75 $14.00 $15.75 128k Infinity% more
OpenAI OpenAI GPT-5.2 Pro $21.00 $168.00 $189.00 400k Infinity% more
OpenAI OpenAI GPT-5.2 $1.75 $14.00 $15.75 400k Infinity% more
OpenAI OpenAI GPT-5.1-Codex-Max $1.25 $10.00 $11.25 400k Infinity% more
OpenAI OpenAI GPT-5.1 $1.25 $10.00 $11.25 400k Infinity% more
OpenAI OpenAI GPT-5.1 Chat $1.25 $10.00 $11.25 128k Infinity% more

All Models

❓ Cohere Pricing FAQ

What is the cheapest Cohere model?

The cheapest Cohere model is Command R7B (12-2024) at $0.19 per 1M tokens (input + output combined).

What is the maximum context length for Cohere models?

Cohere models support up to 256k context length, allowing you to process large documents and maintain long conversations.

How do I choose between Cohere models?

For budget projects, choose the cheapest model. For code generation, prioritize low output price. For complex reasoning, choose models with reasoning capability. Use our scenario guide above.