AI API Pricing Hub: Compare Token Costs Across Providers

Compare pricing for OpenAI, Anthropic, Google, DeepSeek, xAI, Groq, and more in one place. Filter by brand, price, context length, and capabilities.

๐Ÿ”ฅ Model Pricing Overview

Click price to open calculator, check models to compare ยท 100 models

Clear Filters
Brand Model Input /1M Output /1M Context Capabilities
OpenRouter OpenRouter Free Models Router Free Long Free Free 200k
chatvisionreasoning
OpenRouter OpenRouter Pony Alpha Free Long Free Free 200k
chatreasoning
OpenRouter OpenRouter Aurora Alpha Free Free Free 128k
chatreasoning
OpenAI OpenAI o1-mini Free Free Free 128k
chatreasoning
OpenAI OpenAI o1-mini Free Free Free 128k
chatreasoning
Perplexity Perplexity Sonar Reasoning Free Free Free 127k
chatreasoning
Perplexity Perplexity Sonar Reasoning Free Free Free 127k
chatreasoning
DeepSeek DeepSeek R1 Distill Qwen 14B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 1.5B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 1.5B Free Free Free 131k
chatreasoningcode
Perplexity Perplexity R1 1776 Free Free Free 128k
chatreasoning
Perplexity Perplexity R1 1776 Free Free Free 128k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 Zero Free Free Free 164k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 Zero Free Free Free 164k
chatreasoning
Other Other L3.3 Electra R1 70B Free Free Free 128k
chatreasoning
Other Other L3.3 Electra R1 70B Free Free Free 128k
chatreasoning
NVIDIA NVIDIA Llama 3.1 Nemotron Ultra 253B v1 Free Free Free 131k
chatreasoning
Moonshot AI Moonshot AI Kimi VL A3B Thinking Free Free Free 131k
chatvisionreasoning
Moonshot AI Moonshot AI Kimi VL A3B Thinking Free Free Free 131k
chatvisionreasoning
Microsoft Microsoft MAI DS R1 Free Free Free 164k
chatreasoning
Microsoft Microsoft MAI DS R1 Free Free Free 164k
chatreasoning
Other Other DeepSeek R1T Chimera Free Free Free 164k
chatreasoning
Other Other DeepSeek R1T Chimera Free Free Free 164k
chatreasoning
OpenAI OpenAI Codex Mini Free Long Free Free 200k
chatvisionreasoningcode
Other Other Valkyrie 49B V1 Free Free Free 131k
chatreasoning
Other Other Valkyrie 49B V1 Free Free Free 131k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 0528 Qwen3 8B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 7B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 7B Free Free Free 131k
chatreasoningcode
Moonshot AI Moonshot AI Kimi Dev 72B Free Free Free 131k
chatreasoning
OpenRouter OpenRouter Cypher Alpha Free Long Free Free 1.0M
chatreasoning
OpenRouter OpenRouter Cypher Alpha Free Long Free Free 1.0M
chatreasoning
Other Other Cogito V2 Preview Deepseek 671B Free Free Free 131k
chatreasoning
Other Other Cogito V2 Preview Deepseek 671B Free Free Free 131k
chatreasoning
Other Other Cogito V2 Preview Llama 109B Free Free Free 131k
chatvisionreasoning
Other Other Cogito V2 Preview Llama 70B Free Free Free 131k
chatreasoning
ByteDance ByteDance Seed OSS 36B Instruct Free Free Free 131k
chatreasoning
ByteDance ByteDance Seed OSS 36B Instruct Free Free Free 131k
chatreasoning
OpenRouter OpenRouter Sonoma Sky Alpha Free Long Free Free 2.0M
chatvisionreasoning
OpenRouter OpenRouter Sonoma Sky Alpha Free Long Free Free 2.0M
chatvisionreasoning
Google Google Gemini 2.5 Flash Preview 09-2025 Free Long Free Free 1.0M
chatvisionvideoaudio +1
Other Other Cogito V2 Preview Llama 405B Free Free Free 131k
chatreasoning
OpenRouter OpenRouter Andromeda Alpha Free Free Free 128k
chatvisionreasoning
OpenRouter OpenRouter Andromeda Alpha Free Free Free 128k
chatvisionreasoning
OpenRouter OpenRouter Sherlock Think Alpha Free Long Free Free 1.8M
chatvisionreasoning
OpenRouter OpenRouter Sherlock Think Alpha Free Long Free Free 1.8M
chatvisionreasoning
Other Other R1T Chimera Free Free Free 164k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 0528 Qwen3 8B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 14B Free Free Free 131k
chatreasoningcode
Other Other Cogito V2 Preview Llama 109B Free Free Free 131k
chatvisionreasoning
Other Other R1T Chimera Free Free Free 164k
chatreasoning
Moonshot AI Moonshot AI Kimi Dev 72B Free Free Free 131k
chatreasoning
Other Other DeepSeek R1T Chimera Free Free Free 164k
chatreasoning
Google Google Gemini 2.5 Flash Preview 09-2025 Free Long Free Free 1.0M
chatvisionvideoaudio +1
NVIDIA NVIDIA Llama 3.1 Nemotron Ultra 253B v1 Free Free Free 131k
chatreasoning
Other Other Cogito V2 Preview Llama 70B Free Free Free 131k
chatreasoning
OpenAI OpenAI Codex Mini Free Long Free Free 200k
chatvisionreasoningcode
Other Other Cogito V2 Preview Llama 405B Free Free Free 131k
chatreasoning
Other Other LFM2.5-1.2B-Thinking (free) Free Free Free 33k
chatreasoningtool_use
Alibaba Qwen Alibaba Qwen QwQ 32B Preview Free Free Free 33k
chatreasoningcode
Alibaba Qwen Alibaba Qwen QwQ 32B Preview Free Free Free 33k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Llama 8B Free Free Free -
chatreasoning
DeepSeek DeepSeek R1 Distill Llama 8B Free Free Free -
chatreasoning
Other Other Dolphin3.0 R1 Mistral 24B Free Free Free 33k
chatreasoning
Other Other Dolphin3.0 R1 Mistral 24B Free Free Free 33k
chatreasoning
Other Other Flash 3 Free Free Free 32k
chatreasoning
Other Other Flash 3 Free Free Free 32k
chatreasoning
Other Other OlympicCoder 32B Free Free Free 33k
chatreasoningcode
Other Other OlympicCoder 32B Free Free Free 33k
chatreasoningcode
Other Other Deepcoder 14B Preview Free Free Free 96k
chatreasoningcode
Other Other Deepcoder 14B Preview Free Free Free 96k
chatreasoningcode
Other Other QwQ 32B RpR v1 Free Free Free 33k
chatreasoning
Other Other QwQ 32B RpR v1 Free Free Free 33k
chatreasoning
Zhipu AI Zhipu AI GLM Z1 32B Free Free Free 33k
chatreasoning
Zhipu AI Zhipu AI GLM Z1 32B Free Free Free 33k
chatreasoning
Zhipu AI Zhipu AI GLM Z1 9B Free Free Free 32k
chatreasoning
Zhipu AI Zhipu AI GLM Z1 9B Free Free Free 32k
chatreasoning
Zhipu AI Zhipu AI GLM Z1 Rumination 32B Free Free Free 32k
chatreasoning
Zhipu AI Zhipu AI GLM Z1 Rumination 32B Free Free Free 32k
chatreasoning
Alibaba Qwen Alibaba Qwen Qwen3 4B (free) Free Free Free 41k
chatreasoningtool_usecode
Alibaba Qwen Alibaba Qwen Qwen3 1.7B Free Free Free 32k
chatreasoningcode
Alibaba Qwen Alibaba Qwen Qwen3 1.7B Free Free Free 32k
chatreasoningcode
Alibaba Qwen Alibaba Qwen Qwen3 0.6B Free Free Free 32k
chatreasoningcode
Alibaba Qwen Alibaba Qwen Qwen3 0.6B Free Free Free 32k
chatreasoningcode
Microsoft Microsoft Phi 4 Reasoning Free Free Free 33k
chatreasoning
Microsoft Microsoft Phi 4 Reasoning Free Free Free 33k
chatreasoning
Microsoft Microsoft Phi 4 Reasoning Plus Free Free Free 33k
chatreasoning
Nous Research Nous Research DeepHermes 3 Mistral 24B Preview Free Free Free 33k
chatreasoning
Mistral Mistral Magistral Medium 2506 Free Free Free 41k
chatreasoning
Mistral Mistral Magistral Medium 2506 Free Free Free 41k
chatreasoning
Mistral Mistral Magistral Small 2506 Free Free Free 40k
chatreasoning
Mistral Mistral Magistral Small 2506 Free Free Free 40k
chatreasoning
Zhipu AI Zhipu AI GLM 4.1V 9B Thinking Free Free Free 66k
chatvisionreasoning
Other Other Step3 Free Free Free 66k
chatvisionreasoning
Nous Research Nous Research DeepHermes 3 Mistral 24B Preview Free Free Free 33k
chatreasoning
Zhipu AI Zhipu AI GLM 4.1V 9B Thinking Free Free Free 66k
chatvisionreasoning
Microsoft Microsoft Phi 4 Reasoning Plus Free Free Free 33k
chatreasoning
Other Other Step3 Free Free Free 66k
chatvisionreasoning
NVIDIA NVIDIA Llama Nemotron Embed VL 1B V2 (free) Free Free Free 131k
chatvisiontool_use
Other Other Trinity Large Preview (free) Free Free Free 131k
chattool_use
๐Ÿ“ฆ

Browse by Brand

OpenAI / Anthropic / Google / DeepSeek / xAI / Groq / Mistral

View All Brands โ†’
๐Ÿ’ฐ

Find Cheapest

Loading...

Compare More โ†’
๐Ÿงฎ

Cost Calculator

Enter your token usage, get instant monthly cost & cheaper alternatives

Open Calculator โ†’

Find the Cheapest for Your Use Case

Different scenarios need different metrics

๐Ÿ’ฌ

Scenario A: Chat / Customer Service

High concurrency, short output. Prioritize low input + output price.

๐Ÿ’ป

Scenario B: Code / Agent

Medium-long output. Focus on output price + code quality.

๐Ÿง 

Scenario C: Reasoning / Math

Long chain reasoning. Look for reasoning capability + long context.

๐Ÿ“Š Price Visualization

Visual comparison of model costs

Cost per 1M Tokens (Input + Output)
Price vs Context Length

If you only need short conversations, you don't need the most powerful model. If you frequently generate long outputs, output price is your main cost driver.

๐Ÿ’ก Understanding Token Costs

  • Tokens are billing units, not equal to words or characters
  • English: 1 token โ‰ˆ 4 characters โ‰ˆ 0.75 words
  • Chinese: 1 token โ‰ˆ 1-2 characters (varies by model)
  • API Cost = Input Tokens ร— Input Price + Output Tokens ร— Output Price

Frequently Asked Questions

It depends on your use case. For short conversations, look at input price. For long outputs, focus on output price. Currently, DeepSeek V3 and Gemini Flash series offer the best value for most use cases.

Use the same metric: $/1M tokens. Compare input and output prices separately, then use the calculator with your actual input/output ratio to estimate real costs.

Use the calculator's text estimation mode. As a rough guide: English 1 token โ‰ˆ 4 characters โ‰ˆ 0.75 words. Chinese 1 token โ‰ˆ 1-2 characters.

Most coding tasks generate longer outputs, so output price usually has more impact on total cost. Consider weighting output price higher in your comparison.

Reasoning models (like o1, DeepSeek R1) typically generate longer thinking chains and output tokens. Consider controlling output length or using cheaper draft models before switching to powerful ones.

We regularly update pricing data from official sources. Check the 'Updated' timestamp at the bottom of each page. For critical decisions, always verify with official pricing pages.

Enter your token usage in the calculator. The system will suggest models with similar context length and capabilities at lower prices.

Different channels (official API vs cloud providers) may have different pricing, billing methods, or regional fees. We label the provider_channel field to distinguish these.