AI API Pricing Hub: Compare Token Costs Across Providers

Compare pricing for OpenAI, Anthropic, Google, DeepSeek, xAI, Groq, and more in one place. Filter by brand, price, context length, and capabilities.

๐Ÿ”ฅ Model Pricing Overview

Click price to open calculator, check models to compare ยท 100 models

Brand Model Input /1M Output /1M Context Capabilities
OpenRouter OpenRouter Auto Router Free Long Free Free 2.0M
chatreasoning
Anthropic Anthropic Claude 3.5 Sonnet (2024-06-20) Free Long Free Free 200k
chatvisionreasoning
OpenAI OpenAI o1-mini Free Free Free 128k
chatreasoning
Perplexity Perplexity Sonar Reasoning Free Free Free 127k
chatreasoning
DeepSeek DeepSeek R1 Distill Qwen 1.5B Free Free Free 131k
chatreasoningcode
Perplexity Perplexity R1 1776 Free Free Free 128k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 Zero Free Free Free 164k
chatreasoning
Other Other L3.3 Electra R1 70B Free Free Free 128k
chatreasoning
Google Google Gemini 2.5 Pro Experimental Free Long Free Free 1.0M
chatvisionreasoning
Moonshot AI Moonshot AI Kimi VL A3B Thinking Free Free Free 131k
chatvisionreasoning
Microsoft Microsoft MAI DS R1 Free Free Free 164k
chatreasoning
Other Other Valkyrie 49B V1 Free Free Free 131k
chatreasoning
DeepSeek DeepSeek R1 Distill Qwen 7B Free Free Free 131k
chatreasoningcode
OpenRouter OpenRouter Cypher Alpha Free Long Free Free 1.0M
chatreasoning
OpenRouter OpenRouter Horizon Alpha Free Long Free Free 256k
chatvisionreasoning
OpenRouter OpenRouter Horizon Beta Free Long Free Free 256k
chatvisionreasoning
DeepSeek DeepSeek DeepSeek V3.1 Base Free Free Free 164k
chatreasoning
Other Other Cogito V2 Preview Deepseek 671B Free Free Free 131k
chatreasoning
ByteDance ByteDance Seed OSS 36B Instruct Free Free Free 131k
chatreasoning
OpenRouter OpenRouter Sonoma Sky Alpha Free Long Free Free 2.0M
chatvisionreasoning
OpenRouter OpenRouter Sonoma Dusk Alpha Free Long Free Free 2.0M
chatvisionreasoning
OpenRouter OpenRouter Andromeda Alpha Free Free Free 128k
chatvisionreasoning
OpenRouter OpenRouter Polaris Alpha Free Long Free Free 256k
chatvisionreasoning
OpenRouter OpenRouter Sherlock Think Alpha Free Long Free Free 1.8M
chatvisionreasoning
OpenRouter OpenRouter Sherlock Dash Alpha Free Long Free Free 1.8M
chatvisionreasoning
OpenRouter OpenRouter Bert-Nebulon Alpha Free Long Free Free 256k
chatvisionreasoning
OpenRouter OpenRouter Body Builder (beta) Free Free Free 128k
chatreasoning
Other Other MiMo-V2-Flash (free) Free Long Free Free 262k
chatreasoningtool_use
Mistral Mistral Mistral Small 3.1 24B Free Free 131k
chatvisionreasoningtool_use
Other Other DeepSeek R1T Chimera Free Free 164k
chatreasoningtool_use
DeepSeek DeepSeek R1 0528 Free Free 164k
chatreasoningtool_use
Other Other DeepSeek R1T2 Chimera Free Free 164k
chatreasoningtool_use
Moonshot AI Moonshot AI Kimi K2 0711 Free Free 131k
chatreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 Coder 480B A35B (exacto) Long Free Free 262k
chatreasoningtool_usecode
Zhipu AI Zhipu AI GLM 4.5 Air Free Free 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-20b Free Free 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-120b (exacto) Free Free 131k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron Nano 9B V2 Free Free 131k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron Nano 12B 2 VL Free Free 131k
chatvisionvideoreasoning +1
Other Other Trinity Mini Free Free 131k
chatreasoningtool_use
Mistral Mistral Devstral 2 2512 Long Free Free 262k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron 3 Nano 30B A3B Long Free Free 262k
chatreasoningtool_use
IBM Granite IBM Granite Granite 4.0 Micro Cheapest $0.02 $0.11 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-20b $0.02 $0.10 131k
chatreasoningtool_use
DeepSeek DeepSeek R1 Distill Llama 70B $0.03 $0.11 131k
chatreasoningtool_use
Mistral Mistral Mistral Small 3.1 24B $0.03 $0.11 131k
chatvisionreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 8B $0.04 $0.14 128k
chatreasoningtool_usecode
OpenAI OpenAI gpt-oss-120b (exacto) $0.04 $0.19 131k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron Nano 9B V2 $0.04 $0.16 131k
chatreasoningtool_use
Other Other Trinity Mini $0.04 $0.15 131k
chatreasoningtool_use
Zhipu AI Zhipu AI GLM 4.5 Air $0.05 $0.22 131k
chatreasoningtool_use
OpenAI OpenAI GPT-5 Nano Long $0.05 $0.40 400k
chatvisionreasoningtool_use
Mistral Mistral Devstral 2 2512 Long $0.05 $0.22 262k
chatreasoningtool_use
Mistral Mistral Devstral Small 2505 $0.06 $0.12 128k
chatreasoningtool_use
DeepSeek DeepSeek DeepSeek R1 0528 Qwen3 8B $0.06 $0.09 128k
chatreasoningtool_usecode
NVIDIA NVIDIA Nemotron 3 Nano 30B A3B Long $0.06 $0.24 262k
chatreasoningtool_use
Mistral Mistral Devstral Small 1.1 $0.07 $0.28 128k
chatreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 Coder 30B A3B Instruct $0.07 $0.27 160k
chatreasoningtool_usecode
Baidu Baidu ERNIE 4.5 21B A3B $0.07 $0.28 120k
chatreasoningtool_use
Baidu Baidu ERNIE 4.5 21B A3B Thinking $0.07 $0.28 131k
chatreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 235B A22B Instruct 2507 Long $0.07 $0.46 262k
chatreasoningtool_usecode
Google Google Gemini 2.0 Flash Lite Long $0.07 $0.30 1.0M
chatvisionvideoaudio +2
OpenAI OpenAI gpt-oss-safeguard-20b $0.07 $0.30 131k
chatreasoningtool_use
ByteDance ByteDance Seed 1.6 Flash Long $0.07 $0.30 262k
chatvisionvideoreasoning +1
Alibaba Qwen Alibaba Qwen Qwen3 30B A3B Instruct 2507 Long $0.08 $0.33 262k
chatreasoningtool_usecode
Alibaba Qwen Alibaba Qwen Qwen3 VL 8B Instruct $0.08 $0.50 131k
chatvisionreasoningtool_use +1
Alibaba Qwen Alibaba Qwen Qwen3 Next 80B A3B Instruct Long $0.09 $1.10 262k
chatreasoningtool_usecode
Alibaba Qwen Alibaba Qwen Tongyi DeepResearch 30B A3B $0.09 $0.40 131k
chatreasoningtool_use
Google Google Gemini 2.0 Flash Long $0.10 $0.40 1.0M
chatvisionvideoaudio +2
Google Google Gemini 2.5 Flash Lite Long $0.10 $0.40 1.0M
chatvisionvideoaudio +2
ByteDance ByteDance UI-TARS 7B $0.10 $0.20 128k
chatvisionreasoningtool_use
Google Google Gemini 2.5 Flash Lite Preview 09-2025 Long $0.10 $0.40 1.0M
chatvisionvideoaudio +2
NVIDIA NVIDIA Llama 3.3 Nemotron Super 49B V1.5 $0.10 $0.40 131k
chatreasoningtool_use
Mistral Mistral Ministral 3 3B 2512 $0.10 $0.10 131k
chatvisionreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 235B A22B Thinking 2507 Long $0.11 $0.60 262k
chatreasoningtool_usecode
Nous Research Nous Research Hermes 4 70B $0.11 $0.38 131k
chatreasoningtool_use
Tencent Tencent Hunyuan A13B Instruct $0.14 $0.57 131k
chatreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 Next 80B A3B Thinking Long $0.15 $1.20 262k
chatreasoningtool_usecode
Alibaba Qwen Alibaba Qwen Qwen3 VL 30B A3B Instruct Long $0.15 $0.60 262k
chatvisionreasoningtool_use +1
Mistral Mistral Ministral 3 8B 2512 Long $0.15 $0.15 262k
chatvisionreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 VL 8B Thinking Long $0.18 $2.10 256k
chatvisionreasoningtool_use +1
Other Other Jamba Mini 1.7 Long $0.20 $0.40 256k
chatreasoningtool_use
xAI xAI Grok Code Fast 1 Long $0.20 $1.50 256k
chatreasoningtool_usecode
Other Other LongCat Flash Chat $0.20 $0.80 131k
chatreasoningtool_use
xAI xAI Grok 4 Fast Long $0.20 $0.50 2.0M
chatvisionreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 VL 235B A22B Instruct Long $0.20 $1.20 262k
chatvisionreasoningtool_use +1
Alibaba Qwen Alibaba Qwen Qwen3 VL 30B A3B Thinking $0.20 $1.00 131k
chatvisionreasoningtool_use +1
MiniMax MiniMax MiniMax M2 $0.20 $1.00 197k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron Nano 12B 2 VL $0.20 $0.60 131k
chatvisionvideoreasoning +1
xAI xAI Grok 4.1 Fast Long $0.20 $0.50 2.0M
chatvisionreasoningtool_use
Other Other INTELLECT-3 $0.20 $1.10 131k
chatreasoningtool_use
Mistral Mistral Ministral 3 14B 2512 Long $0.20 $0.20 262k
chatvisionreasoningtool_use
Other Other KAT-Coder-Pro V1 Long $0.21 $0.83 256k
chatreasoningtool_usecode
DeepSeek DeepSeek DeepSeek V3.1 Terminus $0.21 $0.79 164k
chatreasoningtool_use
DeepSeek DeepSeek DeepSeek V3.2 Exp $0.21 $0.32 164k
chatreasoningtool_use
Alibaba Qwen Alibaba Qwen Qwen3 Coder 480B A35B (exacto) Long $0.22 $0.95 262k
chatreasoningtool_usecode
Alibaba Qwen Alibaba Qwen Qwen3 Coder 480B A35B (exacto) Long $0.22 $1.80 262k
chatreasoningtool_usecode
Other Other Mercury Coder $0.25 $1.00 128k
chatreasoningtool_usecode
Other Other Mercury $0.25 $1.00 128k
chatreasoningtool_use
Other Other DeepSeek R1T2 Chimera $0.25 $0.85 164k
chatreasoningtool_use
๐Ÿ“ฆ

Browse by Brand

OpenAI / Anthropic / Google / DeepSeek / xAI / Groq / Mistral

View All Brands โ†’
๐Ÿ’ฐ

Find Cheapest

Chat: gpt-oss-20b $0.02/1M

Compare More โ†’
๐Ÿงฎ

Cost Calculator

Enter your token usage, get instant monthly cost & cheaper alternatives

Open Calculator โ†’

Find the Cheapest for Your Use Case

Different scenarios need different metrics

๐Ÿ’ฌ

Scenario A: Chat / Customer Service

High concurrency, short output. Prioritize low input + output price.

Recommended: gpt-oss-20b $0.02 / $0.10
๐Ÿ’ป

Scenario B: Code / Agent

Medium-long output. Focus on output price + code quality.

Recommended: DeepSeek R1 0528 Qwen3 8B $0.06 / $0.09
๐Ÿง 

Scenario C: Reasoning / Math

Long chain reasoning. Look for reasoning capability + long context.

Recommended: gpt-oss-20b $0.02 / $0.10

๐Ÿ“Š Price Visualization

Visual comparison of model costs

Cost per 1M Tokens (Input + Output)
Price vs Context Length

If you only need short conversations, you don't need the most powerful model. If you frequently generate long outputs, output price is your main cost driver.

๐Ÿ’ก Understanding Token Costs

  • Tokens are billing units, not equal to words or characters
  • English: 1 token โ‰ˆ 4 characters โ‰ˆ 0.75 words
  • Chinese: 1 token โ‰ˆ 1-2 characters (varies by model)
  • API Cost = Input Tokens ร— Input Price + Output Tokens ร— Output Price

Frequently Asked Questions

It depends on your use case. For short conversations, look at input price. For long outputs, focus on output price. Currently, DeepSeek V3 and Gemini Flash series offer the best value for most use cases.

Use the same metric: $/1M tokens. Compare input and output prices separately, then use the calculator with your actual input/output ratio to estimate real costs.

Use the calculator's text estimation mode. As a rough guide: English 1 token โ‰ˆ 4 characters โ‰ˆ 0.75 words. Chinese 1 token โ‰ˆ 1-2 characters.

Most coding tasks generate longer outputs, so output price usually has more impact on total cost. Consider weighting output price higher in your comparison.

Reasoning models (like o1, DeepSeek R1) typically generate longer thinking chains and output tokens. Consider controlling output length or using cheaper draft models before switching to powerful ones.

We regularly update pricing data from official sources. Check the 'Updated' timestamp at the bottom of each page. For critical decisions, always verify with official pricing pages.

Enter your token usage in the calculator. The system will suggest models with similar context length and capabilities at lower prices.

Different channels (official API vs cloud providers) may have different pricing, billing methods, or regional fees. We label the provider_channel field to distinguish these.