AI API Pricing Hub: Compare Token Costs Across Providers

Compare pricing for OpenAI, Anthropic, Google, DeepSeek, xAI, Groq, and more in one place. Filter by brand, price, context length, and capabilities.

🔥 Model Pricing Overview

Click price to open calculator, check models to compare · 100 models

Brand Model Input /1M Output /1M Context Capabilities
OpenRouter OpenRouter Free Models Router Free Long Free Free 200k
chatvisionreasoning
OpenRouter OpenRouter Pony Alpha Free Long Free Free 200k
chatreasoning
OpenRouter OpenRouter Aurora Alpha Free Free Free 128k
chatreasoning
OpenAI OpenAI o1-mini Free Free Free 128k
chatreasoning
OpenAI OpenAI o1-mini Free Free Free 128k
chatreasoning
Perplexity Perplexity Sonar Reasoning Free Free Free 127k
chatreasoning
Perplexity Perplexity Sonar Reasoning Free Free Free 127k
chatreasoning
DeepSeek DeepSeek R1 Distill Qwen 14B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 1.5B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 1.5B Free Free Free 131k
chatreasoningcode
Perplexity Perplexity R1 1776 Free Free Free 128k
chatreasoning
Perplexity Perplexity R1 1776 Free Free Free 128k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 Zero Free Free Free 164k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 Zero Free Free Free 164k
chatreasoning
Other Other L3.3 Electra R1 70B Free Free Free 128k
chatreasoning
Other Other L3.3 Electra R1 70B Free Free Free 128k
chatreasoning
NVIDIA NVIDIA Llama 3.1 Nemotron Ultra 253B v1 Free Free Free 131k
chatreasoning
Moonshot AI Moonshot AI Kimi VL A3B Thinking Free Free Free 131k
chatvisionreasoning
Moonshot AI Moonshot AI Kimi VL A3B Thinking Free Free Free 131k
chatvisionreasoning
Microsoft Microsoft MAI DS R1 Free Free Free 164k
chatreasoning
Microsoft Microsoft MAI DS R1 Free Free Free 164k
chatreasoning
Other Other DeepSeek R1T Chimera Free Free Free 164k
chatreasoning
Other Other DeepSeek R1T Chimera Free Free Free 164k
chatreasoning
OpenAI OpenAI Codex Mini Free Long Free Free 200k
chatvisionreasoningcode
Other Other Valkyrie 49B V1 Free Free Free 131k
chatreasoning
Other Other Valkyrie 49B V1 Free Free Free 131k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 0528 Qwen3 8B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 7B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 7B Free Free Free 131k
chatreasoningcode
Moonshot AI Moonshot AI Kimi Dev 72B Free Free Free 131k
chatreasoning
OpenRouter OpenRouter Cypher Alpha Free Long Free Free 1.0M
chatreasoning
OpenRouter OpenRouter Cypher Alpha Free Long Free Free 1.0M
chatreasoning
Other Other Cogito V2 Preview Deepseek 671B Free Free Free 131k
chatreasoning
Other Other Cogito V2 Preview Deepseek 671B Free Free Free 131k
chatreasoning
Other Other Cogito V2 Preview Llama 109B Free Free Free 131k
chatvisionreasoning
Other Other Cogito V2 Preview Llama 70B Free Free Free 131k
chatreasoning
ByteDance ByteDance Seed OSS 36B Instruct Free Free Free 131k
chatreasoning
ByteDance ByteDance Seed OSS 36B Instruct Free Free Free 131k
chatreasoning
OpenRouter OpenRouter Sonoma Sky Alpha Free Long Free Free 2.0M
chatvisionreasoning
OpenRouter OpenRouter Sonoma Sky Alpha Free Long Free Free 2.0M
chatvisionreasoning
Google Google Gemini 2.5 Flash Preview 09-2025 Free Long Free Free 1.0M
chatvisionvideoaudio +1
Other Other Cogito V2 Preview Llama 405B Free Free Free 131k
chatreasoning
OpenRouter OpenRouter Andromeda Alpha Free Free Free 128k
chatvisionreasoning
OpenRouter OpenRouter Andromeda Alpha Free Free Free 128k
chatvisionreasoning
OpenRouter OpenRouter Sherlock Think Alpha Free Long Free Free 1.8M
chatvisionreasoning
OpenRouter OpenRouter Sherlock Think Alpha Free Long Free Free 1.8M
chatvisionreasoning
Other Other R1T Chimera Free Free Free 164k
chatreasoning
DeepSeek DeepSeek DeepSeek R1 0528 Qwen3 8B Free Free Free 131k
chatreasoningcode
DeepSeek DeepSeek R1 Distill Qwen 14B Free Free Free 131k
chatreasoningcode
Other Other Cogito V2 Preview Llama 109B Free Free Free 131k
chatvisionreasoning
Other Other R1T Chimera Free Free Free 164k
chatreasoning
Moonshot AI Moonshot AI Kimi Dev 72B Free Free Free 131k
chatreasoning
Other Other DeepSeek R1T Chimera Free Free Free 164k
chatreasoning
Google Google Gemini 2.5 Flash Preview 09-2025 Free Long Free Free 1.0M
chatvisionvideoaudio +1
NVIDIA NVIDIA Llama 3.1 Nemotron Ultra 253B v1 Free Free Free 131k
chatreasoning
Other Other Cogito V2 Preview Llama 70B Free Free Free 131k
chatreasoning
OpenAI OpenAI Codex Mini Free Long Free Free 200k
chatvisionreasoningcode
Other Other Cogito V2 Preview Llama 405B Free Free Free 131k
chatreasoning
Alibaba Qwen Alibaba Qwen Qwen3.5 397B A17B Long Free Free 262k
chatvisionvideoreasoning +2
Alibaba Qwen Alibaba Qwen Qwen3.5 Plus 2026-02-15 Long Free Free 1.0M
chatvisionvideoreasoning +2
Anthropic Anthropic Claude Sonnet 4.6 Long Free Free 1.0M
chatvisionreasoningtool_use
Google Google Gemini 3.1 Pro Preview Long Free Free 1.0M
chatvisionvideoaudio +2
Other Other Aion-2.0 Free Free 131k
chatreasoningtool_use
OpenAI OpenAI GPT-5.3-Codex Long Free Free 400k
chatvisionreasoningtool_use +1
Google Google Gemini 3.1 Pro Preview Custom Tools Long Free Free 1.0M
chatvisionvideoaudio +2
Alibaba Qwen Alibaba Qwen Qwen3.5-Flash Long Free Free 1.0M
chatvisionvideoreasoning +2
Alibaba Qwen Alibaba Qwen Qwen3.5-122B-A10B Long Free Free 262k
chatvisionvideoreasoning +2
Alibaba Qwen Alibaba Qwen Qwen3.5-27B Long Free Free 262k
chatvisionvideoreasoning +2
Alibaba Qwen Alibaba Qwen Qwen3.5-35B-A3B Long Free Free 262k
chatvisionvideoreasoning +2
ByteDance ByteDance Seed-2.0-Mini Long Free Free 262k
chatvisionvideoreasoning +1
DeepSeek DeepSeek R1 0528 Free Free 164k
chatreasoningtool_use
Other Other DeepSeek R1T2 Chimera Free Free 164k
chatreasoningtool_use
Zhipu AI Zhipu AI GLM 4.5 Air Free Free 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-20b Free Free 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-120b (exacto) Free Free 131k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron Nano 9B V2 Free Free 131k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron Nano 12B 2 VL Free Free 131k
chatvisionvideoreasoning +1
Other Other Trinity Mini Free Free 131k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron 3 Nano 30B A3B Long Free Free 262k
chatreasoningtool_use
Other Other MiMo-V2-Flash (free) Long Free Free 262k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-20b Cheapest $0.02 $0.10 131k
chatreasoningtool_use
DeepSeek DeepSeek R1 Distill Llama 70B $0.03 $0.11 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-20b $0.03 $0.14 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-120b (exacto) $0.04 $0.19 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-120b (exacto) $0.04 $0.19 131k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron Nano 9B V2 $0.04 $0.16 131k
chatreasoningtool_use
Other Other Trinity Mini $0.04 $0.15 131k
chatreasoningtool_use
Zhipu AI Zhipu AI GLM 4.5 Air $0.05 $0.22 131k
chatreasoningtool_use
OpenAI OpenAI GPT-5 Nano Long $0.05 $0.40 400k
chatvisionreasoningtool_use
NVIDIA NVIDIA Nemotron 3 Nano 30B A3B Long $0.05 $0.20 262k
chatreasoningtool_use
Other Other GLM 4.7 Flash Long $0.06 $0.40 203k
chatreasoningtool_use
NVIDIA NVIDIA Nemotron 3 Nano 30B A3B Long $0.06 $0.24 262k
chatreasoningtool_use
Baidu Baidu ERNIE 4.5 21B A3B $0.07 $0.28 120k
chatreasoningtool_use
Baidu Baidu ERNIE 4.5 21B A3B Thinking $0.07 $0.28 131k
chatreasoningtool_use
OpenAI OpenAI gpt-oss-safeguard-20b $0.07 $0.30 131k
chatreasoningtool_use
ByteDance ByteDance Seed 1.6 Flash Long $0.07 $0.30 262k
chatvisionvideoreasoning +1
ByteDance ByteDance Seed 1.6 Flash Long $0.07 $0.30 262k
chatvisionvideoreasoning +1
Alibaba Qwen Alibaba Qwen Tongyi DeepResearch 30B A3B $0.09 $0.40 131k
chatreasoningtool_use
Alibaba Qwen Alibaba Qwen Tongyi DeepResearch 30B A3B $0.09 $0.45 131k
chatreasoningtool_use
Other Other MiMo-V2-Flash (free) Long $0.09 $0.29 262k
chatreasoningtool_use
📦

Browse by Brand

OpenAI / Anthropic / Google / DeepSeek / xAI / Groq / Mistral

View All Brands →
💰

Find Cheapest

Chat: gpt-oss-20b $0.02/1M

Compare More →
🧮

Cost Calculator

Enter your token usage, get instant monthly cost & cheaper alternatives

Open Calculator →

Find the Cheapest for Your Use Case

Different scenarios need different metrics

💬

Scenario A: Chat / Customer Service

High concurrency, short output. Prioritize low input + output price.

Recommended: gpt-oss-20b $0.02 / $0.10
💻

Scenario B: Code / Agent

Medium-long output. Focus on output price + code quality.

🧠

Scenario C: Reasoning / Math

Long chain reasoning. Look for reasoning capability + long context.

Recommended: gpt-oss-20b $0.02 / $0.10

📊 Price Visualization

Visual comparison of model costs

Cost per 1M Tokens (Input + Output)
Price vs Context Length

If you only need short conversations, you don't need the most powerful model. If you frequently generate long outputs, output price is your main cost driver.

💡 Understanding Token Costs

  • Tokens are billing units, not equal to words or characters
  • English: 1 token ≈ 4 characters ≈ 0.75 words
  • Chinese: 1 token ≈ 1-2 characters (varies by model)
  • API Cost = Input Tokens × Input Price + Output Tokens × Output Price

Frequently Asked Questions

It depends on your use case. For short conversations, look at input price. For long outputs, focus on output price. Currently, DeepSeek V3 and Gemini Flash series offer the best value for most use cases.

Use the same metric: $/1M tokens. Compare input and output prices separately, then use the calculator with your actual input/output ratio to estimate real costs.

Use the calculator's text estimation mode. As a rough guide: English 1 token ≈ 4 characters ≈ 0.75 words. Chinese 1 token ≈ 1-2 characters.

Most coding tasks generate longer outputs, so output price usually has more impact on total cost. Consider weighting output price higher in your comparison.

Reasoning models (like o1, DeepSeek R1) typically generate longer thinking chains and output tokens. Consider controlling output length or using cheaper draft models before switching to powerful ones.

We regularly update pricing data from official sources. Check the 'Updated' timestamp at the bottom of each page. For critical decisions, always verify with official pricing pages.

Enter your token usage in the calculator. The system will suggest models with similar context length and capabilities at lower prices.

Different channels (official API vs cloud providers) may have different pricing, billing methods, or regional fees. We label the provider_channel field to distinguish these.