AI API Pricing Hub: Compare Token Costs Across Providers

Compare pricing for OpenAI, Anthropic, Google, DeepSeek, xAI, Groq, and more in one place. Filter by brand, price, context length, and capabilities.

Browse Models Calculate My Cost

🔥 Model Pricing Overview

Click price to open calculator, check models to compare · 100 models

Brand	Model	Input /1M	Output /1M	Context	Capabilities
OpenRouter	Free Models Router Free Long	Free	Free	200k	chatvisionreasoning
OpenRouter	Pony Alpha Free Long	Free	Free	200k	chatreasoning
OpenRouter	Aurora Alpha Free	Free	Free	128k	chatreasoning
OpenAI	o1-mini Free	Free	Free	128k	chatreasoning
OpenAI	o1-mini Free	Free	Free	128k	chatreasoning
Perplexity	Sonar Reasoning Free	Free	Free	127k	chatreasoning
Perplexity	Sonar Reasoning Free	Free	Free	127k	chatreasoning
DeepSeek	R1 Distill Qwen 14B Free	Free	Free	131k	chatreasoningcode
DeepSeek	R1 Distill Qwen 1.5B Free	Free	Free	131k	chatreasoningcode
DeepSeek	R1 Distill Qwen 1.5B Free	Free	Free	131k	chatreasoningcode
Perplexity	R1 1776 Free	Free	Free	128k	chatreasoning
Perplexity	R1 1776 Free	Free	Free	128k	chatreasoning
DeepSeek	DeepSeek R1 Zero Free	Free	Free	164k	chatreasoning
DeepSeek	DeepSeek R1 Zero Free	Free	Free	164k	chatreasoning
Other	L3.3 Electra R1 70B Free	Free	Free	128k	chatreasoning
Other	L3.3 Electra R1 70B Free	Free	Free	128k	chatreasoning
NVIDIA	Llama 3.1 Nemotron Ultra 253B v1 Free	Free	Free	131k	chatreasoning
Moonshot AI	Kimi VL A3B Thinking Free	Free	Free	131k	chatvisionreasoning
Moonshot AI	Kimi VL A3B Thinking Free	Free	Free	131k	chatvisionreasoning
Microsoft	MAI DS R1 Free	Free	Free	164k	chatreasoning
Microsoft	MAI DS R1 Free	Free	Free	164k	chatreasoning
Other	DeepSeek R1T Chimera Free	Free	Free	164k	chatreasoning
Other	DeepSeek R1T Chimera Free	Free	Free	164k	chatreasoning
OpenAI	Codex Mini Free Long	Free	Free	200k	chatvisionreasoningcode
Other	Valkyrie 49B V1 Free	Free	Free	131k	chatreasoning
Other	Valkyrie 49B V1 Free	Free	Free	131k	chatreasoning
DeepSeek	DeepSeek R1 0528 Qwen3 8B Free	Free	Free	131k	chatreasoningcode
DeepSeek	R1 Distill Qwen 7B Free	Free	Free	131k	chatreasoningcode
DeepSeek	R1 Distill Qwen 7B Free	Free	Free	131k	chatreasoningcode
Moonshot AI	Kimi Dev 72B Free	Free	Free	131k	chatreasoning
OpenRouter	Cypher Alpha Free Long	Free	Free	1.0M	chatreasoning
OpenRouter	Cypher Alpha Free Long	Free	Free	1.0M	chatreasoning
Other	Cogito V2 Preview Deepseek 671B Free	Free	Free	131k	chatreasoning
Other	Cogito V2 Preview Deepseek 671B Free	Free	Free	131k	chatreasoning
Other	Cogito V2 Preview Llama 109B Free	Free	Free	131k	chatvisionreasoning
Other	Cogito V2 Preview Llama 70B Free	Free	Free	131k	chatreasoning
ByteDance	Seed OSS 36B Instruct Free	Free	Free	131k	chatreasoning
ByteDance	Seed OSS 36B Instruct Free	Free	Free	131k	chatreasoning
OpenRouter	Sonoma Sky Alpha Free Long	Free	Free	2.0M	chatvisionreasoning
OpenRouter	Sonoma Sky Alpha Free Long	Free	Free	2.0M	chatvisionreasoning
Google	Gemini 2.5 Flash Preview 09-2025 Free Long	Free	Free	1.0M	chatvisionvideoaudio +1
Other	Cogito V2 Preview Llama 405B Free	Free	Free	131k	chatreasoning
OpenRouter	Andromeda Alpha Free	Free	Free	128k	chatvisionreasoning
OpenRouter	Andromeda Alpha Free	Free	Free	128k	chatvisionreasoning
OpenRouter	Sherlock Think Alpha Free Long	Free	Free	1.8M	chatvisionreasoning
OpenRouter	Sherlock Think Alpha Free Long	Free	Free	1.8M	chatvisionreasoning
Other	R1T Chimera Free	Free	Free	164k	chatreasoning
DeepSeek	DeepSeek R1 0528 Qwen3 8B Free	Free	Free	131k	chatreasoningcode
DeepSeek	R1 Distill Qwen 14B Free	Free	Free	131k	chatreasoningcode
Other	Cogito V2 Preview Llama 109B Free	Free	Free	131k	chatvisionreasoning
Other	R1T Chimera Free	Free	Free	164k	chatreasoning
Moonshot AI	Kimi Dev 72B Free	Free	Free	131k	chatreasoning
Other	DeepSeek R1T Chimera Free	Free	Free	164k	chatreasoning
Google	Gemini 2.5 Flash Preview 09-2025 Free Long	Free	Free	1.0M	chatvisionvideoaudio +1
NVIDIA	Llama 3.1 Nemotron Ultra 253B v1 Free	Free	Free	131k	chatreasoning
Other	Cogito V2 Preview Llama 70B Free	Free	Free	131k	chatreasoning
OpenAI	Codex Mini Free Long	Free	Free	200k	chatvisionreasoningcode
Other	Cogito V2 Preview Llama 405B Free	Free	Free	131k	chatreasoning
Alibaba Qwen	Qwen3.5 397B A17B Long	Free	Free	262k	chatvisionvideoreasoning +2
Alibaba Qwen	Qwen3.5 Plus 2026-02-15 Long	Free	Free	1.0M	chatvisionvideoreasoning +2
Anthropic	Claude Sonnet 4.6 Long	Free	Free	1.0M	chatvisionreasoningtool_use
Google	Gemini 3.1 Pro Preview Long	Free	Free	1.0M	chatvisionvideoaudio +2
Other	Aion-2.0	Free	Free	131k	chatreasoningtool_use
OpenAI	GPT-5.3-Codex Long	Free	Free	400k	chatvisionreasoningtool_use +1
Google	Gemini 3.1 Pro Preview Custom Tools Long	Free	Free	1.0M	chatvisionvideoaudio +2
Alibaba Qwen	Qwen3.5-Flash Long	Free	Free	1.0M	chatvisionvideoreasoning +2
Alibaba Qwen	Qwen3.5-122B-A10B Long	Free	Free	262k	chatvisionvideoreasoning +2
Alibaba Qwen	Qwen3.5-27B Long	Free	Free	262k	chatvisionvideoreasoning +2
Alibaba Qwen	Qwen3.5-35B-A3B Long	Free	Free	262k	chatvisionvideoreasoning +2
ByteDance	Seed-2.0-Mini Long	Free	Free	262k	chatvisionvideoreasoning +1
DeepSeek	R1 0528	Free	Free	164k	chatreasoningtool_use
Other	DeepSeek R1T2 Chimera	Free	Free	164k	chatreasoningtool_use
Zhipu AI	GLM 4.5 Air	Free	Free	131k	chatreasoningtool_use
OpenAI	gpt-oss-20b	Free	Free	131k	chatreasoningtool_use
OpenAI	gpt-oss-120b (exacto)	Free	Free	131k	chatreasoningtool_use
NVIDIA	Nemotron Nano 9B V2	Free	Free	131k	chatreasoningtool_use
NVIDIA	Nemotron Nano 12B 2 VL	Free	Free	131k	chatvisionvideoreasoning +1
Other	Trinity Mini	Free	Free	131k	chatreasoningtool_use
NVIDIA	Nemotron 3 Nano 30B A3B Long	Free	Free	262k	chatreasoningtool_use
Other	MiMo-V2-Flash (free) Long	Free	Free	262k	chatreasoningtool_use
OpenAI	gpt-oss-20b Cheapest	$0.02	$0.10	131k	chatreasoningtool_use
DeepSeek	R1 Distill Llama 70B	$0.03	$0.11	131k	chatreasoningtool_use
OpenAI	gpt-oss-20b	$0.03	$0.14	131k	chatreasoningtool_use
OpenAI	gpt-oss-120b (exacto)	$0.04	$0.19	131k	chatreasoningtool_use
OpenAI	gpt-oss-120b (exacto)	$0.04	$0.19	131k	chatreasoningtool_use
NVIDIA	Nemotron Nano 9B V2	$0.04	$0.16	131k	chatreasoningtool_use
Other	Trinity Mini	$0.04	$0.15	131k	chatreasoningtool_use
Zhipu AI	GLM 4.5 Air	$0.05	$0.22	131k	chatreasoningtool_use
OpenAI	GPT-5 Nano Long	$0.05	$0.40	400k	chatvisionreasoningtool_use
NVIDIA	Nemotron 3 Nano 30B A3B Long	$0.05	$0.20	262k	chatreasoningtool_use
Other	GLM 4.7 Flash Long	$0.06	$0.40	203k	chatreasoningtool_use
NVIDIA	Nemotron 3 Nano 30B A3B Long	$0.06	$0.24	262k	chatreasoningtool_use
Baidu	ERNIE 4.5 21B A3B	$0.07	$0.28	120k	chatreasoningtool_use
Baidu	ERNIE 4.5 21B A3B Thinking	$0.07	$0.28	131k	chatreasoningtool_use
OpenAI	gpt-oss-safeguard-20b	$0.07	$0.30	131k	chatreasoningtool_use
ByteDance	Seed 1.6 Flash Long	$0.07	$0.30	262k	chatvisionvideoreasoning +1
ByteDance	Seed 1.6 Flash Long	$0.07	$0.30	262k	chatvisionvideoreasoning +1
Alibaba Qwen	Tongyi DeepResearch 30B A3B	$0.09	$0.40	131k	chatreasoningtool_use
Alibaba Qwen	Tongyi DeepResearch 30B A3B	$0.09	$0.45	131k	chatreasoningtool_use
Other	MiMo-V2-Flash (free) Long	$0.09	$0.29	262k	chatreasoningtool_use

📦

Browse by Brand

OpenAI / Anthropic / Google / DeepSeek / xAI / Groq / Mistral

View All Brands →

💰

Find Cheapest

Chat: gpt-oss-20b $0.02/1M

Compare More →

🧮

Cost Calculator

Enter your token usage, get instant monthly cost & cheaper alternatives

Open Calculator →

Browse by Brand

Click a brand to view all models and pricing

Find the Cheapest for Your Use Case

Different scenarios need different metrics

💬

Scenario A: Chat / Customer Service

High concurrency, short output. Prioritize low input + output price.

Recommended: gpt-oss-20b $0.02 / $0.10

💻

Scenario B: Code / Agent

Medium-long output. Focus on output price + code quality.

🧠

Scenario C: Reasoning / Math

Long chain reasoning. Look for reasoning capability + long context.

Recommended: gpt-oss-20b $0.02 / $0.10

📊 Price Visualization

Visual comparison of model costs

Cost per 1M Tokens (Input + Output)

Price vs Context Length

If you only need short conversations, you don't need the most powerful model. If you frequently generate long outputs, output price is your main cost driver.

💡 Understanding Token Costs

Tokens are billing units, not equal to words or characters
English: 1 token ≈ 4 characters ≈ 0.75 words
Chinese: 1 token ≈ 1-2 characters (varies by model)
API Cost = Input Tokens × Input Price + Output Tokens × Output Price

Open Cost Calculator

Frequently Asked Questions

It depends on your use case. For short conversations, look at input price. For long outputs, focus on output price. Currently, DeepSeek V3 and Gemini Flash series offer the best value for most use cases.

Use the same metric: $/1M tokens. Compare input and output prices separately, then use the calculator with your actual input/output ratio to estimate real costs.

Use the calculator's text estimation mode. As a rough guide: English 1 token ≈ 4 characters ≈ 0.75 words. Chinese 1 token ≈ 1-2 characters.

Most coding tasks generate longer outputs, so output price usually has more impact on total cost. Consider weighting output price higher in your comparison.

Reasoning models (like o1, DeepSeek R1) typically generate longer thinking chains and output tokens. Consider controlling output length or using cheaper draft models before switching to powerful ones.

We regularly update pricing data from official sources. Check the 'Updated' timestamp at the bottom of each page. For critical decisions, always verify with official pricing pages.

Enter your token usage in the calculator. The system will suggest models with similar context length and capabilities at lower prices.

Different channels (official API vs cloud providers) may have different pricing, billing methods, or regional fees. We label the provider_channel field to distinguish these.