AI Models 2026Complete Spec Sheets

Technical specifications, pricing, context window, and capabilities of 660 AI models from 104 companies. Sheets updated weekly with manufacturer data.

For performance ranking, visit the AI Benchmark.

660

Models

104

Companies

109

Open Source

155

Multimodal

AI21 Labs

7 models

Model	Context	Input Price	Output Price	OS	MM	API
AI21: Jamba Large 1.7 256K tokens·$2.00/1M	256K tokens	$2.00	$8.00	✅	—	✅
Jamba 1.5 Large $2.00/1M	—	$2.00	$8.00	—	—	✅
Jamba 1.5 Mini $0.20/1M	—	$0.20	$0.40	—	—	✅
Jamba 1.6 Large $2.00/1M	—	$2.00	$8.00	—	—	✅
Jamba 1.6 Mini $0.20/1M	—	$0.20	$0.40	—	—	✅
Jamba 1.7 Mini 0	—	—	—	—	—	✅
Jamba Reasoning 3B 0	—	—	—	—	—	✅

Adobe

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Adobe Image5	—	—	—	—	✅	✅

AionLabs

3 models

Model	Context	Input Price	Output Price	OS	MM	API
AionLabs: Aion-1.0 131K tokens·$4.00/1M	131K tokens	$4.00	$8.00	—	—	✅
AionLabs: Aion-2.0 131K tokens·$0.80/1M	131K tokens	$0.80	$1.60	—	—	✅
AionLabs: Aion-RP 1.0 (8B) 33K tokens·$0.80/1M	33K tokens	$0.80	$1.60	—	—	✅

AlfredPros

1 model

Model	Context	Input Price	Output Price	OS	MM	API
AlfredPros: CodeLLaMa 7B Instruct Solidity 4K tokens·$0.80/1M	4K tokens	$0.80	$1.20	✅	—	✅

Alibaba

65 models

Model	Context	Input Price	Output Price	OS	MM	API
Qwen Chat 14B 0	—	—	—	—	—	✅
Qwen Chat 72B 0	—	—	—	—	—	✅
Qwen: Qwen2.5 7B Instruct 33K tokens·$0.04/1M	33K tokens	$0.04	$0.10	✅	—	✅
Qwen: Qwen2.5 VL 72B Instruct 32K tokens·$0.25/1M	32K tokens	$0.25	$0.75	✅	✅	✅
Qwen: Qwen3 235B A22B Instruct 2507 262K tokens·$0.70/1M	262K tokens	$0.70	$2.80	✅	—	✅
Qwen: Qwen3 235B A22B Thinking 2507 131K tokens·$0.15/1M	131K tokens	$0.15	$1.50	✅	—	✅
Qwen: Qwen3 30B A3B Instruct 2507 262K tokens·$0.20/1M	262K tokens	$0.20	$0.80	✅	—	✅
Qwen: Qwen3 30B A3B Thinking 2507 131K tokens·$0.08/1M	131K tokens	$0.08	$0.40	✅	—	✅
Qwen: Qwen3 Coder 30B A3B Instruct 160K tokens·$0.45/1M	160K tokens	$0.45	$2.25	✅	—	✅
Qwen: Qwen3 Next 80B A3B Instruct 262K tokens·$0.50/1M	262K tokens	$0.50	$2.00	✅	—	✅
Qwen: Qwen3 VL 235B A22B Instruct 262K tokens·$0.70/1M	262K tokens	$0.70	$2.80	✅	✅	✅
Qwen: Qwen3 VL 30B A3B Instruct 131K tokens·$0.20/1M	131K tokens	$0.20	$0.80	✅	✅	✅
Qwen: Qwen3 VL 32B Instruct 131K tokens·$0.70/1M	131K tokens	$0.70	$2.80	✅	✅	✅
Qwen: Qwen3 VL 8B Instruct 131K tokens·$0.18/1M	131K tokens	$0.18	$0.70	✅	✅	✅
Qwen1.5 Chat 110B 0	—	—	—	—	—	✅
Qwen2 Instruct 72B 0	—	—	—	—	—	✅
Qwen2.5 72B Instruct 33K tokens·$0.47/1M	33K tokens	$0.47	$0.49	✅	—	✅
Qwen2.5 Coder 32B Instruct 33K tokens00	33K tokens	—	—	✅	—	✅
Qwen2.5 Coder Instruct 7B 0	—	—	—	—	—	✅
Qwen2.5 Instruct 32B 0	—	—	—	—	—	✅
Qwen2.5 Max 0	—	—	—	—	—	✅
Qwen3 0.6B (Non-reasoning) 0	—	—	—	—	—	✅
Qwen3 0.6B (Reasoning) 0	—	—	—	—	—	✅
Qwen3 1.7B (Non-reasoning) 0	—	—	—	—	—	✅
Qwen3 1.7B (Reasoning) 0	—	—	—	—	—	✅
Qwen3 14B (Non-reasoning) $0.35/1M	—	$0.35	$1.40	—	—	✅
Qwen3 14B (Reasoning) $0.35/1M	—	$0.35	$4.20	—	—	✅
Qwen3 235B A22B (Reasoning) $0.70/1M	—	$0.70	$8.40	—	—	✅
Qwen3 30B A3B (Reasoning) $0.20/1M	—	$0.20	$2.40	—	—	✅
Qwen3 30B A3B 2507 (Reasoning) $0.20/1M	—	$0.20	$2.40	—	—	✅
Qwen3 30B A3B 2507 Instruct $0.20/1M	—	$0.20	$0.80	—	—	✅
Qwen3 32B (Non-reasoning) $0.70/1M	—	$0.70	$2.80	—	—	✅
Qwen3 32B (Reasoning) $0.70/1M	—	$0.70	$8.40	—	—	✅
Qwen3 4B (Non-reasoning) 0	—	—	—	—	—	✅
Qwen3 4B (Reasoning) 0	—	—	—	—	—	✅
Qwen3 4B 2507 (Reasoning) 0	—	—	—	—	—	✅
Qwen3 4B 2507 Instruct 0	—	—	—	—	—	✅
Qwen3 8B (Non-reasoning) $0.18/1M	—	$0.18	$0.70	—	—	✅
Qwen3 8B (Reasoning) $0.18/1M	—	$0.18	$2.10	—	—	✅
Qwen3 Coder 480B A35B Instruct $1.50/1M	—	$1.50	$7.50	—	—	✅
Qwen3 Max (Preview) $1.20/1M	—	$1.20	$6.00	—	—	✅
Qwen3 Max Thinking (Preview) $1.20/1M	—	$1.20	$6.00	—	—	✅
Qwen3 Next 80B A3B (Reasoning) $0.50/1M	—	$0.50	$6.00	—	—	✅
Qwen3 Omni 30B A3B (Reasoning) $0.25/1M	—	$0.25	$0.97	—	—	✅
Qwen3 Omni 30B A3B Instruct $0.25/1M	—	$0.25	$0.97	—	—	✅
Qwen3 VL 235B A22B (Reasoning) $0.70/1M	—	$0.70	$8.40	—	—	✅
Qwen3 VL 30B A3B (Reasoning) $0.20/1M	—	$0.20	$2.40	—	—	✅
Qwen3 VL 32B (Reasoning) $0.70/1M	—	$0.70	$8.40	—	—	✅
Qwen3 VL 4B (Reasoning) 0	—	—	—	—	—	✅
Qwen3 VL 4B Instruct 0	—	—	—	—	—	✅
Qwen3 VL 8B (Reasoning) $0.18/1M	—	$0.18	$2.10	—	—	✅
Qwen3.5 0.8B (Non-reasoning) 0	—	—	—	—	—	✅
Qwen3.5 0.8B (Reasoning) 0	—	—	—	—	—	✅
Qwen3.5 2B (Reasoning) 0	—	—	—	—	—	✅
Qwen3.5 4B (Non-reasoning) $0.03/1M	—	$0.03	$0.15	—	—	✅
Qwen3.5 4B (Reasoning) $0.03/1M	—	$0.03	$0.15	—	—	✅
Qwen3.5 9B (Reasoning) 0	—	—	—	—	—	✅
Qwen3.5 Omni Flash $0.10/1M	—	$0.10	$0.80	—	—	✅
Qwen3.5 Omni Plus $0.40/1M	—	$0.40	$4.80	—	—	✅
Qwen3.6 Max Preview $1.30/1M	—	$1.30	$7.80	—	—	✅
Qwen3.7 Max $2.50/1M	—	$2.50	$7.50	—	—	✅
Qwen3.7 Plus $0.40/1M	—	$0.40	$1.60	—	—	✅
QwQ 32B $0.66/1M	—	$0.66	$1.00	—	—	✅
QwQ 32B-Preview 0	—	—	—	—	—	✅
Wan 2.1	—	—	—	✅	—	✅

Allen Institute for AI

8 models

Model	Context	Input Price	Output Price	OS	MM	API
Llama 3.1 Tulu3 405B 0	—	—	—	—	—	✅
Molmo 7B-D 0	—	—	—	—	—	✅
Molmo2-8B 0	—	—	—	—	—	✅
OLMo 2 32B 0	—	—	—	—	—	✅
OLMo 2 7B 0	—	—	—	—	—	✅
Olmo 3 7B Instruct $0.10/1M	—	$0.10	$0.20	—	—	✅
Olmo 3 7B Think 0	—	—	—	—	—	✅
Olmo 3.1 32B Think 0	—	—	—	—	—	✅

AllenAI

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Olmo 3 32B Think 66K tokens00	66K tokens	—	—	✅	—	✅
Olmo 3.1 32B Instruct 66K tokens00	66K tokens	—	—	✅	—	✅

Amazon

13 models

Model	Context	Input Price	Output Price	OS	MM	API
Amazon: Nova 2 Lite 1.0M tokens·$0.30/1M	1.0M tokens	$0.30	$2.50	—	✅	✅
Amazon: Nova Lite 1.0 300K tokens·$0.06/1M	300K tokens	$0.06	$0.24	—	✅	✅
Amazon: Nova Micro 1.0 128K tokens·$0.04/1M	128K tokens	$0.04	$0.14	—	—	✅
Amazon: Nova Premier 1.0 1.0M tokens·$2.50/1M	1.0M tokens	$2.50	$12.50	—	✅	✅
Amazon: Nova Pro 1.0 300K tokens·$0.80/1M	300K tokens	$0.80	$3.20	—	✅	✅
Nova 2.0 Lite (high) $0.30/1M	—	$0.30	$2.50	—	—	✅
Nova 2.0 Omni (low) $0.30/1M	—	$0.30	$2.50	—	—	✅
Nova 2.0 Omni (medium) $0.30/1M	—	$0.30	$2.50	—	—	✅
Nova 2.0 Omni (Non-reasoning) $0.30/1M	—	$0.30	$2.50	—	—	✅
Nova 2.0 Pro Preview (medium) $1.25/1M	—	$1.25	$10.00	—	—	✅
Nova Lite $0.06/1M	—	$0.06	$0.24	—	—	✅
Nova Micro $0.04/1M	—	$0.04	$0.14	—	—	✅
Nova Pro $0.80/1M	—	$0.80	$3.20	—	—	✅

Anthropic

38 models

Model	Context	Input Price	Output Price	OS	MM	API
Anthropic: Claude 3 Haiku 200K tokens·$0.25/1M	200K tokens	$0.25	$1.25	—	✅	✅
Claude 2.0 0	—	—	—	—	—	✅
Claude 2.1 0	—	—	—	—	—	✅
Claude 3 Opus $15.00/1M	—	$15.00	$75.00	—	—	✅
Claude 3 Sonnet $3.00/1M	—	$3.00	$15.00	—	—	✅
Claude 3.5	—	—	—	—	—	✅
Claude 3.5 Haiku 200K tokens·$0.80/1M	200K tokens	$0.80	$4.00	—	✅	✅
Claude 3.5 Sonnet (June '24) $3.00/1M	—	$3.00	$15.00	—	—	✅
Claude 3.5 Sonnet (Oct '24) $3.00/1M	—	$3.00	$15.00	—	—	✅
Claude 3.7 Sonnet 200K tokens·$3.00/1M	200K tokens	$3.00	$15.00	—	✅	✅
Claude 3.7 Sonnet (thinking) 200K tokens00	200K tokens	—	—	—	✅	✅
Claude 4 Opus (Reasoning) $15.00/1M	—	$15.00	$75.00	—	—	✅
Claude 4 Sonnet (Reasoning) $3.00/1M	—	$3.00	$15.00	—	—	✅
Claude 4.1 Opus (Non-reasoning) $15.00/1M	—	$15.00	$75.00	—	—	✅
Claude 4.1 Opus (Reasoning) $15.00/1M	—	$15.00	$75.00	—	—	✅
Claude 4.5 Haiku (Reasoning) $1.00/1M	—	$1.00	$5.00	—	—	✅
Claude 4.5 Sonnet (Non-reasoning) $3.00/1M	—	$3.00	$15.00	—	—	✅
Claude 4.5 Sonnet (Reasoning) $3.00/1M	—	$3.00	$15.00	—	—	✅
Claude Code	—	—	—	—	—	✅
Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) 1.0M tokens·$10.00/1M	1.0M tokens	$10.00	$50.00	—	—	✅
Claude Haiku 4.5 200K tokens·$1.00/1M	200K tokens	$1.00	$5.00	—	✅	✅
Claude Instant 0	—	—	—	—	—	✅
Claude Opus 4 200K tokens·$15.00/1M	200K tokens	$15.00	$75.00	—	✅	✅
Claude Opus 4.1 200K tokens·$15.00/1M	200K tokens	$15.00	$75.00	—	✅	✅
Claude Opus 4.5 200K tokens·$5.00/1M	200K tokens	$5.00	$25.00	—	✅	✅
Claude Opus 4.5 (Reasoning) $5.00/1M	—	$5.00	$25.00	—	—	✅
Claude Opus 4.6 1.0M tokens·$5.00/1M	1.0M tokens	$5.00	$25.00	—	✅	✅
Claude Opus 4.6 (Adaptive Reasoning, Max Effort) $5.00/1M	—	$5.00	$25.00	—	—	✅
Claude Opus 4.6 (Fast) 1.0M tokens·$30.00/1M	1.0M tokens	$30.00	$150.00	—	✅	✅
Claude Opus 4.7 1.0M tokens·$5.00/1M	1.0M tokens	$5.00	$25.00	—	✅	✅
Claude Opus 4.7 (Fast) 1.0M tokens·$30.00/1M	1.0M tokens	$30.00	$150.00	—	✅	✅
Claude Opus 4.8 (Adaptive Reasoning, Max Effort) 1.0M tokens·$5.00/1M	1.0M tokens	$5.00	$25.00	—	—	✅
Claude Sonnet 4 1.0M tokens·$3.00/1M	1.0M tokens	$3.00	$15.00	—	✅	✅
Claude Sonnet 4.5 1.0M tokens·$3.00/1M	1.0M tokens	$3.00	$15.00	—	✅	✅
Claude Sonnet 4.6 1.0M tokens·$3.00/1M	1.0M tokens	$3.00	$15.00	—	✅	✅
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) $3.00/1M	—	$3.00	$15.00	—	—	✅
Claude Sonnet 4.6 (Non-reasoning, Low Effort) $3.00/1M	—	$3.00	$15.00	—	—	✅
Opus 4.7	—	—	—	—	—	✅

Arcee AI

7 models

Model	Context	Input Price	Output Price	OS	MM	API
Arcee AI: Coder Large 33K tokens·$0.50/1M	33K tokens	$0.50	$0.80	—	—	✅
Arcee AI: Maestro Reasoning 131K tokens·$0.90/1M	131K tokens	$0.90	$3.30	—	—	✅
Arcee AI: Spotlight 131K tokens·$0.18/1M	131K tokens	$0.18	$0.18	—	✅	✅
Arcee AI: Trinity Large Thinking 262K tokens·$0.22/1M	262K tokens	$0.22	$0.85	✅	—	✅
Arcee AI: Trinity Mini 131K tokens·$0.04/1M	131K tokens	$0.04	$0.15	✅	—	✅
Arcee AI: Virtuoso Large 131K tokens·$0.75/1M	131K tokens	$0.75	$1.20	—	—	✅
Trinity Large Thinking $0.23/1M	—	$0.23	$0.88	—	—	✅

Baidu

5 models

Model	Context	Input Price	Output Price	OS	MM	API
Baidu: ERNIE 4.5 21B A3B Thinking 131K tokens·$0.07/1M	131K tokens	$0.07	$0.28	✅	—	✅
Baidu: ERNIE 4.5 300B A47B 123K tokens·$0.28/1M	123K tokens	$0.28	$1.10	✅	—	✅
Baidu: ERNIE 4.5 VL 28B A3B 30K tokens·$0.14/1M	30K tokens	$0.14	$0.56	✅	✅	✅
Baidu: ERNIE 4.5 VL 424B A47B 123K tokens·$0.42/1M	123K tokens	$0.42	$1.25	✅	✅	✅
ERNIE 5.0 Thinking Preview 0	—	—	—	—	—	✅

Black Forest Labs

1 model

Model	Context	Input Price	Output Price	OS	MM	API
FLUX1.1 [pro]	—	—	—	—	✅	✅

ByteDance

1 model

Model	Context	Input Price	Output Price	OS	MM	API
ByteDance: UI-TARS 7B 128K tokens·$0.10/1M	128K tokens	$0.10	$0.20	✅	✅	✅

ByteDance Seed

4 models

Model	Context	Input Price	Output Price	OS	MM	API
ByteDance Seed: Seed 1.6 Flash 262K tokens·$0.07/1M	262K tokens	$0.07	$0.30	—	✅	✅
ByteDance Seed: Seed-2.0-Lite 262K tokens·$0.25/1M	262K tokens	$0.25	$2.00	—	✅	✅
Doubao Seed Code 0	—	—	—	—	—	✅
Seed-OSS-36B-Instruct $0.21/1M	—	$0.21	$0.57	—	—	✅

China Mobile

3 models

Model	Context	Input Price	Output Price	OS	MM	API
JT-35B-Flash 0	—	—	—	—	—	✅
JT-4.1 Flash 236B A21B 0	—	—	—	—	—	✅
JT-MINI 0	—	—	—	—	—	✅

Cognition

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Devin	—	—	—	—	—	✅

Cohere

7 models

Model	Context	Input Price	Output Price	OS	MM	API
Cohere: Command R+ (08-2024) 128K tokens·$2.50/1M	128K tokens	$2.50	$10.00	—	—	✅
Cohere: Command R7B (12-2024) 128K tokens·$0.04/1M	128K tokens	$0.04	$0.15	—	—	✅
Command A+ 0	—	—	—	—	—	✅
Command-R (Mar '24) $0.50/1M	—	$0.50	$1.50	—	—	✅
Command-R+ (Apr '24) $3.00/1M	—	$3.00	$15.00	—	—	✅
North Mini Code 0	—	—	—	—	—	✅
Tiny Aya Global 0	—	—	—	—	—	✅

Databricks

1 model

Model	Context	Input Price	Output Price	OS	MM	API
DBRX Instruct 0	—	—	—	—	—	✅

Deep Cogito

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Cogito v2.1 (Reasoning) $1.25/1M	—	$1.25	$1.25	—	—	✅
Deep Cogito: Cogito v2.1 671B 128K tokens·$1.25/1M	128K tokens	$1.25	$1.25	—	—	✅

DeepSeek

25 models

Model	Context	Input Price	Output Price	OS	MM	API
DeepSeek Coder V2 Lite Instruct 0	—	—	—	—	—	✅
DeepSeek LLM 67B Chat (V1) 0	—	—	—	—	—	✅
DeepSeek R1 (Jan '25) $1.35/1M	—	$1.35	$4.20	—	—	✅
DeepSeek R1 0528 Qwen3 8B 0	—	—	—	—	—	✅
DeepSeek R1 Distill Llama 8B 0	—	—	—	—	—	✅
DeepSeek R1 Distill Qwen 1.5B 0	—	—	—	—	—	✅
DeepSeek R1 Distill Qwen 14B 0	—	—	—	—	—	✅
DeepSeek V3 131K tokens·$0.36/1M	131K tokens	$0.36	$0.89	✅	—	✅
DeepSeek V3 0324 $1.14/1M	—	$1.14	$1.25	—	—	✅
DeepSeek V3.1 164K tokens·$0.59/1M	164K tokens	$0.59	$1.69	✅	—	✅
DeepSeek V3.1 Terminus 131K tokens·$1.64/1M	131K tokens	$1.64	$2.75	✅	—	✅
DeepSeek V3.2 164K tokens·$0.28/1M	164K tokens	$0.28	$0.42	✅	—	✅
DeepSeek V3.2 Exp 164K tokens·$0.27/1M	164K tokens	$0.27	$0.41	✅	—	✅
DeepSeek V3.2 Exp (Non-reasoning) $0.28/1M	—	$0.28	$0.42	—	—	✅
DeepSeek V3.2 Exp (Reasoning) $0.28/1M	—	$0.28	$0.42	—	—	✅
DeepSeek V3.2 Speciale 164K tokens00	164K tokens	—	—	✅	—	✅
DeepSeek V4 Flash 1.0M tokens·$0.14/1M	1.0M tokens	$0.14	$0.28	✅	—	✅
DeepSeek V4 Pro 1.0M tokens·$0.43/1M	1.0M tokens	$0.43	$0.87	✅	—	✅
DeepSeek-Coder-V2 0	—	—	—	—	—	✅
DeepSeek-V2-Chat 0	—	—	—	—	—	✅
DeepSeek-V2.5 0	—	—	—	—	—	✅
DeepSeek-V2.5 (Dec '24) 0	—	—	—	—	—	✅
DeepSeek: R1 164K tokens·$0.70/1M	164K tokens	$0.70	$2.50	✅	—	✅
DeepSeek: R1 Distill Qwen 32B 128K tokens00	128K tokens	—	—	✅	—	✅
R1 Distill Llama 70B 128K tokens·$0.70/1M	128K tokens	$0.70	$1.05	✅	—	✅

ElevenLabs

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Eleven v3	—	—	—	—	✅	✅
ElevenAgents	—	—	—	—	✅	✅

EssentialAI

1 model

Model	Context	Input Price	Output Price	OS	MM	API
EssentialAI: Rnj 1 Instruct 33K tokens·$0.15/1M	33K tokens	$0.15	$0.15	✅	—	✅

Goliath 120B

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Goliath 120B 6K tokens·$3.75/1M	6K tokens	$3.75	$7.50	✅	—	✅

Google

68 models

Model	Context	Input Price	Output Price	OS	MM	API
DiffusionGemma 26B A4B 0	—	—	—	—	—	✅
Gemini 1.0 Pro 0	—	—	—	—	—	✅
Gemini 1.0 Ultra 0	—	—	—	—	—	✅
Gemini 1.5 Flash (May '24) 0	—	—	—	—	—	✅
Gemini 1.5 Flash (Sep '24) 0	—	—	—	—	—	✅
Gemini 1.5 Flash-8B 0	—	—	—	—	—	✅
Gemini 1.5 Pro (May '24) 0	—	—	—	—	—	✅
Gemini 1.5 Pro (Sep '24) 0	—	—	—	—	—	✅
Gemini 2.0 Flash 1.0M tokens·$0.15/1M	1.0M tokens	$0.15	$0.60	—	✅	✅
Gemini 2.0 Flash (experimental) 0	—	—	—	—	—	✅
Gemini 2.0 Flash Lite 1.0M tokens·$0.07/1M	1.0M tokens	$0.07	$0.30	—	✅	✅
Gemini 2.0 Flash Thinking Experimental (Dec '24) 0	—	—	—	—	—	✅
Gemini 2.0 Flash Thinking Experimental (Jan '25) 0	—	—	—	—	—	✅
Gemini 2.0 Flash-Lite (Feb '25) 0	—	—	—	—	—	✅
Gemini 2.0 Flash-Lite (Preview) 0	—	—	—	—	—	✅
Gemini 2.0 Pro Experimental (Feb '25) 0	—	—	—	—	—	✅
Gemini 2.5	—	—	—	—	—	✅
Gemini 2.5 Flash 1.0M tokens·$0.30/1M	1.0M tokens	$0.30	$2.50	—	✅	✅
Gemini 2.5 Flash Lite 1.0M tokens·$0.10/1M	1.0M tokens	$0.10	$0.40	—	✅	✅
Gemini 2.5 Flash Preview (Non-reasoning) 0	—	—	—	—	—	✅
Gemini 2.5 Flash Preview (Reasoning) $0.30/1M	—	$0.30	$2.50	—	—	✅
Gemini 2.5 Flash Preview (Sep '25) (Reasoning) 0	—	—	—	—	—	✅
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) $0.10/1M	—	$0.10	$0.40	—	—	✅
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) $0.10/1M	—	$0.10	$0.40	—	—	✅
Gemini 2.5 Pro 1.0M tokens·$1.25/1M	1.0M tokens	$1.25	$10.00	—	✅	✅
Gemini 2.5 Pro Preview (Mar' 25) 0	—	—	—	—	—	✅
Gemini 2.5 Pro Preview (May' 25) $1.25/1M	—	$1.25	$10.00	—	—	✅
Gemini 2.5 Pro Preview 05-06 1.0M tokens·$1.25/1M	1.0M tokens	$1.25	$10.00	—	✅	✅
Gemini 2.5 Pro Preview 06-05 1.0M tokens·$1.25/1M	1.0M tokens	$1.25	$10.00	—	✅	✅
Gemini 3 Deep Think 0	—	—	—	—	—	✅
Gemini 3 Flash Preview (Non-reasoning) $0.50/1M	—	$0.50	$3.00	—	—	✅
Gemini 3 Flash Preview (Reasoning) $0.50/1M	—	$0.50	$3.00	—	—	✅
Gemini 3 Pro Preview (high) $2.00/1M	—	$2.00	$12.00	—	—	✅
Gemini 3 Pro Preview (low) $2.00/1M	—	$2.00	$12.00	—	—	✅
Gemini 3.1 Flash Lite 1.0M tokens·$0.25/1M	1.0M tokens	$0.25	$1.50	—	✅	✅
Gemini 3.1 Flash Lite Preview 1.0M tokens·$0.25/1M	1.0M tokens	$0.25	$1.50	—	✅	✅
Gemini 3.1 Pro Preview 1.0M tokens·$2.00/1M	1.0M tokens	$2.00	$12.00	—	✅	✅
Gemini 3.1 Pro Preview Custom Tools 1.0M tokens·$2.00/1M	1.0M tokens	$2.00	$12.00	—	✅	✅
Gemini 3.5	—	—	—	—	—	✅
Gemini 3.5 Flash (minimal) $1.50/1M	—	$1.50	$9.00	—	—	✅
Gemma 2 27B 8K tokens·$0.65/1M	8K tokens	$0.65	$0.65	—	—	✅
Gemma 3 12B 131K tokens00	131K tokens	—	—	—	✅	✅
Gemma 3 1B Instruct 0	—	—	—	—	—	✅
Gemma 3 270M 0	—	—	—	—	—	✅
Gemma 3 27B 131K tokens00	131K tokens	—	—	—	✅	✅
Gemma 3 4B 131K tokens00	131K tokens	—	—	—	✅	✅
Gemma 3n 4B 33K tokens·$0.06/1M	33K tokens	$0.06	$0.12	—	—	✅
Gemma 3n E2B Instruct 0	—	—	—	—	—	✅
Gemma 3n E4B Instruct $0.02/1M	—	$0.02	$0.04	—	—	✅
Gemma 3n E4B Instruct Preview (May '25) 0	—	—	—	—	—	✅
Gemma 4 12B (Reasoning) $0.10/1M	—	$0.10	$0.30	—	—	✅
Gemma 4 26B A4B 262K tokens·$0.13/1M	262K tokens	$0.13	$0.40	—	✅	✅
Gemma 4 31B 262K tokens·$0.14/1M	262K tokens	$0.14	$0.40	—	✅	✅
Gemma 4 E2B (Non-reasoning) 0	—	—	—	—	—	✅
Gemma 4 E2B (Reasoning) 0	—	—	—	—	—	✅
Gemma 4 E4B (Non-reasoning) 0	—	—	—	—	—	✅
Gemma 4 E4B (Reasoning) 0	—	—	—	—	—	✅
Google: Gemini 3.5 Flash 1.0M tokens·$1.50/1M	1.0M tokens	$1.50	$9.00	—	✅	✅
Google: Nano Banana 2 (Gemini 3.1 Flash Image) 131K tokens·$0.50/1M	131K tokens	$0.50	$3.00	—	✅	✅
Google: Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) 66K tokens·$0.25/1M	66K tokens	$0.25	$1.50	—	✅	✅
Google: Nano Banana Pro (Gemini 3 Pro Image) 66K tokens·$2.00/1M	66K tokens	$2.00	$12.00	—	✅	✅
Imagen 4	—	—	—	—	✅	✅
Lyria 3 Clip Preview 1.0M tokens	1.0M tokens	—	—	—	✅	✅
Lyria 3 Pro Preview 1.0M tokens	1.0M tokens	—	—	—	✅	✅
Nano Banana (Gemini 2.5 Flash Image) 33K tokens·$0.30/1M	33K tokens	$0.30	$2.50	—	✅	✅
Nano Banana 2 (Gemini 3.1 Flash Image Preview) 131K tokens·$0.50/1M	131K tokens	$0.50	$3.00	—	✅	✅
Nano Banana Pro (Gemini 3 Pro Image Preview) 66K tokens·$2.00/1M	66K tokens	$2.00	$12.00	—	✅	✅
PALM-2 0	—	—	—	—	—	✅

IBM

10 models

Model	Context	Input Price	Output Price	OS	MM	API
Granite 3.3 8B (Non-reasoning) $0.03/1M	—	$0.03	$0.25	—	—	✅
Granite 4.0 1B 0	—	—	—	—	—	✅
Granite 4.0 350M 0	—	—	—	—	—	✅
Granite 4.0 H 1B 0	—	—	—	—	—	✅
Granite 4.0 H 350M 0	—	—	—	—	—	✅
Granite 4.0 H Small $0.06/1M	—	$0.06	$0.25	—	—	✅
Granite 4.0 Micro 131K tokens00	131K tokens	—	—	✅	—	✅
Granite 4.1 30B 0	—	—	—	—	—	✅
Granite 4.1 3B 0	—	—	—	—	—	✅
Granite 4.1 8B $0.05/1M	—	$0.05	$0.10	—	—	✅

Ideogram

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Ideogram 4.0	—	—	—	✅	✅	✅

Inception

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Inception: Mercury 2 128K tokens·$0.25/1M	128K tokens	$0.25	$0.75	—	—	✅

Inclusion AI

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Ling 2.6 Flash $0.10/1M	—	$0.10	$0.30	—	—	✅
Ling-2.6-1T $0.30/1M	—	$0.30	$2.50	—	—	✅

InclusionAI

6 models

Model	Context	Input Price	Output Price	OS	MM	API
Ling-1T 0	—	—	—	—	—	✅
Ling-flash-2.0 $0.14/1M	—	$0.14	$0.57	—	—	✅
Ling-mini-2.0 0	—	—	—	—	—	✅
Ring-1T 0	—	—	—	—	—	✅
Ring-2.6-1T $0.30/1M	—	$0.30	$2.50	—	—	✅
Ring-flash-2.0 $0.14/1M	—	$0.14	$0.57	—	—	✅

Inflection

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Inflection: Inflection 3 Pi 8K tokens·$2.50/1M	8K tokens	$2.50	$10.00	—	—	✅
Inflection: Inflection 3 Productivity 8K tokens·$2.50/1M	8K tokens	$2.50	$10.00	—	—	✅

Kimi

4 models

Model	Context	Input Price	Output Price	OS	MM	API
Kimi K2 Thinking 262K tokens·$0.60/1M	262K tokens	$0.60	$2.50	—	—	✅
Kimi K2.7 Code $0.95/1M	—	$0.95	$4.00	—	—	✅
Kimi K3 $3.00/1M	—	$3.00	$15.00	—	—	✅
Kimi Linear 48B A3B Instruct 0	—	—	—	—	—	✅

Kling AI

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Kling 2.1	—	—	—	—	✅	—

Korea Telecom

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Mi:dm K 2.5 Pro 0	—	—	—	—	—	✅
Mi:dm K 2.5 Pro Preview 0	—	—	—	—	—	✅

Kuaishou

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Kling AI 2.0	—	—	—	—	—	✅

KwaiKAT

1 model

Model	Context	Input Price	Output Price	OS	MM	API
KAT-Coder-Pro V1 0	—	—	—	—	—	✅

Kwaipilot

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Kwaipilot: KAT-Coder-Pro V2 256K tokens·$0.30/1M	256K tokens	$0.30	$1.20	—	—	✅

LG AI

2 models

Model	Context	Input Price	Output Price	OS	MM	API
EXAONE 4.5 33B 0	—	—	—	—	—	✅
K-EXAONE (Reasoning) 0	—	—	—	—	—	✅

LG AI Research

3 models

Model	Context	Input Price	Output Price	OS	MM	API
Exaone 4.0 1.2B (Non-reasoning) 0	—	—	—	—	—	✅
EXAONE 4.0 32B (Non-reasoning) 0	—	—	—	—	—	✅
EXAONE 4.0 32B (Reasoning) 0	—	—	—	—	—	✅

Liquid AI

8 models

Model	Context	Input Price	Output Price	OS	MM	API
LFM 40B 0	—	—	—	—	—	✅
LFM2 1.2B 0	—	—	—	—	—	✅
LFM2 2.6B 0	—	—	—	—	—	✅
LFM2 8B A1B 0	—	—	—	—	—	✅
LFM2.5-1.2B-Instruct 0	—	—	—	—	—	✅
LFM2.5-1.2B-Thinking 0	—	—	—	—	—	✅
LFM2.5-8B-A1B 0	—	—	—	—	—	✅
LFM2.5-VL-1.6B 0	—	—	—	—	—	✅

LiquidAI

1 model

Model	Context	Input Price	Output Price	OS	MM	API
LFM2-24B-A2B 33K tokens00	33K tokens	—	—	✅	—	✅

LongCat

2 models

Model	Context	Input Price	Output Price	OS	MM	API
LongCat 2.0 0	—	—	—	—	—	✅
LongCat Flash Lite 0	—	—	—	—	—	✅

Luma AI

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Luma Dream Machine 1.6	—	—	—	—	—	✅
Luma Ray3.2	—	—	—	—	✅	✅

MBZUAI Institute of Foundation Models

3 models

Model	Context	Input Price	Output Price	OS	MM	API
K2 Think V2 0	—	—	—	—	—	✅
K2-V2 (high) 0	—	—	—	—	—	✅
K2-V2 (medium) 0	—	—	—	—	—	✅

Magnum v4 72B

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Magnum v4 72B 16K tokens·$3.00/1M	16K tokens	$3.00	$5.00	✅	—	✅

Mancer

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Mancer: Weaver (alpha) 8K tokens·$0.75/1M	8K tokens	$0.75	$1.00	—	—	✅

Model	Context	Input Price	Output Price	OS	MM	API
Llama 2 Chat 13B 0	—	—	—	—	—	✅
Llama 2 Chat 70B 0	—	—	—	—	—	✅
Llama 2 Chat 7B $0.05/1M	—	$0.05	$0.25	—	—	✅
Llama 3 70B Instruct 8K tokens·$0.65/1M	8K tokens	$0.65	$2.75	✅	—	✅
Llama 3 8B Instruct 8K tokens·$0.04/1M	8K tokens	$0.04	$0.14	✅	—	✅
Llama 3.1 70B Instruct 131K tokens·$0.56/1M	131K tokens	$0.56	$0.56	✅	—	✅
Llama 3.1 8B Instruct 16K tokens·$0.10/1M	16K tokens	$0.10	$0.10	✅	—	✅
Llama 3.1 Instruct 405B $2.75/1M	—	$2.75	$6.50	—	—	✅
Llama 3.2 11B Vision Instruct 131K tokens·$0.34/1M	131K tokens	$0.34	$0.34	✅	✅	✅
Llama 3.2 1B Instruct 60K tokens·$0.10/1M	60K tokens	$0.10	$0.10	✅	—	✅
Llama 3.2 3B Instruct 80K tokens·$0.15/1M	80K tokens	$0.15	$0.15	✅	—	✅
Llama 3.2 Instruct 90B (Vision) $1.38/1M	—	$1.38	$1.38	—	—	✅
Llama 3.3 70B Instruct 131K tokens·$0.58/1M	131K tokens	$0.58	$0.71	✅	—	✅
Llama 4 Maverick 1.0M tokens·$0.35/1M	1.0M tokens	$0.35	$0.85	✅	✅	✅
Llama 4 Scout 10.0M tokens·$0.17/1M	10.0M tokens	$0.17	$0.63	✅	✅	✅
Llama 65B 0	—	—	—	—	—	✅
Llama Guard 3 8B 131K tokens·$0.48/1M	131K tokens	$0.48	$0.03	✅	—	✅
Llama Guard 4 12B 164K tokens·$0.18/1M	164K tokens	$0.18	$0.18	✅	✅	✅
Muse Spark 0	—	—	—	—	—	✅
Muse Spark 1.1 (xhigh) $1.25/1M	—	$1.25	$4.25	—	—	✅

Microsoft

5 models

Model	Context	Input Price	Output Price	OS	MM	API
Microsoft: Phi 4 16K tokens·$0.13/1M	16K tokens	$0.13	$0.50	✅	—	✅
Phi-3 Mini Instruct 3.8B 0	—	—	—	—	—	✅
Phi-4 Mini Instruct 0	—	—	—	—	—	✅
Phi-4 Multimodal Instruct 0	—	—	—	—	—	✅
WizardLM-2 8x22B 66K tokens·$0.62/1M	66K tokens	$0.62	$0.62	✅	—	✅

Midjourney

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Midjourney V8.1	—	—	—	—	—	—

MiniMax

14 models

Model	Context	Input Price	Output Price	OS	MM	API
Hailuo 2.3	—	—	—	—	✅	✅
Hailuo MiniMax Video-01	—	—	—	—	—	✅
MiniMax M1 40k 0	—	—	—	—	—	✅
MiniMax M1 80k $0.55/1M	—	$0.55	$2.20	—	—	✅
MiniMax-M2 205K tokens·$0.30/1M	205K tokens	$0.30	$1.20	—	—	✅
MiniMax-M3 1.0M tokens·$0.30/1M	1.0M tokens	$0.30	$1.20	—	—	✅
MiniMax: MiniMax M1 1.0M tokens·$0.40/1M	1.0M tokens	$0.40	$2.20	—	—	✅
MiniMax: MiniMax M2-her 66K tokens·$0.30/1M	66K tokens	$0.30	$1.20	—	—	✅
MiniMax: MiniMax M2.1 197K tokens·$0.30/1M	197K tokens	$0.30	$1.20	✅	—	✅
MiniMax: MiniMax M2.5 197K tokens·$0.30/1M	197K tokens	$0.30	$1.20	✅	—	✅
MiniMax: MiniMax M2.7 197K tokens·$0.30/1M	197K tokens	$0.30	$1.20	✅	—	✅
MiniMax: MiniMax-01 1.0M tokens·$0.20/1M	1.0M tokens	$0.20	$1.10	✅	✅	✅
Music 2.6	—	—	—	—	✅	✅
Speech 2.8	—	—	—	—	✅	✅

Mistral

21 models

Model	Context	Input Price	Output Price	OS	MM	API
Devstral 2 0	—	—	—	—	—	✅
Devstral Small (Jul '25) 131K tokens·$0.10/1M	131K tokens	$0.10	$0.30	—	—	✅
Devstral Small (May '25) 0	—	—	—	—	—	✅
Devstral Small 2 0	—	—	—	—	—	✅
Magistral Medium 1 0	—	—	—	—	—	✅
Magistral Small 1 0	—	—	—	—	—	✅
Magistral Small 1.2 $0.50/1M	—	$0.50	$1.50	—	—	✅
Ministral 3 14B $0.20/1M	—	$0.20	$0.20	—	—	✅
Ministral 3 3B $0.10/1M	—	$0.10	$0.10	—	—	✅
Ministral 3 8B $0.15/1M	—	$0.15	$0.15	—	—	✅
Mistral 7B Instruct $0.25/1M	—	$0.25	$0.25	—	—	✅
Mistral Large 2 (Jul '24) 131K tokens·$2.00/1M	131K tokens	$2.00	$6.00	—	—	✅
Mistral Large 2 (Nov '24) $2.00/1M	—	$2.00	$6.00	—	—	✅
Mistral Large 3 $4.00/1M	—	$4.00	$12.00	—	—	✅
Mistral Medium $2.75/1M	—	$2.75	$8.10	—	—	✅
Mistral Small (Feb '24) $1.00/1M	—	$1.00	$3.00	—	—	✅
Mistral Small (Sep '24) $0.20/1M	—	$0.20	$0.60	—	—	✅
Mistral Small 3 $0.10/1M	—	$0.10	$0.30	—	—	✅
Mistral Small 3.1 $0.10/1M	—	$0.10	$0.30	—	—	✅
Mistral Small 3.2 $0.10/1M	—	$0.10	$0.30	—	—	✅
Mixtral 8x22B Instruct 0	—	—	—	—	—	✅

Mistral AI

22 models

Model	Context	Input Price	Output Price	OS	MM	API
Magistral Medium 1.2 $2.00/1M	—	$2.00	$5.00	—	—	✅
Mistral	—	—	—	—	—	✅
Mistral Large 128K tokens·$2.00/1M	128K tokens	$2.00	$6.00	✅	—	✅
Mistral: Codestral 2508 256K tokens·$0.30/1M	256K tokens	$0.30	$0.90	—	—	✅
Mistral: Devstral 2 2512 262K tokens·$0.40/1M	262K tokens	$0.40	$2.00	✅	—	✅
Mistral: Devstral Medium 131K tokens·$0.40/1M	131K tokens	$0.40	$2.00	✅	—	✅
Mistral: Ministral 3 14B 2512 262K tokens·$0.20/1M	262K tokens	$0.20	$0.20	✅	✅	✅
Mistral: Ministral 3 8B 2512 262K tokens·$0.15/1M	262K tokens	$0.15	$0.15	✅	✅	✅
Mistral: Mistral 7B Instruct v0.1 3K tokens·$0.11/1M	3K tokens	$0.11	$0.19	✅	—	✅
Mistral: Mistral Medium 3 131K tokens·$0.40/1M	131K tokens	$0.40	$2.00	✅	✅	✅
Mistral: Mistral Medium 3.1 131K tokens·$0.40/1M	131K tokens	$0.40	$2.00	✅	✅	✅
Mistral: Mistral Medium 3.5 262K tokens·$1.50/1M	262K tokens	$1.50	$7.50	✅	✅	✅
Mistral: Mistral Nemo 131K tokens·$0.02/1M	131K tokens	$0.02	$0.03	✅	—	✅
Mistral: Mistral Small 3.1 24B 128K tokens·$0.35/1M	128K tokens	$0.35	$0.56	✅	✅	✅
Mistral: Mistral Small 3.2 24B 128K tokens·$0.07/1M	128K tokens	$0.07	$0.20	✅	✅	✅
Mistral: Mistral Small 4 262K tokens·$0.15/1M	262K tokens	$0.15	$0.60	✅	✅	✅
Mistral: Mistral Small Creative 33K tokens·$0.10/1M	33K tokens	$0.10	$0.30	✅	—	✅
Mistral: Mixtral 8x22B Instruct 66K tokens·$2.00/1M	66K tokens	$2.00	$6.00	✅	—	✅
Mistral: Mixtral 8x7B Instruct 33K tokens·$0.45/1M	33K tokens	$0.45	$0.70	✅	—	✅
Mistral: Pixtral Large 2411 131K tokens·$2.00/1M	131K tokens	$2.00	$6.00	—	✅	✅
Mistral: Saba 33K tokens00	33K tokens	—	—	✅	—	✅
Mistral: Voxtral Small 24B 2507 32K tokens·$0.10/1M	32K tokens	$0.10	$0.30	✅	✅	✅

Moonshot AI

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Kimi K2 131K tokens·$0.58/1M	131K tokens	$0.58	$2.40	—	—	✅

MoonshotAI

4 models

Model	Context	Input Price	Output Price	OS	MM	API
MoonshotAI: Kimi K2 0711 131K tokens·$0.57/1M	131K tokens	$0.57	$2.30	✅	—	✅
MoonshotAI: Kimi K2 0905 262K tokens·$0.60/1M	262K tokens	$0.60	$2.50	✅	—	✅
MoonshotAI: Kimi K2.5 262K tokens·$0.60/1M	262K tokens	$0.60	$3.00	✅	✅	✅
MoonshotAI: Kimi K2.6 262K tokens·$0.95/1M	262K tokens	$0.95	$4.00	✅	✅	✅

Morph

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Morph: Morph V3 Fast 82K tokens·$0.80/1M	82K tokens	$0.80	$1.20	—	—	✅
Morph: Morph V3 Large 262K tokens·$0.90/1M	262K tokens	$0.90	$1.90	—	—	✅

Motif Technologies

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Motif-2-12.7B-Reasoning 0	—	—	—	—	—	✅

Multiverse Computing

1 model

Model	Context	Input Price	Output Price	OS	MM	API
HyperNova 60B 2605 $0.04/1M	—	$0.04	$0.14	—	—	✅

MythoMax 13B

1 model

Model	Context	Input Price	Output Price	OS	MM	API
MythoMax 13B 4K tokens·$0.06/1M	4K tokens	$0.06	$0.06	✅	—	✅

NVIDIA

18 models

Model	Context	Input Price	Output Price	OS	MM	API
Llama 3.1 Nemotron 70B Instruct 131K tokens·$1.20/1M	131K tokens	$1.20	$1.20	✅	—	✅
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) 0	—	—	—	—	—	✅
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) $0.60/1M	—	$0.60	$1.80	—	—	✅
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) 0	—	—	—	—	—	✅
Llama 3.3 Nemotron Super 49B v1 (Reasoning) 0	—	—	—	—	—	✅
Llama Nemotron Super 49B v1.5 (Non-reasoning) $0.40/1M	—	$0.40	$0.40	—	—	✅
Llama Nemotron Super 49B v1.5 (Reasoning) $0.40/1M	—	$0.40	$0.40	—	—	✅
Nemotron 3 Nano Omni 30B A3B Reasoning $0.07/1M	—	$0.07	$0.30	—	—	✅
Nemotron 3 Ultra 550B A55B (Reasoning) 1.0M tokens·$0.68/1M	1.0M tokens	$0.68	$2.67	—	—	✅
Nemotron Cascade 2 30B A3B 0	—	—	—	—	—	✅
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) 262K tokens·$0.05/1M	262K tokens	$0.05	$0.20	—	—	✅
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) $0.05/1M	—	$0.05	$0.20	—	—	✅
NVIDIA Nemotron 3 Nano 4B 0	—	—	—	—	—	✅
NVIDIA Nemotron 3 Super 120B A12B (Reasoning) 1.0M tokens·$0.25/1M	1.0M tokens	$0.25	$0.78	—	—	✅
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) $0.20/1M	—	$0.20	$0.60	—	—	✅
NVIDIA Nemotron Nano 12B v2 VL (Reasoning) $0.20/1M	—	$0.20	$0.60	—	—	✅
NVIDIA Nemotron Nano 9B V2 (Non-reasoning) 131K tokens·$0.05/1M	131K tokens	$0.05	$0.20	—	—	✅
NVIDIA Nemotron Nano 9B V2 (Reasoning) $0.04/1M	—	$0.04	$0.16	—	—	✅

Nanbeige

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Nanbeige4.1-3B 0	—	—	—	—	—	✅

Naver

1 model

Model	Context	Input Price	Output Price	OS	MM	API
HyperCLOVA X SEED Think (32B) 0	—	—	—	—	—	✅

Nex AGI

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Nex AGI: DeepSeek V3.1 Nex N1 131K tokens·$0.14/1M	131K tokens	$0.14	$0.50	✅	—	✅
Nex-N2-Pro 262K tokens·$0.50/1M	262K tokens	$0.50	$2.50	—	—	✅

Nous

4 models

Model	Context	Input Price	Output Price	OS	MM	API
Nous: Hermes 3 405B Instruct 131K tokens·$1.00/1M	131K tokens	$1.00	$1.00	✅	—	✅
Nous: Hermes 3 70B Instruct 131K tokens·$0.30/1M	131K tokens	$0.30	$0.30	✅	—	✅
Nous: Hermes 4 405B 131K tokens·$1.00/1M	131K tokens	$1.00	$3.00	✅	—	✅
Nous: Hermes 4 70B 131K tokens·$0.13/1M	131K tokens	$0.13	$0.40	✅	—	✅

Nous Research

7 models

Model	Context	Input Price	Output Price	OS	MM	API
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) 0	—	—	—	—	—	✅
DeepHermes 3 - Mistral 24B Preview (Non-reasoning) 0	—	—	—	—	—	✅
Hermes 3 - Llama-3.1 70B $0.70/1M	—	$0.70	$0.70	—	—	✅
Hermes 4 - Llama-3.1 405B (Non-reasoning) $1.00/1M	—	$1.00	$3.00	—	—	✅
Hermes 4 - Llama-3.1 405B (Reasoning) $1.00/1M	—	$1.00	$3.00	—	—	✅
Hermes 4 - Llama-3.1 70B (Non-reasoning) $0.13/1M	—	$0.13	$0.40	—	—	✅
Hermes 4 - Llama-3.1 70B (Reasoning) $0.13/1M	—	$0.13	$0.40	—	—	✅

NousResearch

1 model

Model	Context	Input Price	Output Price	OS	MM	API
NousResearch: Hermes 2 Pro - Llama-3 8B 8K tokens·$0.14/1M	8K tokens	$0.14	$0.14	✅	—	✅

OpenAI

102 models

Model	Context	Input Price	Output Price	OS	MM	API
Codex	—	—	—	—	—	✅
Deep Research	—	—	—	—	—	✅
GPT Audio 128K tokens·$2.50/1M	128K tokens	$2.50	$10.00	—	✅	✅
GPT Audio Mini 128K tokens·$0.60/1M	128K tokens	$0.60	$2.40	—	✅	✅
GPT Chat Latest 400K tokens·$5.00/1M	400K tokens	$5.00	$30.00	—	✅	✅
GPT Image 2	—	—	—	—	✅	✅
GPT-3.5 Turbo 16K tokens·$0.50/1M	16K tokens	$0.50	$1.50	—	—	✅
GPT-3.5 Turbo $0.50/1M	—	$0.50	$1.50	—	—	✅
GPT-3.5 Turbo (0613) 0	—	—	—	—	—	✅
GPT-4 Turbo 128K tokens·$10.00/1M	128K tokens	$10.00	$30.00	—	✅	✅
GPT-4 Turbo Preview 128K tokens·$10.00/1M	128K tokens	$10.00	$30.00	—	—	✅
GPT-4.1 1.0M tokens·$2.00/1M	1.0M tokens	$2.00	$8.00	—	✅	✅
GPT-4.1 Mini 1.0M tokens·$0.40/1M	1.0M tokens	$0.40	$1.60	—	✅	✅
GPT-4.1 Nano 1.0M tokens·$0.10/1M	1.0M tokens	$0.10	$0.40	—	✅	✅
GPT-4.5 (Preview) 0	—	—	—	—	—	✅
GPT-4o (2024-08-06) 128K tokens·$2.50/1M	128K tokens	$2.50	$10.00	—	✅	✅
GPT-4o (2024-11-20) 128K tokens·$2.50/1M	128K tokens	$2.50	$10.00	—	✅	✅
GPT-4o (ChatGPT) 0	—	—	—	—	—	✅
GPT-4o (March 2025, chatgpt-4o-latest) 0	—	—	—	—	—	✅
GPT-4o Audio 128K tokens·$2.50/1M	128K tokens	$2.50	$10.00	—	✅	✅
GPT-4o mini Realtime (Dec '24) 0	—	—	—	—	—	✅
GPT-4o Realtime (Dec '24) 0	—	—	—	—	—	✅
GPT-4o Search Preview 128K tokens·$2.50/1M	128K tokens	$2.50	$10.00	—	—	✅
GPT-4o Transcribe	—	—	—	—	✅	✅
GPT-4o-mini (2024-07-18) 128K tokens·$0.15/1M	128K tokens	$0.15	$0.60	—	✅	✅
GPT-4o-mini Search Preview 128K tokens·$0.15/1M	128K tokens	$0.15	$0.60	—	—	✅
GPT-5 400K tokens·$1.25/1M	400K tokens	$1.25	$10.00	—	✅	✅
GPT-5 (ChatGPT) $1.25/1M	—	$1.25	$10.00	—	—	✅
GPT-5 (minimal) $1.25/1M	—	$1.25	$10.00	—	—	✅
GPT-5 Chat 128K tokens·$1.25/1M	128K tokens	$1.25	$10.00	—	✅	✅
GPT-5 Codex 400K tokens·$1.25/1M	400K tokens	$1.25	$10.00	—	✅	✅
GPT-5 Image 400K tokens·$10.00/1M	400K tokens	$10.00	$10.00	—	✅	✅
GPT-5 Image Mini 400K tokens·$2.50/1M	400K tokens	$2.50	$2.00	—	✅	✅
GPT-5 Mini 400K tokens·$0.25/1M	400K tokens	$0.25	$2.00	—	✅	✅
GPT-5 mini (minimal) $0.25/1M	—	$0.25	$2.00	—	—	✅
GPT-5 Nano 400K tokens·$0.05/1M	400K tokens	$0.05	$0.40	—	✅	✅
GPT-5 nano (minimal) $0.05/1M	—	$0.05	$0.40	—	—	✅
GPT-5 Pro 400K tokens·$15.00/1M	400K tokens	$15.00	$120.00	—	✅	✅
GPT-5.1 400K tokens·$1.25/1M	400K tokens	$1.25	$10.00	—	✅	✅
GPT-5.1 Chat 128K tokens·$1.25/1M	128K tokens	$1.25	$10.00	—	✅	✅
GPT-5.1-Codex 400K tokens·$1.25/1M	400K tokens	$1.25	$10.00	—	✅	✅
GPT-5.1-Codex-Max 400K tokens·$1.25/1M	400K tokens	$1.25	$10.00	—	✅	✅
GPT-5.1-Codex-Mini 400K tokens·$0.25/1M	400K tokens	$0.25	$2.00	—	✅	✅
GPT-5.2 400K tokens·$1.75/1M	400K tokens	$1.75	$14.00	—	✅	✅
GPT-5.2 Pro 400K tokens·$21.00/1M	400K tokens	$21.00	$168.00	—	✅	✅
GPT-5.2-Codex 400K tokens·$1.75/1M	400K tokens	$1.75	$14.00	—	✅	✅
GPT-5.3 Chat 128K tokens·$1.75/1M	128K tokens	$1.75	$14.00	—	✅	✅
GPT-5.3-Codex 400K tokens·$1.75/1M	400K tokens	$1.75	$14.00	—	✅	✅
GPT-5.4 1.1M tokens·$2.50/1M	1.1M tokens	$2.50	$15.00	—	✅	✅
GPT-5.4 Image 2 272K tokens·$8.00/1M	272K tokens	$8.00	$15.00	—	✅	✅
GPT-5.4 Mini 400K tokens·$0.75/1M	400K tokens	$0.75	$4.50	—	✅	✅
GPT-5.4 Nano 400K tokens·$0.20/1M	400K tokens	$0.20	$1.25	—	✅	✅
GPT-5.4 Pro 1.1M tokens·$30.00/1M	1.1M tokens	$30.00	$180.00	—	✅	✅
GPT-5.5 1.1M tokens·$5.00/1M	1.1M tokens	$5.00	$30.00	—	✅	✅
GPT-5.5 Instant (June 2026) $5.00/1M	—	$5.00	$30.00	—	—	✅
GPT-5.5 Instant (May 2026) $5.00/1M	—	$5.00	$30.00	—	—	✅
GPT-5.5 Pro 1.1M tokens00	1.1M tokens	—	—	—	✅	✅
GPT-5.6 Luna (high) $1.00/1M	—	$1.00	$6.00	—	—	✅
GPT-5.6 Luna (low) $1.00/1M	—	$1.00	$6.00	—	—	✅
GPT-5.6 Luna (max) 1.1M tokens·$1.00/1M	1.1M tokens	$1.00	$6.00	—	—	✅
GPT-5.6 Luna (medium) $1.00/1M	—	$1.00	$6.00	—	—	✅
GPT-5.6 Luna (Non-reasoning) $1.00/1M	—	$1.00	$6.00	—	—	✅
GPT-5.6 Luna (xhigh) $1.00/1M	—	$1.00	$6.00	—	—	✅
GPT-5.6 Sol (high) $5.00/1M	—	$5.00	$30.00	—	—	✅
GPT-5.6 Sol (low) $5.00/1M	—	$5.00	$30.00	—	—	✅
GPT-5.6 Sol (max) 1.1M tokens·$5.00/1M	1.1M tokens	$5.00	$30.00	—	—	✅
GPT-5.6 Sol (medium) $5.00/1M	—	$5.00	$30.00	—	—	✅
GPT-5.6 Sol (Non-reasoning) $5.00/1M	—	$5.00	$30.00	—	—	✅
GPT-5.6 Sol (xhigh) $5.00/1M	—	$5.00	$30.00	—	—	✅
GPT-5.6 Terra (high) $2.50/1M	—	$2.50	$15.00	—	—	✅
GPT-5.6 Terra (low) $2.50/1M	—	$2.50	$15.00	—	—	✅
GPT-5.6 Terra (max) 1.1M tokens·$2.50/1M	1.1M tokens	$2.50	$15.00	—	—	✅
GPT-5.6 Terra (medium) $2.50/1M	—	$2.50	$15.00	—	—	✅
GPT-5.6 Terra (Non-reasoning) $2.50/1M	—	$2.50	$15.00	—	—	✅
GPT-5.6 Terra (xhigh) $2.50/1M	—	$2.50	$15.00	—	—	✅
gpt-oss-120b 131K tokens·$0.15/1M	131K tokens	$0.15	$0.60	—	—	✅
gpt-oss-20b 131K tokens·$0.05/1M	131K tokens	$0.05	$0.20	—	—	✅
gpt-oss-safeguard-20b 131K tokens·$0.07/1M	131K tokens	$0.07	$0.30	—	—	✅
GPT-Realtime-Whisper	—	—	—	—	✅	✅
o1 200K tokens·$15.00/1M	200K tokens	$15.00	$60.00	—	✅	✅
o1-mini 0	—	—	—	—	—	✅
o1-preview $16.50/1M	—	$16.50	$66.00	—	—	✅
o1-pro 200K tokens·$150.00/1M	200K tokens	$150.00	$600.00	—	✅	✅
o3 200K tokens·$2.00/1M	200K tokens	$2.00	$8.00	—	✅	✅
o3 Deep Research 200K tokens·$10.00/1M	200K tokens	$10.00	$40.00	—	✅	✅
o3 Mini 200K tokens·$1.10/1M	200K tokens	$1.10	$4.40	—	✅	✅
o3 Mini High 200K tokens·$1.10/1M	200K tokens	$1.10	$4.40	—	✅	✅
o3 Pro 200K tokens·$20.00/1M	200K tokens	$20.00	$80.00	—	✅	✅
o4 Mini 200K tokens·$1.10/1M	200K tokens	$1.10	$4.40	—	✅	✅
o4 Mini Deep Research 200K tokens·$2.00/1M	200K tokens	$2.00	$8.00	—	✅	✅
o4 Mini High 200K tokens·$1.10/1M	200K tokens	$1.10	$4.40	—	✅	✅
OpenAI: GPT-3.5 Turbo 16k 16K tokens·$3.00/1M	16K tokens	$3.00	$4.00	—	—	✅
OpenAI: GPT-4 8K tokens·$30.00/1M	8K tokens	$30.00	$60.00	—	—	✅
OpenAI: GPT-4 Turbo (older v1106) 128K tokens·$10.00/1M	128K tokens	$10.00	$30.00	—	—	✅
OpenAI: GPT-4o 128K tokens·$2.50/1M	128K tokens	$2.50	$10.00	—	✅	✅
OpenAI: GPT-4o (2024-05-13) 128K tokens·$5.00/1M	128K tokens	$5.00	$15.00	—	✅	✅
OpenAI: GPT-4o-mini 128K tokens·$0.15/1M	128K tokens	$0.15	$0.60	—	✅	✅
OpenAI: GPT-5.6 Luna Pro 1.1M tokens·$1.00/1M	1.1M tokens	$1.00	$6.00	—	✅	✅
OpenAI: GPT-5.6 Sol Pro 1.1M tokens·$5.00/1M	1.1M tokens	$5.00	$30.00	—	✅	✅
OpenAI: GPT-5.6 Terra Pro 1.1M tokens·$2.50/1M	1.1M tokens	$2.50	$15.00	—	✅	✅
Sora	—	—	—	—	—	✅
Sora 2	—	—	—	—	✅	✅

OpenBMB

2 models

Model	Context	Input Price	Output Price	OS	MM	API
MiniCPM-V 4.6 1.3B 0	—	—	—	—	—	✅
MiniCPM5-1B (Non-reasoning) 0	—	—	—	—	—	✅

OpenChat

1 model

Model	Context	Input Price	Output Price	OS	MM	API
OpenChat 3.5 (1210) 0	—	—	—	—	—	✅

Perplexity

6 models

Model	Context	Input Price	Output Price	OS	MM	API
Perplexity: Sonar Deep Research 128K tokens·$2.00/1M	128K tokens	$2.00	$8.00	—	—	✅
Perplexity: Sonar Pro Search 200K tokens·$3.00/1M	200K tokens	$3.00	$15.00	—	✅	✅
R1 1776 0	—	—	—	—	—	✅
Sonar 127K tokens00	127K tokens	—	—	—	—	✅
Sonar Reasoning 127K tokens00	127K tokens	—	—	—	—	✅
Sonar Reasoning Pro 128K tokens00	128K tokens	—	—	—	✅	✅

Pika

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Pika 2.5	—	—	—	—	✅	✅

Pika Labs

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Pika 2.1	—	—	—	—	—	✅

Prime Intellect

1 model

Model	Context	Input Price	Output Price	OS	MM	API
INTELLECT-3 131K tokens00	131K tokens	—	—	✅	—	✅

ReMM SLERP 13B

1 model

Model	Context	Input Price	Output Price	OS	MM	API
ReMM SLERP 13B 6K tokens·$0.45/1M	6K tokens	$0.45	$0.65	✅	—	✅

Recraft

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Recraft V4.1	—	—	—	—	✅	✅

Reka Edge

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Reka Edge 16K tokens·$0.10/1M	16K tokens	$0.10	$0.10	✅	✅	✅

Reka Flash 3

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Reka Flash 3 66K tokens·$0.20/1M	66K tokens	$0.20	$0.80	✅	—	✅

Relace

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Relace: Relace Apply 3 256K tokens·$0.85/1M	256K tokens	$0.85	$1.25	—	—	✅
Relace: Relace Search 256K tokens·$1.00/1M	256K tokens	$1.00	$3.00	—	—	✅

Runway

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Runway Gen-3 Alpha	—	—	—	—	—	✅
Runway Gen-4.5	—	—	—	—	✅	✅

Sao10K

4 models

Model	Context	Input Price	Output Price	OS	MM	API
Sao10K: Llama 3 8B Lunaris 8K tokens·$0.04/1M	8K tokens	$0.04	$0.05	✅	—	✅
Sao10K: Llama 3.1 70B Hanami x1 16K tokens·$3.00/1M	16K tokens	$3.00	$3.00	✅	—	✅
Sao10K: Llama 3.1 Euryale 70B v2.2 131K tokens·$0.85/1M	131K tokens	$0.85	$0.85	✅	—	✅
Sao10K: Llama 3.3 Euryale 70B 131K tokens·$0.65/1M	131K tokens	$0.65	$0.75	✅	—	✅

Sao10k

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Sao10k: Llama 3 Euryale 70B v2.1 8K tokens·$1.48/1M	8K tokens	$1.48	$1.48	✅	—	✅

Sarvam

3 models

Model	Context	Input Price	Output Price	OS	MM	API
Sarvam 105B (high) $0.04/1M	—	$0.04	$0.17	—	—	✅
Sarvam 30B $0.03/1M	—	$0.03	$0.11	—	—	✅
Sarvam M (Reasoning) 0	—	—	—	—	—	✅

ServiceNow

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Apriel-v1.5-15B-Thinker 0	—	—	—	—	—	✅
Apriel-v1.6-15B-Thinker 0	—	—	—	—	—	✅

Snowflake

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Arctic Instruct 0	—	—	—	—	—	✅

Stability AI

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Stable Diffusion 3.5 Large	—	—	—	✅	✅	✅
Stable Video Diffusion 3D	—	—	—	✅	—	✅

StepFun

4 models

Model	Context	Input Price	Output Price	OS	MM	API
Step 3.5 Flash $0.10/1M	—	$0.10	$0.30	—	—	✅
Step 3.5 Flash 262K tokens·$0.10/1M	262K tokens	$0.10	$0.30	✅	—	✅
Step 3.7 Flash $0.20/1M	—	$0.20	$1.15	—	—	✅
Step3 VL 10B 0	—	—	—	—	—	✅

Suno

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Suno v4.5	—	—	—	—	✅	—

Swiss AI Initiative

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Apertus 70B Instruct $0.82/1M	—	$0.82	$2.92	—	—	✅
Apertus 8B Instruct $0.10/1M	—	$0.10	$0.20	—	—	✅

TII UAE

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Falcon-H1R-7B 0	—	—	—	—	—	✅

Tencent

3 models

Model	Context	Input Price	Output Price	OS	MM	API
Hy3-preview (Reasoning) 262K tokens·$0.12/1M	262K tokens	$0.12	$0.43	—	—	✅
Hy3-preview (Reasoning) 262K tokens·$0.12/1M	262K tokens	$0.12	$0.43	—	—	✅
Tencent: Hunyuan A13B Instruct 131K tokens·$0.14/1M	131K tokens	$0.14	$0.57	✅	—	✅

TheDrummer

4 models

Model	Context	Input Price	Output Price	OS	MM	API
TheDrummer: Cydonia 24B V4.1 131K tokens·$0.30/1M	131K tokens	$0.30	$0.50	✅	—	✅
TheDrummer: Rocinante 12B 33K tokens·$0.17/1M	33K tokens	$0.17	$0.43	✅	—	✅
TheDrummer: Skyfall 36B V2 33K tokens·$0.55/1M	33K tokens	$0.55	$0.80	✅	—	✅
TheDrummer: UnslopNemo 12B 33K tokens·$0.40/1M	33K tokens	$0.40	$0.40	✅	—	✅

Thinking Machines

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Inkling $1.87/1M	—	$1.87	$4.68	—	—	✅

Tongyi DeepResearch 30B A3B

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Tongyi DeepResearch 30B A3B 131K tokens·$0.09/1M	131K tokens	$0.09	$0.45	✅	—	✅

Trillion Labs

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Tri-21B-Think 0	—	—	—	—	—	✅
Tri-21B-think Preview 0	—	—	—	—	—	✅

Unknown

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Pro 400	—	—	—	—	—	✅

Upstage

6 models

Model	Context	Input Price	Output Price	OS	MM	API
Solar Mini $0.15/1M	—	$0.15	$0.15	—	—	✅
Solar Open 100B (Reasoning) 0	—	—	—	—	—	✅
Solar Pro 2 (Non-reasoning) 0	—	—	—	—	—	✅
Solar Pro 2 (Preview) (Non-reasoning) 0	—	—	—	—	—	✅
Solar Pro 2 (Preview) (Reasoning) 0	—	—	—	—	—	✅
Solar Pro 3 128K tokens00	128K tokens	—	—	—	—	✅

Writer

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Writer: Palmyra X5 1.0M tokens·$0.60/1M	1.0M tokens	$0.60	$6.00	—	—	✅

Xiaomi

7 models

Model	Context	Input Price	Output Price	OS	MM	API
MiMo-V2-Flash (Feb 2026) 0	—	—	—	—	—	✅
MiMo-V2-Flash (Reasoning) 262K tokens·$0.10/1M	262K tokens	$0.10	$0.30	—	—	✅
MiMo-V2-Omni-0327 0	—	—	—	—	—	✅
MiMo-V2.5 $0.14/1M	—	$0.14	$0.28	—	—	✅
Xiaomi: MiMo-V2-Omni 262K tokens00	262K tokens	—	—	—	✅	✅
Xiaomi: MiMo-V2-Pro 1.0M tokens00	1.0M tokens	—	—	—	—	✅
Xiaomi: MiMo-V2.5-Pro 1.0M tokens·$0.43/1M	1.0M tokens	$0.43	$0.87	—	—	✅

Z.ai

18 models

Model	Context	Input Price	Output Price	OS	MM	API
GLM 5V Turbo (Reasoning) 203K tokens00	203K tokens	—	—	—	—	✅
GLM-4.5 (Reasoning) 131K tokens00	131K tokens	—	—	—	—	✅
GLM-4.5-Air $0.17/1M	—	$0.17	$0.98	—	—	✅
GLM-4.5V (Non-reasoning) $0.60/1M	—	$0.60	$1.80	—	—	✅
GLM-4.5V (Reasoning) $0.60/1M	—	$0.60	$1.80	—	—	✅
GLM-4.6 (Non-reasoning) $0.57/1M	—	$0.57	$2.20	—	—	✅
GLM-4.6 (Reasoning) $0.55/1M	—	$0.55	$2.20	—	—	✅
GLM-4.6V (Non-reasoning) $0.30/1M	—	$0.30	$0.90	—	—	✅
GLM-4.6V (Reasoning) $0.30/1M	—	$0.30	$0.90	—	—	✅
GLM-4.7 (Reasoning) $0.60/1M	—	$0.60	$2.20	—	—	✅
GLM-4.7-Flash (Reasoning) $0.07/1M	—	$0.07	$0.40	—	—	✅
GLM-5 (Non-reasoning) $1.00/1M	—	$1.00	$3.20	—	—	✅
GLM-5 (Reasoning) 203K tokens·$1.00/1M	203K tokens	$1.00	$3.20	—	—	✅
GLM-5-Turbo 203K tokens00	203K tokens	—	—	—	—	✅
GLM-5.1 (Non-reasoning) $1.40/1M	—	$1.40	$4.40	—	—	✅
GLM-5.1 (Reasoning) $1.40/1M	—	$1.40	$4.40	—	—	✅
GLM-5.2 (max) $1.40/1M	—	$1.40	$4.40	—	—	✅
Z.ai: GLM 4 32B 128K tokens·$0.10/1M	128K tokens	$0.10	$0.10	—	—	✅

anthropic

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Claude Sonnet 5 1.0M tokens·$2.00/1M	1.0M tokens	$2.00	$10.00	—	—	✅

cohere

1 model

Model	Context	Input Price	Output Price	OS	MM	API
command	—	—	—	—	—	✅

deepseek

1 model

Model	Context	Input Price	Output Price	OS	MM	API
DeepSeek V4	—	—	—	—	—	✅

google

2 models

Model	Context	Input Price	Output Price	OS	MM	API
Gemini 3.1	—	—	—	—	—	✅
Gemini 3.6	—	—	—	—	—	✅

openai

1 model

Model	Context	Input Price	Output Price	OS	MM	API
GPT-5.6	—	—	—	—	—	✅

xAI

17 models

Model	Context	Input Price	Output Price	OS	MM	API
Grok 2 (Dec '24) 0	—	—	—	—	—	✅
Grok 3 131K tokens·$4.00/1M	131K tokens	$4.00	$20.00	—	—	✅
Grok 3 Beta 131K tokens·$3.00/1M	131K tokens	$3.00	$15.00	—	—	✅
Grok 3 Mini 131K tokens·$0.30/1M	131K tokens	$0.30	$0.50	—	—	✅
Grok 3 Mini Beta 131K tokens·$0.30/1M	131K tokens	$0.30	$0.50	—	—	✅
Grok 4 256K tokens·$5.50/1M	256K tokens	$5.50	$27.50	—	✅	✅
Grok 4 Fast 2.0M tokens·$0.20/1M	2.0M tokens	$0.20	$0.50	—	✅	✅
Grok 4.1 Fast 2.0M tokens00	2.0M tokens	—	—	—	✅	✅
Grok 4.20 2.0M tokens·$2.00/1M	2.0M tokens	$2.00	$6.00	—	✅	✅
Grok 4.20 0309 (Reasoning) $2.00/1M	—	$2.00	$6.00	—	—	✅
Grok 4.20 Multi-Agent 2.0M tokens·$1.25/1M	2.0M tokens	$1.25	$2.50	—	✅	✅
Grok 4.3 1.0M tokens·$1.25/1M	1.0M tokens	$1.25	$2.50	—	✅	✅
Grok Beta 0	—	—	—	—	—	✅
Grok Build 0.1 0616 $1.00/1M	—	$1.00	$2.00	—	—	✅
Grok Code Fast 1 256K tokens00	256K tokens	—	—	—	—	✅
Grok-1 0	—	—	—	—	—	✅
xAI: Grok Build 0.1 256K tokens·$1.00/1M	256K tokens	$1.00	$2.00	—	✅	✅

xai

1 model

Model	Context	Input Price	Output Price	OS	MM	API
Grok 4.5 500K tokens·$2.00/1M	500K tokens	$2.00	$6.00	—	—	✅

Guide to AI Models in 2026

The AI model ecosystem in 2026 is dominated by four major families: GPT from OpenAI, Claude from Anthropic, Gemini from Google, and Llama from Meta. Each family has models of different sizes and specializations, with varying prices and capabilities for different use cases.

GPT Family (OpenAI)

OpenAI offers the GPT-4o line as its main model, with variants at different costs and speeds. GPT-4o-mini is the most affordable option with excellent cost-effectiveness. The OpenAI API is the most widely supported by third-party tools and integrations, making it the default choice for many applications.

Claude Family (Anthropic)

Anthropic positions Claude with a focus on safety and following complex instructions. Claude Opus is the most capable model in the lineup, with a 200K token context window — ideal for analyzing long documents. Claude Haiku is the fastest and cheapest option. Anthropic has a strong presence in enterprise and compliance-sensitive use cases.

Gemini Family (Google)

Gemini is notable for its 1 million token context window — the largest among commercial models — and native integration with the Google ecosystem (Search, Workspace, Cloud). Gemini Flash is the most affordable option with exceptional speed.

Open Source: Llama, Qwen & DeepSeek

The open source segment has advanced significantly. Meta AI released Llama 4 with competitive performance in certain tasks. Alibaba maintains the Qwen family with a focus on multilingual support. DeepSeek surprised with frontier performance at substantially lower cost than equivalent proprietary models.

Frequently Asked Questions

What is the difference between GPT-4o and Claude Opus?

GPT-4o from OpenAI and Claude Opus from Anthropic are both frontier models with similar capabilities. GPT-4o has better speed and integration with the OpenAI ecosystem. Claude Opus excels at tasks with long context and complex reasoning.

What is context window in AI models?

Context window is the maximum amount of text the model can process in a single request, measured in tokens (approximately 4 characters per token in English). Models with larger context windows can analyze complete documents and extensive codebases.

Which AI model is open source?

Open source models include Llama (Meta), Qwen (Alibaba), Mistral, DeepSeek, and Gemma (Google). They are available under licenses that allow use, modification, and self-deployment, without depending on paid APIs.

How does per-token pricing work?

LLMs charge per tokens processed — separated by input tokens (what you send) and output tokens (what the model generates). Prices are in USD per 1 million tokens. Output tokens typically cost 3-5x more than input tokens.

Explore

Benchmark Compare Comparisons Tools Glossary Guides