AI Models 2026Complete Spec Sheets

Technical specifications, pricing, context window, and capabilities of 500 AI models from 61 companies. Sheets updated weekly with manufacturer data.

For performance ranking, visit the AI Benchmark.

500

Models

61

Companies

92

Open Source

122

Multimodal

AI21 Labs

7 models
ModelOS
AI21: Jamba Large 1.7
256K tokens·$2.00/1M
Jamba 1.5 Large
$2.00/1M
Jamba 1.5 Mini
$0.20/1M
Jamba 1.6 Large
$2.00/1M
Jamba 1.6 Mini
$0.20/1M
Jamba 1.7 Mini
0
Jamba Reasoning 3B
0

AionLabs

3 models
ModelOS
AionLabs: Aion-1.0
131K tokens·$4.00/1M
AionLabs: Aion-2.0
131K tokens·$0.80/1M
AionLabs: Aion-RP 1.0 (8B)
33K tokens·$0.80/1M

AlfredPros

1 model
ModelOS
AlfredPros: CodeLLaMa 7B Instruct Solidity
4K tokens·$0.80/1M

Alibaba

64 models
ModelOS
Qwen Chat 14B
0
Qwen Chat 72B
0
Qwen: Qwen2.5 7B Instruct
33K tokens·$0.04/1M
Qwen: Qwen2.5 VL 72B Instruct
32K tokens·$0.25/1M
Qwen: Qwen3 235B A22B Instruct 2507
262K tokens·$0.20/1M
Qwen: Qwen3 235B A22B Thinking 2507
131K tokens·$0.15/1M
Qwen: Qwen3 30B A3B Instruct 2507
262K tokens·$0.08/1M
Qwen: Qwen3 30B A3B Thinking 2507
131K tokens·$0.08/1M
Qwen: Qwen3 Coder 30B A3B Instruct
160K tokens·$0.19/1M
Qwen: Qwen3 Next 80B A3B Instruct
262K tokens·$0.50/1M
Qwen: Qwen3 VL 235B A22B Instruct
262K tokens·$0.30/1M
Qwen: Qwen3 VL 30B A3B Instruct
131K tokens·$0.20/1M
Qwen: Qwen3 VL 32B Instruct
131K tokens·$0.70/1M
Qwen: Qwen3 VL 8B Instruct
131K tokens·$0.18/1M
Qwen1.5 Chat 110B
0
Qwen2 Instruct 72B
0
Qwen2.5 72B Instruct
33K tokens·$0.36/1M
Qwen2.5 Coder 32B Instruct
33K tokens00
Qwen2.5 Coder Instruct 7B
0
Qwen2.5 Instruct 32B
0
Qwen2.5 Max
$1.60/1M
Qwen3 0.6B (Non-reasoning)
$0.11/1M
Qwen3 0.6B (Reasoning)
$0.11/1M
Qwen3 1.7B (Non-reasoning)
$0.11/1M
Qwen3 1.7B (Reasoning)
$0.11/1M
Qwen3 14B (Non-reasoning)
$0.23/1M
Qwen3 14B (Reasoning)
$0.23/1M
Qwen3 235B A22B (Reasoning)
$0.70/1M
Qwen3 30B A3B (Reasoning)
$0.09/1M
Qwen3 30B A3B 2507 (Reasoning)
$0.28/1M
Qwen3 30B A3B 2507 Instruct
$0.15/1M
Qwen3 32B (Non-reasoning)
$0.15/1M
Qwen3 32B (Reasoning)
$0.20/1M
Qwen3 4B (Non-reasoning)
$0.11/1M
Qwen3 4B (Reasoning)
$0.11/1M
Qwen3 4B 2507 (Reasoning)
0
Qwen3 4B 2507 Instruct
0
Qwen3 8B (Non-reasoning)
$0.18/1M
Qwen3 8B (Reasoning)
$0.11/1M
Qwen3 Coder 480B A35B Instruct
$0.30/1M
Qwen3 Max (Preview)
$1.20/1M
Qwen3 Max Thinking (Preview)
$1.20/1M
Qwen3 Next 80B A3B (Reasoning)
$0.50/1M
Qwen3 Omni 30B A3B (Reasoning)
$0.25/1M
Qwen3 Omni 30B A3B Instruct
$0.25/1M
Qwen3 VL 235B A22B (Reasoning)
$0.84/1M
Qwen3 VL 30B A3B (Reasoning)
$0.20/1M
Qwen3 VL 32B (Reasoning)
$0.70/1M
Qwen3 VL 4B (Reasoning)
0
Qwen3 VL 4B Instruct
0
Qwen3 VL 8B (Reasoning)
$0.18/1M
Qwen3.5 0.8B (Non-reasoning)
$0.01/1M
Qwen3.5 0.8B (Reasoning)
$0.01/1M
Qwen3.5 2B (Reasoning)
$0.02/1M
Qwen3.5 4B (Non-reasoning)
$0.03/1M
Qwen3.5 4B (Reasoning)
$0.03/1M
Qwen3.5 9B (Reasoning)
0
Qwen3.5 Omni Flash
$0.10/1M
Qwen3.5 Omni Plus
$0.40/1M
Qwen3.6 Max Preview
$1.30/1M
Qwen3.7 Max
$2.50/1M
QwQ 32B
$0.66/1M
QwQ 32B-Preview
0
Wan 2.1

Allen Institute for AI

8 models
ModelOS
Llama 3.1 Tulu3 405B
0
Molmo 7B-D
0
Molmo2-8B
0
OLMo 2 32B
0
OLMo 2 7B
0
Olmo 3 7B Instruct
$0.10/1M
Olmo 3 7B Think
0
Olmo 3.1 32B Think
0

AllenAI

2 models
ModelOS
Olmo 3 32B Think
66K tokens00
Olmo 3.1 32B Instruct
66K tokens00

Amazon

13 models
ModelOS
Amazon: Nova 2 Lite
1.0M tokens·$0.30/1M
Amazon: Nova Lite 1.0
300K tokens·$0.06/1M
Amazon: Nova Micro 1.0
128K tokens·$0.04/1M
Amazon: Nova Premier 1.0
1.0M tokens·$2.50/1M
Amazon: Nova Pro 1.0
300K tokens·$0.80/1M
Nova 2.0 Lite (high)
$0.30/1M
Nova 2.0 Omni (low)
$0.30/1M
Nova 2.0 Omni (medium)
$0.30/1M
Nova 2.0 Omni (Non-reasoning)
$0.30/1M
Nova 2.0 Pro Preview (medium)
$1.25/1M
Nova Lite
$0.06/1M
Nova Micro
$0.04/1M
Nova Pro
$0.80/1M

Anthropic

35 models
ModelOS
Anthropic: Claude 3 Haiku
200K tokens·$0.25/1M
Anthropic: Claude Opus 4.8 (Fast)
1.0M tokens·$10.00/1M
Claude 2.0
0
Claude 2.1
0
Claude 3 Opus
$18.75/1M
Claude 3 Sonnet
$3.00/1M
Claude 3.5 Haiku
200K tokens·$1.00/1M
Claude 3.5 Sonnet (June '24)
$3.75/1M
Claude 3.5 Sonnet (Oct '24)
$3.75/1M
Claude 3.7 Sonnet
200K tokens·$3.75/1M
Claude 3.7 Sonnet (thinking)
200K tokens00
Claude 4 Opus (Reasoning)
$18.75/1M
Claude 4 Sonnet (Reasoning)
$3.75/1M
Claude 4.1 Opus (Non-reasoning)
$18.75/1M
Claude 4.1 Opus (Reasoning)
$18.75/1M
Claude 4.5 Haiku (Reasoning)
$1.25/1M
Claude 4.5 Sonnet (Non-reasoning)
$3.75/1M
Claude 4.5 Sonnet (Reasoning)
$3.75/1M
Claude Haiku 4.5
200K tokens·$1.25/1M
Claude Instant
0
Claude Opus 4
200K tokens·$18.75/1M
Claude Opus 4.1
200K tokens·$15.00/1M
Claude Opus 4.5
200K tokens·$6.25/1M
Claude Opus 4.5 (Reasoning)
$6.25/1M
Claude Opus 4.6
1.0M tokens·$6.25/1M
Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
$6.25/1M
Claude Opus 4.6 (Fast)
1.0M tokens·$30.00/1M
Claude Opus 4.7
1.0M tokens·$6.25/1M
Claude Opus 4.7 (Fast)
1.0M tokens·$30.00/1M
Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
1.0M tokens·$6.25/1M
Claude Sonnet 4
1.0M tokens·$3.75/1M
Claude Sonnet 4.5
1.0M tokens·$3.00/1M
Claude Sonnet 4.6
1.0M tokens·$3.75/1M
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
$3.75/1M
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
$3.75/1M

Arcee AI

7 models
ModelOS
Arcee AI: Coder Large
33K tokens·$0.50/1M
Arcee AI: Maestro Reasoning
131K tokens·$0.90/1M
Arcee AI: Spotlight
131K tokens·$0.18/1M
Arcee AI: Trinity Large Thinking
262K tokens·$0.22/1M
Arcee AI: Trinity Mini
131K tokens·$0.04/1M
Arcee AI: Virtuoso Large
131K tokens·$0.75/1M
Trinity Large Thinking
$0.23/1M

Baidu

5 models
ModelOS
Baidu: ERNIE 4.5 21B A3B Thinking
131K tokens·$0.07/1M
Baidu: ERNIE 4.5 300B A47B
123K tokens·$0.28/1M
Baidu: ERNIE 4.5 VL 28B A3B
30K tokens·$0.14/1M
Baidu: ERNIE 4.5 VL 424B A47B
123K tokens·$0.42/1M
ERNIE 5.0 Thinking Preview
0

ByteDance

2 models
ModelOS
ByteDance: UI-TARS 7B
128K tokens·$0.10/1M
Doubao Seed Code
0

ByteDance Seed

4 models
ModelOS
ByteDance Seed: Seed 1.6 Flash
262K tokens·$0.07/1M
ByteDance Seed: Seed-2.0-Lite
262K tokens·$0.25/1M
Doubao Seed Code
0
Seed-OSS-36B-Instruct
$0.21/1M

China Mobile

3 models
ModelOS
JT-35B-Flash
0
JT-35B-Flash
0
JT-MINI
0

Cohere

6 models
ModelOS
Cohere: Command R+ (08-2024)
128K tokens·$2.50/1M
Cohere: Command R7B (12-2024)
128K tokens·$0.04/1M
Command A+
0
Command-R (Mar '24)
$0.50/1M
Command-R+ (Apr '24)
$3.00/1M
Tiny Aya Global
0

Databricks

1 model
ModelOS
DBRX Instruct
0

Deep Cogito

2 models
ModelOS
Cogito v2.1 (Reasoning)
$1.25/1M
Deep Cogito: Cogito v2.1 671B
128K tokens·$1.25/1M

DeepSeek

25 models
ModelOS
DeepSeek Coder V2 Lite Instruct
0
DeepSeek LLM 67B Chat (V1)
0
DeepSeek R1 (Jan '25)
$1.68/1M
DeepSeek R1 0528 Qwen3 8B
0
DeepSeek R1 Distill Llama 8B
0
DeepSeek R1 Distill Qwen 1.5B
0
DeepSeek R1 Distill Qwen 14B
0
DeepSeek V3
131K tokens·$0.23/1M
DeepSeek V3 0324
$1.20/1M
DeepSeek V3.1
164K tokens·$0.56/1M
DeepSeek V3.1 Terminus
164K tokens·$0.27/1M
DeepSeek V3.2
131K tokens·$0.50/1M
DeepSeek V3.2 Exp
164K tokens·$0.27/1M
DeepSeek V3.2 Exp (Non-reasoning)
$0.28/1M
DeepSeek V3.2 Exp (Reasoning)
$0.28/1M
DeepSeek V3.2 Speciale
164K tokens00
DeepSeek V4 Flash
1.0M tokens·$0.14/1M
DeepSeek V4 Pro
1.0M tokens·$0.43/1M
DeepSeek-Coder-V2
0
DeepSeek-V2-Chat
0
DeepSeek-V2.5
0
DeepSeek-V2.5 (Dec '24)
0
DeepSeek: R1
164K tokens·$0.70/1M
DeepSeek: R1 Distill Qwen 32B
128K tokens00
R1 Distill Llama 70B
131K tokens·$0.70/1M

EssentialAI

1 model
ModelOS
EssentialAI: Rnj 1 Instruct
33K tokens·$0.15/1M

Goliath 120B

1 model
ModelOS
Goliath 120B
6K tokens·$3.75/1M

Google

61 models
ModelOS
Gemini 1.0 Pro
0
Gemini 1.0 Ultra
0
Gemini 1.5 Flash (May '24)
0
Gemini 1.5 Flash (Sep '24)
0
Gemini 1.5 Flash-8B
0
Gemini 1.5 Pro (May '24)
0
Gemini 1.5 Pro (Sep '24)
0
Gemini 2.0 Flash
1.0M tokens·$0.15/1M
Gemini 2.0 Flash (experimental)
0
Gemini 2.0 Flash Lite
1.0M tokens·$0.07/1M
Gemini 2.0 Flash Thinking Experimental (Dec '24)
0
Gemini 2.0 Flash Thinking Experimental (Jan '25)
0
Gemini 2.0 Flash-Lite (Feb '25)
0
Gemini 2.0 Flash-Lite (Preview)
0
Gemini 2.0 Pro Experimental (Feb '25)
0
Gemini 2.5 Flash
1.0M tokens·$0.30/1M
Gemini 2.5 Flash Lite
1.0M tokens·$0.10/1M
Gemini 2.5 Flash Preview (Non-reasoning)
0
Gemini 2.5 Flash Preview (Reasoning)
0
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)
0
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
$0.10/1M
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
$0.10/1M
Gemini 2.5 Pro
1.0M tokens·$1.25/1M
Gemini 2.5 Pro Preview (Mar' 25)
0
Gemini 2.5 Pro Preview (May' 25)
$1.25/1M
Gemini 2.5 Pro Preview 05-06
1.0M tokens·$1.25/1M
Gemini 2.5 Pro Preview 06-05
1.0M tokens·$1.25/1M
Gemini 3 Deep Think
0
Gemini 3 Flash Preview
1.0M tokens·$0.50/1M
Gemini 3 Flash Preview (Non-reasoning)
$0.50/1M
Gemini 3 Flash Preview (Reasoning)
$0.50/1M
Gemini 3 Pro Preview (high)
$2.00/1M
Gemini 3 Pro Preview (low)
$2.00/1M
Gemini 3.1 Flash Lite
1.0M tokens·$0.25/1M
Gemini 3.1 Flash Lite Preview
1.0M tokens·$0.25/1M
Gemini 3.1 Pro Preview
1.0M tokens·$2.00/1M
Gemini 3.1 Pro Preview Custom Tools
1.0M tokens·$2.00/1M
Gemini 3.5 Flash (minimal)
$1.50/1M
Gemma 2 27B
8K tokens·$0.65/1M
Gemma 3 12B
131K tokens·$0.09/1M
Gemma 3 1B Instruct
0
Gemma 3 270M
0
Gemma 3 27B
131K tokens·$0.11/1M
Gemma 3 4B
131K tokens·$0.04/1M
Gemma 3n 4B
33K tokens·$0.06/1M
Gemma 3n E2B Instruct
0
Gemma 3n E4B Instruct
$0.02/1M
Gemma 3n E4B Instruct Preview (May '25)
0
Gemma 4 26B A4B
262K tokens·$0.13/1M
Gemma 4 31B
262K tokens·$0.14/1M
Gemma 4 E2B (Non-reasoning)
0
Gemma 4 E2B (Reasoning)
0
Gemma 4 E4B (Non-reasoning)
0
Gemma 4 E4B (Reasoning)
0
Google: Gemini 3.5 Flash
1.0M tokens·$1.50/1M
Lyria 3 Clip Preview
1.0M tokens
Lyria 3 Pro Preview
1.0M tokens
Nano Banana (Gemini 2.5 Flash Image)
33K tokens·$0.30/1M
Nano Banana 2 (Gemini 3.1 Flash Image Preview)
131K tokens·$0.50/1M
Nano Banana Pro (Gemini 3 Pro Image Preview)
66K tokens·$2.00/1M
PALM-2
0

IBM

10 models
ModelOS
Granite 3.3 8B (Non-reasoning)
$0.03/1M
Granite 4.0 1B
0
Granite 4.0 350M
0
Granite 4.0 H 1B
0
Granite 4.0 H 350M
0
Granite 4.0 H Small
$0.06/1M
Granite 4.0 Micro
131K tokens00
Granite 4.1 30B
0
Granite 4.1 3B
0
Granite 4.1 8B
$0.05/1M

Inception

1 model
ModelOS
Inception: Mercury 2
128K tokens·$0.25/1M

Inclusion AI

2 models
ModelOS
Ling 2.6 Flash
$0.10/1M
Ling-2.6-1T
$0.30/1M

InclusionAI

6 models
ModelOS
Ling-1T
0
Ling-flash-2.0
$0.14/1M
Ling-mini-2.0
0
Ring-1T
0
Ring-2.6-1T
$0.30/1M
Ring-flash-2.0
$0.14/1M

Inflection

2 models
ModelOS
Inflection: Inflection 3 Pi
8K tokens·$2.50/1M
Inflection: Inflection 3 Productivity
8K tokens·$2.50/1M

Kimi

2 models
ModelOS
Kimi K2 Thinking
262K tokens·$0.60/1M
Kimi Linear 48B A3B Instruct
0

Korea Telecom

2 models
ModelOS
Mi:dm K 2.5 Pro
0
Mi:dm K 2.5 Pro Preview
0

Kuaishou

1 model
ModelOS
Kling AI 2.0

KwaiKAT

1 model
ModelOS
KAT-Coder-Pro V1
$0.30/1M

Kwaipilot

1 model
ModelOS
Kwaipilot: KAT-Coder-Pro V2
256K tokens·$0.30/1M

LG AI

2 models
ModelOS
EXAONE 4.5 33B
0
K-EXAONE (Reasoning)
0

LG AI Research

3 models
ModelOS
Exaone 4.0 1.2B (Non-reasoning)
0
EXAONE 4.0 32B (Non-reasoning)
0
EXAONE 4.0 32B (Reasoning)
0

Liquid AI

7 models
ModelOS
LFM 40B
0
LFM2 1.2B
0
LFM2 2.6B
0
LFM2 8B A1B
0
LFM2.5-1.2B-Instruct
0
LFM2.5-1.2B-Thinking
0
LFM2.5-VL-1.6B
0

LiquidAI

1 model
ModelOS
LFM2-24B-A2B
33K tokens·$0.03/1M

LongCat

1 model
ModelOS
LongCat Flash Lite
0

Luma AI

1 model
ModelOS
Luma Dream Machine 1.6

MBZUAI Institute of Foundation Models

3 models
ModelOS
K2 Think V2
0
K2-V2 (high)
0
K2-V2 (medium)
0

Magnum v4 72B

1 model
ModelOS
Magnum v4 72B
16K tokens·$3.00/1M

Mancer

1 model
ModelOS
Mancer: Weaver (alpha)
8K tokens·$0.75/1M

Meta

19 models
ModelOS
Llama 2 Chat 13B
0
Llama 2 Chat 70B
0
Llama 2 Chat 7B
$0.05/1M
Llama 3 70B Instruct
8K tokens·$0.65/1M
Llama 3 8B Instruct
8K tokens·$0.04/1M
Llama 3.1 70B Instruct
131K tokens·$0.56/1M
Llama 3.1 8B Instruct
16K tokens·$0.10/1M
Llama 3.1 Instruct 405B
$2.75/1M
Llama 3.2 11B Vision Instruct
131K tokens·$0.24/1M
Llama 3.2 1B Instruct
60K tokens·$0.05/1M
Llama 3.2 3B Instruct
80K tokens·$0.15/1M
Llama 3.2 Instruct 90B (Vision)
$1.38/1M
Llama 3.3 70B Instruct
131K tokens·$0.58/1M
Llama 4 Maverick
1.0M tokens·$0.35/1M
Llama 4 Scout
10.0M tokens·$0.17/1M
Llama 65B
0
Llama Guard 3 8B
131K tokens·$0.48/1M
Llama Guard 4 12B
164K tokens·$0.18/1M
Muse Spark
0

Microsoft

5 models
ModelOS
Microsoft: Phi 4
16K tokens·$0.13/1M
Phi-3 Mini Instruct 3.8B
0
Phi-4 Mini Instruct
0
Phi-4 Multimodal Instruct
0
WizardLM-2 8x22B
66K tokens·$0.62/1M

MiniMax

10 models
ModelOS
Hailuo MiniMax Video-01
MiniMax M1 40k
0
MiniMax M1 80k
$0.55/1M
MiniMax-M2
205K tokens·$0.30/1M
MiniMax: MiniMax M1
1.0M tokens·$0.40/1M
MiniMax: MiniMax M2-her
66K tokens·$0.30/1M
MiniMax: MiniMax M2.1
197K tokens·$0.30/1M
MiniMax: MiniMax M2.5
197K tokens·$0.30/1M
MiniMax: MiniMax M2.7
197K tokens·$0.30/1M
MiniMax: MiniMax-01
1.0M tokens·$0.20/1M

Mistral

21 models
ModelOS
Devstral 2
0
Devstral Small (Jul '25)
131K tokens·$0.10/1M
Devstral Small (May '25)
0
Devstral Small 2
$0.10/1M
Magistral Medium 1
0
Magistral Small 1
0
Magistral Small 1.2
0
Ministral 3 14B
$0.20/1M
Ministral 3 3B
$0.10/1M
Ministral 3 8B
$0.15/1M
Mistral 7B Instruct
$0.20/1M
Mistral Large 2 (Jul '24)
131K tokens·$2.00/1M
Mistral Large 2 (Nov '24)
$2.00/1M
Mistral Large 3
$4.00/1M
Mistral Medium
$2.75/1M
Mistral Small (Feb '24)
$1.00/1M
Mistral Small (Sep '24)
$0.20/1M
Mistral Small 3
$0.07/1M
Mistral Small 3.1
$0.10/1M
Mistral Small 3.2
$0.09/1M
Mixtral 8x22B Instruct
0

Mistral AI

23 models
ModelOS
Magistral Medium 1.2
0
Mistral Large
128K tokens·$2.00/1M
Mistral: Codestral 2508
256K tokens·$0.30/1M
Mistral: Devstral 2 2512
262K tokens·$0.40/1M
Mistral: Devstral Medium
131K tokens·$0.40/1M
Mistral: Devstral Small 1.1
131K tokens·$0.10/1M
Mistral: Ministral 3 14B 2512
262K tokens·$0.20/1M
Mistral: Ministral 3 3B 2512
131K tokens·$0.10/1M
Mistral: Ministral 3 8B 2512
262K tokens·$0.15/1M
Mistral: Mistral 7B Instruct v0.1
3K tokens·$0.11/1M
Mistral: Mistral Medium 3
131K tokens·$0.40/1M
Mistral: Mistral Medium 3.1
131K tokens·$0.40/1M
Mistral: Mistral Medium 3.5
262K tokens·$1.50/1M
Mistral: Mistral Nemo
131K tokens·$0.02/1M
Mistral: Mistral Small 3.1 24B
128K tokens·$0.35/1M
Mistral: Mistral Small 3.2 24B
128K tokens·$0.07/1M
Mistral: Mistral Small 4
262K tokens·$0.20/1M
Mistral: Mistral Small Creative
33K tokens·$0.10/1M
Mistral: Mixtral 8x22B Instruct
66K tokens·$2.00/1M
Mistral: Mixtral 8x7B Instruct
33K tokens·$0.45/1M
Mistral: Pixtral Large 2411
131K tokens·$2.00/1M
Mistral: Saba
33K tokens00
Mistral: Voxtral Small 24B 2507
32K tokens·$0.10/1M

Moonshot AI

1 model
ModelOS
Kimi K2
131K tokens·$0.58/1M

MoonshotAI

4 models
ModelOS
MoonshotAI: Kimi K2 0711
131K tokens·$0.57/1M
MoonshotAI: Kimi K2 0905
262K tokens·$0.60/1M
MoonshotAI: Kimi K2.5
262K tokens·$0.60/1M
MoonshotAI: Kimi K2.6
262K tokens·$0.95/1M

Morph

2 models
ModelOS
Morph: Morph V3 Fast
82K tokens·$0.80/1M
Morph: Morph V3 Large
262K tokens·$0.90/1M

Motif Technologies

1 model
ModelOS
Motif-2-12.7B-Reasoning
0

MythoMax 13B

1 model
ModelOS
MythoMax 13B
4K tokens·$0.06/1M

NVIDIA

17 models
ModelOS
Llama 3.1 Nemotron 70B Instruct
131K tokens·$1.20/1M
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
0
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
$0.60/1M
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
0
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
0
Llama Nemotron Super 49B v1.5 (Non-reasoning)
$0.10/1M
Llama Nemotron Super 49B v1.5 (Reasoning)
$0.10/1M
Nemotron 3 Nano Omni 30B A3B Reasoning
$0.07/1M
Nemotron Cascade 2 30B A3B
0
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
262K tokens·$0.05/1M
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
$0.06/1M
NVIDIA Nemotron 3 Nano 4B
0
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
1.0M tokens·$0.30/1M
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
$0.20/1M
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
$0.20/1M
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
131K tokens·$0.05/1M
NVIDIA Nemotron Nano 9B V2 (Reasoning)
$0.04/1M

Nanbeige

1 model
ModelOS
Nanbeige4.1-3B
0

Naver

1 model
ModelOS
HyperCLOVA X SEED Think (32B)
0

Nex AGI

1 model
ModelOS
Nex AGI: DeepSeek V3.1 Nex N1
131K tokens·$0.14/1M

Nous

4 models
ModelOS
Nous: Hermes 3 405B Instruct
131K tokens·$1.00/1M
Nous: Hermes 3 70B Instruct
131K tokens·$0.30/1M
Nous: Hermes 4 405B
131K tokens·$1.00/1M
Nous: Hermes 4 70B
131K tokens·$0.13/1M

Nous Research

7 models
ModelOS
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
0
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
0
Hermes 3 - Llama-3.1 70B
$0.30/1M
Hermes 4 - Llama-3.1 405B (Non-reasoning)
$1.00/1M
Hermes 4 - Llama-3.1 405B (Reasoning)
$1.00/1M
Hermes 4 - Llama-3.1 70B (Non-reasoning)
$0.13/1M
Hermes 4 - Llama-3.1 70B (Reasoning)
$0.13/1M

NousResearch

1 model
ModelOS
NousResearch: Hermes 2 Pro - Llama-3 8B
8K tokens·$0.14/1M

OpenAI

75 models
ModelOS
GPT Audio
128K tokens·$2.50/1M
GPT Audio Mini
128K tokens·$0.60/1M
GPT Chat Latest
400K tokens·$5.00/1M
GPT-3.5 Turbo
16K tokens·$0.50/1M
GPT-3.5 Turbo
$0.50/1M
GPT-3.5 Turbo (0613)
0
GPT-4 Turbo
128K tokens·$10.00/1M
GPT-4 Turbo Preview
128K tokens·$10.00/1M
GPT-4.1
1.0M tokens·$2.00/1M
GPT-4.1 Mini
1.0M tokens·$0.40/1M
GPT-4.1 Nano
1.0M tokens·$0.10/1M
GPT-4.5 (Preview)
0
GPT-4o (2024-08-06)
128K tokens·$2.50/1M
GPT-4o (2024-11-20)
128K tokens·$2.50/1M
GPT-4o (ChatGPT)
0
GPT-4o (March 2025, chatgpt-4o-latest)
0
GPT-4o Audio
128K tokens·$2.50/1M
GPT-4o mini Realtime (Dec '24)
0
GPT-4o Realtime (Dec '24)
0
GPT-4o Search Preview
128K tokens·$2.50/1M
GPT-4o-mini (2024-07-18)
128K tokens·$0.15/1M
GPT-4o-mini Search Preview
128K tokens·$0.15/1M
GPT-5
400K tokens·$1.25/1M
GPT-5 (ChatGPT)
$1.25/1M
GPT-5 (minimal)
$1.25/1M
GPT-5 Chat
128K tokens·$1.25/1M
GPT-5 Codex
400K tokens·$1.25/1M
GPT-5 Image
400K tokens·$10.00/1M
GPT-5 Image Mini
400K tokens·$2.50/1M
GPT-5 Mini
400K tokens·$0.25/1M
GPT-5 mini (minimal)
$0.25/1M
GPT-5 Nano
400K tokens·$0.05/1M
GPT-5 nano (minimal)
$0.05/1M
GPT-5 Pro
400K tokens·$15.00/1M
GPT-5.1
400K tokens·$1.25/1M
GPT-5.1 Chat
128K tokens·$1.25/1M
GPT-5.1-Codex
400K tokens·$1.25/1M
GPT-5.1-Codex-Max
400K tokens·$1.25/1M
GPT-5.1-Codex-Mini
400K tokens·$0.25/1M
GPT-5.2
400K tokens·$1.75/1M
GPT-5.2 Chat
128K tokens·$1.75/1M
GPT-5.2 Pro
400K tokens·$21.00/1M
GPT-5.2-Codex
400K tokens·$1.75/1M
GPT-5.3 Chat
128K tokens·$1.75/1M
GPT-5.3-Codex
400K tokens·$1.75/1M
GPT-5.4
1.1M tokens·$2.50/1M
GPT-5.4 Image 2
272K tokens·$8.00/1M
GPT-5.4 Mini
400K tokens·$0.75/1M
GPT-5.4 Nano
400K tokens·$0.20/1M
GPT-5.4 Pro
1.1M tokens·$30.00/1M
GPT-5.5
1.1M tokens·$5.00/1M
GPT-5.5 Instant (May 2026)
$5.00/1M
GPT-5.5 Pro
1.1M tokens00
gpt-oss-120b
131K tokens·$0.15/1M
gpt-oss-20b
131K tokens·$0.06/1M
gpt-oss-safeguard-20b
131K tokens·$0.07/1M
o1
200K tokens·$15.00/1M
o1-mini
0
o1-preview
$16.50/1M
o1-pro
200K tokens·$150.00/1M
o3
200K tokens·$2.00/1M
o3 Deep Research
200K tokens·$10.00/1M
o3 Mini
200K tokens·$1.10/1M
o3 Mini High
200K tokens·$1.10/1M
o3 Pro
200K tokens·$20.00/1M
o4 Mini
200K tokens·$1.10/1M
o4 Mini Deep Research
200K tokens·$2.00/1M
o4 Mini High
200K tokens·$1.10/1M
OpenAI: GPT-3.5 Turbo 16k
16K tokens·$3.00/1M
OpenAI: GPT-4
8K tokens·$30.00/1M
OpenAI: GPT-4 Turbo (older v1106)
128K tokens·$10.00/1M
OpenAI: GPT-4o
128K tokens·$2.50/1M
OpenAI: GPT-4o (2024-05-13)
128K tokens·$5.00/1M
OpenAI: GPT-4o-mini
128K tokens·$0.15/1M
Sora

OpenBMB

1 model
ModelOS
MiniCPM-V 4.6 1.3B
0

anthropic

4 models
ModelOS
Claude 3.5
Claude Opus 4.8
Opus 4.7
Opus 4.8

google

1 model
ModelOS
Gemini 3.5

mistral

1 model
ModelOS
Mistral

Guide to AI Models in 2026

The AI model ecosystem in 2026 is dominated by four major families: GPT from OpenAI, Claude from Anthropic, Gemini from Google, and Llama from Meta. Each family has models of different sizes and specializations, with varying prices and capabilities for different use cases.

GPT Family (OpenAI)

OpenAI offers the GPT-4o line as its main model, with variants at different costs and speeds. GPT-4o-mini is the most affordable option with excellent cost-effectiveness. The OpenAI API is the most widely supported by third-party tools and integrations, making it the default choice for many applications.

Claude Family (Anthropic)

Anthropic positions Claude with a focus on safety and following complex instructions. Claude Opus is the most capable model in the lineup, with a 200K token context window — ideal for analyzing long documents. Claude Haiku is the fastest and cheapest option. Anthropic has a strong presence in enterprise and compliance-sensitive use cases.

Gemini Family (Google)

Gemini is notable for its 1 million token context window — the largest among commercial models — and native integration with the Google ecosystem (Search, Workspace, Cloud). Gemini Flash is the most affordable option with exceptional speed.

Open Source: Llama, Qwen & DeepSeek

The open source segment has advanced significantly. Meta AI released Llama 4 with competitive performance in certain tasks. Alibaba maintains the Qwen family with a focus on multilingual support. DeepSeek surprised with frontier performance at substantially lower cost than equivalent proprietary models.

Frequently Asked Questions

What is the difference between GPT-4o and Claude Opus?

GPT-4o from OpenAI and Claude Opus from Anthropic are both frontier models with similar capabilities. GPT-4o has better speed and integration with the OpenAI ecosystem. Claude Opus excels at tasks with long context and complex reasoning.

What is context window in AI models?

Context window is the maximum amount of text the model can process in a single request, measured in tokens (approximately 4 characters per token in English). Models with larger context windows can analyze complete documents and extensive codebases.

Which AI model is open source?

Open source models include Llama (Meta), Qwen (Alibaba), Mistral, DeepSeek, and Gemma (Google). They are available under licenses that allow use, modification, and self-deployment, without depending on paid APIs.

How does per-token pricing work?

LLMs charge per tokens processed — separated by input tokens (what you send) and output tokens (what the model generates). Prices are in USD per 1 million tokens. Output tokens typically cost 3-5x more than input tokens.

Explore