Technical specifications, pricing, context window, and capabilities of 500 AI models from 61 companies. Sheets updated weekly with manufacturer data.
For performance ranking, visit the AI Benchmark.
500
Models
61
Companies
92
Open Source
122
Multimodal
| Model | OS |
|---|---|
| AI21: Jamba Large 1.7 256K tokens·$2.00/1M | ✅ |
| Jamba 1.5 Large $2.00/1M | — |
| Jamba 1.5 Mini $0.20/1M | — |
| Jamba 1.6 Large $2.00/1M | — |
| Jamba 1.6 Mini $0.20/1M | — |
| Jamba 1.7 Mini 0 | — |
| Jamba Reasoning 3B 0 | — |
| Model | OS |
|---|---|
| AionLabs: Aion-1.0 131K tokens·$4.00/1M | — |
| AionLabs: Aion-2.0 131K tokens·$0.80/1M | — |
| AionLabs: Aion-RP 1.0 (8B) 33K tokens·$0.80/1M | — |
| Model | OS |
|---|---|
| AlfredPros: CodeLLaMa 7B Instruct Solidity 4K tokens·$0.80/1M | ✅ |
| Model | OS |
|---|---|
| Llama 3.1 Tulu3 405B 0 | — |
| Molmo 7B-D 0 | — |
| Molmo2-8B 0 | — |
| OLMo 2 32B 0 | — |
| OLMo 2 7B 0 | — |
| Olmo 3 7B Instruct $0.10/1M | — |
| Olmo 3 7B Think 0 | — |
| Olmo 3.1 32B Think 0 | — |
| Model | OS |
|---|---|
| Olmo 3 32B Think 66K tokens00 | ✅ |
| Olmo 3.1 32B Instruct 66K tokens00 | ✅ |
| Model | OS |
|---|---|
| Amazon: Nova 2 Lite 1.0M tokens·$0.30/1M | — |
| Amazon: Nova Lite 1.0 300K tokens·$0.06/1M | — |
| Amazon: Nova Micro 1.0 128K tokens·$0.04/1M | — |
| Amazon: Nova Premier 1.0 1.0M tokens·$2.50/1M | — |
| Amazon: Nova Pro 1.0 300K tokens·$0.80/1M | — |
| Nova 2.0 Lite (high) $0.30/1M | — |
| Nova 2.0 Omni (low) $0.30/1M | — |
| Nova 2.0 Omni (medium) $0.30/1M | — |
| Nova 2.0 Omni (Non-reasoning) $0.30/1M | — |
| Nova 2.0 Pro Preview (medium) $1.25/1M | — |
| Nova Lite $0.06/1M | — |
| Nova Micro $0.04/1M | — |
| Nova Pro $0.80/1M | — |
| Model | OS |
|---|---|
| Arcee AI: Coder Large 33K tokens·$0.50/1M | — |
| Arcee AI: Maestro Reasoning 131K tokens·$0.90/1M | — |
| Arcee AI: Spotlight 131K tokens·$0.18/1M | — |
| Arcee AI: Trinity Large Thinking 262K tokens·$0.22/1M | ✅ |
| Arcee AI: Trinity Mini 131K tokens·$0.04/1M | ✅ |
| Arcee AI: Virtuoso Large 131K tokens·$0.75/1M | — |
| Trinity Large Thinking $0.23/1M | — |
| Model | OS |
|---|---|
| Baidu: ERNIE 4.5 21B A3B Thinking 131K tokens·$0.07/1M | ✅ |
| Baidu: ERNIE 4.5 300B A47B 123K tokens·$0.28/1M | ✅ |
| Baidu: ERNIE 4.5 VL 28B A3B 30K tokens·$0.14/1M | ✅ |
| Baidu: ERNIE 4.5 VL 424B A47B 123K tokens·$0.42/1M | ✅ |
| ERNIE 5.0 Thinking Preview 0 | — |
| Model | OS |
|---|---|
| ByteDance: UI-TARS 7B 128K tokens·$0.10/1M | ✅ |
| Doubao Seed Code 0 | — |
| Model | OS |
|---|---|
| ByteDance Seed: Seed 1.6 Flash 262K tokens·$0.07/1M | — |
| ByteDance Seed: Seed-2.0-Lite 262K tokens·$0.25/1M | — |
| Doubao Seed Code 0 | — |
| Seed-OSS-36B-Instruct $0.21/1M | — |
| Model | OS |
|---|---|
| JT-35B-Flash 0 | — |
| JT-35B-Flash 0 | — |
| JT-MINI 0 | — |
| Model | OS |
|---|---|
| Cohere: Command R+ (08-2024) 128K tokens·$2.50/1M | — |
| Cohere: Command R7B (12-2024) 128K tokens·$0.04/1M | — |
| Command A+ 0 | — |
| Command-R (Mar '24) $0.50/1M | — |
| Command-R+ (Apr '24) $3.00/1M | — |
| Tiny Aya Global 0 | — |
| Model | OS |
|---|---|
| DBRX Instruct 0 | — |
| Model | OS |
|---|---|
| Cogito v2.1 (Reasoning) $1.25/1M | — |
| Deep Cogito: Cogito v2.1 671B 128K tokens·$1.25/1M | — |
| Model | OS |
|---|---|
| DeepSeek Coder V2 Lite Instruct 0 | — |
| DeepSeek LLM 67B Chat (V1) 0 | — |
| DeepSeek R1 (Jan '25) $1.68/1M | — |
| DeepSeek R1 0528 Qwen3 8B 0 | — |
| DeepSeek R1 Distill Llama 8B 0 | — |
| DeepSeek R1 Distill Qwen 1.5B 0 | — |
| DeepSeek R1 Distill Qwen 14B 0 | — |
| DeepSeek V3 131K tokens·$0.23/1M | ✅ |
| DeepSeek V3 0324 $1.20/1M | — |
| DeepSeek V3.1 164K tokens·$0.56/1M | ✅ |
| DeepSeek V3.1 Terminus 164K tokens·$0.27/1M | ✅ |
| DeepSeek V3.2 131K tokens·$0.50/1M | ✅ |
| DeepSeek V3.2 Exp 164K tokens·$0.27/1M | ✅ |
| DeepSeek V3.2 Exp (Non-reasoning) $0.28/1M | — |
| DeepSeek V3.2 Exp (Reasoning) $0.28/1M | — |
| DeepSeek V3.2 Speciale 164K tokens00 | ✅ |
| DeepSeek V4 Flash 1.0M tokens·$0.14/1M | ✅ |
| DeepSeek V4 Pro 1.0M tokens·$0.43/1M | ✅ |
| DeepSeek-Coder-V2 0 | — |
| DeepSeek-V2-Chat 0 | — |
| DeepSeek-V2.5 0 | — |
| DeepSeek-V2.5 (Dec '24) 0 | — |
| DeepSeek: R1 164K tokens·$0.70/1M | ✅ |
| DeepSeek: R1 Distill Qwen 32B 128K tokens00 | ✅ |
| R1 Distill Llama 70B 131K tokens·$0.70/1M | ✅ |
| Model | OS |
|---|---|
| EssentialAI: Rnj 1 Instruct 33K tokens·$0.15/1M | ✅ |
| Model | OS |
|---|---|
| Goliath 120B 6K tokens·$3.75/1M | ✅ |
| Model | OS |
|---|---|
| Granite 3.3 8B (Non-reasoning) $0.03/1M | — |
| Granite 4.0 1B 0 | — |
| Granite 4.0 350M 0 | — |
| Granite 4.0 H 1B 0 | — |
| Granite 4.0 H 350M 0 | — |
| Granite 4.0 H Small $0.06/1M | — |
| Granite 4.0 Micro 131K tokens00 | ✅ |
| Granite 4.1 30B 0 | — |
| Granite 4.1 3B 0 | — |
| Granite 4.1 8B $0.05/1M | — |
| Model | OS |
|---|---|
| Inception: Mercury 2 128K tokens·$0.25/1M | — |
| Model | OS |
|---|---|
| Ling 2.6 Flash $0.10/1M | — |
| Ling-2.6-1T $0.30/1M | — |
| Model | OS |
|---|---|
| Ling-1T 0 | — |
| Ling-flash-2.0 $0.14/1M | — |
| Ling-mini-2.0 0 | — |
| Ring-1T 0 | — |
| Ring-2.6-1T $0.30/1M | — |
| Ring-flash-2.0 $0.14/1M | — |
| Model | OS |
|---|---|
| Inflection: Inflection 3 Pi 8K tokens·$2.50/1M | — |
| Inflection: Inflection 3 Productivity 8K tokens·$2.50/1M | — |
| Model | OS |
|---|---|
| Kimi K2 Thinking 262K tokens·$0.60/1M | — |
| Kimi Linear 48B A3B Instruct 0 | — |
| Model | OS |
|---|---|
| Mi:dm K 2.5 Pro 0 | — |
| Mi:dm K 2.5 Pro Preview 0 | — |
| Model | OS |
|---|---|
| Kling AI 2.0 | — |
| Model | OS |
|---|---|
| KAT-Coder-Pro V1 $0.30/1M | — |
| Model | OS |
|---|---|
| Kwaipilot: KAT-Coder-Pro V2 256K tokens·$0.30/1M | — |
| Model | OS |
|---|---|
| EXAONE 4.5 33B 0 | — |
| K-EXAONE (Reasoning) 0 | — |
| Model | OS |
|---|---|
| LFM2-24B-A2B 33K tokens·$0.03/1M | ✅ |
| Model | OS |
|---|---|
| LongCat Flash Lite 0 | — |
| Model | OS |
|---|---|
| Luma Dream Machine 1.6 | — |
| Model | OS |
|---|---|
| K2 Think V2 0 | — |
| K2-V2 (high) 0 | — |
| K2-V2 (medium) 0 | — |
| Model | OS |
|---|---|
| Magnum v4 72B 16K tokens·$3.00/1M | ✅ |
| Model | OS |
|---|---|
| Mancer: Weaver (alpha) 8K tokens·$0.75/1M | — |
| Model | OS |
|---|---|
| Llama 2 Chat 13B 0 | — |
| Llama 2 Chat 70B 0 | — |
| Llama 2 Chat 7B $0.05/1M | — |
| Llama 3 70B Instruct 8K tokens·$0.65/1M | ✅ |
| Llama 3 8B Instruct 8K tokens·$0.04/1M | ✅ |
| Llama 3.1 70B Instruct 131K tokens·$0.56/1M | ✅ |
| Llama 3.1 8B Instruct 16K tokens·$0.10/1M | ✅ |
| Llama 3.1 Instruct 405B $2.75/1M | — |
| Llama 3.2 11B Vision Instruct 131K tokens·$0.24/1M | ✅ |
| Llama 3.2 1B Instruct 60K tokens·$0.05/1M | ✅ |
| Llama 3.2 3B Instruct 80K tokens·$0.15/1M | ✅ |
| Llama 3.2 Instruct 90B (Vision) $1.38/1M | — |
| Llama 3.3 70B Instruct 131K tokens·$0.58/1M | ✅ |
| Llama 4 Maverick 1.0M tokens·$0.35/1M | ✅ |
| Llama 4 Scout 10.0M tokens·$0.17/1M | ✅ |
| Llama 65B 0 | — |
| Llama Guard 3 8B 131K tokens·$0.48/1M | ✅ |
| Llama Guard 4 12B 164K tokens·$0.18/1M | ✅ |
| Muse Spark 0 | — |
| Model | OS |
|---|---|
| Microsoft: Phi 4 16K tokens·$0.13/1M | ✅ |
| Phi-3 Mini Instruct 3.8B 0 | — |
| Phi-4 Mini Instruct 0 | — |
| Phi-4 Multimodal Instruct 0 | — |
| WizardLM-2 8x22B 66K tokens·$0.62/1M | ✅ |
| Model | OS |
|---|---|
| Hailuo MiniMax Video-01 | — |
| MiniMax M1 40k 0 | — |
| MiniMax M1 80k $0.55/1M | — |
| MiniMax-M2 205K tokens·$0.30/1M | — |
| MiniMax: MiniMax M1 1.0M tokens·$0.40/1M | — |
| MiniMax: MiniMax M2-her 66K tokens·$0.30/1M | — |
| MiniMax: MiniMax M2.1 197K tokens·$0.30/1M | ✅ |
| MiniMax: MiniMax M2.5 197K tokens·$0.30/1M | ✅ |
| MiniMax: MiniMax M2.7 197K tokens·$0.30/1M | ✅ |
| MiniMax: MiniMax-01 1.0M tokens·$0.20/1M | ✅ |
| Model | OS |
|---|---|
| Devstral 2 0 | — |
| Devstral Small (Jul '25) 131K tokens·$0.10/1M | — |
| Devstral Small (May '25) 0 | — |
| Devstral Small 2 $0.10/1M | — |
| Magistral Medium 1 0 | — |
| Magistral Small 1 0 | — |
| Magistral Small 1.2 0 | — |
| Ministral 3 14B $0.20/1M | — |
| Ministral 3 3B $0.10/1M | — |
| Ministral 3 8B $0.15/1M | — |
| Mistral 7B Instruct $0.20/1M | — |
| Mistral Large 2 (Jul '24) 131K tokens·$2.00/1M | — |
| Mistral Large 2 (Nov '24) $2.00/1M | — |
| Mistral Large 3 $4.00/1M | — |
| Mistral Medium $2.75/1M | — |
| Mistral Small (Feb '24) $1.00/1M | — |
| Mistral Small (Sep '24) $0.20/1M | — |
| Mistral Small 3 $0.07/1M | — |
| Mistral Small 3.1 $0.10/1M | — |
| Mistral Small 3.2 $0.09/1M | — |
| Mixtral 8x22B Instruct 0 | — |
| Model | OS |
|---|---|
| Kimi K2 131K tokens·$0.58/1M | — |
| Model | OS |
|---|---|
| MoonshotAI: Kimi K2 0711 131K tokens·$0.57/1M | ✅ |
| MoonshotAI: Kimi K2 0905 262K tokens·$0.60/1M | ✅ |
| MoonshotAI: Kimi K2.5 262K tokens·$0.60/1M | ✅ |
| MoonshotAI: Kimi K2.6 262K tokens·$0.95/1M | ✅ |
| Model | OS |
|---|---|
| Morph: Morph V3 Fast 82K tokens·$0.80/1M | — |
| Morph: Morph V3 Large 262K tokens·$0.90/1M | — |
| Model | OS |
|---|---|
| Motif-2-12.7B-Reasoning 0 | — |
| Model | OS |
|---|---|
| MythoMax 13B 4K tokens·$0.06/1M | ✅ |
| Model | OS |
|---|---|
| Llama 3.1 Nemotron 70B Instruct 131K tokens·$1.20/1M | ✅ |
| Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) 0 | — |
| Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) $0.60/1M | — |
| Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) 0 | — |
| Llama 3.3 Nemotron Super 49B v1 (Reasoning) 0 | — |
| Llama Nemotron Super 49B v1.5 (Non-reasoning) $0.10/1M | — |
| Llama Nemotron Super 49B v1.5 (Reasoning) $0.10/1M | — |
| Nemotron 3 Nano Omni 30B A3B Reasoning $0.07/1M | — |
| Nemotron Cascade 2 30B A3B 0 | — |
| NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) 262K tokens·$0.05/1M | — |
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) $0.06/1M | — |
| NVIDIA Nemotron 3 Nano 4B 0 | — |
| NVIDIA Nemotron 3 Super 120B A12B (Reasoning) 1.0M tokens·$0.30/1M | — |
| NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) $0.20/1M | — |
| NVIDIA Nemotron Nano 12B v2 VL (Reasoning) $0.20/1M | — |
| NVIDIA Nemotron Nano 9B V2 (Non-reasoning) 131K tokens·$0.05/1M | — |
| NVIDIA Nemotron Nano 9B V2 (Reasoning) $0.04/1M | — |
| Model | OS |
|---|---|
| Nanbeige4.1-3B 0 | — |
| Model | OS |
|---|---|
| HyperCLOVA X SEED Think (32B) 0 | — |
| Model | OS |
|---|---|
| Nex AGI: DeepSeek V3.1 Nex N1 131K tokens·$0.14/1M | ✅ |
| Model | OS |
|---|---|
| Nous: Hermes 3 405B Instruct 131K tokens·$1.00/1M | ✅ |
| Nous: Hermes 3 70B Instruct 131K tokens·$0.30/1M | ✅ |
| Nous: Hermes 4 405B 131K tokens·$1.00/1M | ✅ |
| Nous: Hermes 4 70B 131K tokens·$0.13/1M | ✅ |
| Model | OS |
|---|---|
| NousResearch: Hermes 2 Pro - Llama-3 8B 8K tokens·$0.14/1M | ✅ |
| Model | OS |
|---|---|
| GPT Audio 128K tokens·$2.50/1M | — |
| GPT Audio Mini 128K tokens·$0.60/1M | — |
| GPT Chat Latest 400K tokens·$5.00/1M | — |
| GPT-3.5 Turbo 16K tokens·$0.50/1M | — |
| GPT-3.5 Turbo $0.50/1M | — |
| GPT-3.5 Turbo (0613) 0 | — |
| GPT-4 Turbo 128K tokens·$10.00/1M | — |
| GPT-4 Turbo Preview 128K tokens·$10.00/1M | — |
| GPT-4.1 1.0M tokens·$2.00/1M | — |
| GPT-4.1 Mini 1.0M tokens·$0.40/1M | — |
| GPT-4.1 Nano 1.0M tokens·$0.10/1M | — |
| GPT-4.5 (Preview) 0 | — |
| GPT-4o (2024-08-06) 128K tokens·$2.50/1M | — |
| GPT-4o (2024-11-20) 128K tokens·$2.50/1M | — |
| GPT-4o (ChatGPT) 0 | — |
| GPT-4o (March 2025, chatgpt-4o-latest) 0 | — |
| GPT-4o Audio 128K tokens·$2.50/1M | — |
| GPT-4o mini Realtime (Dec '24) 0 | — |
| GPT-4o Realtime (Dec '24) 0 | — |
| GPT-4o Search Preview 128K tokens·$2.50/1M | — |
| GPT-4o-mini (2024-07-18) 128K tokens·$0.15/1M | — |
| GPT-4o-mini Search Preview 128K tokens·$0.15/1M | — |
| GPT-5 400K tokens·$1.25/1M | — |
| GPT-5 (ChatGPT) $1.25/1M | — |
| GPT-5 (minimal) $1.25/1M | — |
| GPT-5 Chat 128K tokens·$1.25/1M | — |
| GPT-5 Codex 400K tokens·$1.25/1M | — |
| GPT-5 Image 400K tokens·$10.00/1M | — |
| GPT-5 Image Mini 400K tokens·$2.50/1M | — |
| GPT-5 Mini 400K tokens·$0.25/1M | — |
| GPT-5 mini (minimal) $0.25/1M | — |
| GPT-5 Nano 400K tokens·$0.05/1M | — |
| GPT-5 nano (minimal) $0.05/1M | — |
| GPT-5 Pro 400K tokens·$15.00/1M | — |
| GPT-5.1 400K tokens·$1.25/1M | — |
| GPT-5.1 Chat 128K tokens·$1.25/1M | — |
| GPT-5.1-Codex 400K tokens·$1.25/1M | — |
| GPT-5.1-Codex-Max 400K tokens·$1.25/1M | — |
| GPT-5.1-Codex-Mini 400K tokens·$0.25/1M | — |
| GPT-5.2 400K tokens·$1.75/1M | — |
| GPT-5.2 Chat 128K tokens·$1.75/1M | — |
| GPT-5.2 Pro 400K tokens·$21.00/1M | — |
| GPT-5.2-Codex 400K tokens·$1.75/1M | — |
| GPT-5.3 Chat 128K tokens·$1.75/1M | — |
| GPT-5.3-Codex 400K tokens·$1.75/1M | — |
| GPT-5.4 1.1M tokens·$2.50/1M | — |
| GPT-5.4 Image 2 272K tokens·$8.00/1M | — |
| GPT-5.4 Mini 400K tokens·$0.75/1M | — |
| GPT-5.4 Nano 400K tokens·$0.20/1M | — |
| GPT-5.4 Pro 1.1M tokens·$30.00/1M | — |
| GPT-5.5 1.1M tokens·$5.00/1M | — |
| GPT-5.5 Instant (May 2026) $5.00/1M | — |
| GPT-5.5 Pro 1.1M tokens00 | — |
| gpt-oss-120b 131K tokens·$0.15/1M | — |
| gpt-oss-20b 131K tokens·$0.06/1M | — |
| gpt-oss-safeguard-20b 131K tokens·$0.07/1M | — |
| o1 200K tokens·$15.00/1M | — |
| o1-mini 0 | — |
| o1-preview $16.50/1M | — |
| o1-pro 200K tokens·$150.00/1M | — |
| o3 200K tokens·$2.00/1M | — |
| o3 Deep Research 200K tokens·$10.00/1M | — |
| o3 Mini 200K tokens·$1.10/1M | — |
| o3 Mini High 200K tokens·$1.10/1M | — |
| o3 Pro 200K tokens·$20.00/1M | — |
| o4 Mini 200K tokens·$1.10/1M | — |
| o4 Mini Deep Research 200K tokens·$2.00/1M | — |
| o4 Mini High 200K tokens·$1.10/1M | — |
| OpenAI: GPT-3.5 Turbo 16k 16K tokens·$3.00/1M | — |
| OpenAI: GPT-4 8K tokens·$30.00/1M | — |
| OpenAI: GPT-4 Turbo (older v1106) 128K tokens·$10.00/1M | — |
| OpenAI: GPT-4o 128K tokens·$2.50/1M | — |
| OpenAI: GPT-4o (2024-05-13) 128K tokens·$5.00/1M | — |
| OpenAI: GPT-4o-mini 128K tokens·$0.15/1M | — |
| Sora | — |
| Model | OS |
|---|---|
| MiniCPM-V 4.6 1.3B 0 | — |
| Model | OS |
|---|---|
| Claude 3.5 | — |
| Claude Opus 4.8 | — |
| Opus 4.7 | — |
| Opus 4.8 | — |
| Model | OS |
|---|---|
| Gemini 3.5 | — |
| Model | OS |
|---|---|
| Mistral | — |
The AI model ecosystem in 2026 is dominated by four major families: GPT from OpenAI, Claude from Anthropic, Gemini from Google, and Llama from Meta. Each family has models of different sizes and specializations, with varying prices and capabilities for different use cases.
OpenAI offers the GPT-4o line as its main model, with variants at different costs and speeds. GPT-4o-mini is the most affordable option with excellent cost-effectiveness. The OpenAI API is the most widely supported by third-party tools and integrations, making it the default choice for many applications.
Anthropic positions Claude with a focus on safety and following complex instructions. Claude Opus is the most capable model in the lineup, with a 200K token context window — ideal for analyzing long documents. Claude Haiku is the fastest and cheapest option. Anthropic has a strong presence in enterprise and compliance-sensitive use cases.
Gemini is notable for its 1 million token context window — the largest among commercial models — and native integration with the Google ecosystem (Search, Workspace, Cloud). Gemini Flash is the most affordable option with exceptional speed.
The open source segment has advanced significantly. Meta AI released Llama 4 with competitive performance in certain tasks. Alibaba maintains the Qwen family with a focus on multilingual support. DeepSeek surprised with frontier performance at substantially lower cost than equivalent proprietary models.
GPT-4o from OpenAI and Claude Opus from Anthropic are both frontier models with similar capabilities. GPT-4o has better speed and integration with the OpenAI ecosystem. Claude Opus excels at tasks with long context and complex reasoning.
Context window is the maximum amount of text the model can process in a single request, measured in tokens (approximately 4 characters per token in English). Models with larger context windows can analyze complete documents and extensive codebases.
Open source models include Llama (Meta), Qwen (Alibaba), Mistral, DeepSeek, and Gemma (Google). They are available under licenses that allow use, modification, and self-deployment, without depending on paid APIs.
LLMs charge per tokens processed — separated by input tokens (what you send) and output tokens (what the model generates). Prices are in USD per 1 million tokens. Output tokens typically cost 3-5x more than input tokens.