Compare GPT vs Claude vs GeminiSide by Side in 2026

Interactive tool to compare 500+ AI models side by side: price per token, speed, benchmarks and context window. Find which model is best for your use case in 2026.

By Luis Fernando RoquetteLast updated: June 01, 2026 500 models available

Compare dois modelos agora

NOVO

Selecione dois modelos para ver a comparação detalhada lado a lado.

vs

All Available Models

AI21: Jamba Large 1.7(AI21 Labs)Jamba 1.5 Large(AI21 Labs)Jamba 1.5 Mini(AI21 Labs)Jamba 1.6 Large(AI21 Labs)Jamba 1.6 Mini(AI21 Labs)Jamba 1.7 Mini(AI21 Labs)Jamba Reasoning 3B(AI21 Labs)AionLabs: Aion-1.0(AionLabs)AionLabs: Aion-2.0(AionLabs)AionLabs: Aion-RP 1.0 (8B)(AionLabs)AlfredPros: CodeLLaMa 7B Instruct Solidity(AlfredPros)Qwen Chat 14B(Alibaba)Qwen Chat 72B(Alibaba)Qwen: Qwen2.5 7B Instruct(Alibaba)Qwen: Qwen2.5 VL 72B Instruct(Alibaba)Qwen: Qwen3 235B A22B Instruct 2507(Alibaba)Qwen: Qwen3 235B A22B Thinking 2507(Alibaba)Qwen: Qwen3 30B A3B Instruct 2507(Alibaba)Qwen: Qwen3 30B A3B Thinking 2507(Alibaba)Qwen: Qwen3 Coder 30B A3B Instruct(Alibaba)Qwen: Qwen3 Next 80B A3B Instruct(Alibaba)Qwen: Qwen3 VL 235B A22B Instruct(Alibaba)Qwen: Qwen3 VL 30B A3B Instruct(Alibaba)Qwen: Qwen3 VL 32B Instruct(Alibaba)Qwen: Qwen3 VL 8B Instruct(Alibaba)Qwen1.5 Chat 110B(Alibaba)Qwen2 Instruct 72B(Alibaba)Qwen2.5 72B Instruct(Alibaba)Qwen2.5 Coder 32B Instruct(Alibaba)Qwen2.5 Coder Instruct 7B (Alibaba)Qwen2.5 Instruct 32B(Alibaba)Qwen2.5 Max(Alibaba)Qwen3 0.6B (Non-reasoning)(Alibaba)Qwen3 0.6B (Reasoning)(Alibaba)Qwen3 1.7B (Non-reasoning)(Alibaba)Qwen3 1.7B (Reasoning)(Alibaba)Qwen3 14B (Non-reasoning)(Alibaba)Qwen3 14B (Reasoning)(Alibaba)Qwen3 235B A22B (Reasoning)(Alibaba)Qwen3 30B A3B (Reasoning)(Alibaba)Qwen3 30B A3B 2507 (Reasoning)(Alibaba)Qwen3 30B A3B 2507 Instruct(Alibaba)Qwen3 32B (Non-reasoning)(Alibaba)Qwen3 32B (Reasoning)(Alibaba)Qwen3 4B (Non-reasoning)(Alibaba)Qwen3 4B (Reasoning)(Alibaba)Qwen3 4B 2507 (Reasoning)(Alibaba)Qwen3 4B 2507 Instruct(Alibaba)Qwen3 8B (Non-reasoning)(Alibaba)Qwen3 8B (Reasoning)(Alibaba)Qwen3 Coder 480B A35B Instruct(Alibaba)Qwen3 Max (Preview)(Alibaba)Qwen3 Max Thinking (Preview)(Alibaba)Qwen3 Next 80B A3B (Reasoning)(Alibaba)Qwen3 Omni 30B A3B (Reasoning)(Alibaba)Qwen3 Omni 30B A3B Instruct(Alibaba)Qwen3 VL 235B A22B (Reasoning)(Alibaba)Qwen3 VL 30B A3B (Reasoning)(Alibaba)Qwen3 VL 32B (Reasoning)(Alibaba)Qwen3 VL 4B (Reasoning)(Alibaba)Qwen3 VL 4B Instruct(Alibaba)Qwen3 VL 8B (Reasoning)(Alibaba)Qwen3.5 0.8B (Non-reasoning)(Alibaba)Qwen3.5 0.8B (Reasoning)(Alibaba)Qwen3.5 2B (Reasoning)(Alibaba)Qwen3.5 4B (Non-reasoning)(Alibaba)Qwen3.5 4B (Reasoning)(Alibaba)Qwen3.5 9B (Reasoning)(Alibaba)Qwen3.5 Omni Flash(Alibaba)Qwen3.5 Omni Plus(Alibaba)Qwen3.6 Max Preview(Alibaba)Qwen3.7 Max(Alibaba)QwQ 32B(Alibaba)QwQ 32B-Preview(Alibaba)Wan 2.1(Alibaba)Llama 3.1 Tulu3 405B(Allen Institute for AI)Molmo 7B-D(Allen Institute for AI)Molmo2-8B(Allen Institute for AI)OLMo 2 32B(Allen Institute for AI)OLMo 2 7B(Allen Institute for AI)Olmo 3 7B Instruct(Allen Institute for AI)Olmo 3 7B Think(Allen Institute for AI)Olmo 3.1 32B Think(Allen Institute for AI)Olmo 3 32B Think(AllenAI)Olmo 3.1 32B Instruct(AllenAI)Amazon: Nova 2 Lite(Amazon)Amazon: Nova Lite 1.0(Amazon)Amazon: Nova Micro 1.0(Amazon)Amazon: Nova Premier 1.0(Amazon)Amazon: Nova Pro 1.0(Amazon)Nova 2.0 Lite (high)(Amazon)Nova 2.0 Omni (low)(Amazon)Nova 2.0 Omni (medium)(Amazon)Nova 2.0 Omni (Non-reasoning)(Amazon)Nova 2.0 Pro Preview (medium)(Amazon)Nova Lite(Amazon)Nova Micro(Amazon)Nova Pro(Amazon)Claude 3.5(anthropic)Claude Opus 4.8(anthropic)Opus 4.7(anthropic)Opus 4.8(anthropic)Anthropic: Claude 3 Haiku(Anthropic)Anthropic: Claude Opus 4.8 (Fast)(Anthropic)Claude 2.0(Anthropic)Claude 2.1(Anthropic)Claude 3 Opus(Anthropic)Claude 3 Sonnet(Anthropic)Claude 3.5 Haiku(Anthropic)Claude 3.5 Sonnet (June '24)(Anthropic)Claude 3.5 Sonnet (Oct '24)(Anthropic)Claude 3.7 Sonnet(Anthropic)Claude 3.7 Sonnet (thinking)(Anthropic)Claude 4 Opus (Reasoning)(Anthropic)Claude 4 Sonnet (Reasoning)(Anthropic)Claude 4.1 Opus (Non-reasoning)(Anthropic)Claude 4.1 Opus (Reasoning)(Anthropic)Claude 4.5 Haiku (Reasoning)(Anthropic)Claude 4.5 Sonnet (Non-reasoning)(Anthropic)Claude 4.5 Sonnet (Reasoning)(Anthropic)Claude Haiku 4.5(Anthropic)Claude Instant(Anthropic)Claude Opus 4(Anthropic)Claude Opus 4.1(Anthropic)Claude Opus 4.5(Anthropic)Claude Opus 4.5 (Reasoning)(Anthropic)Claude Opus 4.6(Anthropic)Claude Opus 4.6 (Adaptive Reasoning, Max Effort)(Anthropic)Claude Opus 4.6 (Fast)(Anthropic)Claude Opus 4.7(Anthropic)Claude Opus 4.7 (Fast)(Anthropic)Claude Opus 4.8 (Adaptive Reasoning, Max Effort)(Anthropic)Claude Sonnet 4(Anthropic)Claude Sonnet 4.5(Anthropic)Claude Sonnet 4.6(Anthropic)Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)(Anthropic)Claude Sonnet 4.6 (Non-reasoning, Low Effort)(Anthropic)Arcee AI: Coder Large(Arcee AI)Arcee AI: Maestro Reasoning(Arcee AI)Arcee AI: Spotlight(Arcee AI)Arcee AI: Trinity Large Thinking(Arcee AI)Arcee AI: Trinity Mini(Arcee AI)Arcee AI: Virtuoso Large(Arcee AI)Trinity Large Thinking(Arcee AI)Baidu: ERNIE 4.5 21B A3B Thinking(Baidu)Baidu: ERNIE 4.5 300B A47B (Baidu)Baidu: ERNIE 4.5 VL 28B A3B(Baidu)Baidu: ERNIE 4.5 VL 424B A47B (Baidu)ERNIE 5.0 Thinking Preview(Baidu)ByteDance: UI-TARS 7B (ByteDance)Doubao Seed Code(ByteDance)ByteDance Seed: Seed 1.6 Flash(ByteDance Seed)ByteDance Seed: Seed-2.0-Lite(ByteDance Seed)Doubao Seed Code(ByteDance Seed)Seed-OSS-36B-Instruct(ByteDance Seed)JT-35B-Flash(China Mobile)JT-35B-Flash(China Mobile)JT-MINI(China Mobile)Cohere: Command R+ (08-2024)(Cohere)Cohere: Command R7B (12-2024)(Cohere)Command A+(Cohere)Command-R (Mar '24)(Cohere)Command-R+ (Apr '24)(Cohere)Tiny Aya Global(Cohere)DBRX Instruct(Databricks)Cogito v2.1 (Reasoning)(Deep Cogito)Deep Cogito: Cogito v2.1 671B(Deep Cogito)DeepSeek Coder V2 Lite Instruct(DeepSeek)DeepSeek LLM 67B Chat (V1)(DeepSeek)DeepSeek R1 (Jan '25)(DeepSeek)DeepSeek R1 0528 Qwen3 8B(DeepSeek)DeepSeek R1 Distill Llama 8B(DeepSeek)DeepSeek R1 Distill Qwen 1.5B(DeepSeek)DeepSeek R1 Distill Qwen 14B(DeepSeek)DeepSeek V3(DeepSeek)DeepSeek V3 0324(DeepSeek)DeepSeek V3.1(DeepSeek)DeepSeek V3.1 Terminus(DeepSeek)DeepSeek V3.2(DeepSeek)DeepSeek V3.2 Exp(DeepSeek)DeepSeek V3.2 Exp (Non-reasoning)(DeepSeek)DeepSeek V3.2 Exp (Reasoning)(DeepSeek)DeepSeek V3.2 Speciale(DeepSeek)DeepSeek V4 Flash(DeepSeek)DeepSeek V4 Pro(DeepSeek)DeepSeek-Coder-V2(DeepSeek)DeepSeek-V2-Chat(DeepSeek)DeepSeek-V2.5(DeepSeek)DeepSeek-V2.5 (Dec '24)(DeepSeek)DeepSeek: R1(DeepSeek)DeepSeek: R1 Distill Qwen 32B(DeepSeek)R1 Distill Llama 70B(DeepSeek)EssentialAI: Rnj 1 Instruct(EssentialAI)Goliath 120B(Goliath 120B)Gemini 3.5(google)Gemini 1.0 Pro(Google)Gemini 1.0 Ultra(Google)Gemini 1.5 Flash (May '24)(Google)Gemini 1.5 Flash (Sep '24)(Google)Gemini 1.5 Flash-8B(Google)Gemini 1.5 Pro (May '24)(Google)Gemini 1.5 Pro (Sep '24)(Google)Gemini 2.0 Flash(Google)Gemini 2.0 Flash (experimental)(Google)Gemini 2.0 Flash Lite(Google)Gemini 2.0 Flash Thinking Experimental (Dec '24)(Google)Gemini 2.0 Flash Thinking Experimental (Jan '25)(Google)Gemini 2.0 Flash-Lite (Feb '25)(Google)Gemini 2.0 Flash-Lite (Preview)(Google)Gemini 2.0 Pro Experimental (Feb '25)(Google)Gemini 2.5 Flash(Google)Gemini 2.5 Flash Lite(Google)Gemini 2.5 Flash Preview (Non-reasoning)(Google)Gemini 2.5 Flash Preview (Reasoning)(Google)Gemini 2.5 Flash Preview (Sep '25) (Reasoning)(Google)Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)(Google)Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)(Google)Gemini 2.5 Pro(Google)Gemini 2.5 Pro Preview (Mar' 25)(Google)Gemini 2.5 Pro Preview (May' 25)(Google)Gemini 2.5 Pro Preview 05-06(Google)Gemini 2.5 Pro Preview 06-05(Google)Gemini 3 Deep Think(Google)Gemini 3 Flash Preview(Google)Gemini 3 Flash Preview (Non-reasoning)(Google)Gemini 3 Flash Preview (Reasoning)(Google)Gemini 3 Pro Preview (high)(Google)Gemini 3 Pro Preview (low)(Google)Gemini 3.1 Flash Lite(Google)Gemini 3.1 Flash Lite Preview(Google)Gemini 3.1 Pro Preview(Google)Gemini 3.1 Pro Preview Custom Tools(Google)Gemini 3.5 Flash (minimal)(Google)Gemma 2 27B(Google)Gemma 3 12B(Google)Gemma 3 1B Instruct(Google)Gemma 3 270M(Google)Gemma 3 27B(Google)Gemma 3 4B(Google)Gemma 3n 4B(Google)Gemma 3n E2B Instruct(Google)Gemma 3n E4B Instruct(Google)Gemma 3n E4B Instruct Preview (May '25)(Google)Gemma 4 26B A4B (Google)Gemma 4 31B(Google)Gemma 4 E2B (Non-reasoning)(Google)Gemma 4 E2B (Reasoning)(Google)Gemma 4 E4B (Non-reasoning)(Google)Gemma 4 E4B (Reasoning)(Google)Google: Gemini 3.5 Flash(Google)Lyria 3 Clip Preview(Google)Lyria 3 Pro Preview(Google)Nano Banana (Gemini 2.5 Flash Image)(Google)Nano Banana 2 (Gemini 3.1 Flash Image Preview)(Google)Nano Banana Pro (Gemini 3 Pro Image Preview)(Google)PALM-2(Google)Granite 3.3 8B (Non-reasoning)(IBM)Granite 4.0 1B(IBM)Granite 4.0 350M(IBM)Granite 4.0 H 1B(IBM)Granite 4.0 H 350M(IBM)Granite 4.0 H Small(IBM)Granite 4.0 Micro(IBM)Granite 4.1 30B(IBM)Granite 4.1 3B(IBM)Granite 4.1 8B(IBM)Inception: Mercury 2(Inception)Ling 2.6 Flash(Inclusion AI)Ling-2.6-1T(Inclusion AI)Ling-1T(InclusionAI)Ling-flash-2.0(InclusionAI)Ling-mini-2.0(InclusionAI)Ring-1T(InclusionAI)Ring-2.6-1T(InclusionAI)Ring-flash-2.0(InclusionAI)Inflection: Inflection 3 Pi(Inflection)Inflection: Inflection 3 Productivity(Inflection)Kimi K2 Thinking(Kimi)Kimi Linear 48B A3B Instruct(Kimi)Mi:dm K 2.5 Pro(Korea Telecom)Mi:dm K 2.5 Pro Preview(Korea Telecom)Kling AI 2.0(Kuaishou)KAT-Coder-Pro V1(KwaiKAT)Kwaipilot: KAT-Coder-Pro V2(Kwaipilot)EXAONE 4.5 33B(LG AI)K-EXAONE (Reasoning)(LG AI)Exaone 4.0 1.2B (Non-reasoning)(LG AI Research)EXAONE 4.0 32B (Non-reasoning)(LG AI Research)EXAONE 4.0 32B (Reasoning)(LG AI Research)LFM 40B(Liquid AI)LFM2 1.2B(Liquid AI)LFM2 2.6B(Liquid AI)LFM2 8B A1B(Liquid AI)LFM2.5-1.2B-Instruct(Liquid AI)LFM2.5-1.2B-Thinking(Liquid AI)LFM2.5-VL-1.6B(Liquid AI)LFM2-24B-A2B(LiquidAI)LongCat Flash Lite(LongCat)Luma Dream Machine 1.6(Luma AI)Magnum v4 72B(Magnum v4 72B)Mancer: Weaver (alpha)(Mancer)K2 Think V2(MBZUAI Institute of Foundation Models)K2-V2 (high)(MBZUAI Institute of Foundation Models)K2-V2 (medium)(MBZUAI Institute of Foundation Models)Llama 2 Chat 13B(Meta)Llama 2 Chat 70B(Meta)Llama 2 Chat 7B(Meta)Llama 3 70B Instruct(Meta)Llama 3 8B Instruct(Meta)Llama 3.1 70B Instruct(Meta)Llama 3.1 8B Instruct(Meta)Llama 3.1 Instruct 405B(Meta)Llama 3.2 11B Vision Instruct(Meta)Llama 3.2 1B Instruct(Meta)Llama 3.2 3B Instruct(Meta)Llama 3.2 Instruct 90B (Vision)(Meta)Llama 3.3 70B Instruct(Meta)Llama 4 Maverick(Meta)Llama 4 Scout(Meta)Llama 65B(Meta)Llama Guard 3 8B(Meta)Llama Guard 4 12B(Meta)Muse Spark(Meta)Microsoft: Phi 4(Microsoft)Phi-3 Mini Instruct 3.8B(Microsoft)Phi-4 Mini Instruct(Microsoft)Phi-4 Multimodal Instruct(Microsoft)WizardLM-2 8x22B(Microsoft)Hailuo MiniMax Video-01(MiniMax)MiniMax M1 40k(MiniMax)MiniMax M1 80k(MiniMax)MiniMax-M2(MiniMax)MiniMax: MiniMax M1(MiniMax)MiniMax: MiniMax M2-her(MiniMax)MiniMax: MiniMax M2.1(MiniMax)MiniMax: MiniMax M2.5(MiniMax)MiniMax: MiniMax M2.7(MiniMax)MiniMax: MiniMax-01(MiniMax)Mistral(mistral)Devstral 2(Mistral)Devstral Small (Jul '25)(Mistral)Devstral Small (May '25)(Mistral)Devstral Small 2(Mistral)Magistral Medium 1(Mistral)Magistral Small 1(Mistral)Magistral Small 1.2(Mistral)Ministral 3 14B(Mistral)Ministral 3 3B(Mistral)Ministral 3 8B(Mistral)Mistral 7B Instruct(Mistral)Mistral Large 2 (Jul '24)(Mistral)Mistral Large 2 (Nov '24)(Mistral)Mistral Large 3(Mistral)Mistral Medium(Mistral)Mistral Small (Feb '24)(Mistral)Mistral Small (Sep '24)(Mistral)Mistral Small 3(Mistral)Mistral Small 3.1(Mistral)Mistral Small 3.2(Mistral)Mixtral 8x22B Instruct(Mistral)Magistral Medium 1.2(Mistral AI)Mistral Large(Mistral AI)Mistral: Codestral 2508(Mistral AI)Mistral: Devstral 2 2512(Mistral AI)Mistral: Devstral Medium(Mistral AI)Mistral: Devstral Small 1.1(Mistral AI)Mistral: Ministral 3 14B 2512(Mistral AI)Mistral: Ministral 3 3B 2512(Mistral AI)Mistral: Ministral 3 8B 2512(Mistral AI)Mistral: Mistral 7B Instruct v0.1(Mistral AI)Mistral: Mistral Medium 3(Mistral AI)Mistral: Mistral Medium 3.1(Mistral AI)Mistral: Mistral Medium 3.5(Mistral AI)Mistral: Mistral Nemo(Mistral AI)Mistral: Mistral Small 3.1 24B(Mistral AI)Mistral: Mistral Small 3.2 24B(Mistral AI)Mistral: Mistral Small 4(Mistral AI)Mistral: Mistral Small Creative(Mistral AI)Mistral: Mixtral 8x22B Instruct(Mistral AI)Mistral: Mixtral 8x7B Instruct(Mistral AI)Mistral: Pixtral Large 2411(Mistral AI)Mistral: Saba(Mistral AI)Mistral: Voxtral Small 24B 2507(Mistral AI)Kimi K2(Moonshot AI)MoonshotAI: Kimi K2 0711(MoonshotAI)MoonshotAI: Kimi K2 0905(MoonshotAI)MoonshotAI: Kimi K2.5(MoonshotAI)MoonshotAI: Kimi K2.6(MoonshotAI)Morph: Morph V3 Fast(Morph)Morph: Morph V3 Large(Morph)Motif-2-12.7B-Reasoning(Motif Technologies)MythoMax 13B(MythoMax 13B)Nanbeige4.1-3B(Nanbeige)HyperCLOVA X SEED Think (32B)(Naver)Nex AGI: DeepSeek V3.1 Nex N1(Nex AGI)Nous: Hermes 3 405B Instruct(Nous)Nous: Hermes 3 70B Instruct(Nous)Nous: Hermes 4 405B(Nous)Nous: Hermes 4 70B(Nous)DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)(Nous Research)DeepHermes 3 - Mistral 24B Preview (Non-reasoning)(Nous Research)Hermes 3 - Llama-3.1 70B(Nous Research)Hermes 4 - Llama-3.1 405B (Non-reasoning)(Nous Research)Hermes 4 - Llama-3.1 405B (Reasoning)(Nous Research)Hermes 4 - Llama-3.1 70B (Non-reasoning)(Nous Research)Hermes 4 - Llama-3.1 70B (Reasoning)(Nous Research)NousResearch: Hermes 2 Pro - Llama-3 8B(NousResearch)Llama 3.1 Nemotron 70B Instruct(NVIDIA)Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)(NVIDIA)Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)(NVIDIA)Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)(NVIDIA)Llama 3.3 Nemotron Super 49B v1 (Reasoning)(NVIDIA)Llama Nemotron Super 49B v1.5 (Non-reasoning)(NVIDIA)Llama Nemotron Super 49B v1.5 (Reasoning)(NVIDIA)Nemotron 3 Nano Omni 30B A3B Reasoning(NVIDIA)Nemotron Cascade 2 30B A3B(NVIDIA)NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)(NVIDIA)NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)(NVIDIA)NVIDIA Nemotron 3 Nano 4B(NVIDIA)NVIDIA Nemotron 3 Super 120B A12B (Reasoning)(NVIDIA)NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)(NVIDIA)NVIDIA Nemotron Nano 12B v2 VL (Reasoning)(NVIDIA)NVIDIA Nemotron Nano 9B V2 (Non-reasoning)(NVIDIA)NVIDIA Nemotron Nano 9B V2 (Reasoning)(NVIDIA)GPT Audio(OpenAI)GPT Audio Mini(OpenAI)GPT Chat Latest(OpenAI)GPT-3.5 Turbo(OpenAI)GPT-3.5 Turbo(OpenAI)GPT-3.5 Turbo (0613)(OpenAI)GPT-4 Turbo(OpenAI)GPT-4 Turbo Preview(OpenAI)GPT-4.1(OpenAI)GPT-4.1 Mini(OpenAI)GPT-4.1 Nano(OpenAI)GPT-4.5 (Preview)(OpenAI)GPT-4o (2024-08-06)(OpenAI)GPT-4o (2024-11-20)(OpenAI)GPT-4o (ChatGPT)(OpenAI)GPT-4o (March 2025, chatgpt-4o-latest)(OpenAI)GPT-4o Audio(OpenAI)GPT-4o mini Realtime (Dec '24)(OpenAI)GPT-4o Realtime (Dec '24)(OpenAI)GPT-4o Search Preview(OpenAI)GPT-4o-mini (2024-07-18)(OpenAI)GPT-4o-mini Search Preview(OpenAI)GPT-5(OpenAI)GPT-5 (ChatGPT)(OpenAI)GPT-5 (minimal)(OpenAI)GPT-5 Chat(OpenAI)GPT-5 Codex(OpenAI)GPT-5 Image(OpenAI)GPT-5 Image Mini(OpenAI)GPT-5 Mini(OpenAI)GPT-5 mini (minimal)(OpenAI)GPT-5 Nano(OpenAI)GPT-5 nano (minimal)(OpenAI)GPT-5 Pro(OpenAI)GPT-5.1(OpenAI)GPT-5.1 Chat(OpenAI)GPT-5.1-Codex(OpenAI)GPT-5.1-Codex-Max(OpenAI)GPT-5.1-Codex-Mini(OpenAI)GPT-5.2(OpenAI)GPT-5.2 Chat(OpenAI)GPT-5.2 Pro(OpenAI)GPT-5.2-Codex(OpenAI)GPT-5.3 Chat(OpenAI)GPT-5.3-Codex(OpenAI)GPT-5.4(OpenAI)GPT-5.4 Image 2(OpenAI)GPT-5.4 Mini(OpenAI)GPT-5.4 Nano(OpenAI)GPT-5.4 Pro(OpenAI)GPT-5.5(OpenAI)GPT-5.5 Instant (May 2026)(OpenAI)GPT-5.5 Pro(OpenAI)gpt-oss-120b(OpenAI)gpt-oss-20b(OpenAI)gpt-oss-safeguard-20b(OpenAI)o1(OpenAI)o1-mini(OpenAI)o1-preview(OpenAI)o1-pro(OpenAI)o3(OpenAI)o3 Deep Research(OpenAI)o3 Mini(OpenAI)o3 Mini High(OpenAI)o3 Pro(OpenAI)o4 Mini(OpenAI)o4 Mini Deep Research(OpenAI)o4 Mini High(OpenAI)OpenAI: GPT-3.5 Turbo 16k(OpenAI)OpenAI: GPT-4(OpenAI)OpenAI: GPT-4 Turbo (older v1106)(OpenAI)OpenAI: GPT-4o(OpenAI)OpenAI: GPT-4o (2024-05-13)(OpenAI)OpenAI: GPT-4o-mini(OpenAI)Sora(OpenAI)MiniCPM-V 4.6 1.3B(OpenBMB)

Top 10 Models — 10-Axis Comparison

Data from ELO Chatbot Arena, Artificial Analysis and OpenRouter. ELO: daily • Prices: weekly.

ModelELOIntel.Code$/1M in$/1M outtok/sContextMultiOSS
1,49752.948.1$30.00$150.001.0M
1,47751.376.45$1.75$14.00128K
1,46246.473.9$0.50$3.001.0M
1,45178.18$1.75$14.00128K
1,42621.872.11$15.00$120.00400K
1,42332.973.19$0.27$0.41164K
1,417$0.57$2.30131K
1,39976.07$3.00$15.001.0M

Intel. = Intelligence Index (0–100) · Code = Coding Index · tok/s = tokens per second · Multi = multimodal · OSS = open source. See full methodology →

How to Compare AI Models in 2026

Comparison Criteria

Comparing AI models requires multidimensional analysis. There is no single “best model” — the choice depends on the use case, budget, and technical requirements. The key criteria are: response quality (measured by benchmarks like MMLU and GPQA), cost per token, inference speed, context window size, tool calling support, multimodality, and language-specific performance.

Price per Token: The Real Cost

AI models are generally charged per “token” — units of processed text. One token is roughly 3/4 of a word in English. Pricing varies dramatically: from $0.01/1M tokens (lightweight models) to $60+/1M tokens (frontier models). For high-volume applications like customer support chatbots, the cost difference can add up to thousands of dollars per month.

Context Window: How Much Text the Model Processes

The context window determines how much text the model can “see” at once. Models with a small context window (8K–32K tokens) are suited for simple queries and short conversations. Models with large context (128K–200K) process entire documents, contracts, and codebases. Gemini 1.5 Pro leads with 2M tokens — enough for entire books.

Speed and Latency

For real-time applications (chatbots, code autocomplete), generation speed (tokens per second) and initial latency (time to first token) are crucial. Smaller models (GPT-4o-mini, Claude Haiku, Mistral Small) are significantly faster than frontier models. Latency also varies by region — consider your proximity to the provider’s data centers when evaluating performance.

Benchmarks: What They Actually Measure

MMLU (Massive Multitask Language Understanding) tests general knowledge across 57 disciplines. GPQA Diamond tests reasoning in physics, chemistry, and biology at PhD level. SWE-bench tests real-world code bug resolution. Chatbot Arena (LMSYS) measures human preference in conversations. No single benchmark tells the full story — use multiple for a balanced view.

Popular Comparisons

The most popular comparisons include: GPT-4o vs Claude 3.5 Sonnet (the two most widely used models), Gemini vs ChatGPT (Google vs OpenAI ecosystem), Claude vs GPT for code (which is better for programming), and open source vs proprietary models (Llama vs GPT — when to use each). Use the tool above to compare any combination of models.

Popular Comparisons

Frequently Asked Questions

How do you compare AI models?

A proper comparison should consider multiple factors: quality benchmarks (MMLU, GPQA), price per token, inference speed, context window size, tool calling support, multimodality, and performance on your specific task. There is no universal "best" — it depends on your use case.

What is the difference between GPT and Claude?

GPT (OpenAI) and Claude (Anthropic) are the two most popular frontier models. GPT tends to be more versatile and integrated (ChatGPT, Copilot). Claude excels at following complex instructions, long contexts (200K tokens), and safety. Both deliver strong performance across English and other languages.

GPT-5 or Claude Opus?

GPT-5 and Claude Opus compete at the top of the rankings. GPT-5 is faster at generation. Claude Opus is more precise for reasoning and long-form analysis. For coding, both are excellent. For cost-efficiency at high volume, smaller versions (GPT-4o-mini, Claude Haiku) are recommended.

Is Gemini better than ChatGPT?

Gemini (Google) has advantages in context window (up to 2M tokens), Google Search integration, and native multimodal processing. ChatGPT (GPT-4o/5) has advantages in ecosystem (plugins, GPT Store) and speed. For general-purpose use, both are highly competitive.

What is the cheapest AI model?

Models like GPT-4o-mini, Claude Haiku, and DeepSeek V3 offer excellent quality for less than $0.30/1M tokens. For free local use, open source models like Llama and Qwen can be run via Ollama at zero API cost.

Explore Other Categories