Best AI for Code in 2026Updated by AA Coding Index

Which AI codes best in 2026? Ranking of 480 models using AA Coding Index as the primary signal, with fallback to LiveCodeBench and SciCode. Built to favor current coding models instead of stale one-off spikes.

Synced: July 17, 2026 • 480 models with coding benchmarks

Use Cases

Code Autocomplete

Inline suggestions as you type. Ideal for IDEs like Cursor and VS Code.

Top models: GPT-5.6 Sol (xhigh), GPT-5.6 Sol (max), GPT-5.6 Sol (high)

Code Generation

Create functions, classes and full projects from natural language descriptions.

Top models: GPT-5.6 Sol (xhigh), GPT-5.6 Sol (max), GPT-5.6 Sol (high)

Debug & Code Review

Identify bugs, suggest fixes and review pull requests automatically.

Top models: GPT-5.6 Sol (xhigh), GPT-5.6 Sol (max), GPT-5.6 Sol (high)

Coding Ranking — Top Models

#	Model	Company	Coding Score	Benchmark	Context	Input Price	Release	Open Source
🥇	GPT-5.6 Sol (xhigh)	OpenAI	78.3	AA Coding Index	—	$5.00	Jul 2026	—
🥈	GPT-5.6 Sol (max)	OpenAI	77.4	AA Coding Index	1.1M tokens	$5.00	Jul 2026	—
🥉	GPT-5.6 Sol (high)	OpenAI	77.2	AA Coding Index	—	$5.00	Jul 2026	—
4	GPT-5.6 Terra (max)	OpenAI	76.7	AA Coding Index	1.1M tokens	$2.50	Jul 2026	—
5	Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)	Anthropic	76.5	AA Coding Index	1.0M tokens	$10.00	Jun 2026	—
6	GPT-5.6 Sol (medium)	OpenAI	76.3	AA Coding Index	—	$5.00	Jul 2026	—
7	Kimi K3	Kimi	76.2	AA Coding Index	—	$3.00	Jul 2026	—
8	GPT-5.5	OpenAI	74.9	AA Coding Index	1.1M tokens	$5.00	Apr 2026	—
9	Claude Opus 4.8 (Adaptive Reasoning, Max Effort)	Anthropic	74.3	AA Coding Index	1.0M tokens	$5.00	May 2026	—
10	Claude Opus 4.7	Anthropic	73.6	AA Coding Index	1.0M tokens	$5.00	Apr 2026	—
11	Claude Opus 4.7 (Fast)	Anthropic	73.6	AA Coding Index	1.0M tokens	$30.00	May 2026	—
12	Grok 4.5	xai	72.4	AA Coding Index	500K tokens	$2.00	Jul 2026	—
13	Claude Sonnet 5	anthropic	71.5	AA Coding Index	1.0M tokens	$2.00	Jun 2026	—
14	GPT-5.5 Pro	OpenAI	71.5	AA Coding Index	1.1M tokens	—	Apr 2026	—
15	GPT-5.6 Luna (max)	OpenAI	71.4	AA Coding Index	1.1M tokens	$1.00	Jul 2026	—
16	Muse Spark 1.1 (xhigh)	Meta	71.3	AA Coding Index	—	$1.25	Jul 2026	—
17	GPT-5.4	OpenAI	71.1	AA Coding Index	1.1M tokens	$2.50	Mar 2026	—
18	GPT-5.4 Pro	OpenAI	71.1	AA Coding Index	1.1M tokens	$30.00	Mar 2026	—
19	GPT-5.6 Terra (xhigh)	OpenAI	70.6	AA Coding Index	—	$2.50	Jul 2026	—
20	Google: Gemini 3.5 Flash	Google	70.1	AA Coding Index	1.0M tokens	$1.50	May 2026	—
21	Gemini 3.5 Flash (minimal)	Google	70.1	AA Coding Index	—	$1.50	May 2026	—
22	GPT-5.6 Sol (low)	OpenAI	69.7	AA Coding Index	—	$5.00	Jul 2026	—
23	Gemini 3.1 Pro Preview	Google	68.8	AA Coding Index	1.0M tokens	$2.00	Feb 2026	—
24	GLM-5.2 (max)	Z.ai	68.8	AA Coding Index	—	$1.40	Jun 2026	—
25	Gemini 3.1	google	68.8	AA Coding Index	—	—	—	—
26	GPT-5.6 Luna (xhigh)	OpenAI	68.6	AA Coding Index	—	$1.00	Jul 2026	—
27	GPT-5.6 Terra (high)	OpenAI	67.1	AA Coding Index	—	$2.50	Jul 2026	—
28	Qwen3.7 Max	Alibaba	66.0	AA Coding Index	—	$2.50	May 2026	—
29	GPT-5.6 Sol (Non-reasoning)	OpenAI	65.1	AA Coding Index	—	$5.00	Jul 2026	—
30	GPT-5.6 Terra (medium)	OpenAI	64.7	AA Coding Index	—	$2.50	Jul 2026	—
31	GPT-5.6 Luna (high)	OpenAI	63.3	AA Coding Index	—	$1.00	Jul 2026	—
32	Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)	Anthropic	63.0	AA Coding Index	—	$3.00	Feb 2026	—
33	MoonshotAI: Kimi K2.6	MoonshotAI	61.8	AA Coding Index	262K tokens	$0.95	Apr 2026	✅
34	Kimi K2.7 Code	Kimi	60.8	AA Coding Index	—	$0.95	Jun 2026	—
35	Xiaomi: MiMo-V2.5-Pro	Xiaomi	60.2	AA Coding Index	1.0M tokens	$0.43	Apr 2026	—
36	Kwaipilot: KAT-Coder-Pro V2	Kwaipilot	59.5	AA Coding Index	256K tokens	$0.30	Mar 2026	—
37	DeepSeek V4 Pro	DeepSeek	59.4	AA Coding Index	1.0M tokens	$0.43	Apr 2026	✅
38	Nex-N2-Pro	Nex AGI	59.1	AA Coding Index	262K tokens	$0.50	Jun 2026	—
39	KAT-Coder-Pro V1	KwaiKAT	58.9	AA Coding Index	—	—	Nov 2025	—
40	Hy3-preview (Reasoning)	Tencent	58.8	AA Coding Index	262K tokens	$0.12	Apr 2026	—
41	Hy3-preview (Reasoning)	Tencent	58.8	AA Coding Index	262K tokens	$0.12	Apr 2026	—
42	Muse Spark	Meta	58.6	AA Coding Index	—	—	Apr 2026	—
43	MiniMax-M3	MiniMax	58.6	AA Coding Index	1.0M tokens	$0.30	Jun 2026	—
44	GPT-5.6 Terra (low)	OpenAI	58.1	AA Coding Index	—	$2.50	Jul 2026	—
45	MiMo-V2.5	Xiaomi	56.8	AA Coding Index	—	$0.14	Apr 2026	—
46	DeepSeek V4 Flash	DeepSeek	56.2	AA Coding Index	1.0M tokens	$0.14	Apr 2026	✅
47	GPT-5.4 Mini	OpenAI	56.1	AA Coding Index	400K tokens	$0.75	Mar 2026	—
48	GPT-5.4 Nano	OpenAI	56.1	AA Coding Index	400K tokens	$0.20	Mar 2026	—
49	Qwen3.7 Plus	Alibaba	55.9	AA Coding Index	—	$0.40	Jun 2026	—
50	GLM-5.1 (Reasoning)	Z.ai	55.8	AA Coding Index	—	$1.40	Apr 2026	—
51	MiniMax: MiniMax M2.7	MiniMax	52.6	AA Coding Index	197K tokens	$0.30	Mar 2026	✅
52	JT-4.1 Flash 236B A21B	China Mobile	52.4	AA Coding Index	—	—	Jul 2026	—
53	GPT-5.6 Terra (Non-reasoning)	OpenAI	52.3	AA Coding Index	—	$2.50	Jul 2026	—
54	Claude 4.5 Sonnet (Reasoning)	Anthropic	52.1	AA Coding Index	—	$3.00	Sep 2025	—
55	Claude 4.5 Sonnet (Non-reasoning)	Anthropic	52.1	AA Coding Index	—	$3.00	Sep 2025	—
56	Inkling	Thinking Machines	52.1	AA Coding Index	—	$1.87	Jul 2026	—
57	Grok Build 0.1 0616	xAI	51.5	AA Coding Index	—	$1.00	—	—
58	GPT-5.6 Luna (medium)	OpenAI	50.7	AA Coding Index	—	$1.00	Jul 2026	—
59	MiMo-V2-Flash (Reasoning)	Xiaomi	49.8	AA Coding Index	262K tokens	$0.10	Dec 2025	—
60	MiMo-V2-Flash (Feb 2026)	Xiaomi	49.8	AA Coding Index	—	—	Dec 2025	—
61	GPT-5.1	OpenAI	49.4	AA Coding Index	400K tokens	$1.25	Nov 2025	—
62	GPT-5.1-Codex	OpenAI	49.4	AA Coding Index	400K tokens	$1.25	Nov 2025	—
63	GPT-5.1 Chat	OpenAI	49.4	AA Coding Index	128K tokens	$1.25	Nov 2025	—
64	Nemotron 3 Ultra 550B A55B (Reasoning)	NVIDIA	49.3	AA Coding Index	1.0M tokens	$0.68	Jun 2026	—
65	Mistral: Mistral Medium 3.5	Mistral AI	46.9	AA Coding Index	262K tokens	$1.50	Apr 2026	✅
66	Kimi K2 Thinking	Kimi	46.8	AA Coding Index	262K tokens	$0.60	Nov 2025	—
67	MoonshotAI: Kimi K2.5	MoonshotAI	46.8	AA Coding Index	262K tokens	$0.60	Jan 2026	✅
68	Gemini 2.5 Pro Preview (Mar' 25)	Google	46.7	AA Coding Index	—	—	Mar 2025	—
69	Gemini 2.5 Pro Preview (May' 25)	Google	46.7	AA Coding Index	—	$1.25	May 2025	—
70	GLM-4.6 (Reasoning)	Z.ai	45.8	AA Coding Index	—	$0.55	Sep 2025	—
71	GLM-4.6 (Non-reasoning)	Z.ai	45.8	AA Coding Index	—	$0.57	Sep 2025	—
72	GLM-4.7 (Reasoning)	Z.ai	45.3	AA Coding Index	—	$0.60	Dec 2025	—
73	LongCat 2.0	LongCat	45.3	AA Coding Index	—	—	Jun 2026	—
74	DeepSeek V3.2 Exp (Reasoning)	DeepSeek	44.2	AA Coding Index	—	$0.28	Dec 2025	—
75	DeepSeek V3.2	DeepSeek	44.2	AA Coding Index	164K tokens	$0.28	Dec 2025	✅
76	GPT-5.6 Luna (low)	OpenAI	44.2	AA Coding Index	—	$1.00	Jul 2026	—
77	Claude 4.5 Haiku (Reasoning)	Anthropic	43.9	AA Coding Index	—	$1.00	Oct 2025	—
78	DeepSeek V3.1 Terminus	DeepSeek	43.5	AA Coding Index	131K tokens	$1.64	Sep 2025	✅
79	Gemma 4 31B	Google	43.4	AA Coding Index	262K tokens	$0.14	Apr 2026	—
80	Ring-2.6-1T	InclusionAI	42.8	AA Coding Index	—	$0.30	May 2026	—
81	Grok 4.3	xAI	42.2	AA Coding Index	1.0M tokens	$1.25	Apr 2026	—
82	o1	OpenAI	39.7	AA Coding Index	200K tokens	$15.00	Dec 2024	—
83	Step 3.7 Flash	StepFun	39.6	AA Coding Index	—	$0.20	May 2026	—
84	GPT-5.5 Instant (May 2026)	OpenAI	39.4	AA Coding Index	—	$5.00	May 2026	—
85	GPT-5.5 Instant (June 2026)	OpenAI	39.4	AA Coding Index	—	$5.00	Jun 2026	—
86	GPT-5.6 Luna (Non-reasoning)	OpenAI	39.3	AA Coding Index	—	$1.00	Jul 2026	—
87	Gemma 4 26B A4B	Google	39.3	AA Coding Index	262K tokens	$0.13	Apr 2026	—
88	GPT-5	OpenAI	37.8	AA Coding Index	400K tokens	$1.25	Aug 2025	—
89	GPT-5 (ChatGPT)	OpenAI	37.8	AA Coding Index	—	$1.25	Aug 2025	—
90	NVIDIA Nemotron 3 Super 120B A12B (Reasoning)	NVIDIA	37.7	AA Coding Index	1.0M tokens	$0.25	Mar 2026	—
91	Claude 4 Sonnet (Reasoning)	Anthropic	37.6	AA Coding Index	—	$3.00	May 2025	—
92	North Mini Code	Cohere	36.5	AA Coding Index	—	—	Jun 2026	—
93	Claude 3.7 Sonnet (thinking)	Anthropic	36.4	AA Coding Index	200K tokens	—	Feb 2025	—
94	Claude 3.7 Sonnet	Anthropic	36.4	AA Coding Index	200K tokens	$3.00	Feb 2025	—
95	Gemini 3.1 Flash Lite Preview	Google	34.7	AA Coding Index	1.0M tokens	$0.25	Mar 2026	—
96	Gemini 3.1 Flash Lite	Google	34.7	AA Coding Index	1.0M tokens	$0.25	May 2026	—
97	Nova 2.0 Pro Preview (medium)	Amazon	34.0	AA Coding Index	—	$1.25	Nov 2025	—
98	o1-preview	OpenAI	34.0	AA Coding Index	—	$16.50	Sep 2024	—
99	Gemini 2.5	Google	33.3	AA Coding Index	—	—	—	—
100	Gemini 2.5 Pro	Google	33.3	AA Coding Index	1.0M tokens	$1.25	Jun 2025	—
101	Devstral 2	Mistral	31.3	AA Coding Index	—	—	Dec 2025	—
102	gpt-oss-120b	OpenAI	30.4	AA Coding Index	131K tokens	$0.15	Aug 2025	—
103	Claude 3.5 Sonnet (Oct '24)	Anthropic	30.2	AA Coding Index	—	$3.00	Oct 2024	—
104	Devstral Small 2	Mistral	29.3	AA Coding Index	—	—	Dec 2025	—
105	Qwen3.5 9B (Reasoning)	Alibaba	28.7	AA Coding Index	—	—	Mar 2026	—
106	Command A+	Cohere	27.8	AA Coding Index	—	—	May 2026	—
107	Mistral: Mistral Small 4	Mistral AI	26.6	AA Coding Index	262K tokens	$0.15	Mar 2026	✅
108	Mistral Small 3.1	Mistral	26.3	AA Coding Index	—	$0.10	Mar 2025	—
109	Claude 3.5 Sonnet (June '24)	Anthropic	26.0	AA Coding Index	—	$3.00	Jun 2024	—
110	Gemini 2.0 Pro Experimental (Feb '25)	Google	25.5	AA Coding Index	—	—	Feb 2025	—
111	Ling 2.6 Flash	Inclusion AI	25.3	AA Coding Index	—	$0.10	Apr 2026	—
112	DeepSeek R1 (Jan '25)	DeepSeek	24.6	AA Coding Index	—	$1.35	May 2025	—
113	DeepSeek: R1	DeepSeek	24.6	AA Coding Index	164K tokens	$0.70	May 2025	✅
114	GPT-4o (March 2025, chatgpt-4o-latest)	OpenAI	24.2	AA Coding Index	—	—	Mar 2025	—
115	OpenAI: GPT-4o (2024-05-13)	OpenAI	24.2	AA Coding Index	128K tokens	$5.00	May 2024	—
116	GPT-4o (2024-08-06)	OpenAI	24.2	AA Coding Index	128K tokens	$2.50	Aug 2024	—
117	OpenAI: GPT-4o	OpenAI	24.2	AA Coding Index	128K tokens	$2.50	Nov 2024	—
118	Gemini 2.0 Flash	Google	24.1	AA Coding Index	1.0M tokens	$0.15	Feb 2025	—
119	Gemini 2.0 Flash Thinking Experimental (Dec '24)	Google	24.1	AA Coding Index	—	—	Dec 2024	—
120	Gemini 2.0 Flash Thinking Experimental (Jan '25)	Google	24.1	AA Coding Index	—	—	Jan 2025	—
121	Gemini 2.0 Flash (experimental)	Google	24.1	AA Coding Index	—	—	Dec 2024	—
122	Gemini 1.5 Pro (Sep '24)	Google	23.6	AA Coding Index	—	—	Sep 2024	—
123	HyperNova 60B 2605	Multiverse Computing	23.2	AA Coding Index	—	$0.04	May 2026	—
124	Nova 2.0 Lite (high)	Amazon	23.0	AA Coding Index	—	$0.30	Oct 2025	—
125	DeepSeek V3	DeepSeek	23.0	AA Coding Index	131K tokens	$0.36	Dec 2024	✅
126	Qwen3.5 4B (Reasoning)	Alibaba	22.6	AA Coding Index	—	$0.03	Mar 2026	—
127	Qwen: Qwen3 235B A22B Instruct 2507	Alibaba	22.1	AA Coding Index	262K tokens	$0.70	Jul 2025	✅
128	Qwen: Qwen3 235B A22B Thinking 2507	Alibaba	22.1	AA Coding Index	131K tokens	$0.15	—	✅
129	GPT-4 Turbo	OpenAI	21.5	AA Coding Index	128K tokens	$10.00	Nov 2023	—
130	Magistral Medium 1.2	Mistral AI	21.3	AA Coding Index	—	$2.00	Sep 2025	—
131	DeepSeek V3 0324	DeepSeek	21.2	AA Coding Index	—	$1.14	Mar 2025	—
132	K2 Think V2	MBZUAI Institute of Foundation Models	21.0	AA Coding Index	—	—	Dec 2025	—
133	gpt-oss-20b	OpenAI	20.7	AA Coding Index	131K tokens	$0.05	Aug 2025	—
134	Mistral: Mistral Medium 3.1	Mistral AI	20.5	AA Coding Index	131K tokens	$0.40	Aug 2025	✅
135	Qwen3.5 4B (Non-reasoning)	Alibaba	20.3	AA Coding Index	—	$0.03	Mar 2026	—
136	GPT-4.1 Mini	OpenAI	20.2	AA Coding Index	1.0M tokens	$0.40	Apr 2025	—
137	Mistral Large 3	Mistral	20.1	AA Coding Index	—	$4.00	Feb 2024	—
138	Gemini 1.5 Pro (May '24)	Google	19.8	AA Coding Index	—	—	May 2024	—
139	DiffusionGemma 26B A4B	Google	19.7	AA Coding Index	—	—	Jun 2026	—
140	Claude 3 Opus	Anthropic	19.5	AA Coding Index	—	$15.00	Mar 2024	—
141	Gemini 1.0 Ultra	Google	17.6	AA Coding Index	—	—	Dec 2023	—
142	Qwen3 Next 80B A3B (Reasoning)	Alibaba	17.4	AA Coding Index	—	$0.50	Sep 2025	—
143	o3 Mini High	OpenAI	16.3	AA Coding Index	200K tokens	$1.10	Jan 2025	—
144	Llama 4 Maverick	Meta	16.3	AA Coding Index	1.0M tokens	$0.35	Apr 2025	✅
145	Solar Pro 3	Upstage	16.2	AA Coding Index	128K tokens	—	Apr 2026	—
146	Claude 3.5 Haiku	Anthropic	15.9	AA Coding Index	200K tokens	$0.80	Oct 2024	—
147	GPT-5 Mini	OpenAI	15.6	AA Coding Index	400K tokens	$0.25	Aug 2025	—
148	GPT-5 mini (minimal)	OpenAI	15.6	AA Coding Index	—	$0.25	Aug 2025	—
149	Qwen3 32B (Reasoning)	Alibaba	15.3	AA Coding Index	—	$0.70	Apr 2025	—
150	Magistral Small 1.2	Mistral	14.7	AA Coding Index	—	$0.50	Sep 2025	—
151	NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)	NVIDIA	14.4	AA Coding Index	—	$0.05	Dec 2025	—
152	Ministral 3 14B	Mistral	14.4	AA Coding Index	—	$0.20	Dec 2025	—
153	Claude 2.1	Anthropic	14.0	AA Coding Index	—	—	Nov 2023	—
154	Qwen3 14B (Reasoning)	Alibaba	13.8	AA Coding Index	—	$0.35	Apr 2025	—
155	OpenAI: GPT-4	OpenAI	13.1	AA Coding Index	8K tokens	$30.00	Mar 2023	—
156	Claude 2.0	Anthropic	12.9	AA Coding Index	—	—	Jul 2023	—
157	Mistral Small 3.2	Mistral	12.5	AA Coding Index	—	$0.10	Jun 2025	—
158	Qwen3 30B A3B 2507 (Reasoning)	Alibaba	12.1	AA Coding Index	—	$0.20	Jul 2025	—
159	Llama 3.3 70B Instruct	Meta	11.9	AA Coding Index	131K tokens	$0.58	Dec 2024	✅
160	OpenAI: GPT-4o-mini	OpenAI	11.4	AA Coding Index	128K tokens	$0.15	Jul 2024	—
161	GPT-4.1 Nano	OpenAI	11.1	AA Coding Index	1.0M tokens	$0.10	Apr 2025	—
162	GPT-3.5 Turbo	OpenAI	10.7	AA Coding Index	—	$0.50	Nov 2022	—
163	Granite 4.1 30B	IBM	10.4	AA Coding Index	—	—	Apr 2026	—
164	Gemma 3 27B	Google	10.1	AA Coding Index	131K tokens	—	Mar 2025	—
165	Ministral 3 8B	Mistral	9.7	AA Coding Index	—	$0.15	Dec 2025	—
166	Nanbeige4.1-3B	Nanbeige	9.6	AA Coding Index	—	—	Feb 2026	—
167	Granite 4.1 8B	IBM	9.5	AA Coding Index	—	$0.05	Apr 2026	—
168	Qwen3 8B (Reasoning)	Alibaba	9.0	AA Coding Index	—	$0.18	Apr 2025	—
169	Llama 4 Scout	Meta	8.2	AA Coding Index	10.0M tokens	$0.17	Apr 2025	✅
170	NVIDIA Nemotron 3 Nano 4B	NVIDIA	8.0	AA Coding Index	—	—	Mar 2026	—
171	Claude Instant	Anthropic	7.8	AA Coding Index	—	—	Mar 2023	—
172	Gemma 3 12B	Google	5.8	AA Coding Index	131K tokens	—	Mar 2025	—
173	Llama 3.1 8B Instruct	Meta	5.4	AA Coding Index	16K tokens	$0.10	Jul 2024	✅
174	Ministral 3 3B	Mistral	4.8	AA Coding Index	—	$0.10	Dec 2025	—
175	Granite 4.1 3B	IBM	4.7	AA Coding Index	—	—	Apr 2026	—
176	PALM-2	Google	4.6	AA Coding Index	—	—	May 2023	—
177	Phi-4 Mini Instruct	Microsoft	3.8	AA Coding Index	—	—	Feb 2024	—
178	Gemma 3n E4B Instruct	Google	3.2	AA Coding Index	—	$0.02	Jun 2025	—
179	Qwen3.5 2B (Reasoning)	Alibaba	2.9	AA Coding Index	—	—	Mar 2026	—
180	Gemma 3 4B	Google	2.7	AA Coding Index	131K tokens	—	Mar 2025	—
181	Qwen3.5 0.8B (Non-reasoning)	Alibaba	1.2	AA Coding Index	—	—	Mar 2026	—
182	MiniCPM-V 4.6 1.3B	OpenBMB	0.7	AA Coding Index	—	—	May 2026	—
183	Qwen3.5 0.8B (Reasoning)	Alibaba	0.0	AA Coding Index	—	—	Mar 2026	—
184	Gemini 3 Pro Preview (high)	Google	92.0	LiveCodeBench	—	$2.00	Nov 2025	—
185	Gemini 3 Flash Preview (Reasoning)	Google	91.0	LiveCodeBench	—	$0.50	Dec 2025	—
186	DeepSeek V3.2 Speciale	DeepSeek	90.0	LiveCodeBench	164K tokens	—	Dec 2025	✅
187	GPT-5.2	OpenAI	89.0	LiveCodeBench	400K tokens	$1.75	Dec 2025	—
188	Claude Opus 4.5 (Reasoning)	Anthropic	87.0	LiveCodeBench	—	$5.00	Nov 2025	—
189	Gemini 3 Pro Preview (low)	Google	86.0	LiveCodeBench	—	$2.00	Nov 2025	—
190	o4 Mini	OpenAI	86.0	LiveCodeBench	200K tokens	$1.10	Apr 2025	—
191	o4 Mini High	OpenAI	85.9	LiveCodeBench	200K tokens	$1.10	Apr 2025	—
192	GPT-5.1-Codex-Max	OpenAI	84.9	LiveCodeBench	400K tokens	$1.25	Dec 2025	—
193	GPT-5.1-Codex-Mini	OpenAI	84.0	LiveCodeBench	400K tokens	$0.25	Nov 2025	—
194	GPT-5 Codex	OpenAI	84.0	LiveCodeBench	400K tokens	$1.25	Sep 2025	—
195	Grok 4 Fast	xAI	83.0	LiveCodeBench	2.0M tokens	$0.20	Sep 2025	—
196	MiniMax-M2	MiniMax	83.0	LiveCodeBench	205K tokens	$0.30	Oct 2025	—
197	Grok 4	xAI	82.0	LiveCodeBench	256K tokens	$5.50	Jul 2025	—
198	Grok 4.1 Fast	xAI	82.0	LiveCodeBench	2.0M tokens	—	Nov 2025	—
199	MiniMax: MiniMax M2.1	MiniMax	81.0	LiveCodeBench	197K tokens	$0.30	Dec 2025	✅
200	o3	OpenAI	81.0	LiveCodeBench	200K tokens	$2.00	Apr 2025	—
201	ERNIE 5.0 Thinking Preview	Baidu	81.0	LiveCodeBench	—	—	Nov 2025	—
202	Apriel-v1.6-15B-Thinker	ServiceNow	81.0	LiveCodeBench	—	—	Nov 2025	—
203	o3 Pro	OpenAI	80.8	LiveCodeBench	200K tokens	$20.00	Jun 2025	—
204	Gemini 3 Flash Preview (Non-reasoning)	Google	80.0	LiveCodeBench	—	$0.50	Dec 2025	—
205	GPT-5 Nano	OpenAI	79.0	LiveCodeBench	400K tokens	$0.05	Aug 2025	—
206	INTELLECT-3	Prime Intellect	78.0	LiveCodeBench	131K tokens	—	Nov 2025	✅
207	DeepSeek V3.1	DeepSeek	78.0	LiveCodeBench	164K tokens	$0.59	Aug 2025	✅
208	Doubao Seed Code	ByteDance Seed	77.0	LiveCodeBench	—	—	Nov 2025	—
209	Seed-OSS-36B-Instruct	ByteDance Seed	77.0	LiveCodeBench	—	$0.21	Aug 2025	—
210	K-EXAONE (Reasoning)	LG AI	77.0	LiveCodeBench	—	—	Dec 2025	—
211	GPT-5.2 Pro	OpenAI	76.5	LiveBench Coding	400K tokens	$21.00	Dec 2025	—
212	Claude Sonnet 4.5	Anthropic	76.1	LiveBench Coding	1.0M tokens	$3.00	Sep 2025	—
213	EXAONE 4.0 32B (Reasoning)	LG AI Research	75.0	LiveCodeBench	—	—	Jul 2025	—
214	Gemini 3.5	Google	74.6	LiveBench Coding	—	—	—	—
215	Claude Opus 4.5	Anthropic	74.0	LiveCodeBench	200K tokens	$5.00	Nov 2025	—
216	GLM-4.5 (Reasoning)	Z.ai	74.0	LiveCodeBench	131K tokens	—	Jul 2025	—
217	Llama Nemotron Super 49B v1.5 (Reasoning)	NVIDIA	74.0	LiveCodeBench	—	$0.40	Jul 2025	—
218	Qwen3 VL 32B (Reasoning)	Alibaba	74.0	LiveCodeBench	—	$0.70	Oct 2025	—
219	Apriel-v1.5-15B-Thinker	ServiceNow	73.0	LiveCodeBench	—	—	Sep 2025	—
220	o3 Mini	OpenAI	72.0	LiveCodeBench	200K tokens	$1.10	Jan 2025	—
221	Falcon-H1R-7B	TII UAE	72.0	LiveCodeBench	—	—	Jan 2026	—
222	NVIDIA Nemotron Nano 9B V2 (Reasoning)	NVIDIA	72.0	LiveCodeBench	—	$0.04	Aug 2025	—
223	Gemini 2.5 Flash Preview (Sep '25) (Reasoning)	Google	71.0	LiveCodeBench	—	—	Sep 2025	—
224	MiniMax M1 80k	MiniMax	71.0	LiveCodeBench	—	$0.55	Jun 2025	—
225	Grok 3 Mini	xAI	70.0	LiveCodeBench	131K tokens	$0.30	Feb 2025	—
226	Gemini 2.5 Flash Preview (Reasoning)	Google	70.0	LiveCodeBench	—	$0.30	May 2025	—
227	Olmo 3.1 32B Think	Allen Institute for AI	70.0	LiveCodeBench	—	—	Dec 2025	—
228	Qwen3 VL 30B A3B (Reasoning)	Alibaba	70.0	LiveCodeBench	—	$0.20	Oct 2025	—
229	NVIDIA Nemotron Nano 9B V2 (Non-reasoning)	NVIDIA	70.0	LiveCodeBench	131K tokens	$0.05	Aug 2025	—
230	Cogito v2.1 (Reasoning)	Deep Cogito	69.0	LiveCodeBench	—	$1.25	Nov 2025	—
231	K2-V2 (high)	MBZUAI Institute of Foundation Models	69.0	LiveCodeBench	—	—	Dec 2025	—
232	Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)	Google	69.0	LiveCodeBench	—	$0.10	Sep 2025	—
233	NVIDIA Nemotron Nano 12B v2 VL (Reasoning)	NVIDIA	69.0	LiveCodeBench	—	$0.20	Oct 2025	—
234	Hermes 4 - Llama-3.1 405B (Reasoning)	Nous Research	69.0	LiveCodeBench	—	$1.00	Aug 2025	—
235	Ling-1T	InclusionAI	68.0	LiveCodeBench	—	—	Oct 2025	—
236	Qwen3 Omni 30B A3B (Reasoning)	Alibaba	68.0	LiveCodeBench	—	$0.25	Sep 2025	—
237	Qwen: Qwen3 Next 80B A3B Instruct	Alibaba	68.0	LiveCodeBench	262K tokens	$0.50	Sep 2025	✅
238	GLM-4.5-Air	Z.ai	68.0	LiveCodeBench	—	$0.17	Jul 2025	—
239	Olmo 3 32B Think	AllenAI	67.0	LiveCodeBench	66K tokens	—	Nov 2025	✅
240	MiniMax M1 40k	MiniMax	66.0	LiveCodeBench	—	—	Jun 2025	—
241	Nova 2.0 Omni (medium)	Amazon	66.0	LiveCodeBench	—	$0.30	Nov 2025	—
242	Grok Code Fast 1	xAI	66.0	LiveCodeBench	256K tokens	—	Aug 2025	—
243	Mi:dm K 2.5 Pro	Korea Telecom	66.0	LiveCodeBench	—	—	Dec 2025	—
244	Claude 4.1 Opus (Non-reasoning)	Anthropic	65.4	LiveCodeBench	—	$15.00	Aug 2025	—
245	xAI: Grok Build 0.1	xAI	65.4	LiveBench Coding	256K tokens	$1.00	May 2026	—
246	Claude 4.1 Opus (Reasoning)	Anthropic	65.0	LiveCodeBench	—	$15.00	Aug 2025	—
247	Qwen3 VL 235B A22B (Reasoning)	Alibaba	65.0	LiveCodeBench	—	$0.70	Sep 2025	—
248	Qwen3 Max (Preview)	Alibaba	65.0	LiveCodeBench	—	$1.20	Sep 2025	—
249	Hermes 4 - Llama-3.1 70B (Reasoning)	Nous Research	65.0	LiveCodeBench	—	$0.13	Aug 2025	—
250	Motif-2-12.7B-Reasoning	Motif Technologies	65.0	LiveCodeBench	—	—	Dec 2025	—
251	Claude 4 Opus (Reasoning)	Anthropic	64.0	LiveCodeBench	—	$15.00	May 2025	—
252	Ring-1T	InclusionAI	64.0	LiveCodeBench	—	—	Oct 2025	—
253	Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)	NVIDIA	64.0	LiveCodeBench	—	$0.60	Apr 2025	—
254	Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)	Google	64.0	LiveCodeBench	—	$0.10	Sep 2025	—
255	Qwen3 4B 2507 (Reasoning)	Alibaba	64.0	LiveCodeBench	—	—	Aug 2025	—
256	QwQ 32B	Alibaba	63.0	LiveCodeBench	—	$0.66	Mar 2025	—
257	HyperCLOVA X SEED Think (32B)	Naver	63.0	LiveCodeBench	—	—	Dec 2025	—
258	Ring-flash-2.0	InclusionAI	63.0	LiveCodeBench	—	$0.14	Sep 2025	—
259	Qwen3 235B A22B (Reasoning)	Alibaba	62.0	LiveCodeBench	—	$0.70	Apr 2025	—
260	Solar Pro 2 (Non-reasoning)	Upstage	62.0	LiveCodeBench	—	—	Jul 2025	—
261	Olmo 3 7B Think	Allen Institute for AI	62.0	LiveCodeBench	—	—	Nov 2025	—
262	MoonshotAI: Kimi K2 0905	MoonshotAI	61.0	LiveCodeBench	262K tokens	$0.60	Sep 2025	✅
263	GLM-4.5V (Reasoning)	Z.ai	60.0	LiveCodeBench	—	$0.60	Aug 2025	—
264	Qwen: Qwen3 VL 235B A22B Instruct	Alibaba	59.0	LiveCodeBench	262K tokens	$0.70	Sep 2025	✅
265	Qwen3 Coder 480B A35B Instruct	Alibaba	59.0	LiveCodeBench	—	$1.50	Jul 2025	—
266	Nova 2.0 Omni (low)	Amazon	59.0	LiveCodeBench	—	$0.30	Nov 2025	—
267	Ling-flash-2.0	InclusionAI	59.0	LiveCodeBench	—	$0.14	Sep 2025	—
268	Gemini 2.5 Flash Lite	Google	59.0	LiveCodeBench	1.0M tokens	$0.10	Jun 2025	—
269	o1-mini	OpenAI	58.0	LiveCodeBench	—	—	Sep 2024	—
270	Mi:dm K 2.5 Pro Preview	Korea Telecom	58.0	LiveCodeBench	—	—	Dec 2025	—
271	GPT-5 (minimal)	OpenAI	56.0	LiveCodeBench	—	$1.25	Aug 2025	—
272	Kimi K2	Moonshot AI	56.0	LiveCodeBench	131K tokens	$0.58	Jul 2025	—
273	DeepSeek V3.2 Exp (Non-reasoning)	DeepSeek	55.0	LiveCodeBench	—	$0.28	Sep 2025	—
274	Hermes 4 - Llama-3.1 405B (Non-reasoning)	Nous Research	55.0	LiveCodeBench	—	$1.00	Aug 2025	—
275	GPT-5.2-Codex	OpenAI	55.0	SciCode	400K tokens	$1.75	Dec 2025	—
276	Claude Opus 4	Anthropic	54.0	LiveCodeBench	200K tokens	$15.00	May 2025	—
277	Qwen3 Max Thinking (Preview)	Alibaba	54.0	LiveCodeBench	—	$1.20	Nov 2025	—
278	K2-V2 (medium)	MBZUAI Institute of Foundation Models	54.0	LiveCodeBench	—	—	Dec 2025	—
279	Magistral Medium 1	Mistral	53.0	LiveCodeBench	—	—	Jun 2025	—
280	GPT-5.3-Codex	OpenAI	53.0	SciCode	400K tokens	$1.75	Feb 2026	—
281	Qwen3 30B A3B 2507 Instruct	Alibaba	52.0	LiveCodeBench	—	$0.20	Jul 2025	—
282	Exaone 4.0 1.2B (Non-reasoning)	LG AI Research	52.0	LiveCodeBench	—	—	Jul 2025	—
283	Claude Opus 4.6 (Adaptive Reasoning, Max Effort)	Anthropic	52.0	SciCode	—	$5.00	Feb 2026	—
284	Claude Haiku 4.5	Anthropic	51.0	LiveCodeBench	200K tokens	$1.00	Oct 2025	—
285	Qwen: Qwen3 VL 32B Instruct	Alibaba	51.0	LiveCodeBench	131K tokens	$0.70	Oct 2025	✅
286	Qwen3 30B A3B (Reasoning)	Alibaba	51.0	LiveCodeBench	—	$0.20	Apr 2025	—
287	Magistral Small 1	Mistral	51.0	LiveCodeBench	—	—	Jun 2025	—
288	DeepSeek R1 0528 Qwen3 8B	DeepSeek	51.0	LiveCodeBench	—	—	May 2025	—
289	Qwen: Qwen3 30B A3B Thinking 2507	Alibaba	50.6	LiveCodeBench	131K tokens	$0.08	—	✅
290	Gemini 2.5 Flash	Google	50.0	LiveCodeBench	1.0M tokens	$0.30	May 2025	—
291	Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)	NVIDIA	49.0	LiveCodeBench	—	—	May 2025	—
292	Qwen: Qwen3 VL 30B A3B Instruct	Alibaba	48.0	LiveCodeBench	131K tokens	$0.20	Oct 2025	✅
293	Baidu: ERNIE 4.5 300B A47B	Baidu	47.0	LiveCodeBench	123K tokens	$0.28	Jun 2025	✅
294	GPT-5 nano (minimal)	OpenAI	47.0	LiveCodeBench	—	$0.05	Aug 2025	—
295	EXAONE 4.0 32B (Non-reasoning)	LG AI Research	47.0	LiveCodeBench	—	—	Jul 2025	—
296	Qwen3 4B (Reasoning)	Alibaba	47.0	LiveCodeBench	—	—	Apr 2025	—
297	Qwen3.6 Max Preview	Alibaba	47.0	SciCode	—	$1.30	Apr 2026	—
298	Claude Sonnet 4.6	Anthropic	47.0	SciCode	1.0M tokens	$3.00	Feb 2026	—
299	GPT-4.1	OpenAI	46.0	LiveCodeBench	1.0M tokens	$2.00	Apr 2025	—
300	Solar Pro 2 (Preview) (Reasoning)	Upstage	46.0	LiveCodeBench	—	—	May 2025	—
301	Grok 4.20	xAI	46.0	SciCode	2.0M tokens	$2.00	Apr 2026	—
302	GLM-5 (Reasoning)	Z.ai	46.0	SciCode	203K tokens	$1.00	Feb 2026	—
303	Claude Opus 4.6	Anthropic	46.0	SciCode	1.0M tokens	$5.00	Feb 2026	—
304	Claude Sonnet 4	Anthropic	45.0	LiveCodeBench	1.0M tokens	$3.00	May 2025	—
305	Grok 4.20 0309 (Reasoning)	xAI	45.0	SciCode	—	$2.00	Mar 2026	—
306	Reka Flash 3	Reka Flash 3	44.0	LiveCodeBench	66K tokens	$0.20	Oct 2024	✅
307	GLM 5V Turbo (Reasoning)	Z.ai	44.0	SciCode	203K tokens	—	Apr 2026	—
308	GLM-5-Turbo	Z.ai	44.0	SciCode	203K tokens	—	Mar 2026	—
309	Claude Sonnet 4.6 (Non-reasoning, Low Effort)	Anthropic	44.0	SciCode	—	$3.00	Feb 2026	—
310	Grok 3	xAI	43.0	LiveCodeBench	131K tokens	$4.00	Feb 2025	—
311	Ling-mini-2.0	InclusionAI	43.0	LiveCodeBench	—	—	Sep 2025	—
312	Xiaomi: MiMo-V2-Pro	Xiaomi	43.0	SciCode	1.0M tokens	—	Mar 2026	—
313	MiniMax: MiniMax M2.5	MiniMax	43.0	SciCode	197K tokens	$0.30	Feb 2026	✅
314	Qwen3 Omni 30B A3B Instruct	Alibaba	42.0	LiveCodeBench	—	$0.25	Sep 2025	—
315	GLM-4.6V (Non-reasoning)	Z.ai	41.0	LiveCodeBench	—	$0.30	Dec 2025	—
316	Gemini 2.5 Flash Preview (Non-reasoning)	Google	41.0	LiveCodeBench	—	—	Apr 2025	—
317	Qwen3.5 Omni Plus	Alibaba	41.0	SciCode	—	$0.40	Mar 2026	—
318	Mistral: Mistral Medium 3	Mistral AI	40.0	LiveCodeBench	131K tokens	$0.40	May 2025	✅
319	Qwen: Qwen3 Coder 30B A3B Instruct	Alibaba	40.0	LiveCodeBench	160K tokens	$0.45	Jul 2025	✅
320	MiMo-V2-Omni-0327	Xiaomi	40.0	SciCode	—	—	Mar 2026	—
321	Step 3.5 Flash	StepFun	40.0	SciCode	—	$0.10	Feb 2026	—
322	Solar Pro 2 (Preview) (Non-reasoning)	Upstage	39.0	LiveCodeBench	—	—	May 2025	—
323	Step 3.5 Flash	StepFun	39.0	SciCode	262K tokens	$0.10	Apr 2026	✅
324	Inception: Mercury 2	Inception	39.0	SciCode	128K tokens	$0.25	Feb 2026	—
325	DeepSeek R1 Distill Qwen 14B	DeepSeek	38.0	LiveCodeBench	—	—	Jan 2025	—
326	Kimi Linear 48B A3B Instruct	Kimi	38.0	LiveCodeBench	—	—	Oct 2025	—
327	Qwen3 4B 2507 Instruct	Alibaba	38.0	LiveCodeBench	—	—	Aug 2025	—
328	Gemma 4 12B (Reasoning)	Google	38.0	SciCode	—	$0.10	Jun 2026	—
329	GLM-5 (Non-reasoning)	Z.ai	38.0	SciCode	—	$1.00	Feb 2026	—
330	Ling-2.6-1T	Inclusion AI	37.0	SciCode	—	$0.30	Apr 2026	—
331	Xiaomi: MiMo-V2-Omni	Xiaomi	37.0	SciCode	262K tokens	—	Mar 2026	—
332	Qwen2.5 Max	Alibaba	36.0	LiveCodeBench	—	—	Jan 2025	—
333	NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)	NVIDIA	36.0	LiveCodeBench	262K tokens	$0.05	Dec 2025	—
334	GLM-5.1 (Non-reasoning)	Z.ai	36.0	SciCode	—	$1.40	Apr 2026	—
335	Trinity Large Thinking	Arcee AI	36.0	SciCode	—	$0.23	Apr 2026	—
336	Qwen3 VL 8B (Reasoning)	Alibaba	35.0	LiveCodeBench	—	$0.18	Oct 2025	—
337	GLM-4.5V (Non-reasoning)	Z.ai	35.0	LiveCodeBench	—	$0.60	Aug 2025	—
338	NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)	NVIDIA	35.0	LiveCodeBench	—	$0.20	Oct 2025	—
339	Nemotron Cascade 2 30B A3B	NVIDIA	35.0	SciCode	—	—	Mar 2026	—
340	Mistral: Devstral Medium	Mistral AI	34.0	LiveCodeBench	131K tokens	$0.40	Jul 2025	✅
341	QwQ 32B-Preview	Alibaba	34.0	LiveCodeBench	—	—	Nov 2024	—
342	GLM-4.7-Flash (Reasoning)	Z.ai	34.0	SciCode	—	$0.07	Jan 2026	—
343	Qwen: Qwen3 VL 8B Instruct	Alibaba	33.0	LiveCodeBench	131K tokens	$0.18	Oct 2025	✅
344	GPT-4o (ChatGPT)	OpenAI	33.0	SciCode	—	—	Feb 2025	—
345	Amazon: Nova Premier 1.0	Amazon	32.0	LiveCodeBench	1.0M tokens	$2.50	Apr 2025	—
346	Qwen: Qwen3 30B A3B Instruct 2507	Alibaba	32.0	LiveCodeBench	262K tokens	$0.20	Apr 2025	✅
347	Qwen3 VL 4B (Reasoning)	Alibaba	32.0	LiveCodeBench	—	—	Oct 2025	—
348	Llama 3.1 Instruct 405B	Meta	31.0	LiveCodeBench	—	$2.75	Jul 2024	—
349	Nova 2.0 Omni (Non-reasoning)	Amazon	31.0	LiveCodeBench	—	$0.30	Nov 2025	—
350	Qwen3 1.7B (Reasoning)	Alibaba	31.0	LiveCodeBench	—	—	Apr 2025	—
351	Step3 VL 10B	StepFun	31.0	SciCode	—	—	Jan 2026	—
352	Qwen2.5 Coder 32B Instruct	Alibaba	30.0	LiveCodeBench	33K tokens	—	Nov 2024	✅
353	Sonar	Perplexity	30.0	LiveCodeBench	127K tokens	—	Jan 2025	—
354	Sarvam M (Reasoning)	Sarvam	30.0	LiveCodeBench	—	—	May 2025	—
355	Llama 3.1 Tulu3 405B	Allen Institute for AI	29.0	LiveCodeBench	—	—	Jan 2025	—
356	Mistral Large 2 (Nov '24)	Mistral	29.0	LiveCodeBench	—	$2.00	Nov 2024	—
357	Qwen3 32B (Non-reasoning)	Alibaba	29.0	LiveCodeBench	—	$0.70	Apr 2025	—
358	Llama Nemotron Super 49B v1.5 (Non-reasoning)	NVIDIA	29.0	LiveCodeBench	—	$0.40	Jul 2025	—
359	Qwen3 VL 4B Instruct	Alibaba	29.0	LiveCodeBench	—	—	Oct 2025	—
360	JT-35B-Flash	China Mobile	29.0	SciCode	—	—	May 2026	—
361	Llama 3.3 Nemotron Super 49B v1 (Reasoning)	NVIDIA	28.0	LiveCodeBench	—	—	Mar 2025	—
362	Qwen3 14B (Non-reasoning)	Alibaba	28.0	LiveCodeBench	—	$0.35	Apr 2025	—
363	Qwen2.5 72B Instruct	Alibaba	28.0	LiveCodeBench	33K tokens	$0.47	Sep 2024	✅
364	Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)	NVIDIA	28.0	LiveCodeBench	—	—	Mar 2025	—
365	Sonar Reasoning Pro	Perplexity	28.0	LiveCodeBench	128K tokens	—	Jan 2025	—
366	Nemotron 3 Nano Omni 30B A3B Reasoning	NVIDIA	28.0	SciCode	—	$0.07	Apr 2026	—
367	EXAONE 4.5 33B	LG AI	28.0	SciCode	—	—	Apr 2026	—
368	LongCat Flash Lite	LongCat	28.0	SciCode	—	—	Jan 2026	—
369	DeepSeek: R1 Distill Qwen 32B	DeepSeek	27.0	LiveCodeBench	128K tokens	—	Jan 2025	✅
370	R1 Distill Llama 70B	DeepSeek	27.0	LiveCodeBench	128K tokens	$0.70	Jan 2025	✅
371	Hermes 4 - Llama-3.1 70B (Non-reasoning)	Nous Research	27.0	LiveCodeBench	—	$0.13	Aug 2025	—
372	Grok 2 (Dec '24)	xAI	27.0	LiveCodeBench	—	—	Dec 2024	—
373	Gemini 1.5 Flash (Sep '24)	Google	27.0	LiveCodeBench	—	—	Sep 2024	—
374	Mistral Large 2 (Jul '24)	Mistral	27.0	LiveCodeBench	131K tokens	$2.00	Jul 2024	—
375	Olmo 3 7B Instruct	Allen Institute for AI	27.0	LiveCodeBench	—	$0.10	Nov 2025	—
376	JT-MINI	China Mobile	27.0	SciCode	—	—	Apr 2026	—
377	Solar Open 100B (Reasoning)	Upstage	27.0	SciCode	—	—	Dec 2025	—
378	Mistral: Pixtral Large 2411	Mistral AI	26.0	LiveCodeBench	131K tokens	$2.00	Nov 2024	—
379	Devstral Small (May '25)	Mistral	26.0	LiveCodeBench	—	—	May 2025	—
380	Qwen3.5 Omni Flash	Alibaba	26.0	SciCode	—	$0.10	Mar 2026	—
381	Sarvam 105B (high)	Sarvam	26.0	SciCode	—	$0.04	Mar 2026	—
382	Devstral Small (Jul '25)	Mistral	25.0	LiveCodeBench	131K tokens	$0.10	Jul 2025	—
383	Mistral Small 3	Mistral	25.0	LiveCodeBench	—	$0.10	Jan 2025	—
384	Qwen2.5 Instruct 32B	Alibaba	25.0	LiveCodeBench	—	—	Sep 2024	—
385	Granite 4.0 H Small	IBM	25.0	LiveCodeBench	—	$0.06	Sep 2025	—
386	Grok Beta	xAI	24.0	LiveCodeBench	—	—	Aug 2024	—
387	Gemma 4 E4B (Reasoning)	Google	24.0	SciCode	—	—	Apr 2026	—
388	Mistral: Saba	Mistral AI	24.0	SciCode	33K tokens	—	Feb 2025	✅
389	Llama 3.1 70B Instruct	Meta	23.0	LiveCodeBench	131K tokens	$0.56	Jul 2024	✅
390	Microsoft: Phi 4	Microsoft	23.0	LiveCodeBench	16K tokens	$0.13	Dec 2024	✅
391	Nova Pro	Amazon	23.0	LiveCodeBench	—	$0.80	Dec 2024	—
392	Qwen3 4B (Non-reasoning)	Alibaba	23.0	LiveCodeBench	—	—	Apr 2025	—
393	DeepSeek R1 Distill Llama 8B	DeepSeek	23.0	LiveCodeBench	—	—	Jan 2025	—
394	Gemini 1.5 Flash-8B	Google	22.0	LiveCodeBench	—	—	Oct 2024	—
395	Llama 3.2 Instruct 90B (Vision)	Meta	21.0	LiveCodeBench	—	$1.38	Sep 2024	—
396	Jamba Reasoning 3B	AI21 Labs	21.0	LiveCodeBench	—	—	Oct 2025	—
397	Gemma 4 E2B (Reasoning)	Google	21.0	SciCode	—	—	Apr 2026	—
398	DeepHermes 3 - Mistral 24B Preview (Non-reasoning)	Nous Research	20.0	LiveCodeBench	—	—	Mar 2025	—
399	Llama 3 70B Instruct	Meta	20.0	LiveCodeBench	8K tokens	$0.65	Apr 2024	✅
400	Gemini 1.5 Flash (May '24)	Google	20.0	LiveCodeBench	—	—	May 2024	—
401	Qwen3 8B (Non-reasoning)	Alibaba	20.0	LiveCodeBench	—	$0.18	Apr 2025	—
402	Gemma 4 E2B (Non-reasoning)	Google	20.0	SciCode	—	—	Apr 2026	—
403	Gemini 2.0 Flash-Lite (Feb '25)	Google	19.0	LiveCodeBench	—	—	Feb 2025	—
404	Hermes 3 - Llama-3.1 70B	Nous Research	19.0	LiveCodeBench	—	$0.70	Aug 2024	—
405	Sarvam 30B	Sarvam	19.0	SciCode	—	$0.03	Mar 2026	—
406	Gemini 2.0 Flash-Lite (Preview)	Google	18.0	LiveCodeBench	—	—	Feb 2025	—
407	Claude 3 Sonnet	Anthropic	18.0	LiveCodeBench	—	$3.00	Mar 2024	—
408	AI21: Jamba Large 1.7	AI21 Labs	18.0	LiveCodeBench	256K tokens	$2.00	Jul 2025	✅
409	Granite 4.0 Micro	IBM	18.0	LiveCodeBench	131K tokens	—	Sep 2025	✅
410	Tri-21B-think Preview	Trillion Labs	18.0	SciCode	—	—	Feb 2026	—
411	Mistral Large	Mistral AI	17.8	LiveCodeBench	128K tokens	$2.00	Feb 2024	✅
412	Llama 3.1 Nemotron 70B Instruct	NVIDIA	17.0	LiveCodeBench	131K tokens	$1.20	Oct 2024	✅
413	Jamba 1.6 Large	AI21 Labs	17.0	LiveCodeBench	—	$2.00	Mar 2025	—
414	Nova Lite	Amazon	17.0	LiveCodeBench	—	$0.06	Dec 2024	—
415	Tri-21B-Think	Trillion Labs	17.0	SciCode	—	—	Feb 2026	—
416	Olmo 3.1 32B Instruct	AllenAI	17.0	SciCode	66K tokens	—	Jan 2026	✅
417	GLM-4.6V (Reasoning)	Z.ai	16.0	LiveCodeBench	—	$0.30	Dec 2025	—
418	Qwen2 Instruct 72B	Alibaba	16.0	LiveCodeBench	—	—	Jun 2024	—
419	DeepSeek Coder V2 Lite Instruct	DeepSeek	16.0	LiveCodeBench	—	—	Jun 2024	—
420	Mixtral 8x22B Instruct	Mistral	15.0	LiveCodeBench	—	—	Apr 2024	—
421	Anthropic: Claude 3 Haiku	Anthropic	15.0	LiveCodeBench	200K tokens	$0.25	Mar 2024	—
422	LFM2 8B A1B	Liquid AI	15.0	LiveCodeBench	—	—	Oct 2025	—
423	Mistral: Mixtral 8x22B Instruct	Mistral AI	14.8	LiveCodeBench	66K tokens	$2.00	—	✅
424	Mistral Small (Sep '24)	Mistral	14.0	LiveCodeBench	—	$0.20	Sep 2024	—
425	Jamba 1.5 Large	AI21 Labs	14.0	LiveCodeBench	—	$2.00	Aug 2024	—
426	Gemma 3n E4B Instruct Preview (May '25)	Google	14.0	LiveCodeBench	—	—	May 2025	—
427	Nova Micro	Amazon	14.0	LiveCodeBench	—	$0.04	Dec 2024	—
428	Qwen2.5 Coder Instruct 7B	Alibaba	13.0	LiveCodeBench	—	—	Sep 2024	—
429	Phi-4 Multimodal Instruct	Microsoft	13.0	LiveCodeBench	—	—	Feb 2025	—
430	Granite 3.3 8B (Non-reasoning)	IBM	13.0	LiveCodeBench	—	$0.03	Apr 2025	—
431	Qwen3 1.7B (Non-reasoning)	Alibaba	13.0	LiveCodeBench	—	—	Apr 2025	—
432	Molmo2-8B	Allen Institute for AI	13.0	SciCode	—	—	Dec 2025	—
433	Cohere: Command R+ (08-2024)	Cohere	12.2	LiveCodeBench	128K tokens	$2.50	—	—
434	Command-R+ (Apr '24)	Cohere	12.0	LiveCodeBench	—	$3.00	Apr 2024	—
435	Gemini 1.0 Pro	Google	12.0	LiveCodeBench	—	—	Dec 2023	—
436	Phi-3 Mini Instruct 3.8B	Microsoft	12.0	LiveCodeBench	—	—	Apr 2024	—
437	Granite 4.0 H 1B	IBM	12.0	LiveCodeBench	—	—	Oct 2025	—
438	Qwen3 0.6B (Reasoning)	Alibaba	12.0	LiveCodeBench	—	—	Apr 2025	—
439	OpenChat 3.5 (1210)	OpenChat	12.0	LiveCodeBench	—	—	Dec 2023	—
440	Mistral Small (Feb '24)	Mistral	11.0	LiveCodeBench	—	$1.00	Feb 2024	—
441	Llama 3.2 11B Vision Instruct	Meta	11.0	LiveCodeBench	131K tokens	$0.34	Sep 2024	✅
442	LFM2-24B-A2B	LiquidAI	11.0	SciCode	33K tokens	—	Feb 2026	✅
443	Llama 3 8B Instruct	Meta	10.0	LiveCodeBench	8K tokens	$0.04	Apr 2024	✅
444	Mistral Medium	Mistral	10.0	LiveCodeBench	—	$2.75	Dec 2023	—
445	Llama 2 Chat 13B	Meta	10.0	LiveCodeBench	—	—	Jul 2023	—
446	LFM 40B	Liquid AI	10.0	LiveCodeBench	—	—	Sep 2024	—
447	Gemma 3n E2B Instruct	Google	10.0	LiveCodeBench	—	—	Jun 2025	—
448	Llama 2 Chat 70B	Meta	10.0	LiveCodeBench	—	—	Jul 2023	—
449	DBRX Instruct	Databricks	9.0	LiveCodeBench	—	—	Mar 2024	—
450	DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)	Nous Research	9.0	LiveCodeBench	—	—	Feb 2025	—
451	Llama 3.2 3B Instruct	Meta	8.0	LiveCodeBench	80K tokens	$0.15	Sep 2024	✅
452	LFM2 2.6B	Liquid AI	8.0	LiveCodeBench	—	—	Sep 2025	—
453	LFM2.5-8B-A1B	Liquid AI	8.0	SciCode	—	—	May 2026	—
454	Jamba 1.6 Mini	AI21 Labs	7.0	LiveCodeBench	—	$0.20	Mar 2025	—
455	OLMo 2 32B	Allen Institute for AI	7.0	LiveCodeBench	—	—	Mar 2025	—
456	DeepSeek R1 Distill Qwen 1.5B	DeepSeek	7.0	LiveCodeBench	—	—	Jan 2025	—
457	Qwen3 0.6B (Non-reasoning)	Alibaba	7.0	LiveCodeBench	—	—	Apr 2025	—
458	Mistral: Mixtral 8x7B Instruct	Mistral AI	7.0	LiveCodeBench	33K tokens	$0.45	Dec 2023	✅
459	Jamba 1.7 Mini	AI21 Labs	6.0	LiveCodeBench	—	—	Jul 2025	—
460	Jamba 1.5 Mini	AI21 Labs	6.0	LiveCodeBench	—	$0.20	Aug 2024	—
461	Apertus 70B Instruct	Swiss AI Initiative	6.0	SciCode	—	$0.82	Sep 2025	—
462	Granite 4.0 1B	IBM	5.0	LiveCodeBench	—	—	Oct 2025	—
463	Command-R (Mar '24)	Cohere	5.0	LiveCodeBench	—	$0.50	Mar 2024	—
464	Mistral 7B Instruct	Mistral	5.0	LiveCodeBench	—	$0.25	Sep 2023	—
465	OLMo 2 7B	Allen Institute for AI	4.0	LiveCodeBench	—	—	Nov 2024	—
466	Molmo 7B-D	Allen Institute for AI	4.0	LiveCodeBench	—	—	Sep 2024	—
467	MiniCPM5-1B (Non-reasoning)	OpenBMB	4.0	SciCode	—	—	May 2026	—
468	Gemma 4 E4B (Non-reasoning)	Google	4.0	SciCode	—	—	Apr 2026	—
469	Tiny Aya Global	Cohere	4.0	SciCode	—	—	Feb 2026	—
470	LFM2.5-1.2B-Thinking	Liquid AI	4.0	SciCode	—	—	Jan 2026	—
471	Apertus 8B Instruct	Swiss AI Initiative	4.0	SciCode	—	$0.10	Sep 2025	—
472	LFM2.5-VL-1.6B	Liquid AI	3.0	SciCode	—	—	Jan 2026	—
473	LFM2 1.2B	Liquid AI	2.0	LiveCodeBench	—	—	Jul 2025	—
474	Granite 4.0 H 350M	IBM	2.0	LiveCodeBench	—	—	Oct 2025	—
475	Llama 3.2 1B Instruct	Meta	2.0	LiveCodeBench	60K tokens	$0.10	Sep 2024	✅
476	Granite 4.0 350M	IBM	2.0	LiveCodeBench	—	—	Oct 2025	—
477	Gemma 3 1B Instruct	Google	2.0	LiveCodeBench	—	—	Mar 2025	—
478	LFM2.5-1.2B-Instruct	Liquid AI	2.0	SciCode	—	—	Jan 2026	—
479	Gemma 3 270M	Google	0.0	LiveCodeBench	—	—	Aug 2025	—
480	Llama 2 Chat 7B	Meta	0.0	LiveCodeBench	—	$0.05	Jul 2023	—

+ 180 models without coding benchmarks available.View all models

Complete Guide: AI for Programming in 2026

The State of AI for Code in 2026

Artificial intelligence has fundamentally changed software development. In 2026, large language models (LLMs) can generate working code in dozens of languages, fix bugs in production codebases, and build complete applications from plain-English descriptions. SWE-bench — the most rigorous coding benchmark — evaluates models on real software engineering tasks pulled from GitHub issues.

SWE-bench: The Gold Standard

SWE-bench (Software Engineering Benchmark) is widely considered the gold standard for evaluating LLM coding ability. Unlike academic benchmarks like HumanEval (which tests isolated functions), SWE-bench presents real issues from popular repositories such as Django, Flask, scikit-learn, and requests. The model must understand the project context, locate the relevant files, and generate a patch that resolves the bug — mirroring the actual workflow of a professional developer.

The “Verified” variant (SWE-bench Verified) is curated by human engineers to ensure every task has a clear, verifiable solution. Scores on this benchmark correlate strongly with real-world coding performance, making it the single most informative metric when choosing an AI coding assistant.

HumanEval and LiveCodeBench

HumanEval, created by OpenAI, tests a model's ability to generate Python functions from docstrings. It is simpler than SWE-bench but useful for gauging basic code fluency. LiveCodeBench raises the bar by using problems that are refreshed regularly, reducing the risk of data contamination — a concern when a model may have seen the answers during training.

How to Choose the Best AI Model for Code

The right model depends on your specific use case. For real-time code autocomplete (Cursor, Copilot), speed and latency matter more than peak benchmark scores — lighter mini and flash-class models often deliver the best speed-to-quality ratio. For full project generation or complex debugging, frontier models like Claude Fable 5, GPT-5.5 and Gemini 3.5-class systems are better suited, despite higher costs.

Teams with strict data control requirements (compliance, security) should consider open-source models like DeepSeek Coder, Code Llama, and StarCoder, which can be deployed on-premises with competitive performance. The trade-off between proprietary and open-source involves cost, latency, privacy, and quality considerations.

AI-Powered Coding Tools

The leading AI-assisted development tools in 2026 include Cursor, GitHub Copilot, Windsurf and agentic IDE workflows that let you swap between multiple frontier models. Each tool uses different models under the hood, and the quality of generated code depends directly on the LLM powering it.

Trends for 2026 and Beyond

The most significant trends in AI for code include autonomous software engineering agents (that solve complex tasks without supervision), automated test generation, intelligent refactoring, and native CI/CD pipeline integration. The frontier is shifting from “code assistant” to “autonomous engineer”, with models increasingly capable of navigating large codebases and making architectural decisions.

Frequently Asked Questions

What is the best AI for coding?

In 2026, the top models on coding benchmarks are GPT-5.6 Sol (xhigh), GPT-5.6 Sol (max), GPT-5.6 Sol (high). The best choice depends on your use case: code autocomplete, full project generation, debugging, or code review.

ChatGPT or Claude for code?

Today the strongest current options usually cluster around Claude Fable 5 / Opus 4.8, GPT-5.5 and Gemini 3.5-class models. Claude tends to shine in long-context refactors; GPT is strong in fast generation and tool use. Test with your own repository and workflow.

What is SWE-bench?

SWE-bench (Software Engineering Benchmark) evaluates how well models can resolve real issues from open-source GitHub repositories. It is considered the most realistic coding benchmark because it tests bug resolution in real projects, not academic exercises.

Which free LLMs are good for coding?

Open-source models like DeepSeek Coder, Qwen Coder, and Code Llama offer excellent coding performance with no API cost. You can run them locally with Ollama or access them for free on platforms like Together AI and Groq.

What coding benchmarks matter most?

SWE-bench Verified is the gold standard for real-world coding ability. HumanEval tests basic function generation, while LiveCodeBench uses regularly updated problems to reduce data contamination. For a complete picture, look at all three.

Code & Programming

Best LLMs for Code

Compare models optimized for programming, debugging, and code generation. Evaluated on HumanEval, SWE-Bench, and WebDev Arena.

Code Autocomplete

Smart suggestions as you type

Coming soon

Code Generation

Create complete functions from descriptions

Coming soon

Debug & Refactoring

Find bugs and improve code quality

Coming soon

Code Ranking

0 models

Compare Models Side by Side

Use our comparison tool to see detailed benchmarks between specific models.

Compare Models

Best AI for Code in 2026Updated by AA Coding Index

Use Cases

Code Autocomplete

Code Generation

Debug & Code Review

Coding Ranking — Top Models

Complete Guide: AI for Programming in 2026

The State of AI for Code in 2026

SWE-bench: The Gold Standard

HumanEval and LiveCodeBench

How to Choose the Best AI Model for Code

AI-Powered Coding Tools

Trends for 2026 and Beyond

Frequently Asked Questions

What is the best AI for coding?

ChatGPT or Claude for code?

What is SWE-bench?

Which free LLMs are good for coding?

What coding benchmarks matter most?

Explore Other Categories

Best LLMs for Code

Code Autocomplete

Code Generation

Debug & Refactoring

Code Ranking

Compare Models Side by Side

Best LLMs for Code

Code Autocomplete

Code Generation

Debug & Refactoring

Code Ranking

Compare Models Side by Side