DeepSeek • llm
Grande modelo de linguagem (llm) desenvolvido pela DeepSeek — Intelligence Index 33/100 no Artificial Analysis; US$ 0.275/1M tokens de entrada.
Context Window
—
Input Price/1M
$0.28
Output Price/1M
$0.41
Parameters
—
DeepSeek V3.2 Exp (Reasoning) results on the main AI model evaluation benchmarks. Higher scores indicate better performance.
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| Terminal-Bench Hard | 36.0 | 100.0 | — |
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| LiveCodeBench | 86.0 | 100.0 | — |
| SciCode | 40.0 | 100.0 | — |
| AA Coding Index | 36.7 | 100.0 | — |
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| MMLU-Pro | 86.0 | 100.0 | — |
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| AA-LCR | 69.0 | 100.0 | — |
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| AIME 2025 | 92.0 | 100.0 | — |
| AA Math Index | 92.0 | 100.0 | — |
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| AA Intelligence Index | 41.7 | 100.0 | — |
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| GPQA Diamond | 84.0 | 100.0 | — |
| IFBench | 61.0 | 100.0 | — |
| HLE | 22.0 | 100.0 | — |
| Benchmark | Score | Maximum | Methodology |
|---|---|---|---|
| Tau²-Bench | 91.0 | 100.0 | — |
DeepSeek V3.2 Exp (Reasoning) is an AI model developed by DeepSeek, classified as a llm model. It focuses on text processing and natural language generation. As a proprietary model, it is available via DeepSeek's cloud API.
DeepSeek V3.2 Exp (Reasoning) is usage-based, priced at $0.275/1M input tokens and $0.415/1M output tokens. For context: 1 million tokens is approximately 750,000 words, or about 10 average-length books. At this aggressive price point, it is one of the most cost-effective options on the market, ideal for high-volume applications like chatbots, bulk document analysis, and automation.
DeepSeek V3.2 Exp (Reasoning) was evaluated on 13 different benchmarks, covering categories like Agentic, Coding, Knowledge, Long Context, Math, overall, Reasoning, Tool Use. Results show exceptional performance across available evaluations.
It's important to note that benchmarks measure specific aspects and don't capture the full user experience. Factors like instruction adherence, behavior in long conversations, and real-world task quality vary significantly between models and aren't always reflected in standard scores.
DeepSeek V3.2 Exp (Reasoning) is suitable for a wide range of AI applications: high-volume chatbots and automated support, text generation, summarization, translation, and general assistance.
In the 2026 AI model ecosystem, DeepSeek V3.2 Exp (Reasoning) competes directly with similarly capable models. DeepSeek competes in this segment against OpenAI, Anthropic, Google, and Meta. The choice between models depends on the specific use case, budget, latency requirements, and need for features like multimodality and tool calling.
For a detailed side-by-side comparison, use our comparison tool or check the overall model ranking.
DeepSeek V3.2 Exp (Reasoning) is an AI model developed by DeepSeek. It is a llm model.
DeepSeek V3.2 Exp (Reasoning) costs $0.275/1M input tokens and $0.415/1M output tokens. For heavy usage (e.g., a chatbot handling 100k messages/month), costs can range from $10 to $1,000 depending on volume.
In available benchmarks, DeepSeek V3.2 Exp (Reasoning) scored: Terminal-Bench Hard: 36/100, LiveCodeBench: 86/100, SciCode: 40/100. See the full table above for a detailed comparison.
No, DeepSeek V3.2 Exp (Reasoning) is a proprietary model from DeepSeek. It is available via cloud API. For open source alternatives, check our open source model ranking.
DeepSeek V3.2 Exp (Reasoning) excels at general-purpose language tasks.
Last updated: June 01, 2026 • View methodology →