DeepSeek · 671B (37B active) · Mixture of Experts
Improved V3 with hybrid thinking and tool use
280.2K downloads
3.1K likes
2025-08 128K context
Use Cases
chat code reasoning
Mixture of Experts
Total experts: 256
Active experts: 8
Active params: 37.0B
| Quant | Bits | VRAM | Quality | Status |
|---|---|---|---|---|
| Q2_K | 2 | 215.3 GB | low | — |
| Q3_K_M | 3 | 301.2 GB | moderate | — |
| Q4_K_M | 4 | 344.2 GB | good | — |
| Q5_K_M | 5 | 430.1 GB | good | — |
| Q6_K | 6 | 516.1 GB | excellent | — |
| Q8_0 | 8 | 687.9 GB | excellent | — |
| F16 | 16 | 1375.3 GB | lossless | — |
About this model
DeepSeek-V3.1-Terminus update builds on V3.1’s strengths while addressing key user feedback:
- 🌐 Language consistency: fewer CN/EN mix-ups & no more random chars.
- 🤖 Agent upgrades: stronger Code Agent & Search Agent performance.
Hybrid thinking mode: One model supports both thinking mode and non-thinking mode by changing the chat template.
Smarter tool calling: Through post-training optimization, the model’s performance in tool usage and agent tasks has significantly improved.
Higher thinking efficiency: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly.