DeepSeek V3.1

Name: DeepSeek V3.1
Author: DeepSeek

MIT

DeepSeek · 671B (37B active) · Mixture of Experts

Improved V3 with hybrid thinking and tool use

280.2K downloads 3.1K likes 2025-08 128K context

Use Cases

chat code reasoning

Total experts: 256

Active experts: 8

Active params: 37.0B

Quant	Bits	VRAM	Quality	Status
Q2_K	2	215.3 GB	low	—
Q3_K_M	3	301.2 GB	moderate	—
Q4_K_M	4	344.2 GB	good	—
Q5_K_M	5	430.1 GB	good	—
Q6_K	6	516.1 GB	excellent	—
Q8_0	8	687.9 GB	excellent	—
F16	16	1375.3 GB	lossless	—

DeepSeek-V3.1-Terminus update builds on V3.1’s strengths while addressing key user feedback:

Hybrid thinking mode: One model supports both thinking mode and non-thinking mode by changing the chat template.

Smarter tool calling: Through post-training optimization, the model’s performance in tool usage and agent tasks has significantly improved.

Higher thinking efficiency: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly.