NVIDIA · 9B · Dense
Hybrid Mamba2 architecture for reasoning
245.2K downloads
482 likes
2025-06 128K context
Use Cases
reasoning
| Quant | Bits | VRAM | Quality | Status |
|---|---|---|---|---|
| Q2_K | 2 | 3.4 GB | low | — |
| Q3_K_M | 3 | 4.5 GB | moderate | — |
| Q4_K_M | 4 | 5.1 GB | good | — |
| Q5_K_M | 5 | 6.3 GB | good | — |
| Q6_K | 6 | 7.4 GB | excellent | — |
| Q8_0 | 8 | 9.7 GB | excellent | — |
| F16 | 16 | 18.9 GB | lossless | — |