Mistral AI · 12B · Dense
Multilingual 12B with 128K context
88.4K downloads
1.7K likes
2024-07 128K context
Use Cases
chat multilingual
| Quant | Bits | VRAM | Quality | Status |
|---|---|---|---|---|
| Q2_K | 2 | 4.3 GB | low | — |
| Q3_K_M | 3 | 5.9 GB | moderate | — |
| Q4_K_M | 4 | 6.6 GB | good | — |
| Q5_K_M | 5 | 8.2 GB | good | — |
| Q6_K | 6 | 9.7 GB | excellent | — |
| Q8_0 | 8 | 12.8 GB | excellent | — |
| F16 | 16 | 25.1 GB | lossless | — |
About this model
Mistral NeMo is a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.