back

Mistral Nemo 12B

Apache 2.0

Mistral AI · 12B · Dense

Multilingual 12B with 128K context

88.4K downloads 1.7K likes 2024-07 128K context

Use Cases

chat multilingual

Quantization Options

Quant Bits VRAM Quality Status
Q2_K 2 4.3 GB low
Q3_K_M 3 5.9 GB moderate
Q4_K_M 4 6.6 GB good
Q5_K_M 5 8.2 GB good
Q6_K 6 9.7 GB excellent
Q8_0 8 12.8 GB excellent
F16 16 25.1 GB lossless

About this model

Mistral NeMo is a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.

nemo-base-performance.png

Reference

Blog

Hugging Face