Mistral AI · 47B (12.9B active) · Mixture of Experts
MoE with 12.9B active params
772.5K downloads
4.6K likes
2023-12 32K context
Use Cases
chat code
Mixture of Experts
Total experts: 8
Active experts: 2
Active params: 12.9B
| Quant | Bits | VRAM | Quality | Status |
|---|---|---|---|---|
| Q2_K | 2 | 15.5 GB | low | — |
| Q3_K_M | 3 | 21.6 GB | moderate | — |
| Q4_K_M | 4 | 24.6 GB | good | — |
| Q5_K_M | 5 | 30.6 GB | good | — |
| Q6_K | 6 | 36.6 GB | excellent | — |
| Q8_0 | 8 | 48.6 GB | excellent | — |
| F16 | 16 | 96.8 GB | lossless | — |
About this model
The Mixtral large Language Models (LLM) are a set of pretrained generative Sparse Mixture of Experts.
Sizes
mixtral:8x22bmixtral:8x7b
Mixtral 8x22b
ollama run mixtral:8x22b
Mixtral 8x22B sets a new standard for performance and efficiency within the AI community. It is a sparse Mixture-of-Experts (SMoE) model that uses only 39B active parameters out of 141B, offering unparalleled cost efficiency for its size.
Mixtral 8x22B comes with the following strengths:
- It is fluent in English, French, Italian, German, and Spanish
- It has strong maths and coding capabilities
- It is natively capable of function calling
- 64K tokens context window allows precise information recall from large documents