Cohere · 35B · Dense
Optimized for retrieval-augmented generation
12.1K downloads
1.1K likes
2024-03 128K context
Use Cases
chat rag
| Quant | Bits | VRAM | Quality | Status |
|---|---|---|---|---|
| Q2_K | 2 | 11.7 GB | low | — |
| Q3_K_M | 3 | 16.2 GB | moderate | — |
| Q4_K_M | 4 | 18.4 GB | good | — |
| Q5_K_M | 5 | 22.9 GB | good | — |
| Q6_K | 6 | 27.4 GB | excellent | — |
| Q8_0 | 8 | 36.4 GB | excellent | — |
| F16 | 16 | 72.2 GB | lossless | — |
About this model
Command R is a generative model optimized for long context tasks such as retrieval-augmented generation (RAG) and using external APIs and tools. As a model built for companies to implement at scale, Command R boasts:
- Strong accuracy on RAG and Tool Use
- Low latency, and high throughput
- Longer 128k context
- Strong capabilities across 10 key languages
There are currently two versions of Command R:
- Original release tagged v0.1
- August 2024 update tagged 08-2024