TinyLlama 1.1B
Apache 2.0Community · 1.1B · Dense
Ultralight model for constrained devices
1.9M downloads
1.5K likes
2024-01 2K context
Use Cases
chat edge
| Quant | Bits | VRAM | Quality | Status |
|---|---|---|---|---|
| Q2_K | 2 | 0.9 GB | low | — |
| Q3_K_M | 3 | 1 GB | moderate | — |
| Q4_K_M | 4 | 1.1 GB | good | — |
| Q5_K_M | 5 | 1.2 GB | good | — |
| Q6_K | 6 | 1.3 GB | excellent | — |
| Q8_0 | 8 | 1.6 GB | excellent | — |
| F16 | 16 | 2.8 GB | lossless | — |
About this model
TinyLlama is a compact model with only 1.1B parameters. This compactness allows it to cater to a multitude of applications demanding a restricted computation and memory footprint.