back

TinyLlama 1.1B

Apache 2.0

Community · 1.1B · Dense

Ultralight model for constrained devices

1.9M downloads 1.5K likes 2024-01 2K context

Use Cases

chat edge

Quantization Options

Quant Bits VRAM Quality Status
Q2_K 2 0.9 GB low
Q3_K_M 3 1 GB moderate
Q4_K_M 4 1.1 GB good
Q5_K_M 5 1.2 GB good
Q6_K 6 1.3 GB excellent
Q8_0 8 1.6 GB excellent
F16 16 2.8 GB lossless

About this model

TinyLlama is a compact model with only 1.1B parameters. This compactness allows it to cater to a multitude of applications demanding a restricted computation and memory footprint.

References

Hugging Face

GitHub