back

TinyLlama 1.1B

Name: TinyLlama 1.1B
Author: Community

Apache 2.0

Community · 1.1B · Dense

Ultralight model for constrained devices

HuggingFace Ollama

1.9M downloads 1.5K likes 2024-01 2K context

Use Cases

chat edge

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q2_K	2	0.9 GB	low	—
Q3_K_M	3	1 GB	moderate	—
Q4_K_M	4	1.1 GB	good	—
Q5_K_M	5	1.2 GB	good	—
Q6_K	6	1.3 GB	excellent	—
Q8_0	8	1.6 GB	excellent	—
F16	16	2.8 GB	lossless	—

About this model

TinyLlama is a compact model with only 1.1B parameters. This compactness allows it to cater to a multitude of applications demanding a restricted computation and memory footprint.

References

Hugging Face

GitHub