back

Llama 3.2 1B

Name: Llama 3.2 1B
Author: Meta

Llama 3.2 Community

Meta · 1B · Dense

Meta's smallest Llama for edge devices

HuggingFace Ollama

1.6M downloads 2.3K likes 2024-09 128K context

Use Cases

chat edge

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q2_K	2	0.8 GB	low	—
Q3_K_M	3	0.9 GB	moderate	—
Q4_K_M	4	1 GB	good	—
Q5_K_M	5	1.1 GB	good	—
Q6_K	6	1.3 GB	excellent	—
Q8_0	8	1.5 GB	excellent	—
F16	16	2.5 GB	lossless	—

About this model

The Meta Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out). The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. They outperform many of the available open source and closed chat models on common industry benchmarks.

Sizes

3B parameters (default)

The 3B model outperforms the Gemma 2 2.6B and Phi 3.5-mini models on tasks such as:

Following instructions
Summarization
Prompt rewriting
Tool use

ollama run llama3.2

1B parameters

The 1B model is competitive with other 1-3B parameter models. It’s use cases include:

Personal information management
Multilingual knowledge retrieval
Rewriting tasks running locally on edge

ollama run llama3.2:1b

Benchmarks

Llama 3.2 instruction-tuned benchmarks

Supported Languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai are officially supported. Llama 3.2 has been trained on a broader collection of languages than these 8 supported languages.