back

Meta · 1B · Dense

Meta's smallest Llama for edge devices

1.6M downloads 2.3K likes 2024-09 128K context

Use Cases

chat edge

Quantization Options

Quant Bits VRAM Quality Status
Q2_K 2 0.8 GB low
Q3_K_M 3 0.9 GB moderate
Q4_K_M 4 1 GB good
Q5_K_M 5 1.1 GB good
Q6_K 6 1.3 GB excellent
Q8_0 8 1.5 GB excellent
F16 16 2.5 GB lossless

About this model

The Meta Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out). The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. They outperform many of the available open source and closed chat models on common industry benchmarks.

Sizes

3B parameters (default)

The 3B model outperforms the Gemma 2 2.6B and Phi 3.5-mini models on tasks such as:

  • Following instructions
  • Summarization
  • Prompt rewriting
  • Tool use
ollama run llama3.2

1B parameters

The 1B model is competitive with other 1-3B parameter models. It’s use cases include:

  • Personal information management
  • Multilingual knowledge retrieval
  • Rewriting tasks running locally on edge
ollama run llama3.2:1b

Benchmarks

Llama 3.2 instruction-tuned benchmarks

Supported Languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai are officially supported. Llama 3.2 has been trained on a broader collection of languages than these 8 supported languages.