back

Llama 3.1 8B

Name: Llama 3.1 8B
Author: Meta

Llama 3.1 Community

Meta · 8B · Dense

Meta's versatile 8B — great quality/speed ratio

HuggingFace Ollama

7.3M downloads 5.5K likes 2024-07 128K context

Use Cases

chat code reasoning

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q2_K	2	3.1 GB	low	—
Q3_K_M	3	4.1 GB	moderate	—
Q4_K_M	4	4.6 GB	good	—
Q5_K_M	5	5.6 GB	good	—
Q6_K	6	6.6 GB	excellent	—
Q8_0	8	8.7 GB	excellent	—
F16	16	16.9 GB	lossless	—

About this model

Meta Llama 3.1

Llama 3.1 family of models available:

8B
70B
405B

Llama 3.1 405B is the first openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation.

The upgraded versions of the 8B and 70B models are multilingual and have a significantly longer context length of 128K, state-of-the-art tool use, and overall stronger reasoning capabilities. This enables Meta’s latest models to support advanced use cases, such as long-form text summarization, multilingual conversational agents, and coding assistants.

Meta also has made changes to their license, allowing developers to use the outputs from Llama models, including the 405B model, to improve other models.

Model evaluations

For this release, Meta has evaluation the performance on over 150 benchmark datasets that span a wide range of languages. In addition, Meta performed extensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios. Meta’s experimental evaluation suggests that our flagship model is competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. Additionally, Meta’s smaller models are competitive with closed and open models that have a similar number of parameters.

References

Meta AI Llama 3.1 launch blog post