back

Meta · 8B · Dense

Meta's versatile 8B — great quality/speed ratio

7.3M downloads 5.5K likes 2024-07 128K context

Use Cases

chat code reasoning

Quantization Options

Quant Bits VRAM Quality Status
Q2_K 2 3.1 GB low
Q3_K_M 3 4.1 GB moderate
Q4_K_M 4 4.6 GB good
Q5_K_M 5 5.6 GB good
Q6_K 6 6.6 GB excellent
Q8_0 8 8.7 GB excellent
F16 16 16.9 GB lossless

About this model

Meta Llama 3.1

image.png

Llama 3.1 family of models available:

  • 8B
  • 70B
  • 405B

Llama 3.1 405B is the first openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation.

The upgraded versions of the 8B and 70B models are multilingual and have a significantly longer context length of 128K, state-of-the-art tool use, and overall stronger reasoning capabilities. This enables Meta’s latest models to support advanced use cases, such as long-form text summarization, multilingual conversational agents, and coding assistants.

Meta also has made changes to their license, allowing developers to use the outputs from Llama models, including the 405B model, to improve other models.

Model evaluations

For this release, Meta has evaluation the performance on over 150 benchmark datasets that span a wide range of languages. In addition, Meta performed extensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios. Meta’s experimental evaluation suggests that our flagship model is competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. Additionally, Meta’s smaller models are competitive with closed and open models that have a similar number of parameters.

image.png

image.png

image.png

References