back

Qwen 2.5 32B

Apache 2.0

Alibaba · 32B · Dense

High-quality reasoning and multilingual

3.0M downloads 334 likes 2024-09 128K context

Use Cases

chat multilingual reasoning

Quantization Options

Quant Bits VRAM Quality Status
Q2_K 2 10.7 GB low
Q3_K_M 3 14.8 GB moderate
Q4_K_M 4 16.9 GB good
Q5_K_M 5 21 GB good
Q6_K 6 25.1 GB excellent
Q8_0 8 33.3 GB excellent
F16 16 66.1 GB lossless

About this model

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, a range of base language models and instruction-tuned models are released, with sizes ranging from 0.5 to 72 billion parameters. Qwen2.5 introduces the following improvements over Qwen2:

  • It possesses significantly more knowledge and has greatly enhanced capabilities in coding and mathematics, due to specialized expert models in these domains.
  • It demonstrates significant advancements in instruction following, long-text generation (over 8K tokens), understanding structured data (e.g., tables), and generating structured outputs, especially in JSON format. It is also more resilient to diverse system prompts, improving role-play and condition-setting for chatbots.
  • It supports long contexts of up to 128K tokens and can generate up to 8K tokens.
  • It offers multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

Please note: all models except the 3B and 72B are released under the Apache 2.0 license, while the 3B and 72B models are under the Qwen license.

References

GitHub

Blog post

HuggingFace