back

Qwen 2.5 32B

Name: Qwen 2.5 32B
Author: Alibaba

Apache 2.0

Alibaba · 32B · Dense

High-quality reasoning and multilingual

HuggingFace Ollama

3.0M downloads 334 likes 2024-09 128K context

Use Cases

chat multilingual reasoning

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q2_K	2	10.7 GB	low	—
Q3_K_M	3	14.8 GB	moderate	—
Q4_K_M	4	16.9 GB	good	—
Q5_K_M	5	21 GB	good	—
Q6_K	6	25.1 GB	excellent	—
Q8_0	8	33.3 GB	excellent	—
F16	16	66.1 GB	lossless	—

About this model

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, a range of base language models and instruction-tuned models are released, with sizes ranging from 0.5 to 72 billion parameters. Qwen2.5 introduces the following improvements over Qwen2:

It possesses significantly more knowledge and has greatly enhanced capabilities in coding and mathematics, due to specialized expert models in these domains.
It demonstrates significant advancements in instruction following, long-text generation (over 8K tokens), understanding structured data (e.g., tables), and generating structured outputs, especially in JSON format. It is also more resilient to diverse system prompts, improving role-play and condition-setting for chatbots.
It supports long contexts of up to 128K tokens and can generate up to 8K tokens.
It offers multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

Please note: all models except the 3B and 72B are released under the Apache 2.0 license, while the 3B and 72B models are under the Qwen license.

References

GitHub

Blog post

HuggingFace