Download and use a multi part GGUF? #403

kumikumi · 2024-12-26T15:31:36Z

kumikumi
Dec 26, 2024

Some models such as the QWEN 2.5 Coder appears to be uploaded in multiple parts, see here:

https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct-GGUF/tree/main

I can see that the q8_0 (8-bit quantized model) is a single big gguf file, but the FP16 one consists of multiple smaller gguf files. (qwen2.5-coder-32b-instruct-fp16-00001-of-00009.gguf, qwen2.5-coder-32b-instruct-fp16-00002-of-00009.gguf, etc.)

In such a case, how can I pull and use the model?

Answered by giladgd

Dec 26, 2024

You can use the pull command to download such models; just point to the first file, and it'll download all the required files.
To use it, load the first file, and it'll handle the rest automatically.

View full answer

giladgd · 2024-12-26T17:36:25Z

giladgd
Dec 26, 2024
Maintainer

You can use the pull command to download such models; just point to the first file, and it'll download all the required files.
To use it, load the first file, and it'll handle the rest automatically.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Download and use a multi part GGUF? #403

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Download and use a multi part GGUF? #403

Uh oh!

kumikumi Dec 26, 2024

Replies: 1 comment

Uh oh!

giladgd Dec 26, 2024 Maintainer

kumikumi
Dec 26, 2024

giladgd
Dec 26, 2024
Maintainer