[Feature] Support QLoRA weights #1826

zzh-www · 2024-10-28T07:39:48Z

Does sgl support qlora? Could you provide some instructions on how to use it?

zzh-www · 2024-10-29T03:59:35Z

Hi! I run it by python -m sglang.bench_latency --model-path TheBloke/TinyLlama-1.1B-Chat-v0.3-GPTQ --batch 1 --inp ut-len 1 --output-len 512 --trust-remote-code --lora-paths jashing/tinyllama-colorist-lora.
It seems no problem, is there any way to help me verify that this is the correct thing to do?

zzh-www · 2024-10-30T11:57:16Z

I verified the results of QLoRA on my personal fine-tuning task (deployed as a service using the above command), and concluded that QLoRA did not actually take effect. I believe it's better to throw an error to alert users and avoid misunderstandings. @Ying1123

ByronHsu assigned Ying1123 Oct 28, 2024

merrymercy changed the title ~~sgl & qlora~~ [Feature] Support QLoRA weights Nov 1, 2024

merrymercy added the enhancement New feature or request label Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support QLoRA weights #1826

[Feature] Support QLoRA weights #1826

zzh-www commented Oct 28, 2024

zzh-www commented Oct 29, 2024

zzh-www commented Oct 30, 2024

[Feature] Support QLoRA weights #1826

[Feature] Support QLoRA weights #1826

Comments

zzh-www commented Oct 28, 2024

zzh-www commented Oct 29, 2024

zzh-www commented Oct 30, 2024