google/gemma-2-27b-it QLORA #1743

CyberNativeAI · 2024-07-12T22:41:44Z

CyberNativeAI
Jul 12, 2024

Hi everyone,

I have tried QLORA fine-tuning gemma-2-27b with chatml and flash-attention yesterday and the result model seems confused, even tho the loss seemed to go down and overall everything looked smooth. May someone please share tips&tricks for fine-tuning this model?

NanoCode012 · 2024-10-16T08:09:39Z

NanoCode012
Oct 16, 2024
Collaborator

Hey, sorry for late response, since it's a qlora, the impact on the weights are not as large as lora / full fine-tuning. Have you tried those other options?

3 replies

NanoCode012 Oct 31, 2024
Collaborator

Hey, I'll close this discussion for now but if you're still having this issue, please re-open / create a new one.

CyberNativeAI Oct 31, 2024
Author

It worked out eventually, had to play with parameters. Please close.

NanoCode012 Nov 1, 2024
Collaborator

Thanks for updating us on this!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

google/gemma-2-27b-it QLORA #1743

{{title}}

Replies: 1 comment 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

google/gemma-2-27b-it QLORA #1743

CyberNativeAI Jul 12, 2024

Replies: 1 comment · 3 replies

NanoCode012 Oct 16, 2024 Collaborator

NanoCode012 Oct 31, 2024 Collaborator

CyberNativeAI Oct 31, 2024 Author

NanoCode012 Nov 1, 2024 Collaborator

CyberNativeAI
Jul 12, 2024

Replies: 1 comment 3 replies

NanoCode012
Oct 16, 2024
Collaborator

NanoCode012 Oct 31, 2024
Collaborator

CyberNativeAI Oct 31, 2024
Author

NanoCode012 Nov 1, 2024
Collaborator