You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, this is actually a follow up to this issue #1415 but is a bit of a different problem. I verified essentially that the backpropogation works, however, essentially the last piece of the puzzle is when sampling from the adapters, I noticed that the original weights do not change and therefore the generations do not change. When generating from the adapters. However, when generating from the merged16bit model that I uploaded, it does in fact does generate better responses after many online DPO updates. I know for lora W = Wo + A@B and in the Peft layers for the unsloth model (I am using llama 3.2 1b parameters):
The Q projection for example has the lora_A and lora_B model dict and the base_layer, I want to know how with the parameters below I would perform W = Wo + A@B in order to get the updated weights or is there code for it already somewhere in the unsloth repo? Essentially, I need to add the changes that were updated in the A and B matrices into the initial weights in order for the actual weights for the models to be updated. Thank you.
Hello, this is actually a follow up to this issue #1415 but is a bit of a different problem. I verified essentially that the backpropogation works, however, essentially the last piece of the puzzle is when sampling from the adapters, I noticed that the original weights do not change and therefore the generations do not change. When generating from the adapters. However, when generating from the merged16bit model that I uploaded, it does in fact does generate better responses after many online DPO updates. I know for lora W = Wo + A@B and in the Peft layers for the unsloth model (I am using llama 3.2 1b parameters):
The Q projection for example has the lora_A and lora_B model dict and the base_layer, I want to know how with the parameters below I would perform W = Wo + A@B in order to get the updated weights or is there code for it already somewhere in the unsloth repo? Essentially, I need to add the changes that were updated in the A and B matrices into the initial weights in order for the actual weights for the models to be updated. Thank you.
The text was updated successfully, but these errors were encountered: