Skip to content

Enabling LR scaling for a specific layer (ex. down-projection...) during pretraining#1262

Open
dhia680 wants to merge 3 commits intoNVIDIA:mainfrom dhia680:downproj-lr-scaling