You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
+1 to @shamanez request. It would be great to elaborate more on how to integrate "JetMoE" with Megatron both for pretraining and finetuning via Megablocks.
@tgale96
The JetMoE technical report has mentioned how they used Megablocks with Megatrone to train the model.
Then the author shared this fork of the megablokcs used during the training.
Could you please let us know how we can proceed with a fine-tuning script?
The text was updated successfully, but these errors were encountered: