You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The error occurs during the model compilation phase when the neuronx-cc compiler attempts to generate the NEFF (Neuron Executable File Format) file for the model with batch size 4. The same configuration works successfully with batch sizes 1 and 2.
The text was updated successfully, but these errors were encountered:
I am following the tutorial at https://awsdocs-neuron.readthedocs-hosted.com/en/latest/libraries/nxd-inference/tutorials/trn2-llama3.3-70b-tutorial.html#scenario-1-run-llama3-3-70b-on-trn2 to compile and run Llama 3.3 70B model on trn1. While the compilation works successfully with batch sizes 1 and 2, it fails when attempting to compile with batch size 4.
The compiler (neuronx-cc) terminates with the following error:
Environment:
Command Configuration:
Full error traceback:
The error occurs during the model compilation phase when the neuronx-cc compiler attempts to generate the NEFF (Neuron Executable File Format) file for the model with batch size 4. The same configuration works successfully with batch sizes 1 and 2.
The text was updated successfully, but these errors were encountered: