-
Notifications
You must be signed in to change notification settings - Fork 9
Issues: aws-neuron/neuronx-distributed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Wrong output fp16/bf16 dtype in ParallelEmbedding when sharding accross vocab
bug
Something isn't working
#35
opened Nov 20, 2024 by
dacorvo
Error in NXD LLama Inference
documentation
Improvements or additions to documentation
#32
opened Oct 4, 2024 by
EmilyWebber
Potential bugs for the llama inference example
bug
Something isn't working
#30
opened Sep 18, 2024 by
yinsong1986
XLA_DISABLE_FUNCTIONALIZATION=0
with ZeRO-1 diverges for Mistral on NxD
bug
#26
opened Jul 17, 2024 by
michaelbenayoun
MPMD detected error when using Something isn't working
optimum-neuron
with TP
bug
#24
opened Jun 27, 2024 by
michaelbenayoun
Error: MPMD detected but reload is not supported yet for neuron distributed environment with EAGER DEBUG MODE
bug
Something isn't working
#21
opened May 4, 2024 by
wenboqian
The Llama inference examples needs to be updated to maintain parity with transformers==4.36
documentation
Improvements or additions to documentation
Trn1
#14
opened Jan 24, 2024 by
sol0invictus
doc link broken on main README.md
documentation
Improvements or additions to documentation
#6
opened Dec 19, 2023 by
cfregly
ProTip!
Exclude everything labeled
bug
with -label:bug.