-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Issues: NVIDIA/NeMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
I am trying to train the FastConformer 120M model from scratch, but it is not converging?
#12167
opened Feb 13, 2025 by
PhamDangNguyen
Error in saving nemo checkpoint with Llama-3.1-70B SFT. /opt/NeMo/nemo/utils/callbacks/nemo_model_checkpoint.py
bug
Something isn't working
#12157
opened Feb 12, 2025 by
songwang41
[HELP] Run into the NaN grad problem while going through the exmaple of official document with fp16
bug
Something isn't working
#12134
opened Feb 11, 2025 by
twotwoiscute
Fail to convert trained checkpoint to HF format
bug
Something isn't working
#12124
opened Feb 10, 2025 by
Zhihan1996
Loss Fails to Converge in Nemo2-sft.ipynb with Precision 16
#12102
opened Feb 8, 2025 by
twotwoiscute
ASR Lhoste dataloader : TypeError: object of type 'IterableDatasetWrapper' has no len()
bug
Something isn't working
#12093
opened Feb 7, 2025 by
AudranBert
AttributeError: 'HFDatasetDataModule' object has no attribute 'tokenizer'
bug
Something isn't working
#12080
opened Feb 6, 2025 by
j40903272
extra_loggers is not used to log metrics or hyperparameters
bug
Something isn't working
#12046
opened Feb 4, 2025 by
chajath
llava-like dataset implementation "LazySupervisedDataset" likely fails to handle large dataset
#12034
opened Feb 3, 2025 by
bernardhan33
num_sanity_val_steps too large issue
bug
Something isn't working
#11978
opened Jan 28, 2025 by
shanesyy
Add option for prefetch factor of data loader to config
#11977
opened Jan 28, 2025 by
shengshiqi-google
Megatron BERT Embedding conversion inconsistency
bug
Something isn't working
#11970
opened Jan 28, 2025 by
aditya-malte
Pickling error when trying to save checkpoints with custom checkpointIO
bug
Something isn't working
#11955
opened Jan 24, 2025 by
jdnurme
Gemma 2 NeMo 2.0 to HF conversion bug
bug
Something isn't working
#11951
opened Jan 24, 2025 by
domenVres
MegatronGPTModel trains much worse when reducing micro_batch_size
bug
Something isn't working
#11939
opened Jan 23, 2025 by
m-harmonic
Have a nemo training container without additional framework elements
#11933
opened Jan 23, 2025 by
gabwow
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.