Replies: 1 comment
-
Hi, check tokenizer encode and decode options to validate for arabic you are able to see the text as intended. Start with transducer instead of hybrid and also start from a pretrained encoder (like parakeet-tdt-0.6b-v3). Would also recommend to use fastconformer rather than conformer configs. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello team;
I am trying to train an ASR model using NeMo framework, tried many recipies, but all of them are predicting blank for my dataset.
details:
any thoughts on what could be the issue? like encoding of the json file can be an issue?
example of the outputs:
samples of the date:
I build my tokenizer using this command:
any insights please?
Beta Was this translation helpful? Give feedback.
All reactions