-
Notifications
You must be signed in to change notification settings - Fork 657
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can we modified it for training on Hindi dataset by adding Hindi dictionary in the utils.py #160
Comments
Hiii my |
The tokeniser used is OpenAI BPE based, and can support any unicode character (although hasn't seen data using those tokens), so you'd just need to retrain on that data, without making changes in |
We have created our own Hindi_speaker_encoder.pt based on metavoice encoder architecture, and we have fine-tuned first_stage.pt using our hindi_speaker_encoder.pt just to get an idea of result, but it gives a silent output when we give a hindi text as an input. We have also noticed that their is no training script for firs_stage and second_stage so will it still work or not if we train our hindi dataset on this two pretrained models? |
No description provided.
The text was updated successfully, but these errors were encountered: