-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
For other language's handwriting generation--how to properly fine-tune the model? #15
Comments
I haven't tried to fine-tune for a different language, so I can not tell for sure what may happen. However, here's what I think should happen: If you proceed in this manner, you may encounter training instabilities though. To deal with this, you can pretrain the OCR separately before plugging it into the end-to-end training with the GAN. Having said that, I'd suggest retraining it from scratch. It will be more straightforward :). |
@ankanbhunia Thanks for the reply! Just a quick follow-up question: if I just tried to work on a historical English handwriting dataset (maybe extracted from historical handwriting script through old bibles), putting the OCR network issue aside, will you suggest training it from scratch or fine-tuning the model? Thanks in advance for letting me know it! I'm trying to extend your cool paper to do some projects. |
In that case, I'd suggest to try first fine-tuning the model. |
Thanks! |
@ankanbhunia
While you mention:
"You can train the model in any custom dataset other than IAM and CVL. The process involves creating a dataset_name.pickle file and placing it inside files folder. The structure of dataset_name.pickle is a simple python dictionary."
I go through the codes and the way you suggest seems to retrain the model on another dataset from scratch rather than fine-tuning your model. As your paper has not yet much discussed this, I come to ask your opinion. If I want to apply the model to generate other language's handwritings e.g. Japanese, is there a way to quickly fine-tune your model upon a new Japanese handwriting data? Or do I need to retrain it from scratch?
The text was updated successfully, but these errors were encountered: