Update the training scripts to incorporate the latest versions of Transformers and other dependencies. #84

jzhang533 · 2024-12-10T11:24:53Z

Thanks for your great project, I've played a little bit to run the training scripts.
but found several issues related to the latest dependencies, mainly caused by development iteration of transformers, albumentations, etc. in recent years.

Finally make it work after several tweaks, will explain the reasons by commenting this PR.

manga_ocr_dev/data/process_manga109s.py

manga_ocr_dev/training/dataset.py

manga_ocr_dev/training/metrics.py

manga_ocr_dev/training/train.py

jzhang533 · 2024-12-12T03:38:29Z

manga_ocr_dev/training/get_model.py

@@ -48,7 +47,7 @@ def get_model(encoder_name, decoder_name, max_length, num_decoder_layers=None):

        decoder_config.num_hidden_layers = num_decoder_layers

-    config = VisionEncoderDecoderConfig.from_encoder_decoder_configs(encoder.config, decoder.config)
+    config = VisionEncoderDecoderConfig.from_encoder_decoder_configs(encoder_config, decoder_config)


Probably this is a existing bug ?

Otherwise, the saved config file and model will be different, i.e.: in the config for decoder, there are more layers than the model. When load the model for inference, those layers will be randomly initialized.

manga_ocr_dev/training/metrics.py

kha-white · 2025-01-01T19:50:36Z

@jzhang533 Thanks, the comments were very helpful. Most of the changes seem fine, I'll just have to take a look at that thing with configs.

jzhang533 · 2025-01-02T05:35:27Z

Thanks for review.

I think I have reproduced the training of the model in my repository: https://github.com/jzhang533/manga-ocr. I achieved a CER of 0.1017, which is comparable to the original model's CER of 0.1056 on the Manga109s test split. However, the dataset split I used might differ from the one you used when training the model several years ago. That is because there are randomness involved in splitting the Manga109s dataset here: https://github.com/kha-white/manga-ocr/blob/master/manga_ocr_dev/data/process_manga109s.py#L82.

For the model config, I have updated the model config scripts to https://github.com/jzhang533/manga-ocr/blob/master/manga_ocr_dev/training/my_get_model.py, you may have interest to take a look.

jzhang533 · 2025-01-03T11:03:58Z

@kha-white I have uploaded my trained model to HF: https://huggingface.co/jzhang533/manga-ocr-base-2025

by running manga_ocr -p "jzhang533/manga-ocr-base-2025", the warning reported in #85 disappeared.

jzhang533 added 2 commits December 10, 2024 11:13

update training scripts

5463379

minor

30aa240

jzhang533 commented Dec 10, 2024

View reviewed changes

more update to training scripts

2129004

jzhang533 commented Dec 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the training scripts to incorporate the latest versions of Transformers and other dependencies. #84

Update the training scripts to incorporate the latest versions of Transformers and other dependencies. #84

jzhang533 commented Dec 10, 2024 •

edited

Loading

jzhang533 Dec 12, 2024

kha-white commented Jan 1, 2025

jzhang533 commented Jan 2, 2025

jzhang533 commented Jan 3, 2025

Update the training scripts to incorporate the latest versions of Transformers and other dependencies. #84

Are you sure you want to change the base?

Update the training scripts to incorporate the latest versions of Transformers and other dependencies. #84

Conversation

jzhang533 commented Dec 10, 2024 • edited Loading

jzhang533 Dec 12, 2024

Choose a reason for hiding this comment

kha-white commented Jan 1, 2025

jzhang533 commented Jan 2, 2025

jzhang533 commented Jan 3, 2025

jzhang533 commented Dec 10, 2024 •

edited

Loading