Training for vertical Japanese texts. #1884
Unanswered
Subhajyotidas
asked this question in
Q&A
Replies: 1 comment 4 replies
-
If only vertical texts are presented in your dataset, you may consider rotating all the images by 90 degrees to fits the assumption. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have been trying to train mmocr for vertical Japanese texts (only Hiragana and Katakana).
I am trying to do it with SAR model.
_base_sar_resnet31_parallel-decoder.py file has some resizing features...
I tried to modify it for padding in a vertical manner with replacing width with height, like following.
But as expected it threw the following error.
TypeError: class
EpochBasedTrainLoopin mmengine/runner/loops.py: class
ConcatDatasetin mmocr/datasets/dataset_wrapper.py: class
RescaleToHeightin mmocr/datasets/transforms/textrecog_transforms.py: __init__() missing 1 required positional argument: 'height'
Issue: The results with training original resizing, provides very bad results, as it was meant for horizontal writing.
Question: Is there any way to resize the data meant for vertical OCR training?
Thank you in advance for any help.
Beta Was this translation helpful? Give feedback.
All reactions