Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New branch for english corpus #84

Open
ashwath98 opened this issue Jan 4, 2020 · 1 comment
Open

New branch for english corpus #84

ashwath98 opened this issue Jan 4, 2020 · 1 comment

Comments

@ashwath98
Copy link

Hey, I've been using your library for dataset generation for my ocr project.
I think it would be useful to have a branch dedicated to English, also adding an English corpus file in the data directory.
While using your code I have added some features like random subset selection(from a sentence selected to be the text), which can be useful while training in applications where not all sentences are of fixed length.

Do you suggest I make a Pull Request containing these features?

@starry-xin
Copy link

Hey, I've been using your library for dataset generation for my ocr project. I think it would be useful to have a branch dedicated to English, also adding an English corpus file in the data directory. While using your code I have added some features like random subset selection(from a sentence selected to be the text), which can be useful while training in applications where not all sentences are of fixed length.

Do you suggest I make a Pull Request containing these features?

May I ask you to send your English version to my mailbox? I want to use it for study, not for commercial use.Thank you!
My Email: [email protected]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants