-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve translation of short sentences #215
Comments
I wonder if we can load in dictionaries where it's literally one word to one word. |
Or maybe even synthesize it with the alignment data. |
This behavior is also visible with numbers. A good example is to do a list of numbers. |
Verify the fix with: |
Here is a word count distribution for the merged corpus sl-en: https://firefox-ci-tc.services.mozilla.com/tasks/groups/PPCzZRHaTT6Ys4BIhPGT5w Generated via:
|
So we have basically 0 sentences with 1 word? |
I filed #878 which suggests augmenting with statistically synthesized single word translations. |
I filed #879 which suggests harvesting short sentences from parallel datasets. |
I filed #880 which suggests statistically synthesizing short sentence translations from monolingual data sources. |
We have issues with translating shorter sentences and single words.
What might help:
The text was updated successfully, but these errors were encountered: