Training numbers #2

impactcolor · 2017-10-18T19:06:53Z

This is probably outside the scope of the "issues" but figure I'd ask.
I notice it doesn't take numbers. Is there away to add numbers to the xml data sets so it can also do numbers?

Grzego · 2017-10-18T19:36:22Z

You should be able to generate numbers like:

python generate.py --text="1 2 3 4 5 " --noinfo --bias=4.

although the quality will probably be quite bad (too little examples in dataset).

You can add your own examples in .xml format but you will have to match them to those already in dataset (content should contain tags like: <Transcription>, <Text> and <StrokeSet>, structured like in dataset).

Alternatively if you have data with consecutive points representing how to draw numbers (with labels) you could create your own dataset.

So depending on format of your dataset it might be easier or harder. :)

impactcolor · 2017-10-18T20:51:42Z

I'm really new to this so I'm not sure how to go about creating a dataset. Do you have any articles or direction you can point me to?

Grzego · 2017-10-20T13:06:58Z

Sorry for the delay. I get the feeling you have no data, which is problematic. Could you please elaborate a little bit more on what you are trying to achieve? :)

impactcolor · 2017-10-20T19:08:46Z

It's no problem, thank you for taking the time to even discuss this with me. I found a dataset which of numerically written numbers however it isn't setup as the current dataset used by IAM in xml files. What I'm trying to accomplish is to use the handwriting but it also has to include numbers and currently the numbers do not come out good.

…

On Fri, Oct 20, 2017 at 6:06 AM, Grzegorz Opoka ***@***.***> wrote: Sorry for the delay. I get the feeling you have no data, which is problematic. Could you please elaborate a little bit more on what you are trying to achieve? :) — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#2 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEQOknAGNyvv2VlG7lkOJuE9BNydaJKOks5suJrygaJpZM4P-NV6> .

Grzego · 2017-10-21T10:05:23Z

Ok, is this dataset publicly available? I can look into it to see if there is a way to make it compatible with my code. :)

impactcolor · 2017-10-21T18:50:50Z

Awesome! Here goes: http://yann.lecun.com/exdb/mnist/ http://archive.ics.uci.edu/ml/machine-learning-databases/semeion/ I found these two

…

Sent from my iPhone

On Oct 21, 2017, at 3:05 AM, Grzegorz Opoka ***@***.***> wrote: Ok, is this dataset publicly available? I can look into it to see if there is a way to make it compatible with my code. :) — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

Grzego · 2017-10-23T21:36:55Z

Unfortunatelly, those datasets represent numbers as images. For handwriting generation you would need to have list of consecutive points showing how a digit is written. So those datasets cannot be used here.

impactcolor · 2017-10-23T22:16:37Z

Would this one work? This has the stroke data: https://github.com/edwin-de-jong/mnist-digits-stroke-sequence-data/wiki/MNIST-digits-stroke-sequence-data

…

On Mon, Oct 23, 2017 at 2:36 PM, Grzegorz Opoka ***@***.***> wrote: Unfortunatelly, those datasets represent numbers as images. For handwriting generation you would need to have list of consecutive points showing how a digit is written. So those datasets cannot be used here. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#2 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEQOkpsMBSx4SjLVJftQ-gStOB7Yv2ZYks5svQb3gaJpZM4P-NV6> .

Grzego · 2017-10-23T23:30:25Z

This one might work. :) Can you give some examples of sequences you want to generate? I just want to figure out what kind of augmentation to dataset might be needed.

impactcolor · 2017-10-23T23:43:11Z

about 5 digit random sequences. In example 11445 8013 1507 etc..

…

On Mon, Oct 23, 2017 at 4:30 PM, Grzegorz Opoka ***@***.***> wrote: This one might work. :) Can you give some examples of sequences you want to generate? I just want to figure out what kind of augmentation to dataset might be needed. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#2 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEQOkiB0tXseZLgH7Nry79NSXJcXQchlks5svSGRgaJpZM4P-NV6> .

Grzego · 2017-11-08T20:50:43Z

Sorry for very late response. I tried this dataset and unfortunately it doesn't work well :/ The results are even worse than with original IAM dataset. If by any chance I find better dataset for this task I will post it here.

impactcolor · 2017-11-08T21:02:36Z

THANK YOU!!!!

…

On Wed, Nov 8, 2017 at 12:50 PM, Grzegorz Opoka ***@***.***> wrote: Sorry for very late response. I tried this dataset and unfortunately it doesn't work well :/ The results are even worse than with original IAM dataset. If by any chance I find better dataset for this task I will post it here. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#2 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEQOkiSt828fSdSpFVqBdRCh93u3PkbCks5s0hQkgaJpZM4P-NV6> .

Grzego · 2017-12-08T11:34:39Z

Well it's been a while, but I was kind of interested in this problem and created MNIST handwriting dataset. If you still need to generate numbers you may find it useful. One simple solution is to just pick needed digits from this dataset and concatenate them together. :)

impactcolor · 2017-12-28T01:11:50Z

@Grzego THANK YOU!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training numbers #2

Training numbers #2

impactcolor commented Oct 18, 2017

Grzego commented Oct 18, 2017

impactcolor commented Oct 18, 2017

Grzego commented Oct 20, 2017

impactcolor commented Oct 20, 2017 via email

Grzego commented Oct 21, 2017

impactcolor commented Oct 21, 2017 via email

Grzego commented Oct 23, 2017

impactcolor commented Oct 23, 2017 via email

Grzego commented Oct 23, 2017

impactcolor commented Oct 23, 2017 via email

Grzego commented Nov 8, 2017

impactcolor commented Nov 8, 2017 via email

Grzego commented Dec 8, 2017

impactcolor commented Dec 28, 2017

Training numbers #2

Training numbers #2

Comments

impactcolor commented Oct 18, 2017

Grzego commented Oct 18, 2017

impactcolor commented Oct 18, 2017

Grzego commented Oct 20, 2017

impactcolor commented Oct 20, 2017 via email

Grzego commented Oct 21, 2017

impactcolor commented Oct 21, 2017 via email

Grzego commented Oct 23, 2017

impactcolor commented Oct 23, 2017 via email

Grzego commented Oct 23, 2017

impactcolor commented Oct 23, 2017 via email

Grzego commented Nov 8, 2017

impactcolor commented Nov 8, 2017 via email

Grzego commented Dec 8, 2017

impactcolor commented Dec 28, 2017