Missing License #37

salisaresama · 2020-08-25T07:59:55Z

Dear authors of the annotated corpus,

the data you have shared is quite interesting. Could I ask you under what license is it released, as I was not able to find any clear statement apart from "released for the purpose of contributing to the research of natural language processing"? Is it only for research purposes then or can it also be used for training of commercially used models?

Thank you in advance for your answers!

V.

polm · 2020-08-27T15:16:58Z

I am not a maintainer of this project, but this has come up before and I believe it's impossible for them to release the corpus under any ordinary license because of how it's collected. From the README:

Since the collected documents are fragmentary, i.e., only the lead three sentences of each Web document, we have not obtained permission from copyright owners of the Web documents and do not provide source information such as URL. If copyright owners of Web documents request addition of source information or deletion of these documents, we will update the corpus and newly release it. In this case, please delete the downloaded old version and replace it with the new version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing License #37

Missing License #37

salisaresama commented Aug 25, 2020

polm commented Aug 27, 2020

Missing License #37

Missing License #37

Comments

salisaresama commented Aug 25, 2020

polm commented Aug 27, 2020