You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
the data you have shared is quite interesting. Could I ask you under what license is it released, as I was not able to find any clear statement apart from "released for the purpose of contributing to the research of natural language processing"? Is it only for research purposes then or can it also be used for training of commercially used models?
Thank you in advance for your answers!
V.
The text was updated successfully, but these errors were encountered:
I am not a maintainer of this project, but this has come up before and I believe it's impossible for them to release the corpus under any ordinary license because of how it's collected. From the README:
Since the collected documents are fragmentary, i.e., only the lead three sentences of each Web document, we have not obtained permission from copyright owners of the Web documents and do not provide source information such as URL. If copyright owners of Web documents request addition of source information or deletion of these documents, we will update the corpus and newly release it. In this case, please delete the downloaded old version and replace it with the new version.
Dear authors of the annotated corpus,
the data you have shared is quite interesting. Could I ask you under what license is it released, as I was not able to find any clear statement apart from "released for the purpose of contributing to the research of natural language processing"? Is it only for research purposes then or can it also be used for training of commercially used models?
Thank you in advance for your answers!
V.
The text was updated successfully, but these errors were encountered: