Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A Knowledge Graph resource of NLP-progress #617

Open
jd-coderepos opened this issue Aug 1, 2022 · 7 comments
Open

A Knowledge Graph resource of NLP-progress #617

jd-coderepos opened this issue Aug 1, 2022 · 7 comments

Comments

@jd-coderepos
Copy link

Dear authors, this repository is such a great resource! Many thanks for creating it. I would like to suggest that maybe the Open Research Knowledge Graph (https://orkg.org/) could be leveraged to enlist such resources for persistence, knowledge sharing, and querying. Please find below some resources I created related to the information in this repository.

Named Entity Recognition Tasks in the MUC series

https://orkg.org/comparison/R162797/

NER in the Automatic Content Extraction (ACE) Series

https://orkg.org/comparison/R162851/

Named Entity Recognition in the CoNLL Series and the OntoNotes corpus as a related resource

https://orkg.org/comparison/R166315/

Named Entity Recognition Based on Wikipedia

https://orkg.org/comparison/R166240/

A comparison of the annotated resources of software mentions in scholarly articles

https://orkg.org/comparison/R166560/

NLP Datasets for Named Entity Recognition and Relation Extraction from Biomedicine Scholarly Articles

https://orkg.org/comparison/R163265/

Comparisons and Visualizations of the CrossNER Benchmark Corpus for its Source and Target Domains

https://orkg.org/comparison/R163843/

Surveying BioNLP Shared Tasks Corpora for Named Entity Recognition

https://orkg.org/comparison/R165702/

Surveying BioCreAtIvE Shared Tasks Corpora for Named Entity Recognition

https://orkg.org/comparison/R172155/


The benefits of such machine-encoded data is that Reviews can be automatically created thereby.

Surveying the BioCreAtIvE Shared Task Series

https://orkg.org/review/R172166

Surveying the BioNLP Shared Task Series

https://orkg.org/review/R165924

I would be very happy to offer support in this direction. :)

@RicardoUsbeck
Copy link
Contributor

I am sure, my team at NFDI4DS (https://www.nfdi4datascience.de/) would also be happy to help to convert data

@sebastianruder
Copy link
Owner

This is a great set of resources! What do you think would be the best way to integrate them?

@jd-coderepos
Copy link
Author

jd-coderepos commented Aug 1, 2022

@sebastianruder could we perhaps schedule a call as a starting point, where I could present the ORKG and its features to you? Perhaps then we could elicit a set of requirements to integrate the data. My contact information is here https://sites.google.com/view/jen-web/contact?authuser=0

@RicardoUsbeck happy to hear your thoughts on how best you think perhaps we could go about it, also relaying the information to your team at NFDI4DS. :)

@RicardoUsbeck
Copy link
Contributor

Sure, will do. Actual work will start in Oct.

We wanted to crawl this website to feed the (O)RKG. On the other hand, it would be nice to have a manual (?) export form ORKG (which will undoubtedly grow faster) here...but there are also downsides to this approach. Happy to help you discuss ideas.

@jd-coderepos
Copy link
Author

jd-coderepos commented Aug 1, 2022

Indeed I fully agree with having crawling scripts for the website to structure the data. Perhaps then additional curational support on top of it, to ensure the data quality...

If the resulting dataset can be structured in an excel sheet and is only a one level graph, the csv import feature https://orkg.org/csv-import can be leveraged to bulk import the papers themselves. The individual comparison views themselves then can be created manually...

Furthermore, I kindly suggest also that a template can be defined https://orkg.org/templates for such data enabling new users to seamlessly leverage the defined template when adding new data.

Happy to continue the discussion thread. Please let me know.

@sebastianruder
Copy link
Owner

Hi both, are you ok meeting without me? I think you are both more up-to-date on this type of data. I'm happy to go with whatever you decide, as long as it can be reasonably integrated into the website.

@jd-coderepos
Copy link
Author

@sebastianruder Will be happy to share updates here, in due course.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants