This is a project to model linked data schemas (ie, lightweight ontologies for the fields of agriculture, food, agri-business, plant biology.
The work leverages mainly schema.org and bioschemas. As in those projects, we aim at very simple and practical modelling, which can be useful to share data in an interoperable way, especially by means of APIs and annotated web pages.
The work was born within the Design Future Wheat project, and for the moment it's focused on the use cases dealt with in it. In fact, so far we have been building our schemas starting from well known use cases within crop improvement research. There have been two DFW hackathons where we have done most of the work so far.
This work has started with a first DFW hackathon in 2018, where we first outlined the possible types (ie, classes, properties/relations) that we might need in DFW-relevant data.
Leveraging that, we have started considering use cases from real data, and we have started modelling them using existing schemas. At the same time, we have started to propose new types as needed.
These are the use cases considered:
- Molecular Biology, pathways/genes/proteins (from Knetminer)
- Molecular Biology, ontology terms (from Knetminer)
- Semantic annotations (ex, text mining annotations)
- Molecular Biology, host-pathogen interactions (PHI-Base)
- Dataset description, TODO (based on
bioschema:Dataset
, but CKAN might need DCAT, VoID is also worth to consider) - Field trials
- Links between field trials, cultivars and genes
- Includes the EBI Gene Expression Atlas use case
- Weather conditions and forecasts
Some notes on general criteria adopted to outline the use case models above.
A collection of references and links to various similar projects hackathons, schemas, etc.