-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Capturing the time of mapping review #309
Comments
IMO there should be a reasonably high bar now to add additional metadata elements. But given how important reviews may be for some organisations (especially for non-exact matches), I am absolutely convincable to vote either side. |
Re-using
I agree. |
We would like In general there may be many stages in the lifecycle of a mapping. Currently it is only possible to record the timing of two stages - using |
@dr-shorthair feel free to use the issue template for new elements here: https://github.com/mapping-commons/sssom/issues/new/choose I will respond to the other suggestions individually when the tickets come. As you say, there are a lot of things that we could capture about a mapping. It is important that we only add elements to SSSOM that are of general importance (multiple stakeholders agree they are needed). This could be the case with your requests! I gather you don't agree with the idea to interpret Note that you can always add a "semapv:MappingReview" as a separate justification about a mapping. This seems a bit idiosyncratic at first, but it could do the job. |
Yes, and I am inclined to think this should be the way to go. Rather than having a new distinct field to record the time of every step in the “life cycle” of a mapping (e.g.
This way we would only ever need one date field. |
A more scalable pattern might be to add a Combine this with the one-row-per-status-transition pattern proposed above by @gouttegd Potential list of status values:
|
The current version of the schema allows to capture the fact that a mapping has been reviewed, through the use of the
reviewer_id
and/orreviewer_label
fields.Should the time at which a mapping has been reviewed also be captured, and if so, how?
The use case is a (group of) human reviewer(s) checking whether a mapping is still correct, possibly years after the mapping was first established, and concluding that indeed, the mapping is still 100% correct, there’s nothing to change.
(The case where the reviewers find that the mapping needs some changes is out of scope: if the mapping needs changing, that’s a new mapping that is asserted, so this can be recorded in
mapping_date
.)There are several options.
A) Not actually recording the review date (more or less the current situation). The reviewers just add their IDs to the
reviewer_id
field but leave the mapping otherwise untouched. Pros: nothing to do. Cons: consumers of the mapping have no way to know when the mapping was reviewed, just that it has been reviewed. Has the review occurred in the past 5 weeks or the past 5 years?A1) Variant: not recording the review date, but considering that when a mapping set has been reviewed, it is a new mapping set that is being published after review. The new mapping set therefore has a new
publication_date
, which can be inferred as the date the mappings of the set were last reviewed. Pros: no changes required. Cons: assuming all mappings are systematically (re-)reviewed prior to any new publication of the mapping set seems like a bold assumption.B) Re-using
mapping_date
. That field is intended to capture the “date the mapping was asserted”. We can consider that reviewing of a mapping, even if the review does not lead to any change to the mapping, is tantamount to asserting that mapping (again). Reviewers should re-set that date to the date of the review. Pros: allow to capture the date of review without requiring any change to the schema. Cons: we loose the information about when the mapping was first asserted.C) Adding a new field
review_date
, to be filled in by the reviewers at the same time they fillreviewer_id
. Pros: review date is capture without any loss of information. Cons: require adding a new field to the schema.Other ideas?
The text was updated successfully, but these errors were encountered: