-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Similarity Score - usage question #348
Comments
I have never wondered about this :D I am not entirely sure. What are the use cases for these kinds of alignments? What can you do with a 10% alignment practically I mean? |
Implementation of this solution requires an intermediate assumption that links the membership estimates (0.3, 0.6, 0.1 in your example) to spatial expression. For example, a membership estimate of 0.3 means that, based on the information in the class descriptions, there is a subjective probability of 0.3 that source:Class345 belongs to target:ClassDFG (ideally, subjective probabilities should be estimated by averaging estimates of replicated subject experts). To give this spatial expression in the way proposed, we need to assume that subjective probabilities of membership are directly related to spatial extent. i.e. membership = 0.3 means that 30% of the mapped extent of source:Class345 occurs [somewhere] within the mapped area of target:ClassDFG, but we do not know which 30% of 345. In many cases, uses may decide the assumption is reasonable for their application, though ideally it should be empirically evaluated with test data. |
To provide a bit more context: the goal is to determine some property (function) of a spatial region, where we have a classification of the region using System 1, but the assessment requires its classification using System 2. i.e.
Using a (notional) example Region
Using the proportions in the example above, this would mean that so you do the assessment based on the latter three ... (of course we don't know which 138 sq.km is |
This discussion is a tad out of my depth, I am sorry; I hope someone else from the @mapping-commons/sssom-core team can chip in and give feedback. Without understanding this exactly, I would say that such fuzzy matches are out of scope for SSSOM, but this does not have to keep you from using the
Maybe however my internal and your internal model of your question only have a very low "semantic overlap" and what I am saying here is completely off topic 😛 |
My 2 cents:
|
Hmm. That would be disappointing. I really doubt it is really just a niche concern - it certainly isn't in linguistics. Partial matches are supported by I understand that Perhaps we just adopt a local convention in the context of our project to use the slots in this way. But I thought it was worth canvassing this list to see if a similar use case had already been encountered. |
I am not sure using SSSOM to describe the extent of overlap of regions is the right use of SSSOM. This seems more like a more general kind of relationship instead of a mapping. From what I understand, the semantic similarity measurement should be something like "ontological similarity" like what https://github.com/related-sciences/nxontology implements, but it's understandable that this up to interpretation since the docs are completely empty for https://mapping-commons.github.io/sssom/semantic_similarity_score/ |
I'm assisting the development of some mappings within a set of ecosystem and land-use classifications.
The actual mappings are all done manually by subject-matter experts, so the
mapping justification
issemapv:ManualMappingCuration
.However, many of the mappings are partial, in this sense - a class from the source scheme maps to n classes in the target scheme, in known proportions, e.g.
source:Class345
will correspond totarget:ClassDFG
source:Class345
will correspond totarget:ClassHJK
source:Class345
will correspond totarget:ClassZXC
Is this where
semantic_similarity_score
comes in? Would it be correct to setpredicate_id
toskos:relatedMatch
(or should it beskos:narrowMatch
?)semantic_similarity_score
to0.3
0.1
0.6
respectivelyOr is this all application-dependent - i.e. it is up to us, since we are the ones who will be using the mappings.
The text was updated successfully, but these errors were encountered: