Skip to content
This repository has been archived by the owner on May 22, 2021. It is now read-only.

As a User I want partial match finds also contain results with genus and the lowest infraspecies. #67

Open
dimus opened this issue Oct 5, 2018 · 4 comments

Comments

@dimus
Copy link
Member

dimus commented Oct 5, 2018

@dimus commented on Fri Oct 05 2018

Currently we cut word by word from canonical binomial, trinomial, tetranomial until we find something.

In addition to that it will be useful to return results if we find combination of first word of multinomial name (genus) and the very last epithet in canonical. Such addition seem to be beneficial for both zoologists and botanists.

@alexander-myltsev
Copy link
Contributor

@dimus , please verify the algorithm:

  • if there are fuzzy matches for 2 or more words then we return it. And additionally genus and the lowest infraspecies fuzzy matches if any
  • if there is the only fuzzy match for genus or nothing at all, then we return genus and the lowest infraspecies fuzzy matches if any

@dimus
Copy link
Member Author

dimus commented Nov 7, 2018

If there is only fuzzy match for genus -- we return - no match

@dimus
Copy link
Member Author

dimus commented Nov 7, 2018

if there are fuzzy matches for 2 or more words then we return it. And additionally genus and the lowest
infraspecies fuzzy matches if any

If we found fuzzy match, we return that, and do not go to partial match at all

@dimus
Copy link
Member Author

dimus commented Nov 7, 2018

The algorithm looks like this to me.

  1. If we found any other match -- we do not do partial match at all
  2. For partial match we remove a word and try to match (exact and fuzzy) the rest. if it does not work, we remove one more word and try to match (exact and fuzzy) the rest. If only one word left, we only do exact match.
  3. In addition we remove everything in the middle and try to match genus and last word (exact and fuzzy). If we did get result, we return it together with result from 2 (if any)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants