Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove false positive on en:eu-agriculture and en:non-eu-agriculture tags #1245

Open
raphael0202 opened this issue Aug 29, 2023 · 0 comments

Comments

@raphael0202
Copy link
Collaborator

We detect "en:eu-agriculture" and "en:non-eu-agriculture" using regex on OCRs, in labels.py. This label can be found just below the european agriculture label (en:eu-organic).

We have a significant fraction of these detections that are false positive, as "eu/non-eu agriculture" is also possible, and the negative lookbehind use in the regex doesn't alway detect the "eu" part. Only "en:eu-non-eu-agriculture" is trustworthy.

These insights are particularly tricky to validate, as you have to look closely at the packaging.
We temporarily disabled the generation of these insights, until we find a proper way to handle it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants