Skip to content

Conversation

@rbayet
Copy link
Collaborator

@rbayet rbayet commented Dec 4, 2025

No description provided.

@rbayet rbayet changed the title [Thesaurus] Manage stemming in multi-word thesaurus [Thesaurus] Manage stemming in multi-word thesaurus (only) Dec 4, 2025
Comment on lines +190 to +191
$settings = $this->addAnalyzerSettings($settings, 'synonym', $synonyms, $stemmingFilters);
$settings = $this->addAnalyzerSettings($settings, 'expansion', $expansions, $stemmingFilters);
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Somehow the expansion filters are not properly generated.
For instance, in french (stemmer light_french)
"activité => football,handball,running,tennis,trail"
becomes
"activite=>fotbal,handbal,runing,teni,trail"
which makes the rule NOT work for "activités".
=> The "clean" analysis for "activités" or "activité" is "activit"

So obviously there is something wrong somewhere.

@rbayet
Copy link
Collaborator Author

rbayet commented Dec 4, 2025

@PierreGauthier for reference, sample data

image

You will need Elasticsuite Premium PR 410 for the Explain screen to work if need be.

@rbayet
Copy link
Collaborator Author

rbayet commented Dec 4, 2025

Replacing #3765

@rbayet rbayet force-pushed the feat-stemMultiwordThesaurus-only branch from 6bca69d to bbd91c2 Compare December 8, 2025 11:55
@rbayet rbayet marked this pull request as draft December 8, 2025 14:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants