documentation icon indicating copy to clipboard operation
documentation copied to clipboard

Suggestion: Stop words for languages

Open devrck opened this issue 4 years ago • 2 comments

Hi guys,

I know that Meilisearch starts from scratch :smile: but maybe it's useful to have on the docs page some standard stop words for each language that it supports.

https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-stop-tokenfilter.html#analysis-stop-tokenfilter-stop-words-by-lang

What do you guys think?

Sorry if i opened the issue in this project, please move it to the correct one if needed.

devrck avatar Jul 09 '20 14:07 devrck

Hello @devrck! If I understand well, you would need a documentation page with stop-words lists according to different languages. It could be interesting, but that's indeed not posted in the right repo 😉

I'm going to ask a Meili team member who has the permission to move your issue in the documentation repo!

curquiza avatar Jul 09 '20 14:07 curquiza

Although I can definitely see why it is useful to have a list of common stopwords for supported languages, are we the best people to host such a list? Do we want that responsibility?

According to our docs we support pretty much any white-space separated language (which includes but is not limited to pretty much every language written in the latin alphabet) and Chinese. That's a pretty daunting list, and one I'm not sure the @meilisearch/docs-team has the proper skillset to create.

guimachiavelli avatar Jan 19 '22 15:01 guimachiavelli