schema
schema copied to clipboard
elasticsearch schema files and tooling
For years now we've been fighting the `TF/IDF` algorithm and more recently we've changed to the `BM25` similarity algo which is much better for short texts like ours but it's...
I *think* this is the correct change to make it so "A&P Deli" can be matched by any of these queries: "A&P" "A & P" and "A and P" I...
- sometimes a street is in OSM as "twenty fifth street", which means if a user searches for it as "25th st", they won't find it - sometimes a user...
There is currently no linting or syntax checking enabled for this repo, which means that there are variable levels of code quality throughout the codebase. There are a few options...
as of [today](https://github.com/pelias/schema/pull/407) we finally removed all unidirectional synonyms (ones using the `a=>b` syntax) from our default synonyms file 🎉 unfortunately, I realized that there is a bug which is...
Despite having used the Elasticsearch [match_phrase](https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-match-query-phrase.html) query for years, we actually don't know much about the details of its scoring. In particular, does `slop` have any impact on the score?...
There has been a lot of discussion and chatter about synonyms lately, so I wanted to throw out an idea we've probably all had: What would it take to have...
We should handle difference is spelling of `grey` vs `gray` with a synoynm. We probably cannot rely on users to type the one that matches in a streetname, venue name,...
I noticed that the query `5 5th St, Medford, MA 02155, USA` fails on geocode.earth https://pelias.github.io/compare/#/v1/search?sources=oa%2Cosm&text=5+5th+St%2C+Medford%2C+MA+02155%2C+USA&debug=0 but it succeeds if I spell out "fifth" - https://pelias.github.io/compare/#/v1/search?sources=oa%2Cosm&text=5+fifth+St%2C+Medford%2C+MA+02155%2C+USA&debug=0 the street in OSM...
As reported in https://github.com/pelias/pelias/issues/847, we can improve fuzzy-matching by applying an apostrophe tokenfilter. https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-apostrophe-tokenfilter.html