schema icon indicating copy to clipboard operation
schema copied to clipboard

elasticsearch schema files and tooling

Results 37 schema issues
Sort by recently updated
recently updated
newest added

For years now we've been fighting the `TF/IDF` algorithm and more recently we've changed to the `BM25` similarity algo which is much better for short texts like ours but it's...

discuss

I *think* this is the correct change to make it so "A&P Deli" can be matched by any of these queries: "A&P" "A & P" and "A and P" I...

- sometimes a street is in OSM as "twenty fifth street", which means if a user searches for it as "25th st", they won't find it - sometimes a user...

There is currently no linting or syntax checking enabled for this repo, which means that there are variable levels of code quality throughout the codebase. There are a few options...

as of [today](https://github.com/pelias/schema/pull/407) we finally removed all unidirectional synonyms (ones using the `a=>b` syntax) from our default synonyms file 🎉 unfortunately, I realized that there is a bug which is...

bug

Despite having used the Elasticsearch [match_phrase](https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-match-query-phrase.html) query for years, we actually don't know much about the details of its scoring. In particular, does `slop` have any impact on the score?...

enhancement

There has been a lot of discussion and chatter about synonyms lately, so I wanted to throw out an idea we've probably all had: What would it take to have...

enhancement

We should handle difference is spelling of `grey` vs `gray` with a synoynm. We probably cannot rely on users to type the one that matches in a streetname, venue name,...

enhancement

I noticed that the query `5 5th St, Medford, MA 02155, USA` fails on geocode.earth https://pelias.github.io/compare/#/v1/search?sources=oa%2Cosm&text=5+5th+St%2C+Medford%2C+MA+02155%2C+USA&debug=0 but it succeeds if I spell out "fifth" - https://pelias.github.io/compare/#/v1/search?sources=oa%2Cosm&text=5+fifth+St%2C+Medford%2C+MA+02155%2C+USA&debug=0 the street in OSM...

bug

As reported in https://github.com/pelias/pelias/issues/847, we can improve fuzzy-matching by applying an apostrophe tokenfilter. https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-apostrophe-tokenfilter.html