Peter Johnson

Results 1014 comments of Peter Johnson

We're currently removing apostrophe characters in the `punctuation` filter. The effect of this is to convert `mcdonald's => mcdonalds`. I had a play with introducing the apostrophe tokenfilter linked above...

Yeah good point about the spelling correction, I wonder if using a non-printing char would mitigate that, either way I'm somewhat reluctant to introduce too much 'magic' which would be...

see: https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-compound-word-tokenfilter.html

this is how the `peliasTwoEdgeGram` currently tokenizes that address: `[ '51', 'fr', 'fri', 'frie', 'fried', 'friedr', 'friedri', 'friedric', 'friedrich', 'friedrich-' ]`

`Leonardo da Vinci–Fiumicino Airport` should be searchable by `Fiumicino Airport` http://pelias.mapzen.com/doc?id=geoname:6299619

This feature will require `alt-names` as the street name above can have 3 forms: ``` Friedrich-Richter-Straße Friedrich Richter Straße FriedrichRichterStraße ``` moving to alt-names milestone as it can only be...

> My guess is that we would want to parse any streetnames coming in with formats like "Friedrich-Richter-Straße or Friedrich Richter Straße and store an alt-name of "FriedrichRichterStraße". This combined...

I was expecting the build size to be reduced since it's not storing the 1 byte per document with the norms. It's not significant compared to the rest of the...

Some examples of improvements, in both cases the more popular, yet wordier names are now being scored higher than the exact matching or succinct names. > note: 'Angkor Wat Putt'...

So surprisingly the testing was fairly favourable, as expected it had the positive effect of fixing the field length scoring discrepancy introduced by adding aliases, and produced better sorting in...