Results 26 comments of linuzer

Well, you can reproduce it, may be I wasn't clear enough, and I just realize that I used the semicolon also inside the columns, so yes, sorry, that's really confusing!...

I thought the general procedure was already clear, but here it is: I extract the addresses from russia-latest.osm.pbf using this [https://github.com/kiselev-dv/gazetteer](url) tool, which brings together all hierarchical parts of an...

Here's the same file again, but simpler, just 3 columns, semicolon separated and the addresses comma-separated. So it's the matching-level, the raw OSM-address and the parser result. [result.txt](https://github.com/openvenues/libpostal/files/681406/result.txt)

OK, maybe I still have a deeper miss-conception here. I'll take this example (not because I don't like the result above, just because in that example my query-address is of...

> In a full-text search engine that would be a strong match becasue a word like "улица" is very frequent and would have a low IDF score, but in terms...

``` Автолюбителейроезд Петрозаводск Карелия Российская Федерация 185013 { "house": "автолюбителейроезд", "city": "петрозаводск", "state": "карелия", "country": "российская федерация", "postcode": "185013" } ``` The "house" is the road.