Paul Masurel
Paul Masurel
Another common use case is synonym expansion. A tokenizer might emit synonyms too. It is possible to emit synonyms at index time (faster search, large index, less flexible), or at...
@PSeitz can you have a look at this PR and eventually merge it? @mlvzk In order to ease the release process, we try to keep CHANGELOG.md up to date. Can...
Duplicate of #799
@ppodolsky do you want to pick this ticket?
it is using a levenshtein automaton.
You are absolutely correct. So a simple solution would be to change the score of the fuzzytermquery depending on the distance it hit. Bad news it was never implemented because...
Rephrasing. Are you interesting in addressing this ticket? Do you need pointers to the stuff I was talking about above?
no problem I can search for volunteers on the chat :)
@hntd187 Can you ask his author? He is nice and usually replies really fast. Also if you do not rewrite this code yourself, make sure that there is no license...
@lerouxrgd This is a perfect summary of what needs to done. The last step is not too difficult. The state can be converted into a levenshtein distance. https://github.com/tantivy-search/levenshtein-automata/blob/master/src/dfa.rs#L70 Scoring given...