StringZilla icon indicating copy to clipboard operation
StringZilla copied to clipboard

Levenstein automata for even moar perf?

Open LifeIsStrange opened this issue 3 months ago • 1 comments

Describe what you are looking for

Noob question: would a levenstein automata in stringzilla allow for even faster fuzzy search? https://blog.mikemccandless.com/2011/03/lucenes-fuzzyquery-is-100-times-faster.html?m=1 There is also https://arxiv.org/abs/1008.1191

Can you contribute to the implementation?

  • [ ] I can contribute

Is your feature request specific to a certain interface?

It applies to everything

Contact Details

No response

Is there an existing issue for this?

  • [x] I have searched the existing issues

Code of Conduct

  • [x] I agree to follow this project's Code of Conduct

LifeIsStrange avatar Sep 24 '25 11:09 LifeIsStrange

Yes, @LifeIsStrange, absolutely, automata will be more efficient than our current algorithm. For the unbounded case they are often trickier to implement, so I didn’t rush that for now. Let me know, if you know someone curious to work on that 🤗

ashvardanian avatar Sep 24 '25 11:09 ashvardanian