pyalign icon indicating copy to clipboard operation
pyalign copied to clipboard

setup: add repo URL

Open bertsky opened this issue 3 years ago • 2 comments

The project page on PyPI lacks a backref to Github due to this.

(Most images in the README do not render there, too.)

bertsky avatar Oct 24 '22 15:10 bertsky

(Most images in the README do not render there, too.)

should be fixed by 2nd commit

bertsky avatar Oct 24 '22 15:10 bertsky

Also, I wonder whether you'd want to mention https://github.com/maxbachmann/RapidFuzz as another fast (C++ based) versatile string alignment library for Python?

bertsky avatar Oct 24 '22 15:10 bertsky

Not looked into details, but https://github.com/maxbachmann/RapidFuzz looks more like a string distance computation without any alignments.

poke1024 avatar Nov 30 '22 10:11 poke1024

Not looked into details, but https://github.com/maxbachmann/RapidFuzz looks more like a string distance computation without any alignments.

It implement its own fast Needleman-Wunsch alignment (based on Hyyrö algorithm or Wagner-Fischer) in C++: https://github.com/maxbachmann/RapidFuzz/tree/main/src/rapidfuzz/distance, which then uses https://github.com/maxbachmann/rapidfuzz-cpp. (You can get the alignment result itself via .editops or .opcodes. But only the first-best.) Supports a fixed set of distance/similarity metrics. Includes semiglobal (prefix/postfix). No Smith-Waterman, though. And no custom weights.

No idea how it fits into your schema (how it scales time and memory wise, how well it utilises computing resources etc.) Just thought you might want to mention it in your elect list of other libraries.

bertsky avatar Nov 30 '22 10:11 bertsky

OK, interesting, I added it to the README list in any case.

poke1024 avatar Nov 30 '22 11:11 poke1024