Phylign icon indicating copy to clipboard operation
Phylign copied to clipboard

When too many query/ref pairs are reported, final_stats uses too much of memory due to its naive Python implementation

Open karel-brinda opened this issue 1 year ago • 0 comments

Happens with eg 1M queries

This is the problematic part: https://github.com/karel-brinda/mof-search/blob/e79b0c842ed919f1787a3071a52065d2317c8f71/scripts/final_stats.py#L109

Probably should be possible optimize by that the output is sorted according to ref (so it's sufficient to keep just stats for the last ref in memory)

karel-brinda avatar Sep 23 '22 13:09 karel-brinda