biohansel
biohansel copied to clipboard
Purpose of max kmer frequency?
It's not clear why this threshold was implemented and what kinds of situations it's supposed to be help with.
It seems like it would accidentally exclude certain kmers from subtype calling if the frequency of those kmers is "too high":
https://github.com/phac-nml/biohansel/blob/d20a00be813e72a678c3e8448ca783268b71c3cd/bio_hansel/subtyper.py#L287