biohansel icon indicating copy to clipboard operation
biohansel copied to clipboard

Purpose of max kmer frequency?

Open peterk87 opened this issue 3 years ago • 5 comments

It's not clear why this threshold was implemented and what kinds of situations it's supposed to be help with.

It seems like it would accidentally exclude certain kmers from subtype calling if the frequency of those kmers is "too high":

https://github.com/phac-nml/biohansel/blob/d20a00be813e72a678c3e8448ca783268b71c3cd/bio_hansel/subtyper.py#L287

peterk87 avatar Mar 04 '21 18:03 peterk87