dsk icon indicating copy to clipboard operation
dsk copied to clipboard

inconsistent result in comparison with jellyFish and kmer_counter

Open YanaHrytsenko opened this issue 3 years ago • 3 comments

Correction: Actually, all three produced different results.


Hello, I compared the outputs from DSK to the ones generated by JellyFish and by KMER_COUNTER. I used the same .fa file for all three and generated the 7 mers by each of the packages. While all three produced the same number of 7-mers (8192 counts), only JellyFish and KMER_COUNTER produce identical k-mer profiles (i.e. the same k-mers and their frequencies). However, DSK is different by 1344 kmers from both of them. All kmers were sorted lexicographically and I used set difference to calculate the results. Since two out of three produced the same results, I was wondering if there is anything DSK does differently? I know how DSK counts canonical k-mers and tried to search by reversed k-mer string but still, the output isn't there. Could you please let me know if there is something I am missing, perhaps in the flag setting? Thank you.

YanaHrytsenko avatar Nov 04 '20 01:11 YanaHrytsenko