Jellyfish icon indicating copy to clipboard operation
Jellyfish copied to clipboard

Is it possible to output the source of reads of each kmer

Open bzvew opened this issue 7 years ago • 1 comments

If the read name containing each can be found, I think jellyfish can be used to do more creative work, rather than just counting kmer, e.g. remove reads with some high occurrence kmer.

bzvew avatar Jan 10 '18 12:01 bzvew

Hi @bzvew and Guillaume @gmarcais,

Following up on the request above: we have just used the subset k-mer counting and are pretty happy with the results. Also pulling out the (relatively few) read IDs/sequences which match the k-mer counts would be fantastic, ideally in a separate file. Is this information internally stored by Jellyfish at some (final?) stage, or would it require additional coding to store reads once they have been identified as a hit/target during the hashing steps?

Thanks a lot, Max

MaximilianStammnitz avatar May 10 '18 11:05 MaximilianStammnitz