KMC icon indicating copy to clipboard operation
KMC copied to clipboard

How to generate multi-samples kmer count table?

Open zhangrengang opened this issue 2 years ago • 10 comments

Hi, I want a count table (m× n) like this:

# kmer seq sample1 sample2 ...
AAA... 1 2 ...
AAC... 0 5 ...
AAG... 4 2 ...

where m is number of samples or individuals and n is number of kmer species, and each row indicates the counts of a kmer across mutiple samples.

I aim to compare the difference between samples for each kmer (i.e., to identify different kmers) with this count table. Do any way to do this with fast speed and efficient memory cost?

zhangrengang avatar Mar 15 '22 05:03 zhangrengang