KMC
KMC copied to clipboard
How to generate multi-samples kmer count table?
Hi, I want a count table (m× n) like this:
# kmer seq | sample1 | sample2 | ... |
---|---|---|---|
AAA... | 1 | 2 | ... |
AAC... | 0 | 5 | ... |
AAG... | 4 | 2 | ... |
where m is number of samples or individuals and n is number of kmer species, and each row indicates the counts of a kmer across mutiple samples.
I aim to compare the difference between samples for each kmer (i.e., to identify different kmers) with this count table. Do any way to do this with fast speed and efficient memory cost?