KMC
KMC copied to clipboard
Computing jacquard distances
Hi, I have a set of individuals I want to compute jacquard distances for.
I've produced a kmc database for each individual. One way I thought of to compute these distances is for each pair of individuals, use kmc_tools to make a union and an intersection kmc database.
Then use the number of distinct kmers (could be done with wc -l
of the text dump of the two databases) in the two databases to compute the jacquard distance.
Is there an easier way to compute the jacquard score for two KMC databases?
I've also noticed with some of my union and intersection databses, when I try to use transform to dump them as text kmc_tools hits a segmentation fault 11, why might this be?
Thanks!