KMC icon indicating copy to clipboard operation
KMC copied to clipboard

Computing jacquard distances

Open TransGirlCodes opened this issue 5 years ago • 1 comments

Hi, I have a set of individuals I want to compute jacquard distances for.

I've produced a kmc database for each individual. One way I thought of to compute these distances is for each pair of individuals, use kmc_tools to make a union and an intersection kmc database.

Then use the number of distinct kmers (could be done with wc -l of the text dump of the two databases) in the two databases to compute the jacquard distance.

Is there an easier way to compute the jacquard score for two KMC databases?

I've also noticed with some of my union and intersection databses, when I try to use transform to dump them as text kmc_tools hits a segmentation fault 11, why might this be?

Thanks!

TransGirlCodes avatar Sep 19 '18 13:09 TransGirlCodes