foldseek icon indicating copy to clipboard operation
foldseek copied to clipboard

Cluster based on e-value/tmscore ?

Open Wangchentong opened this issue 4 months ago • 3 comments

Expected Behavior

When i run the easy-cluster wih a set of rfdiffusion generated structures, i obeserve that with foldseek cluster program which based e-value will give the ooposite trend, compared to cluster based on tm-score threshold(tmscore cutoff 0.6)

Current Behavior

image The light blue is the total count of scaffold of each length, the drak blue is the count of clusters, why when use e-values there will be less cluster when length increase while use tm-score the trend is opposite? what;s your recommondation cluster creterion when calculates the structure diversity of structure generation model?

Wangchentong avatar Oct 11 '24 04:10 Wangchentong