foldseek icon indicating copy to clipboard operation
foldseek copied to clipboard

Cluster purity analysis with structurealign?

Open BinhongLiu opened this issue 1 year ago • 4 comments

Expected Behavior

Hi, I found the cluster purity analysis using structurealign here (https://www.biorxiv.org/content/10.1101/2023.03.09.531927v1). The representative structure was aligned to the cluster members using the "structurealign -e INF -a" module in Foldseek to calculate the average LDDT and average TM-score per cluster. Could you provide a more detailed guide about this? I'm not sure if I need to complete the analysis with a loop script.

BinhongLiu avatar Apr 16 '23 13:04 BinhongLiu

We calculated the TM-scores and LDDT scores using 3Di/AA structural alignments (structurealign). To obtain the TM-score and average LDDT score for the alignments we used convertalis modul.

martin-steinegger avatar Apr 26 '23 16:04 martin-steinegger

Should this analysis be completed in the clusters one by one using a loop script?

BinhongLiu avatar Apr 28 '23 12:04 BinhongLiu

Could you please provide more details on that topic? Eg. how did you generate prefilterdb comprising all query-target alignments which is a required input for structurealign?

EDIT: isn't it so that all that steps can be done using just one command easy-search with --exhaustive-search 1?

PawelSzczerbiak avatar Jan 25 '24 08:01 PawelSzczerbiak

We do have the scripts how to compute the purity per cluster here: https://github.com/steineggerlab/afdb-clusters-analysis

martin-steinegger avatar Jan 26 '24 04:01 martin-steinegger