foldseek icon indicating copy to clipboard operation
foldseek copied to clipboard

clustering of multimers

Open adrienchaton opened this issue 1 year ago • 6 comments

Hello,

I am trying to cluster pdbs with several chains, in this case they are all dimers (antibodies).

I have not found whether or not foldseek would allow that but since I could run latest foldseek version on a folder with monomers, I guess the issue I am encountering now is related to limitations of the algorithm to work on single chains only ?

foldseek createdb ./pdbs DB this one works fine but at the second step to call foldseek search then I get an error

k-mer similarity threshold: 78
Starting prefiltering scores calculation (step 1 of 1)
Query db start 1 to 2531172
Target db start 1 to 2531172
Segmentation fault (core dumped)                                  ] 0.00% 1 eta -
Error: Kmer matching step died

Is there a procedure to cluster dimers with FoldSeek please ? Since I will work with antibodies, it is fine for me to assume all the pdbs have the same number of chain = 2.

Thanks!

adrienchaton avatar May 11 '23 10:05 adrienchaton

Currently this is not supported. We are working on this.

martin-steinegger avatar May 25 '23 06:05 martin-steinegger

thanks for your reply and the good job done! that will be a great addition to this tool

adrienchaton avatar May 25 '23 10:05 adrienchaton

one thought I had, could anything related to the glycine linker trick as in e.g. AF multimer, be applicable to FoldSeek?

adrienchaton avatar May 25 '23 10:05 adrienchaton

I'm working on the same

I tried to do a clustering of antibodies based on the orientation of the VH/VL domains.

And I had to designate the same chains id of both as one (I treated H and L chains as A) and I renumbered the residues so as not to have overlapping numbers.

edumenezes77 avatar Jun 01 '23 21:06 edumenezes77

Hi, have you tried this linker or other tools in dimer structure clustering? I have meet the same issue

sirius777coder avatar Jun 24 '24 12:06 sirius777coder