ColabFold icon indicating copy to clipboard operation
ColabFold copied to clipboard

How to save time to predict multimer structure by reusing previous monomer structure results

Open xiaoxinbioinfo opened this issue 1 year ago • 2 comments

Expected Behavior

I used colabfold to predict two monomer proteins cas9 and deaminase, and I wanted to continue to predict their possible complex structure by multimer method. While the cas9 protein was too large (~1400) AA), how can I adjust the code to reuse the previous monomer structure prediction results so that the prediction time of the multimer structure is greatly reduced? How, if at all, can the code be tweaked to achieve this?

xiaoxinbioinfo avatar Apr 10 '24 06:04 xiaoxinbioinfo

Funny, we have been just discussing this to speed up AlphaPulldown ;-) In the simplest way of implementing it, you should be able to do this by providing precalculated monomeric models (e.g. from AlphaFold DB) as monomeric templates, but I don't know if ColabFold supports multiple custom templates. In AlphaPulldown it should be possible (using the mode for multimeric templates https://github.com/KosinskiLab/AlphaPulldown/blob/main/manuals/example_3.md but turning the multimeric_mode off). Perhaps that would allow reaching good multimeric predictions in less recycles. We were discussing benchmarking it over different numbers of recycles, but maybe @sokrypton, you tested that already? In addition, one could possibly use the monomeric models as ColabFold's initial guess, even if they are in the arbitrary orientation of monomers, but I'm not sure if that would work well.

jkosinski avatar Apr 10 '24 07:04 jkosinski

Colabfold allows multiple custom templates if alignable by HHsearch.

martin-steinegger avatar Apr 10 '24 08:04 martin-steinegger