openfold icon indicating copy to clipboard operation
openfold copied to clipboard

The procedure of making the self-distillation data.

Open JinyuanSun opened this issue 3 years ago • 2 comments

I'm curious about .pdb files your kindly provided for self-distillation. Were they predicted using official AF2 weights or the openfold initial training weights trained from scratch?

JinyuanSun avatar Oct 27 '22 09:10 JinyuanSun

image Oh, I found this. Still curious about which model was used. I checked some cases, those structures seem not precisely the same as in the Alphafold DB, especially some regions having low plddt.

JinyuanSun avatar Oct 27 '22 09:10 JinyuanSun

Also, a following up discussion, if the sequences were removed greedily, should we be concerned about enriching too many 'orphan' sequences? I see you kept sequences with more than 50 MSA only, I guess there are many sequences that were dropped.

JinyuanSun avatar Oct 27 '22 09:10 JinyuanSun