alphafold icon indicating copy to clipboard operation
alphafold copied to clipboard

how is alpha fold 3 trained

Open joepareti54 opened this issue 4 months ago • 0 comments

This paper says that the loss function is the weighted sum of L_distogram, L_diffusion and L_confidence. But how is it implemented? taking the derivative to update the weights ? When the processing is in the trunk there is no diffusion yet which occurs later in time. When the processing is in diffusion, the trunk does not seem to execute again. Is there any backpropagation? Moreover the diffusion process training involves predicting the noise injected at a time step, but then the only loss term to base the weight updates ought to be Ldiffusion. How about the attention layers and MLP training ?

joepareti54 avatar Oct 16 '24 14:10 joepareti54