esm icon indicating copy to clipboard operation
esm copied to clipboard

Layer weights for ESM-Fold

Open zhenyuhe00 opened this issue 3 years ago • 4 comments

Hi, Congrats on this series of great work! I'm using your pre-trained model for downstream applications. In the ESM-2 paper, the weighted sum of embeddings from all layers of ESM-2 is fed to the ESM-fold. I wonder if it's enough to only use the embeddings from the last layer of ESM-2, since storing all layers' embedding budget is high. Will the performance drop?

Thanks in advance!

zhenyuhe00 avatar Aug 24 '22 01:08 zhenyuhe00

I'd expect only a very minor performance drop. If you use fixed weights to make the weighted sum you can avoid additional memory consumption by just summing into a result tensor throughout the network.

tomsercu avatar Aug 24 '22 09:08 tomsercu

Thanks!

zhenyuhe00 avatar Aug 24 '22 09:08 zhenyuhe00

I'd expect only a very minor performance drop. If you use fixed weights to make the weighted sum you can avoid additional memory consumption by just summing into a result tensor throughout the network.

I wonder How to get the weights, are they trained when training ESM-Fold, or just manually set to a number? Thanks in advance!

zhenyuhe00 avatar Aug 24 '22 09:08 zhenyuhe00

will be released with ESMFold

tomsercu avatar Aug 24 '22 09:08 tomsercu

Excited to share that ESMFold was released on November 1st!

tomsercu avatar Nov 09 '22 21:11 tomsercu

Could you tell us the key under which the layer weights are stored in esmfold_3B_v1.pt ? thank you!

y-hwang avatar Dec 08 '22 23:12 y-hwang