MFM
MFM copied to clipboard

Published 20 hours ago •

Reame
Issues

The article states that the decoder uses a linear layer, but the pre-training code uses Conv2d, and I'm confused about this.

Open shidanW opened this issue 11 months ago • 0 comments

I‘m so glad if you could help me with it.

Feb 28 '24 14:02 shidanW