1999kevin

Results 11 issues of 1999kevin

Hi, I notice that you only use Transformer to generate the parameters of an MLP with ReLU activation. I wonder whether we can use Transformer to generate the parameters of...