lbx73737373

Results 2 comments of lbx73737373

请问m1的话是要下载toolkit源码,再编译嘛

It is more common to set dim_feedforward == hidden_dim * nheads