fortytwotokens

Results 1 issues of fortytwotokens

thx for sharing in ur paper, u mentioned that u used sgd but ur code used adam instead i changed sgd to adam, but the model couldn't get converged, why...