fortytwotokens
Results
1
issues of
fortytwotokens
thx for sharing in ur paper, u mentioned that u used sgd but ur code used adam instead i changed sgd to adam, but the model couldn't get converged, why...