hengyi

Results 1 issues of hengyi

Without the proper seq2seq pretraining, the training of the model seems not to be able to converge to a stable point.