helo-word
helo-word copied to clipboard
bad result when i replicate this model
when I replicate this model by the instruction on track1, I just got the f score about 28 on valid set, less than 10 on test set. Have anyone replicate this model? Does it work? Or someone gives me some instructions?
The instruction was made for the base model, you can try the t2t model for good result