Nicole
Results
4
comments of
Nicole
where did you get the 30k-clean.model ?
I ran the code yesterday and received a result of 156.097 averaged validation PPL, 149.565 averaged test PPL. So I am reading your code and the original.The first different thing...
Yes, pad_token_id should be 0
@Johnny-xyz 你好,ziya-reader的长度限制是8k,采用后向截断,只保留前面的context