mango

Results 22 comments of mango

> ### Reminder > * [x] I have read the README and searched the existing issues. > > ### Reproduction > 有几个问题请教哈: 1 段落之间需要加\n吗 2.如果模型预处理最长能处理4096个token,那么没有样本的长度是不是尽量在4096以内,且稍微小于4096呢 3.一本书处理成多个样本后需不需要shuf打散呢 4.特殊符号,\t ,需要去掉吗 5.有没有想过的资料介绍呢 >...

@yichuangzhang I have the same problem.Please,have you found the corresponding code?