Xiaotian Han

Results 1 issues of Xiaotian Han

Current training uses ConstantLengthDataset. This dataset return fixed length of tokens (2048) in every step, however, the total number of steps are calculated based on the number of samples. I...