Xiaotian Han
Results
1
issues of
Xiaotian Han
Current training uses ConstantLengthDataset. This dataset return fixed length of tokens (2048) in every step, however, the total number of steps are calculated based on the number of samples. I...