Xiaotian Han issues

Repositories
Issues
Comments

Results 1 issues of


                                            Xiaotian Han

SFT training doesn't fully go through all samples

Current training uses ConstantLengthDataset. This dataset return fixed length of tokens (2048) in every step, however, the total number of steps are calculated based on the number of samples. I...