Student Tian.

Results 1 issues of Student Tian.

The processed data size is 55G. Are you sure about that size? Or can you provide processed SFT data link and pre-training data link separately? Thanks for open source. 🙏🏻