starcoder icon indicating copy to clipboard operation
starcoder copied to clipboard

HuggingFaceH4/oasst1_en - missing dataset

Open erap129 opened this issue 1 year ago • 1 comments
trafficstars

Hello, I wish to reproduce the StarChat training for educational purposes, but I see the dataset (HuggingFaceH4/oasst1_en) has been removed. Is there any way to download it?
If not, any suggestions for similar datasets? I want to use the current code (chat/train.py) with the least amount of friction.

erap129 avatar Nov 27 '23 19:11 erap129

Hi, can anyone help find the dataset?

jiagaoxiang avatar Feb 28 '24 01:02 jiagaoxiang