Open-Assistant
Open-Assistant copied to clipboard
Resolve issue with ShareGPT_vicuna_unfiltered
In SFT-8 branch we use HF dataset gozfarb/ShareGPT_Vicuna_unfiltered. However gozfarb's account has been nuked from HF including all datasets, so this dataset can no longer be used by anyone who doesn't have it cached
need to reach out to gozfarb to get a copy of that dataset and republish it
why was gozfarb's account deleted?
should we use https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered instead?
why was gozfarb's account deleted?
Not sure
should we use https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered instead?
This seems like the best option, although I think it's slightly less cleaned than gozfarb's latest version
There is now also: Aeala/ShareGPT_Vicuna_unfiltered