WizardLM
WizardLM copied to clipboard
78k evolved code instructions
Hi WizardCoder team,
Is the dataset( 78k evolved code instructions) available for download? Thank you very much
+1 to this
+1 to this
+1 to this
Thank you for your wonderful work. The paper introduces a data volume of 78k, of which 20K comes from Alpaca. Where does the other instruction data come from?
+1, feels Evo-Instruct is even more useful for Code Generation than Normal Chat
+1
+1 to this
+1
+1
+1
I have created an open source version of the dataset. It took me 120,000 API calls over 3 days. The major caveat here is that I didn't do much post-processing as they didn't explain their process in the paper. So this uncleaned version of my dataset may not have the same performance as the paper. Feel free to use the following (It's also on HF Hub):
https://github.com/nickrosh/evol-teacher
@nickrosh Legend! Thank you.