Open-Assistant
Open-Assistant copied to clipboard
Create homework-lab essays dataset
Create essays dataset from https://homework-lab.com/examples/
Actually I did it already. Here is the result: https://huggingface.co/datasets/qwedsacf/homework-lab-essays But I only scraped the data without preprocessing. Essays were in .doc and .docx files so I extracted text via textract library. So there are a lot of spaces and tabulations in the texts.
You are a hero :)