Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Add Hippocorpus dataset script

Open MightyAlex200 opened this issue 2 years ago • 1 comments

This adds the Hippocorpus dataset script from #728 It's a notebook file which converts an existing Hippocorpus dataset into a Parquet file. This PR should stay a draft until the code quality and output quality is confirmed.

MightyAlex200 avatar Jan 16 '23 00:01 MightyAlex200

:x: pre-commit failed. Please run pre-commit run --all-files locally and commit the changes. Find more information in the repository's CONTRIBUTING.md

github-actions[bot] avatar Jan 16 '23 00:01 github-actions[bot]