data_tooling icon indicating copy to clipboard operation
data_tooling copied to clipboard

Create license-compliant version of the Pile: subsets

Open albertvillanova opened this issue 4 years ago • 0 comments

Subsets of The Pile:

  • pubmed
  • ubuntu_irc
  • europarl
  • hacker_news
  • nih_exporter

albertvillanova avatar Dec 03 '21 13:12 albertvillanova