pile-pubmedcentral
pile-pubmedcentral copied to clipboard
A script for collecting the PubMed Central dataset in a language modelling friendly format.