https://oscar-project.org [email protected]
The Open Super-large Crawled Aggregated coRpus
OSCAR
:spider: The pipeline for the OSCAR corpus
oscar-project
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.