Shang Wang

Results 12 comments of Shang Wang

> I see the meaning of this rule now. I'm actually not sure if you were looking at the right place in the training rules. The table in https://github.com/mlcommons/training_policies/blob/master/training_rules.adoc#94-quality-measure is...

Additional question: it seems like `the_pile/pile.py` only downloads and interleave the data from various data sources. `processing_scripts` contains many processing scripts, however, how do we know which script is supposed...