data_tooling
data_tooling copied to clipboard
add files to compute basic stats on pseudo crawl dataset
As discussed with @thomasw21, this PR add basic slurm and python scripts to compute an intermiadiary metadata dataset and some statistics for the Pseudo Crawl dataset