ldc_downloader
ldc_downloader copied to clipboard
Script to download corpora from the Linguistic Data Consortium (LDC)
Results
2
ldc_downloader issues
Sort by
recently updated
recently updated
newest added
filenames created by this script are somewhat abnormal. e.g. LDC2016E75, which is described in the 'file name' column of the ldc downloads page (an imperfect guess at the true filename...
The current `grep` only checks for the prefix; hence, if it has the same prefix, multiple rows are grepped, hence failing to download. For example, - `LDC93S1`: TIMIT dataset -...