ldc_downloader icon indicating copy to clipboard operation
ldc_downloader copied to clipboard

Script to download corpora from the Linguistic Data Consortium (LDC)

Results 2 ldc_downloader issues
Sort by recently updated
recently updated
newest added

filenames created by this script are somewhat abnormal. e.g. LDC2016E75, which is described in the 'file name' column of the ldc downloads page (an imperfect guess at the true filename...

The current `grep` only checks for the prefix; hence, if it has the same prefix, multiple rows are grepped, hence failing to download. For example, - `LDC93S1`: TIMIT dataset -...