Yves-Noel Weweler

Results 5 comments of Yves-Noel Weweler

I just take a look at all the outputs produced by `datasets` using the different log-levels. As far as i can tell using `datasets==1.17.0` they overall issue seems to be...

Did you just re-compile festival using gcc-4.8 or did you re-compile both speech_tools and festival? I forced gcc to be gcc-4.8 and executed `compile_other_speech_tools.sh` again. That seems to have solved...

It would be really great if you could provide some numbers to get a glimpse of the capabilities of this approach.

Since I also had issues with files beeing corrupt I computed the md5-sums for comparison. | MD5 | File | |----------------------------------|-----------------------------| | c702b68b84642b289b6de7b87bf004eb | DocBank_500K_ori_img.zip.001 | | a2328a17e582db16611483f218f7fac2 | DocBank_500K_ori_img.zip.002...

```bash wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.001 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.002 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.003 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.004 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.005 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.006 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.007 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.008 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.009 wget https://layoutlm.blob.core.windows.net/docbank/dataset/DocBank_500K_ori_img.zip.010 ``` Since I also had issues with files beeing...