Matthias comments

Results 35 comments of


                                            Matthias

M-AILABS Speech Dataset

Found the problem. The latest release of audiomate does not contain the updated common-voice reader. You could just install audiomate from master branch for now. I will just check if...

M-AILABS Speech Dataset

I created a new release (https://pypi.org/project/audiomate/4.0.0/). Let me know if it works.

I fixed another error. But now it seems that there are some corrupt files in the common-voice ds. ``` return rawread.RawAudioFile(path) File "/Users/matthi/Repos/deepspeech-german/.venv/lib/python3.7/site-packages/audioread/rawread.py", line 64, in __init__ self._file = aifc.open(self._fh)...

M-AILABS Speech Dataset

Cannot see any error in the output. The only difference to my implementation is, that i sorted by column 1 (https://github.com/ynop/audiomate/blob/master/audiomate/corpus/io/mozilla_deepspeech.py).

M-AILABS Speech Dataset

Thats already the code for splitting i used. Did it work?

M-AILABS Speech Dataset

For splitting, the only way would be to implement a reader for the common voice csv (https://audiomate.readthedocs.io/en/latest/documentation/new_dataset_format.html#corpus-reader). I haven't checked SWC for a while, so i can't really help you...

Can i Download the finished Model anywhere?

I am not sure, but i think you need the checkpoints (According to https://github.com/mozilla/DeepSpeech#checkpointing). Unfortunately I don't have them anymore.

Does not seem to work

I have used the prebuilt client, that can be downloaded via: ```python util/taskcluster.py --target /path/to/destination/folder```

Does not seem to work

I have used Ubuntu 16.04 in a docker container

Does not seem to work

As you can see in the README, the WER is pretty bad. That is on data similar to the training data (clean speech). I have tried some segments from news,...