Tatiana Likhomanenko comments

Results 242 comments of


                                            Tatiana Likhomanenko

trafficstars

Empty predictions/100 WER after training conv_glu on a different language

Ohh, you run test.cpp not decode.cpp. For decode.cpp "show" shows words transcriptions while "showletters" show the tokens transcription. For test.cpp we are printing only words transcription always so showletters is...

Empty predictions/100 WER after training conv_glu on a different language

@vineelpratap here is ctc criterion, not asg in the config.

Empty predictions/100 WER after training conv_glu on a different language

First, is this audio really 285476ms ~= 5min with this short transcription? because then it makes totally sense for the model training on one this sample to predict all blanks/silence,...

Empty predictions/100 WER after training conv_glu on a different language

yep, correct, you can use any number. Just was wondering if the problem with the very long input training. No idea for now why it doesn't work. First I would...

AMI data preparation "AssertionError: Unknown speaker!"

Hi @alkazap, Thanks for finding the bug, could you mind to send PR on the fix?

Pre-trained models

For most papers we release pre-trained models (acoustic and language models), so you can check here https://github.com/facebookresearch/wav2letter/tree/master/recipes/models in each folder, mainly in the readme there could be links to the...

[What is good learning rate and batchsize for training conv_glu (wav2letter) 2016? ]

One more thing: are you fine tuning the wav2vec features with the whole net or not? First start with frozen wav2vec features.

[What is good learning rate and batchsize for training conv_glu (wav2letter) 2016? ]

I think better to ask directly Mr Mai Long how he reproduced then. As far as I know in original paper they use frozen wav2vec features.

Is it possible that Huggingface Transformers are ported to C++ using Flashlight?

Yep, you can implement Transformer library with Flashliligh and we already have several implementations for it. What do you mean exactly by "Is such a feature planned to be released...

Installation with OpenBLAS instead of MKL

I think we don't support this option cc @jacobkahn