ericbolo comments

Results 35 comments of


                                            ericbolo

Real-time decoding

Thank you for your response! With the current BiLSTM setup, what would be required is then a feature window that slides over the input and includes for each input all...

For online decoding with neural nets, Kaldi recommends constructing an i-vector that summarizes speaker properties, and training the neural net with audio features + i-vector. In the absence of past...

Real-time decoding

Ok, I now have a pretty good understanding of the diarization/speaker normalization issues, none of them insurmountable in my application. For now, I can focus on the decoding of the...

Real-time decoding

Thank you for the link. I gave it a quick read, my only worry is that in the paper the data is pre-segmented with GMM/HMM, and the training does not...

Real-time decoding

Elaborating: from the paper I understand they sub-sample the data to avoid overfitting, so we don't have access to all the outputs of the utterance, possibly hampering CTC loss.

Real-time decoding

I've stumbled on this paper, which proposes a kind of BLSTM that is compatible with online decoding: http://ieeexplore.ieee.org/document/7953176/ Thought it might be of interest

Real-time decoding

I'm still very much interested in online decoding, yes, but unfortunately my hands are full this month and the next. If anyone is interested to work on this with me...

Real-time decoding

Hi Eric, I'm still interested in online decoding but company priorities have caught up to me and I can't do it single-handedly . This said, if we can team up...

Real-time decoding

From @fmeze 's answer above, implementing online decoding requires (1) audio streaming capability: work required but not exploratory, probably some open source tools available, (2) overhauling the model to be...

Real-time decoding

Regarding the branch, I for one am a lot more comfortable with tf. And sorry but what do you mean by "python in the loop". And what would be the...