tensorflow-speech-recognition Getting some ideas from Wavenet?

Getting some ideas from Wavenet?

Open grapemix opened this issue 7 years ago • 4 comments

Since this project is still in planning stage, I guess we are more open for new ideas. The README mentioned the LSTM, but Wavenet yields better results than LSTM accoring to DeepMind's paper. The Wavenet is explained in the following white paper. Do you think it will be too difficult for us to use the Wavenet approach?

https://drive.google.com/file/d/0B3cxcnOkPx9AeWpLVXhkTDJINDQ/view

Thanks.

Sep 10 '16 01:09 grapemix

Somebody did it already: https://github.com/ibab/tensorflow-wavenet

Sep 15 '16 01:09 andrenatal

@andrenatal That implementation doesn't do STT though right? It's an implementation of the generative material stated in the whitepaper I believe.

Sep 26 '16 15:09 thomasmurphycodes

@andrenatal , ty so much. That repo sounds really very interesting. I am not sure if anyone further discuss this direction. Since it is suggestion ticket, I will still leave this ticket open, but if anyone think the discussion is enough, feel free to close this ticket. And thanks all for your time.

Oct 27 '16 05:10 grapemix

1-d dilated/atrous convolution is the way to go ...

Nov 16 '16 10:11 pannous

tensorflow-speech-recognition tensorflow-speech-recognition copied to clipboard

Getting some ideas from Wavenet?

tensorflow-speech-recognition
tensorflow-speech-recognition copied to clipboard