Michael J Tanana comments

Results 29 comments of


                                            Michael J Tanana

Regarding the Language Model used

This is equivalent to the implementation before adding the language model. If you wanted to replicate the paper, you would spit out the top 1000 beams from the DS model...

Regarding the Language Model used

1. This should be easier than other problems because you can just take the outcome probabilities and walk through them saving the top 1000 at each step. (This will be...

Regarding the Language Model used

And note...the paper mentioned some weighting between the score from the DS model and the score from the LM....wasn't clear if this was estimated or set like a hyper-param

Regarding the Language Model used

I'm still playing with the base model code, but once I get better results, I'd be happy to help with this part...but I'm a month or two from where I'll...

running pretrained net on CPU only

If it was a pre-trained GPU model, you'd just have to de-cuda the model on a platform with a GPU (otherwise it won't load) and then re-save it... it's a...

High values of WER on Libri Dataset (test-clean and dev-clean)

Does the 1080 have 6GB? I'm not sure if that will be able to fit the full model. If you look back at my responses to the thread on running...

High values of WER on Libri Dataset (test-clean and dev-clean)

#71 There's a comment from me near the bottom that helps with memory..

High values of WER on Libri Dataset (test-clean and dev-clean)

Don't forget to downsize the minibatch for testing too

RAM Usage keep increasing

I had this issue as well. Is it confirmed that the memory leak happens on the CPU as well? I remember having some memory leaking for this project in CUDA...

RAM Usage keep increasing

CollectGarbage isn't doing the trick...I remember this was the case with my bug as well..I'll keep looking