practical-pytorch icon indicating copy to clipboard operation
practical-pytorch copied to clipboard

End-of-sentence never predicted

Open brijmohan opened this issue 6 years ago • 0 comments

Hi,

Thanks for the wonderful explanation. I am using this code as guideline to build a speech recognition network. I am giving speech frames (sequence of 40-dim feature vectors) as input to the encoder and trying to predict characters as output of the decoder. Speech frames can be very large in number (>1000) as compared to output length (<100).

So, I have set MAX_LENGTH of attention to be 5000. Unfortunately, it never predicts <eos> token and keeps predicting till 5000 characters.

I am using bidirectional LSTM as encoder. I concatenate the output and hidden from encoder to feed in the decoder, which is just LSTM.

I would highly appreciate any directions.

Thanks Brij

brijmohan avatar Nov 21 '18 18:11 brijmohan