Sung-Lin Yeh

Results 2 issues of Sung-Lin Yeh

The goal of this PR is to support pure ctc training and decoding (beam search). Users can set `ctc_weight: 1` and `ctc_weight_decode: 1` to perform pure ctc training and beamsearch....

enhancement
refactor
work in progress

I should not init previous attention weights with zeros.