Chaitanya

Results 12 comments of Chaitanya

I trained a speechconvtransformer_paper model from scratch on 4 GPUs without ASR pretraining and I'm getting a BLEU score of only 0.33 after 80 epochs of training. Is that to...

Ok, so ASR pretraining isn't an optional step then as it previously seemed, as it seems to be the only way to get the model to converge.