block98k
Results
2
comments of
block98k
Hello. I have a question about the "alternatively training", why you divide the training into two steps. Why not do them in one same mini batch: get y_tilde then compute...
> > I didn't record the results before. But, you can try 50-100 epochs. Btw, the CER showed in the log is not the autoregressive CER. You need to run...