AdvancedLiterateMachinery icon indicating copy to clipboard operation
AdvancedLiterateMachinery copied to clipboard

Lister converges extremely slow

Open Apostatee opened this issue 2 years ago • 1 comments
trafficstars

Have not changed anything and uses Synthtext for training, loss still stays around 3.0 after 2hours training?

Apostatee avatar Nov 10 '23 03:11 Apostatee

Hi,

I have re-implement training using only SynthText, and everything is ok after 2000 steps. image

Please refer to attention scaling operation here if it was changed in your case. Or, please provide more details.

ccx1997 avatar Nov 10 '23 08:11 ccx1997