Jim-Song
Results
1
issues of
Jim-Song
the attention alignment converge in 35000 steps, while r = outputs_per_step = 2, like this:  and converge to a very good rusult after 105000 steps, like this:  but...