Patrick
Results
1
issues of
Patrick
I added scheduled sampling for the batched attention model. I don't have cuda or sconce set up on my computer, but it runs very slowly on a cpu.