Patrick

Results 1 issues of Patrick

I added scheduled sampling for the batched attention model. I don't have cuda or sconce set up on my computer, but it runs very slowly on a cpu.