selfmonitoring-agent icon indicating copy to clipboard operation
selfmonitoring-agent copied to clipboard

Categorical Sampling during training

Open rmant opened this issue 4 years ago • 0 comments

I wonder what the differences are between using categorical sampling to choose the next action during training, vs using a student-forcing method. Is it the same?

Also, does your code provide a way of training with a teacher-forcing regime?

Thank you!

rmant avatar Apr 22 '20 21:04 rmant