Sylvain Marchienne

Results 2 issues of Sylvain Marchienne

Hi, According to most of PyTorch REINFORCE algorithm implementations, the policy gradient loss should sum the `log_probs` on the trajectory (sum over t=1...T) instead of computing the mean. In the...

Hi, Is there any version of this dataset already preprocessed with frames represented as vectors passed through a CNN model? This would be very convenient to have, for instance like...