Sylvain Marchienne
Results
2
issues of
Sylvain Marchienne
Hi, According to most of PyTorch REINFORCE algorithm implementations, the policy gradient loss should sum the `log_probs` on the trajectory (sum over t=1...T) instead of computing the mean. In the...
Hi, Is there any version of this dataset already preprocessed with frames represented as vectors passed through a CNN model? This would be very convenient to have, for instance like...