rl_a3c_pytorch
rl_a3c_pytorch copied to clipboard
A3C LSTM Atari with Pytorch plus A3G design
SpaceInvaders-v0,please
Is it normal to get this trace back in the console? It spams for a few dozen times and then stops abruptly. Then, it starts logging the training session as...
Trained MsPacman model for 2 days. Averaging 6.7k score per 100 epochs, high score ~ 10.5k
Just want to clarify that there is only one saved model per environment and it will be overwritten each training epoch, right? For example, MsPacman will only have one saved...
Hi, Due to lack of resources, i cant train the models myself. Therefore, I need the pre trained models of the various games. Is it possible for you to share...
It seems doesn't get lock when update network params in SharedAdam. However, isn't there process safety problem without a lock?
Hi, I'd to have a question about the following block https://github.com/dgriff777/rl_a3c_pytorch/blob/eb5c9b909abc02911b45e325f7a7c619d3b0fa46/test.py#L60 ``` if player.done and not player.info: state = player.env.reset() player.eps_len += 2 player.state = torch.from_numpy(state).float() if gpu_id >= 0:...
Hi @dgriff777 . Thank you for your repo. It's great that it can achieve such a high score. But I met a problem when I try to apply it to...
Is there a reason why the default for eps in the adam optimizer is so high? Currently, it is 1e-3 [line 104 in shared_optim.py]. Usually, it's around 1e-08. Just wanted...