DeepRL-Tutorials
DeepRL-Tutorials copied to clipboard
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
In PPO.ipynb, the position of action loss epoch and value loss epoch need to be swapped and I suggest that you'd better use RMSprop as the optimizer and reduce the...
``` IndexError Traceback (most recent call last) in 43 try: 44 clear_output(True) ---> 45 plot_all_data(log_dir, env_id, 'DoubleDQN', config.MAX_FRAMES, bin_size=(10, 100, 100, 1), smooth=1, time=timedelta(seconds=int(timer()-start)), ipynb=True) 46 except IOError: 47 pass...