Deep-Reinforcement-Learning-Algorithms-with-PyTorch
Deep-Reinforcement-Learning-Algorithms-with-PyTorch copied to clipboard
PyTorch implementations of deep reinforcement learning algorithms and environments
I run the Cart_Pole.py and got the error.
Closes #49
The package `0.4.1.post2` does not seem to exist anymore. ``` ➜ Deep_RL_Implementations git:(master) pip3 install -r requirements.txt Collecting numpy==1.15.2 (from -r requirements.txt (line 1)) Using cached https://files.pythonhosted.org/packages/22/02/bae88c4aaea4256d890adbf3f7cf33e59a443f9985cf91cd08a35656676a/numpy-1.15.2-cp36-cp36m-manylinux1_x86_64.whl Collecting torch==0.4.1.post2 (from...
can you add PPO-HER as soon as possible ? i have tried,but failded.help me,please.
It's just to help improve this wondeful repo (so useful to test differents agents) I got some troubles with devices training in a custom environement. L.42 in BaseAgent, I replaced...
Hi, There are some issues that I encounter when running the h_DQN agent. Firstly, this line of code would not work if there is no "CONTROLLER" key in the dictionary....
First of all, great work with the library! My understanding of the library is that it starts training from scratch every time. Would there be an easy way of making...
Any ideas on adding callbacks to this repo? Similar to https://docs.fast.ai/callbacks.html ? I currently have a TensorboardCallback in my fork: [My Repo / branch](https://github.com/josiahls/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/develop/deep_reinforcement_learning_algorithms_with_pytorch/utilities/data_structures/Tensorboard_Callback.py) however it is a bit hacky,...
RuntimeError: Expected object of backend CPU but got backend CUDA for argument #4 'mat1' File "/home/matt/Dropbox/Receiver/Antenna/antenna_sims/pytorch/DRL/Agents/Base_Agent.py", line 128, in run_n_episodes self.step() File "/home/matt/Dropbox/Receiver/Antenna/antenna_sims/pytorch/DRL/Agents/DQN_Agents/DQN.py", line 32, in step self.learn() File "/home/matt/Dropbox/Receiver/Antenna/antenna_sims/pytorch/DRL/Agents/DQN_Agents/DQN.py",...
Add the A2C algorithm which is the synchronous version of the algorithm described in this paper https://arxiv.org/pdf/1602.01783.pdf and described here: https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2