Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces copied to clipboard
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
Hi, in the Agent.py line 144 of ddpg, you use `state` to get target_action. I think it should be `state_2`. In the original ddpg.py of stevenpig's implementation, he also uses...
Hello, I'm a new in "large action space". And I'm trying to do some work about large discrete action space. So will it work or could it be applied for...
Hi, jimkon In the original paper, the action for training critic net comes from the full policy. But, in your master, the action is just given by the target actor...
When I change the k_ration in agrs to generate multiple actions, AssertionError shows as: Traceback (most recent call last): File "/Users/xx/Downloads/DROO-master/mec/rlmodel/LDAS/main.py", line 211, in train(args.train_iter, agent, env, evaluate, File "/Users/xx/Downloads/DROO-master/mec/rlmodel/LDAS/main.py",...
Could not find how to adjust exploration rate, exploration-exploitation policy, discount rate, number of warm up steps etc. Please help me out!