Robust-Multitask-RL
Robust-Multitask-RL copied to clipboard
How to apply distral algorithm to environments with continuous action space?
I found the RL algorithm in this code is DQN, but DQN can only apply in environments with discrete action space. What about environments with continuous action space?