DeepRL-Agents
DeepRL-Agents copied to clipboard
Low Rewards for DRQN
I tried with DRQN code for both partial or full observability cases. However, I found it sometimes gets trapped into repeated actions and obtains very low rewards. Do you have the same problems before ? Thanks