Human-in-the-loop-Deep-Reinforcement-Learning icon indicating copy to clipboard operation
Human-in-the-loop-Deep-Reinforcement-Learning copied to clipboard

(Engineering) Toward human-in-the-loop AI: Enhancing deep reinforcement learning via real-time human guidance for autonomous driving

Results 3 Human-in-the-loop-Deep-Reinforcement-Learning issues
Sort by recently updated
recently updated
newest added

Hello, I find your work is really helpful and I really appreciate it, however I found a bug at critic update stage which affect the final performance. It is in...

Hi, In the paper, actor loss is ![1681374070503_4CE60915-EE65-45c9-B8AF-3BBA26A44F66](https://user-images.githubusercontent.com/57997827/231720952-f13001b9-8969-4e27-a5d3-3fd8d80d01e0.png) but the code that calculate actor loss for human intervention steps didn't consider the first term (see https://github.com/wujingda/Human-in-the-loop-Deep-Reinforcement-Learning/blob/main/TD3_based_DRL/TD3HUG.py#L148) Also, the human intervention...

吴同学, 您好! 您发表的论文《Toward human-in-the-loop AI: Enhancing deep reinforcement learning via real-time human guidance for autonomous driving》十分精彩。也非常感谢您无私的分享了论文的相关程序,我在学习您的算法时,可能发现了程序的一个小问题。主要是当算法训练的一个episode结束后,需要更新replay memory buffer时,errors在脚本中没有被定义。我附上了相应的报错信息。 File "...\TD3_based_DRL\TD3.py", line 115, in learn self.memory.batch_update(tree_idx, abs(errors.detach().cpu().numpy()) ) NameError: name 'errors'...