Human-in-the-loop-Deep-Reinforcement-Learning issues

Results 3 Human-in-the-loop-Deep-Reinforcement-Learning issues

Sort by recently updated

a bug about critic update

Hello, I find your work is really helpful and I really appreciate it, however I found a bug at critic update stage which affect the final performance. It is in...

EvergrowHook

Hi, In the paper, actor loss is ![1681374070503_4CE60915-EE65-45c9-B8AF-3BBA26A44F66](https://user-images.githubusercontent.com/57997827/231720952-f13001b9-8969-4e27-a5d3-3fd8d80d01e0.png) but the code that calculate actor loss for human intervention steps didn't consider the first term (see https://github.com/wujingda/Human-in-the-loop-Deep-Reinforcement-Learning/blob/main/TD3_based_DRL/TD3HUG.py#L148) Also, the human intervention...

EvergrowHook

论文程序请教

吴同学，您好！您发表的论文《Toward human-in-the-loop AI: Enhancing deep reinforcement learning via real-time human guidance for autonomous driving》十分精彩。也非常感谢您无私的分享了论文的相关程序，我在学习您的算法时，可能发现了程序的一个小问题。主要是当算法训练的一个episode结束后，需要更新replay memory buffer时，errors在脚本中没有被定义。我附上了相应的报错信息。 File "...\TD3_based_DRL\TD3.py", line 115, in learn self.memory.batch_update(tree_idx, abs(errors.detach().cpu().numpy()) ) NameError: name 'errors'...

jelly-961

Human-in-the-loop-Deep-Reinforcement-Learning
Human-in-the-loop-Deep-Reinforcement-Learning copied to clipboard

Metadata

a bug about critic update

question about actor loss

论文程序请教

← Metadata

Owner

Metadata

Human-in-the-loop-Deep-Reinforcement-Learning Human-in-the-loop-Deep-Reinforcement-Learning copied to clipboard

Metadata

a bug about critic update

question about actor loss

论文程序请教

← Metadata

Owner

Metadata

Human-in-the-loop-Deep-Reinforcement-Learning
Human-in-the-loop-Deep-Reinforcement-Learning copied to clipboard