easy-rl dqn算法问题

dqn算法问题

Open ecsfu opened this issue 1 year ago • 0 comments

在dqn的更新中，为什么没有下面的代码，不用复制策略网络？ if self.sample_count % self.target_update == 0: # 每隔一段时间，将策略网络的参数复制到目标网络 self.target_net.load_state_dict(self.policy_net.state_dict())

Feb 08 '24 08:02 ecsfu

easy-rl easy-rl copied to clipboard

dqn算法问题

easy-rl
easy-rl copied to clipboard