Deep-QLearning-Agent-for-Traffic-Signal-Control icon indicating copy to clipboard operation
Deep-QLearning-Agent-for-Traffic-Signal-Control copied to clipboard

Why are Q network and target network the same?

Open QionghuaLiao opened this issue 3 years ago • 1 comments

Usually, the Q Network is trained while the parameters of target network are fixed. And every certain steps, the parameters of Q Network will be copied to Target Network. But when I check your code, I find that the Q Network and the Target Network are the same neural network, which confuses me. Could you please help me out?

QionghuaLiao avatar Dec 27 '21 12:12 QionghuaLiao

你说的机制是为了减少训练的震荡,这个demo项目就没采用这个机制,直接每步都让q网络更新呗,这有啥confuse的。。。。。。。

wxwmd avatar Aug 09 '22 10:08 wxwmd