DRL-FlappyBird icon indicating copy to clipboard operation
DRL-FlappyBird copied to clipboard

How do 1 and -1 reward be used?

Open guotong1988 opened this issue 7 years ago • 1 comments

I find from here that all the rewards are add into the deque. We need to sample the 1 and -1 reward from the deque to use them. So do you think it may be slow.

In Chinese:是不是reward为1和-1的情况也都放在deque里,那么reward为1和-1的被sample出来的几率岂不是很低,反馈就会很慢?

@songrotek Thank you.

guotong1988 avatar Mar 07 '17 08:03 guotong1988

https://github.com/yenchenlin/DeepLearningFlappyBird/issues/32

guotong1988 avatar Apr 12 '17 11:04 guotong1988