DRL-FlappyBird issues

Hi, Thanks for your nice code and documentation. I saw the report from Kevin Chen [http://cs229.stanford.edu/proj2015/362_report.pdf] where he experimented with three difficulty levels (easy, medium, hard) of the game. Can...

sahisnu

Why did you need copyTargetQNetwork

2

I have no idea about the meaning of copyTargetQNetwork. Why did we need QValueT to eval the QValue_batch? In order to let training process more stable ?

fevemania

Why do the program only use two state?

4

I read from [here](https://github.com/songrotek/DRL-FlappyBird/blob/master/BrainDQN_NIPS.py#L89-L108). Why do the program only use the current state and the next state? Why only using the two state can work? Thank you @songrotek

guotong1988

How do 1 and -1 reward be used?

1

I find from [here ](https://github.com/songrotek/DRL-FlappyBird/blob/master/BrainDQN_NIPS.py#L88) that all the rewards are add into the deque. We need to sample the 1 and -1 reward from the deque to use them. So...

guotong1988

Flappy

nawidsulaimankhil-cpu

DRL-FlappyBird
DRL-FlappyBird copied to clipboard

Metadata

Updated FlappyBirdDQN.py

Some formatting applied (proposals)

Setting Difficulty level of the Game

Why did you need copyTargetQNetwork

Why do the program only use two state?

How do 1 and -1 reward be used?

Flappy

← Metadata

Owner

Metadata

DRL-FlappyBird DRL-FlappyBird copied to clipboard

Metadata

Updated FlappyBirdDQN.py

Some formatting applied (proposals)

Setting Difficulty level of the Game

Why did you need copyTargetQNetwork

Why do the program only use two state?

How do 1 and -1 reward be used?

Flappy

← Metadata

Owner

Metadata

DRL-FlappyBird
DRL-FlappyBird copied to clipboard