simple_dqn issues

README.md: for Ubuntu cv2.so changed to cv2.x86_64-linux-gnu.so

1

Hi, In Ubuntu 16.04.3 LTS cv2.x86_64-linux-gnu.so is the correct name instead of cv2.so. I believe this should be used instead: ``` sudo apt-get install python-opencv ln -s /usr/lib/python2.7/dist-packages/cv2.x86_64-linux-gnu.so NEON/.venv/lib/python2.7/site-packages/ ```...

tgianko

Assertion Error

1

I can get Breakout to play succesfully, pong however gives me this, any suggestions? ./play.sh snapshots/pong_141.pkl --replay_size 10 --backend cpu 2017-11-27 13:08:31,419 Using old model serialization format. Serialized the model...

mcbrs1a

slow training speed in latest code?

9

I just tried the latest code, and found the training speed slowed down significantly, it used to be more than >200 steps_per_second, but right now it's ~100 steps_per_second 2017-09-24 15:08:08,844...

mw66

The test score is different from the DeepMind paper

14

Hi, Thank you for the great project. While testing simple_dqn I found the test score of simple_dqn is different from the DeepMind paper. The DeepMind paper 'Prioritized Experience Replay' (http://arxiv.org/pdf/1511.05952v3.pdf)...

futurecrew

Performance Update

2

- Keep replay memory (screens, pre and post states) in gpu memory - Use transpose kernel to switch to chwn format - Training steps per second on breakout rom at...

jcoreyes

./play.sh snapshots/space_invaders_200.pkl error out

2

./play.sh got the rom name wrong, it invokes: python src/main.py --play_games 1 --display_screen true --load_weights snapshots/space_invaders_200.pkl roms/space.bin while it should be space_invaders.bin

mw66

simple_dqn
simple_dqn copied to clipboard

Metadata

README.md: for Ubuntu cv2.so changed to cv2.x86_64-linux-gnu.so

Assertion Error

slow training speed in latest code?

The test score is different from the DeepMind paper

Performance Update

./play.sh snapshots/space_invaders_200.pkl error out

← Metadata

Owner

Metadata

simple_dqn simple_dqn copied to clipboard

Metadata

README.md: for Ubuntu cv2.so changed to cv2.x86_64-linux-gnu.so

Assertion Error

slow training speed in latest code?

The test score is different from the DeepMind paper

Performance Update

./play.sh snapshots/space_invaders_200.pkl error out

← Metadata

Owner

Metadata

simple_dqn
simple_dqn copied to clipboard