bombora
bombora copied to clipboard
My experimentations with Reinforcement Learning in Pytorch
Bombora
My ongoing experiments in RL, originally based on ikostrikov/pytorch-a3c.
Usage
OMP_NUM_THREADS=1 python main.py --env-name "Pong-v0" --algo a3c --num-processes 4
Note: Pytorch is still in beta and non recent version might have some problems.
Dashboard
Logs are stored in dblogs
directory, to watch agents learn run:
python dashboard.py --dbdir dblogs --env 'Pong-v0'
This will start a visdom server and show the last 2 run logs in Pong-v0 env.
data:image/s3,"s3://crabby-images/56572/56572263d71dd9aba3453ccfb89ff459d715e876" alt=""
data:image/s3,"s3://crabby-images/f00a2/f00a23bfb10b2f485b6fa1da2702bc256276450d" alt=""
Dependencies
- pytorch
- torchvision
- gym
- sqlite3
- visdom
Credits
Cheers to Pytorch and authors of the follwing repos: