async-rl issues

What to put in <path-to-rom>

2

In: python demo_a3c_ale.py [--use-lstm] I've been looking at what to put in for quite a while but I have just found it's related to ALE. I've tried putting either 'breakout'...

FerranAlet

I only have the machine with 4 cpus with each one has 8 cores. Can I train with 32 processes or only 8 processes?

1601214542

Why the value loss need to devide 2 in line 108 of a3c.py

1

v_loss += (v - R) ** 2 / 2 But the original paper just calculate the derivative of the (V-R)^2 right?

onlytailei

cannot evaluate on trained model

I trained the model for 3000000 iterations and saved the model as "3000000.h5". But when I try to evaluate using demo_a3c_ale.py, it turns out there is an error there saying...

goodnamegood

where is the final model and score record saved?

As titled. I found the console just print out scores too fast, where can we find the score VS training iteration records?

goodnamegood

the action (x4) semantics different?

1

Hi, I just noticed: https://github.com/muupan/async-rl/blob/master/ale.py#L115 each training action is taken 4x times to the game environment? e.g. user pressed 'down' once, but in your simulated training the environment to take...

mw66

Color transform is incorrect.

1

Hey, so the color transform that you use is incorrect (for example in Seaquest it sometimes causes the fishes to disappear). https://github.com/muupan/async-rl/blob/12dac595b2ad9d99c19e82e92adb825359a3a3eb/ale.py#L67-L68 You can get the correct one from the...

siemanko

Installation: ImportError: No module named 'ale_python_interface'

1

When I try to run the saved model as : ``` bash python demo_a3c_ale.py ../roms/breakout.bin trained_model/breakout_ff/80000000_finish.h5 ``` I get an error : ``` bash ImportError: No module named 'ale_python_interface' ```...

sahiliitm

Crashes of Spawned Proceeses

Hi there - I forked your code to work on [Super Mario Bros](https://github.com/ehrenbrav/async-rl/tree/eb) :) I'm using a [Nintendo emulator that I modified](https://github.com/ehrenbrav/FCEUX_Learning_Environment) to allow for programmatic control by the agent...

ehrenbrav

async-rl
async-rl copied to clipboard

Metadata

What to put in <path-to-rom>

I only have the machine with 4 cpus with each one has 8 cores. Can I train with 32 processes or only 8 processes?

Why the value loss need to devide 2 in line 108 of a3c.py

cannot evaluate on trained model

where is the final model and score record saved?

Continous control

the action (x4) semantics different?

Color transform is incorrect.

Installation: ImportError: No module named 'ale_python_interface'

Crashes of Spawned Proceeses

← Metadata

Owner

Metadata

async-rl async-rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

async-rl
async-rl copied to clipboard