DeepRL-Agents issues

A3C-Doom, is threading can make real parallelism?

Hi, your repo is very helpful for me to learn RL in action. In `A3C-Doom.ipynb`, I found the code spawns multi-threads to train the model, in Python threading cannot make...

GoingMyWay

apply_gradients need a lock?

A3C is multithread code and does it need a lock while excuting function apply_gradients?

michelliming

checkGoal() in gridworld.py

(Ref. line 97 in gridworld.py) Shouldn't the 'done' flag be TRUE when the 'hero' reaches either 'goal' or 'fire'? Currently checkGoal( ) always returns a FALSE 'done' status.

anjany

simple and odd python problem

2

I switched machines, tried this in python 2 and 3. DIdn't see this before. directory contains all 3 files but it simply won't load the other two modules. Any idea...

paulsimvient

Double-Dueling-DQN stops learning

Running the Double-Dueling-DQN code results in a network, that stops learning after about 2000 episodes., i.e. the game results do not get better. Run the GridWorld example now four different...

florath

Double-Dueling-DQN: question about the rate to update target network

I've encountered the thing that I can't understand while following up the Double-Dueling-DQN.ipynb. There's a def like below ``` def updateTargetGraph(tfVars,tau): total_vars = len(tfVars) op_holder = [] for idx, var...

heesu-pia

Adapting A3C LSTM for Pong

21

Did anyone managed to get the A3C LSTM of this repo to work for Pong (using the openai gym)? I have already tried several different optimizers, learning rates, network architectures,...

MatheusMRFM

A3C Doom: Typo in comment

File: https://github.com/awjuliani/DeepRL-Agents/blob/master/A3C-Doom.ipynb Where: A typo in a comment of the final gist What: "# Start the "work" process for each worker in a separate threat." Correction: threat => thread

choinker

DRQN-AttributeError: 'zip' object has no attribute 'append'

2

I run the code and meet a problem like this: ********************************************************* Target Set Success 5000 0.65 1 98%|█████████████████████████████████████████▏| 50/51 [00:00

silentobservers

DeepRL-Agents
DeepRL-Agents copied to clipboard

Metadata

A3C-Doom, is threading can make real parallelism?

apply_gradients need a lock?

checkGoal() in gridworld.py

Can't see the source code.

simple and odd python problem

Double-Dueling-DQN stops learning

Double-Dueling-DQN: question about the rate to update target network

Adapting A3C LSTM for Pong

A3C Doom: Typo in comment

DRQN-AttributeError: 'zip' object has no attribute 'append'

← Metadata

Owner

Metadata

DeepRL-Agents DeepRL-Agents copied to clipboard

Metadata

← Metadata

Owner

Metadata

DeepRL-Agents
DeepRL-Agents copied to clipboard