ptan issues

Hi Any chance of updating to PyTorch 1.0 ?

2

Hello I am finding it very difficult to install against the previous PyTorch 0.4.0, as it is now released at 1.0. Is it possible to support PyTorch 1.0 and update...

JulesVerny

Random policy within intitial replay buffer

Right now there is no way to actually fill the intitial replay buffer with random actions

Fable67

a2c.py and a2c_atari.py throw error with sample run files

1

For example when I run a2c.py -r "runs/a2c/a2c_cartpole.ini" tons of errors pop up. Regardless I like that you've implemented a lot of algorithms and put them here. It's very useful...

ghost

dqn uses huberloss instead of mseloss

1

https://blog.openai.com/openai-baselines-dqn/ *... In the DQN Nature paper the authors write: “We also found it helpful to clip the error term from the update [...] to be between -1 and 1.“....

YiTanJang

ptan
ptan copied to clipboard

Metadata

Hi Any chance of updating to PyTorch 1.0 ?

Random policy within intitial replay buffer

a2c.py and a2c_atari.py throw error with sample run files

dqn uses huberloss instead of mseloss

Examples under ptan/samples are outdated

DQN Speedup Windows CUDA Error

fixed the typo in 'experience.py' and ...

pollicy functions should use torch functions instead of numpy

there's a typo line 497 of experience.py

torch.autograd Variables are deprecated

← Metadata

Owner

Metadata

ptan ptan copied to clipboard

Metadata

← Metadata

Owner

Metadata

ptan
ptan copied to clipboard