dreamer-pytorch issues

Does the walker run reproduce correctly?

3

Hi, thank you for the cool repository! I tried several tasks `walker walk`, `cheetah run`. They seem to work fine. But when I run `walker run`, the episode_reward cannot achieve...

letusfly85

Fixed typo in "channels:", updated "dm_control" to version 1.0.1

dm_control now automatically installs MuJoCo.

kaiu85

Input to imagine_ahead in utils.py

Input should contain only the initial belief and state right? Here, input contains the entire sequence of beliefs and states instead. Not sure how this works with the algorithm

aravindvenu7

Exploding KL Divergence Loss

I tried running the agent on the Walker Walk environment and the KL Divergence loss seems to be growing exponentially and causing nans. But I have not made any changes...

shivakanthsujit

Implementation time and device?

Hi, Thanks you for your sharing~ I've implemented it for a while. I have some questions about the time because I spend lots of time on reaching 500K steps. However,...

108618035guotingliao

A small question of implementation

Thank you for your sharing, but I have a small question. Why do you 1) use `F.softplus` for `variance(std_dev)` every time and 2) add a constant(min_std_dev). Is it to ensure...

TianQi-777

Hi, I believe the reward loss should be based on `rewards[1:]` instead of `rewards[:-1] `: https://github.com/yusukeurakami/dreamer-pytorch/blob/7e9050e8c454309de40bd0d1a4ec0256ef600147/main.py#L209 If not, can you please explain your reasoning? Thanks,

roggirg

dreamer-pytorch
dreamer-pytorch copied to clipboard

Metadata

Does the walker run reproduce correctly?

Fixed typo in "channels:", updated "dm_control" to version 1.0.1

Input to imagine_ahead in utils.py

Exploding KL Divergence Loss

Implementation time and device?

A small question of implementation

Reward loss timescale

← Metadata

Owner

Metadata

dreamer-pytorch dreamer-pytorch copied to clipboard

Metadata

Does the walker run reproduce correctly?

Fixed typo in "channels:", updated "dm_control" to version 1.0.1

Input to imagine_ahead in utils.py

Exploding KL Divergence Loss

Implementation time and device?

A small question of implementation

Reward loss timescale

← Metadata

Owner

Metadata

dreamer-pytorch
dreamer-pytorch copied to clipboard