awr issues

Why Normalization of vf

1

Hello, thanks for the code, while I tried to re-implement the program, I find that there is one step to normalize value function vf [here](https://github.com/xbpeng/awr/blob/831442fb8d4c24bd200667cbc5e458c7657effc2/learning/rl_agent.py#L230-L234) . It's implementated by `v_predict...

im-Kitsch

Offline version of AWR

1

Hi, I am trying to modify AWR into the offline version (or fully off-policy version). I find that the paper states that one can simply treat the dataset as the...

FineArtz

Train_Return vs Test_Return

3

Hi, Thank you for sharing the repo! I was wondering how the Train_Return and Test_Return is calculated and what the difference between the two. I see that one is using...

masonjar-source

Parameters used for motion imitation

6

Hello, I am trying to use this algorithm (rewritten in PyTorch with Gym vectorized envs) for motion imitation, starting with the PyBullet implementation of the DeepMimic environment. In the paper,...

ManifoldFR

awr
awr copied to clipboard

Metadata

Why Normalization of vf

Offline version of AWR

Train_Return vs Test_Return

Parameters used for motion imitation

← Metadata

Owner

Metadata

awr awr copied to clipboard

Metadata

Why Normalization of vf

Offline version of AWR

Train_Return vs Test_Return

Parameters used for motion imitation

← Metadata

Owner

Metadata

awr
awr copied to clipboard