Munchausen-RL
Munchausen-RL copied to clipboard

Published 20 hours ago •

BY571

→

Metadata

PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN

Reame
Issues

Results 2 Munchausen-RL issues

Sort by recently updated

munchausen_addon and action

Hello!Excuse me！I used your code in another environment, but I encountered difficulties. Action is a decimal array,, so how to rewrite the " munchausen_addon = log_pi.gather(1, actions)" line of code!...

quyouyuan

Wrong value in call to F.softmax

Should `F.softmax(Q_targets_next, dim=1)` be `F.softmax(Q_targets_next / entropy_tau, dim=1)` instead?

marioyc

← Metadata

Stars

Forks

Watchers

Owner

BY571

Metadata

PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN

Back

Munchausen-RL Munchausen-RL copied to clipboard

Metadata

munchausen_addon and action

Wrong value in call to F.softmax

← Metadata

Owner

Metadata

Munchausen-RL
Munchausen-RL copied to clipboard