Randomized-Ensembled-Double-Q-learning-REDQ-
Randomized-Ensembled-Double-Q-learning-REDQ- copied to clipboard

Published 20 hours ago •

BY571

→

Metadata

Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)

Reame
Issues

Results 2 Randomized-Ensembled-Double-Q-learning-REDQ- issues

Sort by recently updated

Actor Update Bug

Should the actor update not utilise idx[0] and idx[1] for Q1 and Q2? currently it just gets the same value of Q from the same critic # ---------------------------- update actor...

beardyFace

Application to discrete action space

Hi, I was wondering that have you applied this idea to environments with discrete action space? or have any idea how it would perform? Thanks

merv22

← Metadata

20

Stars

0

Forks

Watchers

Owner

BY571

Metadata

Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)

Back

Randomized-Ensembled-Double-Q-learning-REDQ- Randomized-Ensembled-Double-Q-learning-REDQ- copied to clipboard

Metadata

Actor Update Bug

Application to discrete action space

← Metadata

Owner

Metadata

Randomized-Ensembled-Double-Q-learning-REDQ-
Randomized-Ensembled-Double-Q-learning-REDQ- copied to clipboard