Dan Pandori

Results 1 issues of Dan Pandori

## Description Creates an entropy reward replay wrapper to support the unsupervised state entropy based pre-training of an agent, as described in the PEBBLE paper. https://sites.google.com/view/icml21pebble ## Testing Added unit...