allenact Properly resume training with offpolicy losses

Properly resume training with offpolicy losses

Open jordis-ai2 opened this issue 4 years ago • 0 comments

Currently, a new epoch will be started when resuming training (new Iterator will be instantiated). We should save the random seed used to shuffle the datasets (for all workers?) and the length of the remaining data, besides enforcing a resume API for iterators.

Sep 01 '20 15:09 jordis-ai2

allenact allenact copied to clipboard

Properly resume training with offpolicy losses

allenact
allenact copied to clipboard