allenact icon indicating copy to clipboard operation
allenact copied to clipboard

Properly resume training with offpolicy losses

Open jordis-ai2 opened this issue 4 years ago • 0 comments

Currently, a new epoch will be started when resuming training (new Iterator will be instantiated). We should save the random seed used to shuffle the datasets (for all workers?) and the length of the remaining data, besides enforcing a resume API for iterators.

jordis-ai2 avatar Sep 01 '20 15:09 jordis-ai2