Mava
Mava copied to clipboard
[BUG] Trainer keeps sampling experience
Describe the bug
The trainer keeps sampling even if the executors crash. This means that the trainer must somehow sample data that the executor is not generating. This might be a problem if the trainer is faster than the executor.
Expected behaviour
The trainer should wait for the executors and not keep sampling if there is no data to sample.