super-ml-pets icon indicating copy to clipboard operation
super-ml-pets copied to clipboard

Training seem to crash occasionally

Open andreped opened this issue 1 year ago • 3 comments

When training RL models using sapai-gym, different errors tend to occur.

I have tried to uses try-expect blocks, but the problem is that if this happens, training using standard baseline 3 crashes, and we will have to start all over again.

We should therefore either: 1) fix what is bugged in sapai/sapai-gym or 2) add a wrapper function that catches when this fails, and tries to generate a new one (if possible).

andreped avatar Aug 08 '22 16:08 andreped