kaos icon indicating copy to clipboard operation
kaos copied to clipboard

Resource exhaustion is not caught and prompted by the system to the user

Open DanielMorales9 opened this issue 6 years ago • 0 comments

What is the current behaviour?

When training a big model which exceeds container memory, the training pipeline never fails and keeps running.

What is the expected behaviour?

Handle the failure and alert the user.

DanielMorales9 avatar Oct 17 '19 16:10 DanielMorales9