clipper
clipper copied to clipboard
Tensorflow model container stops responding after predict function thrown exception
After passing incorrect input to predict function, I have to scale down and up the model in order to predict again.
I think the exceptions from predict function should be caught and logged, but they should not break the prediction loop.
Logs from model container:
Sent heartbeat!
Received heartbeat!
Sent heartbeat!
Received heartbeat!
Sent heartbeat!
Received heartbeat!
Sent heartbeat!
Received heartbeat!
Sent heartbeat!
Received heartbeat!
Got start of message 16
Traceback (most recent call last):
File "/container/tf_container.py", line 110, in
.. cannot predict anymore
Ahh this is an excellent point. Thanks for bringing this to our attention.
@dcrankshaw do you think it makes sense to actually link it with readiness probes
@simon-mo Has it been addressed?