clearml-serving
clearml-serving copied to clipboard
Cuda oom handler [continuation]
In https://github.com/allegroai/clearml-serving/pull/75 was mentioned that CLEARML_SERVING_NUM_PROCESS
has to be 1 (for k8s instance it can be set in https://github.com/allegroai/clearml-helm-charts/blob/main/charts/clearml-serving/templates/clearml-serving-inference-deployment.yaml#L77C21-L77C48), this PR set the same variable for docker-compose instance.
It also fixes requirements - add torch for attempts to clear GPU memory (it can be changed if served models require other version of torch)