clearml-serving issues

Results 26 clearml-serving issues

Sort by recently updated

added exit-on-error option for tritonserver.

Add the option to send exit-on-error parameter to tritonserver. Setting this to false prevents single models breaking a whole tritonserver instance. Fixes allegroai/clearml-serving#60

stephanbertl

[Bug] Run clearml serving with clearml server not working

Hi I built clearml server by self-hosted I have followed [official tutorial](https://clear.ml/docs/latest/docs/clearml_serving/clearml_serving_setup) My config: ```bash CLEARML_WEB_HOST="http://localhost:8080" CLEARML_API_HOST="http://localhost:8008" CLEARML_FILES_HOST="http://localhost:8081" CLEARML_API_ACCESS_KEY="J1WS8MDJMI9MMMA9EMH6" CLEARML_API_SECRET_KEY="5eEYYHq3VMgf9DmQjfLtCiBBV09TgxImQmz8WDckKB0h6t3CIE" CLEARML_SERVING_TASK_ID="2f5e0563914e41b9a1334f841c39852b" ``` because CLEARML_WEB_HOST on port 8080, I have to...

david101-hunter

triton model breaks serving instance

We have setup clearml serving on Kubernetes including triton support. Our triton instance has no GPU, so deploying a model leads to the following error in the triton instance: `E0718...

stephanbertl

[clearml-serving] Specification of model platform not possible

### Describe the bug a clear and concise description of what the bug is. Hi there, it seems that when adding a pytorch model to the self-hosted clearml-serving the platform...

ockaro

bug

Adding clearml_serving_inference restart on CUDA OOM

Restart `clearml-serving-inference` on `torch.cuda.OutOfMemoryError: CUDA out of memory.` helps inference container to clear GPU memory. It would be useful for LLM inference on `clearml-serving-inference` container (requires https://github.com/allegroai/clearml-helm-charts/blob/main/charts/clearml-serving/templates/clearml-serving-inference-deployment.yaml#L74 set to 1)...

IlyaMescheryakov1402

Shared functions across preprocess file

Hello. I have some helper functions that are shared acros the `preprocess.py` files, so I'd like to refactor them. However, I'm not sure where I can put them, and how...

amirhmk

Can not run Toy model (scikit learn) deployment example

Hi I setup step by step from [official tutorial](https://github.com/allegroai/clearml-serving?tab=readme-ov-file#point_right-toy-model-scikit-learn-deployment-example). However, I see some error like this ``` Retrying (Retry(total=239, connect=239, read=240, redirect=240, status=240)) after connection broken by 'NewConnectionError(': Failed to...

david101-hunter

clearml-serving
clearml-serving copied to clipboard

Metadata

added exit-on-error option for tritonserver.

[Bug] Run clearml serving with clearml server not working

triton model breaks serving instance

[clearml-serving] Specification of model platform not possible

Adding clearml_serving_inference restart on CUDA OOM

Shared functions across preprocess file

Can not run Toy model (scikit learn) deployment example

Could not download model in triton container

Removing model monitoring in endpoint

torchserve support?

← Metadata

Owner

Metadata

clearml-serving clearml-serving copied to clipboard

Metadata

← Metadata

Owner

Metadata

clearml-serving
clearml-serving copied to clipboard