Iman Tabrizian

Results 132 comments of Iman Tabrizian

Unfortunately, Python

It is not possible to copy the tensor from GPU to CPU in Python backend directly. You need to use Pytorch (or any other framework that supports DLPack) to perform...

@ShuaiShao93 I understood your use-case and I have updated my comment above.

> @Tabrizian Thanks for approving. Can you help with merging as well? Should be able to merge it soon. Sorry for the delay.

Closing due to in-activity.

Are you asking for safe tensors support for a specific backend? It is not currently supported but you could create your custom Python-based backend that leverages safe-tensors: https://github.com/triton-inference-server/backend/blob/main/docs/python_based_backends.md#

Hi @harievg, let's keep the discussions in this GH issue if possible. Sorry, I didn't fully understand what you're asking for. I see two things mentioned in this GitHub issue:...

Closing due to in-activity.