Iman Tabrizian
Iman Tabrizian
We did an investigation about this feature and after doing some analysis we realized that the perf impact could be lower than it may seem initially. This optimization is most...
@GuanLuo Just to make sure that I'm understanding the problem correctly, is the issue that the integer prepended to the STRING could be interpreted differently depending on whether the system...
Hi @roger652, this PR is currently blocked on a few other changes that are needed in the server side. We'll merge these changes when the server side changes are complete.
@rmccorm4 Sorry for the delay. This must have fallen through the cracks. Yeah, I think we probably want to have this API at some point for creating custom traces. Right...
@HennerM I'm so sorry I was out of the office when I was mentioned and might have missed the notification for this PR. This looks good to me. Thanks for...
I’m interested in the same feature. Is there any way to do this in pybind11?
> I don't see the necessary of providing synchronized infer API, user can read from response iterator after async_infer return and it will block until response is ready. I think...
```npm install --save bak```
```npx bak start```
You can use KIND_MODEL and manually control the GPUs you want to use. One workaround would be to add an additional parameter (e.g. `gpu_device_ids`) in your model config that specifies...