Nimrod Rak issues

Repositories
Issues
Comments

Results 3 issues of


                                            Nimrod Rak

No-copy Tensor transfer in python backend-based ensemble

I am trying to build a pipelined inference server with a mainly python backend (it runs PyTorch models sometimes in the code itself). Originally I had the entire pipeline run...

question

performance

ONNX custom OP uses deprecated functions

# Ask a Question ### Question I am attempting to implement a custom op using CUDA kernels and started looking into existing guides and how-to's available. The simplest and easiest...

question

T5 not performing as expeceted

### Description ```shell I am trying to optimize T5-small inference using Fastertransformer. I am running on a single V100, I followed all the steps in `t5_guide.md` exactly and got a...

bug