Ziyuan Jiang

Results 1 issues of Ziyuan Jiang

I have a PyTorch stateful model which I successfully used in the triton server with sequence_batching direct scheduling strategy. In order to futher optimize the throughput, I want to use...