fangpings

Results 4 comments of fangpings

I have the same problem. We have an ensemble model which has preprocessing, inference and postprocessing. I observed that in the preprocessing phase, sometimes it will generate request whose batch_size...

For ensemble pipeline, the input is a web document. In the preprocessing model we tokenize the web document, but sometimes the number of tokens in the web document will exceed...