InsightFace-REST icon indicating copy to clipboard operation
InsightFace-REST copied to clipboard

Thank you for excllent work. How about TRT batch inference?

Open tungdq212 opened this issue 1 year ago • 1 comments
trafficstars

Thank you for excllent work.

Detection models now can be exported to TRT engine with batch size > 1 - inference code doesn't support it yet, though now they could be used in Triton Inference Server without issues.

Is there any plan for this? Or how can I implement batch inference myself?

tungdq212 avatar Jan 03 '24 02:01 tungdq212

Hi! Batch inference is already supported for all recognition models and for SCRFD and YOLOv5 family detection models.

SthPhoenix avatar Jan 04 '24 23:01 SthPhoenix