student_xiaoMing
student_xiaoMing
I encountered this problem in #85 https://github.com/mit-han-lab/efficientvit/issues/85#issuecomment-2002749929 `python trt_inference.py --model xl1 --encoder_engine ./xl1_encoder.engine --decoder_engine ./xl1_decoder.engine --mode point --point [[[510,356,1],[749,748,0],[100,100,1]]] --img_path ./images/test_3.JPEG --out_path ./vit_trt_img.png` I performed a 3-point inference using the...
hi Thank you so much for your code # Export Decoder `trtexec --onnx=assets/export_models/sam/onnx/xl1_decoder.onnx --minShapes=point_coords:1x1x2,point_labels:1x1 --optShapes=point_coords:16x2x2,point_labels:16x2 --maxShapes=point_coords:16x2x2,point_labels:16x2 --fp16 --saveEngine=assets/export_models/sam/tensorrt/xl1_decoder.engine` What does this parameter mean point_coords:1x1x2,point_labels:1x1 point_coords:16x2x2,point_labels:16x2