GTR
GTR copied to clipboard
RuntimeError: Detectron2 is not compiled with GPU support!
I am trying to run the demo command and saw this error
[04/05 14:46:45 detectron2]: Arguments: Namespace(confidence_threshold=0.5, config_file='configs/GTR_TAO_DR2101.yaml', cpu=False, input=None, opts=['MODEL.WEIGHTS', 'models/GTR_TAO_DR2101.pth'], output='output/demo_yfcc.mp4', video_input='docs/yfcc_v_acef1cb6d38c2beab6e69e266e234f.mp4', webcam=None) WARNING [04/05 14:46:45 d2.config.compat]: Config 'configs/GTR_TAO_DR2101.yaml' has no VERSION. Assuming it to be compatible with latest v2.
[04/05 14:47:30 fvcore.common.checkpoint]: [Checkpointer] Loading from models/GTR_TAO_DR2101.pth ... WARNING [04/05 14:47:37 fvcore.common.checkpoint]: Some model parameters or buffers are not found in the checkpoint: roi_heads.box_predictor.0.freq_weight roi_heads.box_predictor.1.freq_weight roi_heads.box_predictor.2.freq_weight WARNING [04/05 14:47:37 fvcore.common.checkpoint]: The checkpoint state_dict contains keys that are not used by the model: roi_heads.pos_emb.weight Could not find encoder for codec id 27: Encoder not found [ERROR:0] global /io/opencv/modules/videoio/src/cap.cpp (392) open VIDEOIO(CV_IMAGES): raised OpenCV exception:
OpenCV(4.1.2) /io/opencv/modules/videoio/src/cap_images.cpp:253: error: (-5:Bad argument) CAP_IMAGES: can't find starting number (in the name of file): /tmp/video_format_test3gfay7is/test_file.mkv in function 'icvExtractPattern'
Traceback (most recent call last):
File "demo.py", line 161, in
Hi,
Our model needs GPU for inference. Can check here and make sure your detectron2 is installed with GPU support?
Thanks. I have solved the problem. However, I noticed that when I tried a larger video file for the demo, it always says killed to stop the process. Is there a limit how big a video file can be handled byt GTR?
Hi, I have the same problem as you, how did you solve it?
Hi @hxue3 and @kele-git ,
The OOM issue for long videos is due to that we decode the entire compressed video into 3D tensors in preprocessing. There should be a (simple) way to only decode frames in a sliding window. I'll look into this hopefully in the following week(s), but feel free to play with it yourself.
Best, Xingyi
Hi, I have the same problem as you, how did you solve it?
Did you manage to solve it?
Is the detected model combined with the tracked model? I didn't find the output of the test. I look forward to your answer?