Cutie icon indicating copy to clipboard operation
Cutie copied to clipboard

Issues with long videos. (1530 frames). Video Quality (720p).

Open charchit7 opened this issue 9 months ago • 3 comments

Hey, @hkchengrex I was runnning the demo colab script to test out an object (sofa) in my case. And it failed after few frames. Do you know why is that the case? I am assuming because a human subject came into the picture and went away the memory propagation failed? ( issues with the original Xmem paper).

Please let me know your thoughts on it. (check_frame_human is the image when the human came into the frame). check_frame2 check_frame3 check_frame check_frame_human

charchit7 avatar May 22 '24 11:05 charchit7

I am not sure what I am looking at here. Most of these masks look decent -- or am I parsing the scene wrong? Secondly you mentioned that it fails after a few frames, but these frames do not look continuous.

hkchengrex avatar May 23 '24 16:05 hkchengrex

Yeah, I shared intermediate frames for reference. Failed case is if you look at the first image given above. Sharing result and input videos with you : Drive Link Input : obj1_720p.mp4, result : results_obj1.mp4

Please let me know whenever you get time to check. @hkchengrex Thanks :) Have two questions wrt results :

  • Why is the output video not segmenting only one object which is the sofa. ( you can see the color variations in the video frames).
  • You can see the segmentation getting distorted in the later frames. I used your colab tutorial, how can I segment the masks separately for visualisation.

charchit7 avatar May 24 '24 07:05 charchit7