segment-anything icon indicating copy to clipboard operation
segment-anything copied to clipboard

Can I run it in video/real-time?

Open rkz98 opened this issue 2 years ago • 7 comments

Can I run it in video/real-time? Or just in images for now?

rkz98 avatar Apr 18 '23 04:04 rkz98

Like OVIS: http://songbai.site/ovis/index.html#visualization

rkz98 avatar Apr 18 '23 04:04 rkz98

@rkz98 you can run it on realtime /video also, first you need to detect the object, use object detector & pass that detected object frame by frame .

akashAD98 avatar Apr 18 '23 08:04 akashAD98

I'd like to know this as well. dont care how small image or whatevre, just want it running real time. But right now for me swapping out the set image is taking 45 second no matter what size the image passed to it is

RugnirViking avatar Apr 20 '23 15:04 RugnirViking

Take a look at https://github.com/z-x-yang/Segment-and-Track-Anything

bhack avatar Apr 21 '23 00:04 bhack

No, this cannot be run in real time (at least not more than about 5-10 FPS). Every video frame needs to have a heavyweight image feature detector run first, before the segmenter runs. Even on an A100, the feature detector takes some 150ms per image.

dakoner avatar Apr 21 '23 22:04 dakoner

Yes, checkout https://github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once. they combine models together to solve problem

hungtooc avatar Apr 22 '23 04:04 hungtooc

I haven't test the fps. this also depends on the gpu you are using. On one V100 gpu, a 20s video will take 20s to process in total. Something like this. The hugging face demo takes longer because of upload video.

MaureenZOU avatar Apr 25 '23 07:04 MaureenZOU