ROMP icon indicating copy to clipboard operation
ROMP copied to clipboard

Strange Result on custom video

Open abhaygargab opened this issue 4 years ago • 4 comments

Hello @Arthur151, Congratulations on the great work and Thanks for making the code available for use.

I tried running the inference script on a custom video via: CUDA_VISIBLE_DEVICES=0 python core/test.py --gpu=0 --configs_yml=configs/video.yml

But the result is strange.. Is this because of the camera view ?? If yes then, Is there any way to tune the model for such camera views?? Screenshot from 2021-05-12 17-35-47

abhaygargab avatar May 12 '21 12:05 abhaygargab

In this case, it fails in detection. I am solving this problem by fine-tuning ROMP on detection datasets with small subjects. For now, you can crop the image into multiple small pieces and mix up the results.

Arthur151 avatar May 13 '21 01:05 Arthur151

Thank You so much for the response. So, do you that i should maybe crop the image into 4 sub-images and run the model on the 4 images separately?? In that case how should i combine the results??

Also, is there any way that i can feed the object detections or 2D pose estimations to your model by calculating them from external sources ??

abhaygargab avatar May 13 '21 12:05 abhaygargab

Yes, it is all about the scale of people in images. You may get the 3D mesh results of people with enough scale. It is complicate to get the rendering results. Currently, ROMP doesn't support the detection from the external sources.

Arthur151 avatar May 13 '21 13:05 Arthur151

@ayedaemon Sorry for accidently closing this issue. I am working on this problem too. I can't promise the exact time of releasing the related code. But the problem does allievated to some extend.

Arthur151 avatar May 14 '21 02:05 Arthur151