Total3DUnderstanding icon indicating copy to clipboard operation
Total3DUnderstanding copied to clipboard

RGB image or RGB-D for inference ?

Open Fizmath opened this issue 4 years ago • 2 comments

Hello

Can we use simple RGB images taken by mobile phones in prediction ?

Thanks

Fizmath avatar Feb 16 '21 07:02 Fizmath

You can but you will need to get the intrinsic matrix from your phone's camera system. You also need to finetune a 2d detector to generate 2d bounding boxes specific to the classes recognized by Total3DUnderstanding.

alando46 avatar Feb 25 '21 20:02 alando46

Thank you very much for the response.

I get camera intrinsic parameters Focal Length = 3.46 mm , Sensor Size=4.66*3.51 , Pixel Array Size=4160*3120 , Orintation = 90 degree in android by installing a device info app. So, from these we can construct the intrinsic matrix cam_K.txt, right ?

Now, I wonder how we build camera pose (extrinsic matrix) from this intrinsic matrix ? I searched the web but no straight forward answer. How did you do that in your work ?

Another question : after getting a result from your algorithm, what are the possible solutions for stitching room layouts from overlapping photos to build a complete room perimeter ? Is it possible ?

Thanks

Fizmath avatar Mar 16 '21 10:03 Fizmath