Total3DUnderstanding RGB image or RGB-D for inference ?

RGB image or RGB-D for inference ?

Open Fizmath opened this issue 4 years ago • 2 comments

Hello

Can we use simple RGB images taken by mobile phones in prediction ?

Thanks

Feb 16 '21 07:02 Fizmath

You can but you will need to get the intrinsic matrix from your phone's camera system. You also need to finetune a 2d detector to generate 2d bounding boxes specific to the classes recognized by Total3DUnderstanding.

Feb 25 '21 20:02 alando46

Thank you very much for the response.

I get camera intrinsic parameters Focal Length = 3.46 mm , Sensor Size=4.66*3.51 , Pixel Array Size=4160*3120 , Orintation = 90 degree in android by installing a device info app. So, from these we can construct the intrinsic matrix cam_K.txt, right ?

Now, I wonder how we build camera pose (extrinsic matrix) from this intrinsic matrix ? I searched the web but no straight forward answer. How did you do that in your work ?

Another question : after getting a result from your algorithm, what are the possible solutions for stitching room layouts from overlapping photos to build a complete room perimeter ? Is it possible ?

Thanks

Mar 16 '21 10:03 Fizmath

Total3DUnderstanding Total3DUnderstanding copied to clipboard

RGB image or RGB-D for inference ?

Total3DUnderstanding
Total3DUnderstanding copied to clipboard