PHALP icon indicating copy to clipboard operation
PHALP copied to clipboard

3D Location prediction

Open silence-cho opened this issue 1 year ago • 1 comments

https://github.com/brjathu/PHALP/blob/master/PHALP.py#:~:text=y_p_%20%3D%20(y_p%2D0.5)np.exp(n_p)/5000.0256.0

Thanks for your great work, I am reading your code and paper. But I am confused about the meaning of 5000 and 256 in the code. May you give more explanation about the location prediction.

silence-cho avatar Sep 05 '22 03:09 silence-cho

I think it is a perspective projection process with focal_length = 5000, image width is 256, np.exp(n_p) is depth value in camera coordinate.

After the line, x_p_ would be a coordinate in image space may be.

markkim1115 avatar Oct 28 '22 02:10 markkim1115

@silence-cho Yes focal length is 5000 and we compute x,y distance in a normalized (256) space. We predict the x, and y in the pixels space and project it back into 3D coordinates in this line (https://github.com/brjathu/PHALP/blob/1d49d1e5173fbf556e77da6d54c1540c505ba2e2/phalp/trackers/PHALP.py#LL500C19-L500C58). Also we have fully updated the code to support better models, please feel free to try, and let us know if you have any questions.

brjathu avatar Jun 13 '23 23:06 brjathu

closing due to inactivity, please reopen if you have any questions.

brjathu avatar Dec 30 '23 01:12 brjathu