LayoutNet icon indicating copy to clipboard operation
LayoutNet copied to clipboard

3D ground truth interpretation

Open teasherm opened this issue 6 years ago • 2 comments

Hello,

First, thank you for sharing this great work. A quick question about the panoContext_box_train.t7 tensor:

The paper mentions 6 ground truth 3D parameters: sw, sl, sh, tx, tz, r_theta. The first 6 elements in the box tensor above (box[{{1}{1}{1,6}}]), which I believe contain those parameters for the first example image, read:

sw = -0.5154072972870558
sl = -0.6748731674025037
sh = -1.316387492900166
tx = -0.24216556285261603
tz = -0.2114205765327388
r_theta = 0.08283438070600802

A naive interpretation would suggest that the room is almost 3x higher than it is wide? Is there a reason for the negative scale factors? Any guidance on interpretation would be much appreciated

teasherm avatar Aug 15 '18 15:08 teasherm

@teasherm The box parameter stored in "panoContext_box_train.t7" are normalized to be zero mean and standard deviation, causing those negative scale factors. I include the preprocessing script in "preprocessPano.m", you can refer to L94-239 for computing the box parameters.

zouchuhang avatar Aug 17 '18 00:08 zouchuhang

Ah, I see. Thanks @zouchuhang !

teasherm avatar Aug 18 '18 16:08 teasherm