zero123plus How to control the output image to be in those six viewpoints without showing the input camera position？

How to control the output image to be in those six viewpoints without showing the input camera position？

Open sandydf opened this issue 2 years ago • 5 comments

Thanks for releasing the code! I would like to know how to control the output image to be in those six viewpoints without showing the input camera position.

Nov 07 '23 03:11 sandydf

Sorry but I didn't quite get your question. Could you use an example or elaborate more? Thanks.

Nov 07 '23 18:11 eliphatfs

Thank you for your reply! I want to know how you control the output image to be the fixed six viewing angles without generating images outside the six viewing angles. In other issue I see you do not explicitly use any camera pose input during training or inference, but I'm not quite sure how you control the synthesis of the novel views from those six specific angles.

Nov 08 '23 01:11 sandydf

I have another question: why does the decoding of latents directly result in one large image that includes six novel views? I don’t quite understand how this is achieved. I look forward to your reply, thank you!

Nov 08 '23 02:11 sandydf

Because the model is trained like that; and the model is able to infer the fixed novel view angles from the given input image.

Nov 08 '23 21:11 eliphatfs

Thank you for your reply, could you tell me how to train to get the model to have this ability?

Nov 09 '23 01:11 sandydf

zero123plus zero123plus copied to clipboard

How to control the output image to be in those six viewpoints without showing the input camera position？

zero123plus
zero123plus copied to clipboard