zero123plus icon indicating copy to clipboard operation
zero123plus copied to clipboard

How to control the output image to be in those six viewpoints without showing the input camera position?

Open sandydf opened this issue 2 years ago • 5 comments

Thanks for releasing the code! I would like to know how to control the output image to be in those six viewpoints without showing the input camera position.

sandydf avatar Nov 07 '23 03:11 sandydf

Sorry but I didn't quite get your question. Could you use an example or elaborate more? Thanks.

eliphatfs avatar Nov 07 '23 18:11 eliphatfs

Thank you for your reply! I want to know how you control the output image to be the fixed six viewing angles without generating images outside the six viewing angles. In other issue I see you do not explicitly use any camera pose input during training or inference, but I'm not quite sure how you control the synthesis of the novel views from those six specific angles.

sandydf avatar Nov 08 '23 01:11 sandydf

I have another question: why does the decoding of latents directly result in one large image that includes six novel views? I don’t quite understand how this is achieved. I look forward to your reply, thank you!

sandydf avatar Nov 08 '23 02:11 sandydf

Because the model is trained like that; and the model is able to infer the fixed novel view angles from the given input image.

eliphatfs avatar Nov 08 '23 21:11 eliphatfs

Thank you for your reply, could you tell me how to train to get the model to have this ability?

sandydf avatar Nov 09 '23 01:11 sandydf