EscherNet icon indicating copy to clipboard operation
EscherNet copied to clipboard

[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis

Results 11 EscherNet issues
Sort by recently updated
recently updated
newest added

Hi! really nice work I'm using Eschernet 6Dof and in my dataset I would need to use different intrinsics for different images, I guess it's not an issue for NeuS...

Hi, great work! Do you plan to release the code to do inference on real word objects?

Hello! Congratulations for the great work. I have one question about the training process. In Section 3.1 you say "It builds upon an existing 2D diffusion model, inheriting its strong...

I reduce the oputput views to 2 in the demo case and observe a significant performance drop. I wonder why performance excels with 25 views but suffers with 2. Could...

Hi, thanks for your amazing work. I am conducting a research about using one single top-view image to generate the entire object. I've tried many model, including zero-123 XL and...

Hi, I'd like to know how much memory is required for EscherNet? An A100 or just RTX3090 with 24GB will be enough?

Dear authors, first of all, thank you for the amazing work. I am trying to use the model on the Frianka16, but it's not clear to me how to do...

Thanks for your nice work. I'm still confused by the choice of ConvNeXt2 as the image encoder in this project. It is mentioned that the reason for employing ConvNeXt2 is...

Hi @fradif96 We don't have new modules for cross/self-attention. It's the same attention layers but just reshape the latent features from ((b t) l d) -> (b (t l) d)...