EscherNet icon indicating copy to clipboard operation
EscherNet copied to clipboard

Training details

Open fradif96 opened this issue 1 year ago • 3 comments

Hello! Congratulations for the great work. I have one question about the training process. In Section 3.1 you say "It builds upon an existing 2D diffusion model, inheriting its strong web-scale prior through large-scale training". However, in the rest of the paper, it is unclear if the overall architecture is trained from scratch on the Objaverse dataset (rendered as Zero123 does), or if it is fine-tuned by starting from some pre-trained modules of Stable Diffusion. Could you please clarify my doubts? Thanks in advance

fradif96 avatar Feb 08 '24 11:02 fradif96