stable-dreamfusion icon indicating copy to clipboard operation
stable-dreamfusion copied to clipboard

Round renderings with dmtet finetuning

Open mvp18 opened this issue 1 year ago • 2 comments

Description

Hi @ashawkey , thanks for your great work!

I think this issue has been referenced before - here and here. And I have tried all solutions you have suggested - commenting out the torch.compile lines and running with different seeds, but the problem still persists with torch 2.x. Here are a few examples:

https://user-images.githubusercontent.com/24255665/233854048-c0119d73-576f-4d01-af57-5a3eb6166b58.mp4

generated with python main.py -O --text "a DSLR photo of an ice cream sundae" --workspace trial2_icecream --dmtet --iters 5000 --init_ckpt trial_icecream/checkpoints/df.pth

https://user-images.githubusercontent.com/24255665/233854110-f50e0ec3-324f-4815-ae83-40c5ad8deb04.mp4

generated with python main.py -O --text "a highly detailed stone bust of Theodoros Kolokotronis" --workspace trial2_stonehead --dmtet --iters 5000 --init_ckpt trial_stonehead/checkpoints/df.pth

What I noticed from the validation log that this rounding effect starts with the very initial images of the finetuning process. Like this:

df_ep0001_0005_rgb df_ep0004_0005_rgb

I get this warning that the weights of the stable diffusion model are getting randomly initialized before the finetuning starts, but don't think that's the issue since your comments say the guidance model is not required during finetuning.

Steps to Reproduce

python main.py --workspace trial2_icecream -O --test python main.py --workspace trial2_stonehead -O --test

Expected Behavior

Can you share any inputs on how to fix this? Is it only a torch 2.x issue you think?

Environment

Debian GNU/Linux 11, Torch 2.0, CUDA 11.7

mvp18 avatar Apr 23 '23 17:04 mvp18

@mvp18 Hi, do you mean the first stage already produces a sphere? The first two videos don't seem like produced by DMTet.

ashawkey avatar Apr 24 '23 01:04 ashawkey

@ashawkey thank you so much for getting back. Yes, the first 2 videos are generated after dmtet finetuning. I think the results are similar to the issues 218 and 204.

And the round images you see above are from the validation logs in trial2_icecream/validation and trial2_stonehead/validation respectively. The ones before dmtet finetuning look perfectly fine as below:

https://user-images.githubusercontent.com/24255665/233933242-c18c4775-e9ba-4f0f-bd6b-3795024b2a77.mp4

mvp18 avatar Apr 24 '23 07:04 mvp18