stable-dreamfusion Implementing DALLE2 for 2D Image Generation

Hi all!

I saw that LAION-AI has an open source DALLE2 model hosted on huggingface at: https://huggingface.co/laion/DALLE2-PyTorch The model is pretrained and ready for use. I am going to test it out compared to StableDiffusion, but from what I've seen online, it seems that the DALLE2 architecture and output does seem to be more advanced than StableDiffusion. LAION-AI created a Github repo with inference scripts to use with the model found here: https://github.com/LAION-AI/dalle2-laion Could this be implemented in DreamFusion and thus produce better 3D models?

Dec 10 '22 22:12 AngryChihuahua04

I implemented Laionide (GLIDE), it does seem to solve the Janus problem in the squirrel case

Dec 11 '22 21:12 neverix

Cool! I'm super new to ML still, so what exactly is Laionide (GLIDE)? I saw the Github Repo they have but still not super sure what it is. How does it compare to DALLE2?

Dec 11 '22 21:12 AngryChihuahua04

@neverix could you please provide your implementation in a forked repo if possible Would be really really helpfull

Dec 12 '22 05:12 Jainam2130

Sure, https://github.com/neverix/pixel-dreamfusion

Dec 12 '22 10:12 neverix

Cool! I'm super new to ML still, so what exactly is Laionide (GLIDE)? I saw the Github Repo they have but still not super sure what it is. How does it compare to DALLE2?

It's a smaller model trained on LAION by afiaka87. Pretty stable

Dec 12 '22 11:12 neverix

@neverix great work on adding the laionide(GLIDE). Do you have some sample outputs where the Janus problem seems to be gone as you mentioned. It will be really helpful for me. Thanks

Dec 12 '22 15:12 vishalghor

I just checked, there actually is a subtle Janus problem. Going to try dalle2-pytorch instead.

https://user-images.githubusercontent.com/46641404/208136968-340478c0-5932-4296-9c87-db7889e1f211.mp4

Dec 16 '22 17:12 neverix

@neverix Hi, have you implement the dreamfusion using the dalle2 model? Could you please provide your implementation in the forked repo if possible. thanks a lot.

Dec 25 '22 08:12 xuanyuzhang21

Yes, for now I'm trying https://github.com/kakaobrain/karlo, this is in the same repo

Dec 25 '22 16:12 neverix

Yes, for now I'm trying https://github.com/kakaobrain/karlo, this is in the same repo

Thanks a lot. Great work!

Dec 26 '22 01:12 xuanyuzhang21

Yes, for now I'm trying https://github.com/kakaobrain/karlo, this is in the same repo

How are the results with implementing Karlo into Dreamfusion? Is it producing better results than Stable-Diffusion and DALLE2? Thanks for giving the link to your repo. What additional command line arguments do I need to include to use main.py with Karlo?

Jan 03 '23 22:01 AngryChihuahua04

The flag is --guidance karlo, it technically works but results are even worse that GLIDE. I'm trying to figure out why the images are so incoherent.

Jan 03 '23 22:01 neverix

Closing this issue since it's old, and @ashawkey made tons of progress since then.

May 08 '23 17:05 claforte

stable-dreamfusion stable-dreamfusion copied to clipboard

Implementing DALLE2 for 2D Image Generation

stable-dreamfusion
stable-dreamfusion copied to clipboard