stable-dreamfusion icon indicating copy to clipboard operation
stable-dreamfusion copied to clipboard

Implementing DALLE2 for 2D Image Generation

Open AngryChihuahua04 opened this issue 2 years ago • 12 comments

Hi all!

I saw that LAION-AI has an open source DALLE2 model hosted on huggingface at: https://huggingface.co/laion/DALLE2-PyTorch The model is pretrained and ready for use. I am going to test it out compared to StableDiffusion, but from what I've seen online, it seems that the DALLE2 architecture and output does seem to be more advanced than StableDiffusion. LAION-AI created a Github repo with inference scripts to use with the model found here: https://github.com/LAION-AI/dalle2-laion Could this be implemented in DreamFusion and thus produce better 3D models?

AngryChihuahua04 avatar Dec 10 '22 22:12 AngryChihuahua04

I implemented Laionide (GLIDE), it does seem to solve the Janus problem in the squirrel case

neverix avatar Dec 11 '22 21:12 neverix

Cool! I'm super new to ML still, so what exactly is Laionide (GLIDE)? I saw the Github Repo they have but still not super sure what it is. How does it compare to DALLE2?

AngryChihuahua04 avatar Dec 11 '22 21:12 AngryChihuahua04

@neverix could you please provide your implementation in a forked repo if possible Would be really really helpfull

Jainam2130 avatar Dec 12 '22 05:12 Jainam2130

Sure, https://github.com/neverix/pixel-dreamfusion

neverix avatar Dec 12 '22 10:12 neverix

Cool! I'm super new to ML still, so what exactly is Laionide (GLIDE)? I saw the Github Repo they have but still not super sure what it is. How does it compare to DALLE2?

It's a smaller model trained on LAION by afiaka87. Pretty stable

neverix avatar Dec 12 '22 11:12 neverix

@neverix great work on adding the laionide(GLIDE). Do you have some sample outputs where the Janus problem seems to be gone as you mentioned. It will be really helpful for me. Thanks

vishalghor avatar Dec 12 '22 15:12 vishalghor

I just checked, there actually is a subtle Janus problem. Going to try dalle2-pytorch instead.

https://user-images.githubusercontent.com/46641404/208136968-340478c0-5932-4296-9c87-db7889e1f211.mp4

neverix avatar Dec 16 '22 17:12 neverix

@neverix Hi, have you implement the dreamfusion using the dalle2 model? Could you please provide your implementation in the forked repo if possible. thanks a lot.

xuanyuzhang21 avatar Dec 25 '22 08:12 xuanyuzhang21

Yes, for now I'm trying https://github.com/kakaobrain/karlo, this is in the same repo

neverix avatar Dec 25 '22 16:12 neverix

Yes, for now I'm trying https://github.com/kakaobrain/karlo, this is in the same repo

Thanks a lot. Great work!

xuanyuzhang21 avatar Dec 26 '22 01:12 xuanyuzhang21

Yes, for now I'm trying https://github.com/kakaobrain/karlo, this is in the same repo

How are the results with implementing Karlo into Dreamfusion? Is it producing better results than Stable-Diffusion and DALLE2? Thanks for giving the link to your repo. What additional command line arguments do I need to include to use main.py with Karlo?

AngryChihuahua04 avatar Jan 03 '23 22:01 AngryChihuahua04

The flag is --guidance karlo, it technically works but results are even worse that GLIDE. I'm trying to figure out why the images are so incoherent.

neverix avatar Jan 03 '23 22:01 neverix

Closing this issue since it's old, and @ashawkey made tons of progress since then.

claforte avatar May 08 '23 17:05 claforte