DALLE-pytorch icon indicating copy to clipboard operation
DALLE-pytorch copied to clipboard

Train with taming's VQGAN

Open WocMaker opened this issue 3 years ago • 8 comments

Hi, I was training on my own data set with taming's VQ-GAN. There's some error with the dimension with the vae model image image Can anyone help me with it? How can I change this part to let it work?

WocMaker avatar Jun 30 '21 18:06 WocMaker

@rom1504 could this be related to your recent PR?

lucidrains avatar Jun 30 '21 23:06 lucidrains

@rom1504 could this be related to your recent PR? Hi lucidrain, I had checked rom's dalle server respo before. I still have the PR. Do I need to modifiy more, besides adding the args --taming to use VAGAN as vae? image

WocMaker avatar Jul 01 '21 07:07 WocMaker

Since when I use open AI's vae with my own dataset, the loss is not decreasing. image

WocMaker avatar Jul 01 '21 07:07 WocMaker

@lucidrains maybe. I will check today (and fix if there's indeed a regression)

@WocMaker no taming option is indeed all that is needed.

The results with openAI vae are indeed as expected (and as I measured in my own experiments). It's about 4 to 8 times slower to converge when using openAI vae (mostly due to the 1024 seq length).

rom1504 avatar Jul 01 '21 10:07 rom1504

@rom1504 Thanks, Romain. Maybe there's some problem with my dataset. It's werid that my dataset can run with openAI vae, but it result in dim error for VQ-GAN. May I just ask for double check that, the dataset format needed is the same with these two vae setting up? orz

WocMaker avatar Jul 01 '21 10:07 WocMaker

hi @WocMaker , I don't think there is a problem with your dataset. There was a recently introduced bug in Dalle-pytorch code, that I fixed in https://github.com/lucidrains/DALLE-pytorch/pull/329 You can directly use this change or wait for it to be merged

rom1504 avatar Jul 01 '21 13:07 rom1504

@rom1504 thank you! merged 🙏

lucidrains avatar Jul 01 '21 15:07 lucidrains

@rom1504 @lucidrains Thank you so much!!!

WocMaker avatar Jul 01 '21 16:07 WocMaker