DiT icon indicating copy to clipboard operation
DiT copied to clipboard

The model could not be fitted if not predict xstart

Open JJLi0427 opened this issue 1 year ago • 5 comments

Thank you so much for sharing this great work.
I try to use DiT to my dataset, but I meet some problem when I train model using predict_xstart = False. The loss down a little and seem to stuck at a hight level and won't be down, when I use predict_xstart = True the loss down quickly.

JJLi0427 avatar May 11 '24 03:05 JJLi0427

So can you confirm that when using predict_xtart=True, the result would be much better?

santisy avatar May 14 '24 21:05 santisy

So can you confirm that when using predict_xtart=True, the result would be much better?

According to my dataset experiment predict_xtart=True can converge quickly

JJLi0427 avatar May 16 '24 02:05 JJLi0427

According to my dataset experiment predict_xtart=True can converge quickly

@JJLi0427 Hello, I encountered the same problem, have you found the reason?

XijiaWang avatar Jul 16 '24 14:07 XijiaWang

According to my dataset experiment predict_xtart=True can converge quickly

@JJLi0427 Hello, I encountered the same problem, have you found the reason?

This seems to be because there is a problem with the implementation of the denoising function, and I only used predict xstart to do my task later.

JJLi0427 avatar Jul 17 '24 01:07 JJLi0427

I had a similar problem. When I added additional modules, the loss didn't converge at all. May be I should modify predict_xtart.

OlliverYang avatar Feb 17 '25 02:02 OlliverYang