2kpr

Results 2 comments of 2kpr
trafficstars

> Same here, but only when using 8 bit adam. Without it it works perfectly fine. It seems like 8 bit adam causes the model to overfit very quickly. I've...

> > Hi, > > Thanks a lot for your interest in our work. Currently we don't have a google colab. But we have a huggingface demo based on diffusers...