DwanZhang

Results 11 comments of DwanZhang

> @Turlan my curve is similar to you. I think it is because discriminator is learning in the beginning. I meet the same problem, will you froze the discriminator for...

I meet the same problem. Can we discuss more about this paper?

I also want to know the evaluation methods

BTW, I have muted the flash attention module.