Real-ESRGAN Very noisy training curves when trying to fine-tune with paired data, what am I doing wrong?

I'm trying to train Real-ESRGAN with 400 images. The training curves are very noisy, especially the perceptual loss (l_g_precep). I've tried to use various different learning rates. I'm using 4 GPUs with 5 images on each GPU

here is the config file I use:

output-onlineyamltools (1)

Apr 10 '22 07:04 roey1rg

The logged losses are only averages of the mini batch of the one iteration at the time of logging. They are not averaged over the iterations between logs, so they are more like regular samples and thus are prone to noise.

I've implemented averaging in my own code. This is the standard logging: Screenshot_20220410_131342 And this the averaged logging: Screenshot_20220410_131152

Note: I'm using a low batch size of 8 on a single GPU.

Apr 10 '22 11:04 JensDA

Thanks @JensDA, good point

Apr 11 '22 06:04 roey1rg

@roey1rg where to find the log file ?

Jul 04 '22 12:07 mindFlayer123

Can you explain better the concept? Which is the script that you used to obtain that plot?

Oct 11 '22 11:10 HoliSimo

I'm trying to train Real-ESRGAN with 400 images. The training curves are very noisy, especially the perceptual loss (l_g_precep). I've tried to use various different learning rates. I'm using 4 GPUs with 5 images on each GPU

here is the config file I use:

Can you explain how to obtain that plot? plz

Oct 28 '22 13:10 Tsouuu

I'm trying to train Real-ESRGAN with 400 images. The training curves are very noisy, especially the perceptual loss (l_g_precep). I've tried to use various different learning rates. I'm using 4 GPUs with 5 images on each GPU

here is the config file I use:

My l_g_percep loss have been increasing gradually, do you know why it haven't decreased?

Jun 28 '23 08:06 FengMu1995