Real-ESRGAN icon indicating copy to clipboard operation
Real-ESRGAN copied to clipboard

Very noisy training curves when trying to fine-tune with paired data, what am I doing wrong?

Open roey1rg opened this issue 4 years ago • 4 comments

I'm trying to train Real-ESRGAN with 400 images. The training curves are very noisy, especially the perceptual loss (l_g_precep). I've tried to use various different learning rates. I'm using 4 GPUs with 5 images on each GPU image

here is the config file I use:

output-onlineyamltools (1)

roey1rg avatar Apr 10 '22 07:04 roey1rg

The logged losses are only averages of the mini batch of the one iteration at the time of logging. They are not averaged over the iterations between logs, so they are more like regular samples and thus are prone to noise.

I've implemented averaging in my own code. This is the standard logging: Screenshot_20220410_131342 And this the averaged logging: Screenshot_20220410_131152

Note: I'm using a low batch size of 8 on a single GPU.

JensDA avatar Apr 10 '22 11:04 JensDA

Thanks @JensDA, good point

roey1rg avatar Apr 11 '22 06:04 roey1rg

@roey1rg where to find the log file ?

mindFlayer123 avatar Jul 04 '22 12:07 mindFlayer123

Can you explain better the concept? Which is the script that you used to obtain that plot?

HoliSimo avatar Oct 11 '22 11:10 HoliSimo

I'm trying to train Real-ESRGAN with 400 images. The training curves are very noisy, especially the perceptual loss (l_g_precep). I've tried to use various different learning rates. I'm using 4 GPUs with 5 images on each GPU image

here is the config file I use:

output-onlineyamltools (1) Can you explain how to obtain that plot? plz

Tsouuu avatar Oct 28 '22 13:10 Tsouuu

I'm trying to train Real-ESRGAN with 400 images. The training curves are very noisy, especially the perceptual loss (l_g_precep). I've tried to use various different learning rates. I'm using 4 GPUs with 5 images on each GPU image

here is the config file I use:

output-onlineyamltools (1)

My l_g_percep loss have been increasing gradually, do you know why it haven't decreased?

FengMu1995 avatar Jun 28 '23 08:06 FengMu1995