KataGo icon indicating copy to clipboard operation
KataGo copied to clipboard

Data lost in one katago network training is applied in the next net?

Open jaewoo1120 opened this issue 3 years ago • 2 comments

Even when a new network emerges, data from previous network training is uploaded. Will these data be applied when creating the next network?

jaewoo1120 avatar Aug 24 '21 01:08 jaewoo1120

Yes More specifically, training a new network make use of the newest available data (ie selfplay with the latest network) AND a small percentage of all the data from the last weeks/months: all these data are shuffled and used for training. Why doing that? Many reasons: mixing data from many nets ensures more diversity, it limits the risks of overfitting, if a net is bad then its data have a limited impact, the volume of data needed for training is high (hence, without using data from old nets, we would need to wait several days / weeks of selfplay before training a new net), etc...

Friday9i avatar Aug 24 '21 12:08 Friday9i

Thanks for the reply. I'm glad the data wasn't completely discarded. :+1:

jaewoo1120 avatar Aug 25 '21 13:08 jaewoo1120