CSP icon indicating copy to clipboard operation
CSP copied to clipboard

what is the meaning about mode_stu and model_tea?

Open yokings opened this issue 5 years ago • 2 comments

yokings avatar Aug 20 '19 02:08 yokings

Citation from paper "We also apply the strategy of moving average weights proposed in [45]". Tea stands for teacher, stu for student I suppose. The idea is that the teacher accumulates a moving average of the student model to improve generalisation of the learned model.

[45] Tarvainen, A., Valpola, H.: Mean teachers are better role models: Weight-averaged consistency tar- gets improve semi-supervised deep learning results. In: Advances in neural information processing systems, pp. 1195–1204 (2017)

dominikandreas avatar Aug 20 '19 04:08 dominikandreas

thank you

yokings avatar Aug 20 '19 05:08 yokings