zjutzyl
Results
1
comments of
zjutzyl
same issue, i set a big weight decay to avoid it. i suppose that 'update=symbol * lr' enlarging abs(parameter) while symbol not changing.