deepnet
deepnet copied to clipboard
About optimization
Hello. Are you quite sure that history of optimizer (e.g. moments) should be zeroed at the beginning of each epoch?