HIST
HIST copied to clipboard
Whether the training loss converges normally?
Hello, I have attempted to reproduce the Hist model using my dataset, but I am encountering oscillating and non-converging training loss. I was wondering if the training loss in your experiment converges normally.