VILA
VILA copied to clipboard
Random shuffle before dropping the last few samples
I noticed a bug in the data sampler. In the original implementation, the same elements will be dropped in every epoch.
For example, assume the dataset size is 900, and the batch size is 200, then the same 100 samples are dropped every time, which means they are never used in training