DarkRank
DarkRank copied to clipboard
Data sampling
Hello! There is an example of running darkrank distillation you provided in Readme, and there you use --even-iter option. This option means that for each example in mini-batch there always would be another example of the same class. Is such way of sampling a strong requirement for the darkrank loss? Can it successfully converge with totally random data sampling?