YUI comments

Results 15 comments of

YUI

Qustion about the update method of ema_model

Good questions. First, `param.data.mul_(1 - self.wd)` is weight decay.It is also used in the official code (https://github.com/google-research/mixmatch/blob/master/mixmatch.py#L92), but it is different from the normal weight decay of pytorch (which is...

Unlabelled batch size

In Sec. 3.1 of the paper, ''For each u in the batch of unlabeled data U, we generate K augmentations.'' And K is 2 in the paper, so u1 and...

Unlabelled batch size

They are from the same batch but with different augmentations. See here: https://github.com/YU1ut/MixMatch-pytorch/blob/master/dataset/cifar10.py#L7-L14

interleave function

See this: https://github.com/YU1ut/MixMatch-pytorch/issues/20#issuecomment-583770295

The question about batch_size/val_iteration/lr.

In my opinion, this method needs to see the same sample with different augmentations a lot of times and get enough mixup samples to improve the performance. So, it is...

The bias term of convolution in WideResNet

Did you try to initialize the bias in Conv2D to 0?

How to adjust α, λ u in our own data set? Any tips

First, try lambda-u = 0 and change alpha to get the best performance, I think alpha=0.2 is a good start. Then, try to change lambda-u. You also should try different...

Question about the efficiency of parameter T

Thank you for reporting this. I will investigate this.

scores in Beam search should be multiplied or added?

Because the score is logarithmic. https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning/blob/master/eval.py#L111

About the pre-training

No. Only base classes are used in pre-training. https://github.com/YU1ut/imprinted-weights/blob/master/loader.py#L27 And the network only classifies 100 classes. https://github.com/YU1ut/imprinted-weights/blob/master/models.py#L6