Vim
Vim copied to clipboard
Image Augmentation
Hi, I am aware that the authors utilized random cropping, random horizontal flipping, label-smoothing regularization, mixup, and random erasing as data augmentations. However, there hasn't been an ablation study on augmentations. Would the performance of vision mamba decrease if data augmentation was to be eliminated or modified?