ViT-cifar10-pruning
ViT-cifar10-pruning copied to clipboard
没有实现L1 loss。论文中“In order to enforce sparsity of importance scores, we apply ℓ1 regularization on the importance scores: 𝜆∥^a∥1 and optimize it by adding on the training objective, where 𝜆 is the sparsity hyper-parameter”