admm-pruning
admm-pruning copied to clipboard
Discussion about CNN pruning
I read the codes in it and find that they just set the connection weights as zeros. And then use the whole matrix to calculate the output. I assume that when training and testing the network, the memory footprint(gpu memory) would not change.
Therefore, how pruning benefit the CNN when applying in resource limited machine?