LightCompress
LightCompress copied to clipboard
How to extract input tensors for GEMM after pruning and quantization?
Hello, thank you for your work. I’m very interested in the distributions of activations and weights after quantizing a pruned model. Is there a way to extract the input tensors involved in the matrix multiplication operations? Thank you.