How to extract input tensors for GEMM after pruning and quantization?

Open RealJustinNi opened this issue 5 months ago • 0 comments

Hello, thank you for your work. I’m very interested in the distributions of activations and weights after quantizing a pruned model. Is there a way to extract the input tensors involved in the matrix multiplication operations? Thank you.

Aug 01 '25 01:08 RealJustinNi