neural-compressor
neural-compressor copied to clipboard
Detailed information about sparsity algorithm?
@airMeng Hi, can you provide some more detailed information about the sparsity method on this page ? what does the sparsity pattern mean? (ex, 2x1, 16x4 )
Hi wenjingk, we prefer tile-wise sparsity in INC, which is balance between accuracy and performance. Tile-wise sparsity means to divide the whole matrix into tiles, if there are non-zero elements in the tile, the tile is called non-zero tiles. the 2x1, 16x4 corresponds to shapes of tile, like 2x1 means the tile is in 2x1shape, and the below image demonstrates so-called 4x1

For more details like why we choose 2x1, you can refer to some of my gist, not totally the same but similar. We will give more details soon, and feel free to raise any issues!
cc @ftian1 @XinyuYe-Intel
Closed due to no further question.