intgemm
intgemm copied to clipboard
Multiple quantization weights
Allow parts of matrices to have different quantization multipliers: https://github.com/marian-nmt/marian-dev/blob/master/src/tensors/cpu/fbgemm/packed_gemm.cpp#L368