FlexGen
FlexGen copied to clipboard
Add support for symmetric quantization
This PR adds support for symmetric quantization when compressing/decompressing tensors. This is useful for comparing the performance of both symmetric and asymmetric quantization.
- supports storing compressed tensors without zero point
- adds some bitwise operations to support signed quantized values