brevitas
brevitas copied to clipboard
Feat: Support for Groupwise (MX) quantization
This implements:
- New GroupwiseQuantTensor for Int and Float
- Relevant Proxy classes
- MX Float based quantizers
- One notebook to test instantiation and execution
Missing:
- Export
- Default MXInt quantizers?