brevitas
brevitas copied to clipboard
Initial support for Q(C)DQ and Clip in QOps
Supports export to Q(C)DQ and extend QOps with clipping. (C) as in clipping allows to correctly low precision quantization (e.g. 4b) by clipping the output of QuantizeLinear.
@cmcgirr-amd