Daniel Firu
Results
3
comments of
Daniel Firu
this thread is cursed
are there any branches or forks of the 2 x 4bit packing?
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/python/tools/quantization/quant_utils.py#L71 QuantType still doesn't include it.