ONE
ONE copied to clipboard
PoC: Block weight quantize tool for LLM [skip ci]
- Block quantization for LLM: FullyConnected, Gather
- Decide quantize type by circle-quantizer parameter:
--block_quantize_weights
(Q4_0, Q8_0) - Skip quantization by circle-quantizer parameter:
--skipsize_block_quantize
(default: 0)
Caution: It's for PoC of circle format and test model generation. Not for compiler implementation. #13742 #13743