peterjc123 comments

Results 139 comments of


                                            peterjc123

Model with stack does not work with int8 target type

@spacycoder I'm glad it works and it looks cleaner.

RMS Norm doesn't seem to be supported

@spacycoder Yes, both quantization for either `norm` or `RMSNorm` are unsupported at the moment. I wonder if you could actually do that using TFLite. But anyway, we should safely skip...

RMS Norm doesn't seem to be supported

@spacycoder OP-wise speaking, yes, we may go through `MUL -> MEAN -> RSQRT -> MUL`. But the quantization errors can't be ignored I guess, especially for `pow` and `rsqrt`. Also,...

RMS Norm doesn't seem to be supported

with #356, at least it won't throw an error for the models you provided. Quantization for those ops are still skipped.

RMS Norm doesn't seem to be supported

Fixed via https://github.com/alibaba/TinyNeuralNetwork/pull/390

How to quantize ViT model with quantization aware training

Just noticed that you are not using the Quantized graph rewrite of TinyNN as I can see the following option in your code. `"rewrite_graph": False` and `torch.quantization.prepare_qat`. Just FYI, the...

How to quantize ViT model with quantization aware training

We will take a look.

quantize_input_output failed

是我们graph optimizer的问题，没有对fuse_quant_dequant=True做特殊处理，可以先暂时通过optimize=4绕过

rrdb结构——transpose无法消除

可以试一下这个选项 https://github.com/alibaba/TinyNeuralNetwork/blob/main/tinynn/converter/base.py#L58