cnn-quantization icon indicating copy to clipboard operation
cnn-quantization copied to clipboard

Quantization of Convolutional Neural networks.

Results 14 cnn-quantization issues
Sort by recently updated
recently updated
newest added

Hello, Thanks for the great codebase. I tried `Experiment W4A4 + ACIQ + Bit Alloc(A) + Bit Alloc(W) + Bias correction` by `python inference/inference_sim.py -a resnet50 -b 512 -pcq_w -pcq_a...

Hi, thank you for sharing the source code of your work, it's amazing. I would like to inquire about pure integer inference using bias or variance correction as indicated in...

what is the formula for perchannel activation quantization?

It seems like you will work out a new set of scales for activations on every batch, and use the new set of scales to quantize and dequantize activations?

I couldn't reproduce the results with the examples provided in the readme on my 4 GPUs. So I used batch 256 on only 1 GPU and it works. Adding one...

Using the example with --device cpu leads to crash: python inference/inference_sim.py -a resnet50 -b 512 -pcq_w -pcq_a -sh --qtype int4 -qw int4 --device cpu The bug comes from calling torch.cuda.clear()....

I've seen this as a previous issue but it was closed. I have access to the required HW but I am still encountering the error message shown below: Traceback (most...

Can you please explain what need to be changed for the following error ? Thank you. python inference/inference_sim.py -a resnet50 -b 512 /home/user/anaconda3/lib/python3.7/site-packages/yaml/constructor.py:126: DeprecationWarning: Using or importing the ABCs from...

Hi, I build cuda kernels for GEMMLOWP is "./build_all.sh" step have error: ``` ************************************************************** Building int quantization kernels ************************************************************** running install running bdist_egg running egg_info writing int_quantization.egg-info/PKG-INFO writing dependency_links to...