AojunZhou comments

Results 36 comments of


                                            AojunZhou

about the value of weights

@KangolHsu 0.007812 ？ Maybe you truncate the 0.0078125 (2^-7).

why use 1 bit to store 0

yes, for 4 bits, you can use 15 no-zero values + 0, unbalanced quantization doesn't influence INQ result, specifically, the INQ is very flexible，you can adjust bit-width according your task,...

Can this method accelerate inference speed?

@mynameischaos yes, network quantization can accelerate inference speed, but your hardware must support low-precision bit shift， you can pay close attention to intel-altera and intel-movidious product.

The weight is too large.

@lamperouge11 for ImageNet, most convNet models (ResNet, GoogleNet, AlexNet, VGG) weight range are within -1,1.

what is mask_ in blob.cpp?

@TwistedfateKing line 235 at /src/caffe/solvers/sgd_solver.cpp. if only one weight is float, I fixed "std::abs(data_vec[i])>data_copy[partition]" (line 536) at src/caffe/blob.cpp

how to handle the weight‘s sign in bit-shift arithmetic？

@KangolHsu Sorry, I don't know, INQ hardware implementation was completed by another team.

bit-width adjustment Question.

@TwistedfateKing yes, the number 7 (default) is corresponding to 5 bits in paper, you can modify it, 3 for 4 bits, 1 for 3 bits, 0 for 2 bits.

bit-width adjustment Question.

n2 = n1 + 1 −2^(b−1)/2. For instance, if b = 3 and n1 = −1, it is easy to get n2 = −2, if b=5, n2=-1+1-(2^(5-1))/2=-8

bit-width adjustment Question.

@KangolHsu

How can I run your code?

you can read the code from ./src/caffe/blob.cpp in line 480, and I am writing the readme and tutorial.