brevitas issues

Results 214 brevitas issues

Sort by recently updated

Layers: Quantized (Bi-)RNN/GRU/LSTM layers

Add support for quantized (Bi-)RNN/GRU/LSTM, implemented leveraging TorchScript.

Grad for quantized layers for 1-bit is nan

For example, by running the following code we get nan for weights' grad: ``` net = nn.Sequential(qnn.QuantIdentity( bit_width=1 , return_quant_tensor=True) , qnn.QuantLinear(10, 10, bias=False, weight_bit_width=1)) a = torch.rand((1,10), requires_grad=True) net(a).sum().backward()...

iamjalipo

still need of example of quantized network with BN

following issue #363, I have tried to do what you suggested, but I am a bit confused as to how to do it. My confusion stems from the fact that...

lovodkin93

define my own quantization function

Hello, I would like to use any of the quantizers that inherit from `WeightQuantProxyFromInjector`, but I would like to change the quantization formula, namely the `impl` in ['impl`'](https://github.com/Xilinx/brevitas/blob/master/src/brevitas/proxy/parameter_quant.py#L143) (or alternatively...

lovodkin93

creating two instances from the same class

Hello, I have a use-case where I need to create two objects from the same class and pass them to one quantizer. In the standard dependencies tool that you use,...

MaenMallah

Understanding the `BrevitasONNXManager`

This is the continuation of #351, but since the topic has changed a lot I decided to make this a new issue. ### Current state I am at the state...

derpda

Overriding act_quant breaks act_impl

Example: ``` from brevitas import nn as qnn m1 = qnn.QuantIdentity() m2 = qnn.QuantReLU(act_impl=m1.act_impl) ``` m2 should be relu + m1.act_impl but instead it's just m1.act_impl. Workaround to get the...

volcacius

bug

The data type of the convolution operation

Dear author, I would like to confirm one thing with you. Does the quantization layer of Brevitas use the value after inverse quantization when calculating the convolution? (Floating point numbers),Shouldn't...

YZW-explorer

Only power-of-two scale factors are supported

Dear author, I meet a question when I try to use DPUv1Manager.export and DPUv2Manager.export to export onnx file. The export file is shown as follow. `from torch import nn import...

Zhiyuan-Li-ANU

Post-training quantization references

Papers: - Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization https://arxiv.org/abs/1902.01917 - Up or Down? Adaptive Rounding for Post-Training Quantization http://proceedings.mlr.press/v119/nagel20a/nagel20a.pdf - Post training 4-bit...

volcacius

brevitas
brevitas copied to clipboard

Metadata

Layers: Quantized (Bi-)RNN/GRU/LSTM layers

Grad for quantized layers for 1-bit is nan

still need of example of quantized network with BN

define my own quantization function

creating two instances from the same class

Understanding the `BrevitasONNXManager`

Overriding act_quant breaks act_impl

The data type of the convolution operation

Only power-of-two scale factors are supported

Post-training quantization references

← Metadata

Owner

Metadata

brevitas brevitas copied to clipboard

Metadata

← Metadata

Owner

Metadata

brevitas
brevitas copied to clipboard