brevitas issues

Results 214 brevitas issues

Sort by recently updated

Feat (quant-channel-splitting): support channel splitting after quantization

Until now, channel splitting was supported only before quantization. This PR adds support for applying channel splitting after basic quantization of a model.

fabianandresgrob

Update signature check

https://github.com/Xilinx/brevitas/blob/200456825f3b4b8db414f2b25b64311f82d3991a/src/brevitas/nn/equalized_layer.py#L19 Has to be done similarly to: https://github.com/Xilinx/brevitas/blob/200456825f3b4b8db414f2b25b64311f82d3991a/src/brevitas/graph/equalize.py#L639

Giuseppe5

Feat (gpfq): separate float and quant forward pass for speedup

Speeding up GPFQ with separate forward passes for quantized and float input. I avoided offloading the float input to disc and instead saved them under an attribute for GPFQ and...

fabianandresgrob

update dependencies=2.0.1 requirement

I'm trying to package `brevitas` on conda-forge and the requirement `dependencies==2.0.1` is an issue as it's old. Is it possible to update this requirement to `7.7.0`?

isuruf

List of all the avalable quantization strategies

Hi, First of all congratulation for the amazing work that you have done so far re-implementing all the Tensor operations in the quantized domain. I saw that the documentation is...

balditommaso

Feat (tests): extended minifloat unit tests

Adds the following changes in reference to Issue #975: - Updated type hints for `get_max_available_float` - Expanded `test_max_value` for FNUZ minifloat format and for minifloats with custom exponent bit-widths -...

alexredd99

Various SDXL quantization fixes

Adds the following fixes / enhancements: - [x] Set the device correct when checkpoints are loaded - [x] Fix issue setting the bitwidth for the output of linear layers when...

nickfraser

Missing minifloat testing

Expand our minifloat testing with the following: - Testing unscaled minifloat quantizers (https://github.com/Xilinx/brevitas/blob/df1a137aa089434e2bccb1cb87c9cb74c610d7a8/src/brevitas/quant/experimental/float_base.py#L30 and https://github.com/Xilinx/brevitas/blob/df1a137aa089434e2bccb1cb87c9cb74c610d7a8/src/brevitas/quant/experimental/float_base.py#L34) - Testing float proxies as done for int ones here https://github.com/Xilinx/brevitas/blob/dev/tests/brevitas/proxy/test_proxy.py - Add test for...

Giuseppe5

Feat: Support for Groupwise (MX) quantization

This implements: - New GroupwiseQuantTensor for Int and Float - Relevant Proxy classes - MX Float based quantizers - One notebook to test instantiation and execution Missing: - Export -...

Giuseppe5

Weights not quantized after using qnn.QuantConv2d layers for QAT

I am using a custom CNN architecture and trained my model using qnn.QuantConv2d layers for QAT. When I inspect the stored weights of model, to load it to hardware it...

Maya7991

brevitas
brevitas copied to clipboard

Metadata

Feat (quant-channel-splitting): support channel splitting after quantization

Update signature check

Feat (gpfq): separate float and quant forward pass for speedup

update dependencies=2.0.1 requirement

List of all the avalable quantization strategies

Feat (tests): extended minifloat unit tests

Various SDXL quantization fixes

Missing minifloat testing

Feat: Support for Groupwise (MX) quantization

Weights not quantized after using qnn.QuantConv2d layers for QAT

← Metadata

Owner

Metadata

brevitas brevitas copied to clipboard

Metadata

← Metadata

Owner

Metadata

brevitas
brevitas copied to clipboard