compute-engine issues

DoReFa quantizer with higher number of MACs/Ops, Grouped convs as custom ops on LCE 0.7.0

3

Hello, I have a couple of questions regarding quantizer options for Larq and LCE. I am designing a BNN using the DoReFa quantizer, however, I noticed a very high number...

lluevano

`convert_keras_model()` does not work as expected for BinaryDenseNet37 Dilated and XNORNet

1

I tried using python 3.6 + LCE 0.6.2 and python 3.7/3.8 + LCE 0.7.0 to run the following code, and the tflite file generated has unexpected sizes: For python 3.6...

ZhanqiuHu

bug

LCEInterpreter and converter design

1

The LCE interpreter from `lce.testing.Interpreter` is a standalone class and exposes different properties of the quantized model (scale and zero-point for example). the converter on the other hand is built...

simonmaurer

add binary fully connected operator

4

Binary fully connected operator is in essence doing binary matrix matrix multiplication (BGemm). Assume that the input is M × N , the weight is N×K (M is the batch...

arashb

feature

Add support for bitpacked activations in optimized ARM32 kernels

@honglh added optimized kernels for ARM32 in #432. It would be great if we could add support for bitpacked activations to them to match the AArch64 optimized bgemm kernels.

lgeiger

feature

good first issue

Add support for unipolar bconv2d inputs.

5

*This is very much a draft PR.* ## What do these changes do? This is a converter-only change that adds support for binary convolutions with unipolar inputs -- all credit...

AdamHillier

feature

Warn users when converting models which can't be optimally accelerated

1

We should work out a good way to raise warnings for models which won't convert in an 'optimal' way -- essentially any eggregious violation of our [model optimisation guide](https://docs.larq.dev/compute-engine/model_optimization_guide/). It's...

AdamHillier

Add MLIR verifier to BConv op

Since #384 we can make use of the [IR verification](https://mlir.llvm.org/docs/OpDefinitions/#custom-verifier-code) features of MLIR. Our BConv op has a few parameter combinations where [it can throw during `Init` or `Prepare`]( https://github.com/larq/compute-engine/blob/master/larq_compute_engine/tflite/kernels/bconv2d.cc)....

lgeiger

good first issue

internal-improvement

Int8 default ranges break when a bconv is followed by a normal conv.

5

## Observed behaviour When converting this model... ```python model = tf.keras.models.Sequential([ tf.keras.Input((32, 32, 3)), lq.layers.QuantConv2D( 32, (3, 3), input_quantizer="ste_sign", kernel_quantizer="ste_sign", padding="same", pad_values=1.0, use_bias=False ), tf.keras.layers.Conv2D(32, (3, 3)), ]) converted_model =...

AdamHillier

bug

Creating LCE compatible TFLite iOS apps

2

TFLite provides a [collection of example mobile apps](https://www.tensorflow.org/lite/examples) for specific DL applications. We already have a setup in place to build LCE Lite AAR to support Android (see #238) ....

arashb

good first issue

compute-engine
compute-engine copied to clipboard

Metadata

DoReFa quantizer with higher number of MACs/Ops, Grouped convs as custom ops on LCE 0.7.0

`convert_keras_model()` does not work as expected for BinaryDenseNet37 Dilated and XNORNet

LCEInterpreter and converter design

add binary fully connected operator

Add support for bitpacked activations in optimized ARM32 kernels

Add support for unipolar bconv2d inputs.

Warn users when converting models which can't be optimally accelerated

Add MLIR verifier to BConv op

Int8 default ranges break when a bconv is followed by a normal conv.

Creating LCE compatible TFLite iOS apps

← Metadata

Owner

Metadata

compute-engine compute-engine copied to clipboard

Metadata

← Metadata

Owner

Metadata

compute-engine
compute-engine copied to clipboard