tflite-android-transformers icon indicating copy to clipboard operation
tflite-android-transformers copied to clipboard

Support BERT finetuned on SQuAD

Open Pierrci opened this issue 4 years ago • 3 comments

Since the tokenizer is the same than MobileBERT/DistilBERT, would be pretty straightforward to add once this TensorFlow issue is solved: https://github.com/tensorflow/tensorflow/issues/34210

Pierrci avatar Nov 18 '19 17:11 Pierrci

@Pierrci is this closing now as 34210 is closed?

ucalyptus avatar Jan 03 '20 18:01 ucalyptus

The non-quantized TFLite version is around 1GB so way too big for a mobile app. I'll close this once the FP16 quantization works so we can use a model with a reduced size and good performance, but it's not the case for now (at least when I tried last Friday)

Pierrci avatar Jan 06 '20 15:01 Pierrci

Hi @Pierrci, were you able to quantize the BERT for TFLite? I tried a few options but failed to get the quantized model.

csarron avatar Mar 24 '20 22:03 csarron