model-optimization Quant aware training in tensorflow model optimization

Hi,

I did see post_training_quant in model optimization, however, I didn't see quant aware training tutorial or examples.

is there quant aware training in model optimization? is there quant aware training in Tensorflow? could you please tell me the link about quant aware training?

Oct 09 '23 01:10 ardeal

Hi @ardeal, here are some links that might be helpful to start:

QAT: https://www.tensorflow.org/model_optimization/guide/quantization/training
Example: https://www.tensorflow.org/model_optimization/guide/quantization/training_example

Please take a look :)

Oct 10 '23 06:10 tucan9389

Hi @tucan9389 Many many thanks to you for your reply!

I have already found those links you mentioned. The further questions are:

If I apply q_aware_model = quantize_model(model) to the model, will BatchNormalization be applied?
where and how can I check and set which layer(s) should be quantized or not? is there any examples about how to configure the quantization?
I am using the following code to do QAT. did I correctly use the GradientTape?

import tensorflow_model_optimization as tfmot
quantize_model = tfmot.quantization.keras.quantize_model
q_aware_model = quantize_model(model)`
with tf.device('/GPU:0'):
    img, gt_score, gt_geo, ignored_map = img, gt_score, gt_geo, ignored_map
    with tf.GradientTape() as tape:
        pred_score, pred_geo = q_aware_model(img)
        [classify_loss, angle_loss, iou_loss, loss] = loss_tf(gt_score, pred_score, gt_geo, pred_geo, ignored_map)
    
        gradients = tape.gradient(loss, q_aware_model.trainable_variables)
        optimizer.apply_gradients(zip(gradients, q_aware_model.trainable_variables))

Oct 10 '23 06:10 ardeal

@ardeal

Thanks for asking :-)

Q1. Typically yes. But I recommend checking yourself whether existing FakeQuant or not. But as far as I know, if there is relu after BatchNormalization, BatchNormalization won't be applied. You can check allowlisted layers here. Q2-1. I believe that you can check the FakeQuant via tensorboard's graph visualization. Q2-2. Please check this link out for the example and guide in detail: https://www.tensorflow.org/model_optimization/guide/quantization/training_comprehensive_guide Q3. Once you could get the quantized model (typically quantized tflite model) and successfully get the expected accuracy, it's correct.

Please let me know if you encounter additional questions.

Oct 13 '23 02:10 tucan9389

@ardeal

I'll close this issue. Please let me know if you have additional question :)

Apr 01 '24 01:04 tucan9389