tflite-micro Dynamic range quantization support in TFLM

Dynamic range quantization support in TFLM

Open Doomski99 opened this issue 1 year ago • 2 comments

Hello,

I'm having trouble testing Dynamically quantized models on my DSP simulator. I figured it might be not supported on TFLM. I spent much time searching for an official statement about Dynamic Quantization support for TFLM to no avail. The PTQ page of tensorflow summarizes the different quantization options with recommended hardware. But does that also entail "supported hardware" too?

Can I get an official statement about DQ support on TFLM? Does it depend on the chip maker?

Thanks in advance.

Apr 16 '24 13:04 Doomski99

You are correct that TFLM does not currently support dynamic range quantization. This has primarily been because the microcontrollers we have supported lacked hardware floating point acceleration, so inference performance with DRQ would be poor. We've generally recommended full integer quantization either using PTQ or QAT.

I'm leaving this open for myself to add some documentation to highlight this requirement.

Apr 16 '24 15:04 rascani

"This issue is being marked as stale due to inactivity. Remove label or comment to prevent closure in 5 days."

May 12 '24 10:05 github-actions[bot]

"This issue is being closed because it has been marked as stale for 5 days with no further activity."

May 17 '24 10:05 github-actions[bot]

tflite-micro tflite-micro copied to clipboard

Dynamic range quantization support in TFLM

tflite-micro
tflite-micro copied to clipboard