PiDTLN
PiDTLN copied to clipboard
Quantitative models are slower than the original models
hello, when I try in c++ project to infer the Quantized models,I find it is slower than original float32 models. why is it?
c++ project ? Do you rewrite the function of "def callback(indata, outdata, frames, buftime, status)" using c++ ?
Running the tflite runtime the quantised models are much faster on the intended Pi hardware.