esp-dsp icon indicating copy to clipboard operation
esp-dsp copied to clipboard

No optimized dsps_bit_rev_sc16 resulting slow sc16 S3 FFT performance overall (DSP-98)

Open ayavilevich opened this issue 1 year ago • 1 comments

It looks like there is just an ansi implementation of dsps_bit_rev_sc16 at this time. On the other hand, there is an optimized implementation of the dsps_bit_rev2r_fc32 but the fc32 FFT is not accelerated where the sc16 one is.

Considering the optimized dsps_fft2r_sc16 on the S3, it is a blocker for any effective, fast and fixed point FFT on the S3 to not have an optimized bit reversal function as well. Currently, if the first step of the FFT takes time T then just the final bit reversal (ansi) will take 2T.

Please consider adding. I am not sure about the effort, but maybe it is similar to the floating point version with just type changes?!

P.S. similar issue with the dsps_cplx2real_sc16, but lower priority.

ayavilevich avatar Feb 01 '23 07:02 ayavilevich