Nuo Mi
Nuo Mi
Did you check the hevc idct checkasm output? is it aligned with your result? thank you > vvc_inv_dct2_2_c: 16.0 > vvc_inv_dct2_2_avx2: 16.2 > vvc_inv_dct2_4_c: 19.7 > vvc_inv_dct2_4_avx2: 17.0 > vvc_inv_dct2_8_c:...
> but for now the 1D optimisations work alone and they provide the backbone needed for any future optimisation. how about dav1d, it has similar 1d function. or 2d only
>dav1d has over 10,000 lines of inverse transform assembly for AVX2 alone! It was worth it. dav1d is most fast decoder in we see so far. and the current vvc...
> I will look into this 👍, we can start with 2x2 or 4x4 block. zero the entire block and set the fireset coeff to 1 > some internal data...
Any machine support avx2 will have ssse3. Due to data length reasons. use avx2 may not get benefits. It's not a fault to use ssse3 in this case
great! you can check https://github.com/ffvvc/FFmpeg/blob/main/.github/workflows/makefile.yml for configuration and bitstream download. thank you
@light2802 is everything smooth? thank you
the code is not merged by ffmpeg yet. could you help send pr to https://github.com/ffvvc/FFmpeg instead? thank you
remove low priority since #64
Great! thank you.