Add AVX2 assembly code for inter predict
please add AVX for VVCInterDSPContext
| function | 8 bits | 10 bits | 12 bits |
|---|---|---|---|
| put | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| put_uni | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| put_uni_w | :white_large_square: | :white_large_square: | :white_large_square: |
| avg | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| avg_w | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| put_ciip | :white_large_square: | :white_large_square: | :white_large_square: |
| put_gpm | :white_large_square: | :white_large_square: | :white_large_square: |
| fetch_samples | :white_large_square: | :white_large_square: | :white_large_square: |
| bdof_fetch_samples | :white_large_square: | :white_large_square: | :white_large_square: |
| apply_prof | :white_large_square: | :white_large_square: | :white_large_square: |
| apply_prof_uni | :white_large_square: | :white_large_square: | :white_large_square: |
| apply_prof_uni_w | :white_large_square: | :white_large_square: | :white_large_square: |
| apply_bdof | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| sad | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| dmvr | :white_check_mark: | :white_check_mark: | :white_check_mark: |
please add functions based on usage 11.96% ffmpeg_g [.] put_vvc_luma_hv_10 5.88% ffmpeg_g [.] alf_get_coeff_and_clip_10 5.25% ffmpeg_g [.] ff_vvc_inv_dct2_64 4.30% [kernel] [k] __lock_text_start 4.22% ffmpeg_g [.] ff_vvc_alf_filter_luma_w16_16bpc_avx2 3.46% ffmpeg_g [.] put_vvc_luma_bi_hv_10 3.45% ffmpeg_g [.] alf_filter_luma_vb_10 3.13% ffmpeg_g [.] vvc_loop_filter_luma_10 2.81% ffmpeg_g [.] lmcs_filter_luma_10 2.46% ffmpeg_g [.] put_vvc_luma_uni_hv_10 2.27% ffmpeg_g [.] put_vvc_chroma_hv_10 2.21% libc-2.31.so [.] 0x000000000018b733 2.05% libc-2.31.so [.] 0x000000000018bb41 1.95% ffmpeg_g [.] put_vvc_chroma_uni_hv_10 1.84% ffmpeg_g [.] put_vvc_chroma_bi_hv_10 1.81% ffmpeg_g [.] vvc_deblock_bs 1.41% ffmpeg_g [.] ff_vvc_predict_inter 1.25% libpthread-2.31.so [.] __pthread_mutex_lock 1.24% libpthread-2.31.so [.] __pthread_mutex_unlock 1.22% ffmpeg_g [.] ff_vvc_residual_coding 1.08% ffmpeg_g [.] alf_filter_cc_10 1.03% ffmpeg_g [.] apply_prof_uni_10 0.99% ffmpeg_g [.] ff_vvc_alf_filter 0.98% ffmpeg_g [.] ff_vvc_inv_dct2_32 0.94% ffmpeg_g [.] vvc_deblock_bs_luma_vertical 0.92% ffmpeg_g [.] add_residual_10