Cross-Modal-BERT audio data

audio data

Open lileiooo opened this issue 3 years ago • 1 comments

hello,I used WAV2VEC2 to extract audio features,The dimension of each Auido is(50,512),I changed the input to conv1d to 512,But the accuracy rate is always zero.Do you have any suggestions？

Nov 29 '21 04:11 lileiooo

Can U tell more about Ur approach ?

Oct 13 '23 08:10 pretbc

Cross-Modal-BERT Cross-Modal-BERT copied to clipboard

audio data

Cross-Modal-BERT
Cross-Modal-BERT copied to clipboard