Cross-Modal-BERT icon indicating copy to clipboard operation
Cross-Modal-BERT copied to clipboard

audio data

Open lileiooo opened this issue 3 years ago • 1 comments

hello,I used WAV2VEC2 to extract audio features,The dimension of each Auido is(50,512),I changed the input to conv1d to 512,But the accuracy rate is always zero.Do you have any suggestions?

lileiooo avatar Nov 29 '21 04:11 lileiooo

Can U tell more about Ur approach ?

pretbc avatar Oct 13 '23 08:10 pretbc