How MOSI's audio features are obtained

Open xiaoxinchaoren56 opened this issue 1 year ago • 0 comments

I noticed that the audio feature length of the MOSI dataset is 5. May I ask how the audio features are extracted for the MOSI dataset.

Sep 25 '24 10:09 xiaoxinchaoren56