Cross-Modal-BERT icon indicating copy to clipboard operation
Cross-Modal-BERT copied to clipboard

What does the feature dimension represent?

Open hxl523 opened this issue 2 years ago • 2 comments

Hi, when I run the code train_audio,valid_audio,test_audio= pickle.load(open('Cross-Modal-BERT-master/data/audio/MOSI_cmu_audio_CLS.pickle','rb'))

I look at the size of the train_audio , valid_audio and test_audio: train_audio.shape=(1284, 50, 5),valid_audio.shape=(229, 50, 5), test_audio.shape=(686, 50, 5)

1284,229,686 is their data size,But what do 5 and 50 mean?

Looking forward to your reply. Thanks.

hxl523 avatar Mar 11 '22 12:03 hxl523

50 is timesteps,5 is feature dimensions. You can get more infomation from https://github.com/A2Zadeh/CMU-MultimodalSDK/issues/172

Kaicheng-Yang0828 avatar Mar 14 '22 01:03 Kaicheng-Yang0828

It seems that link expire - could You tell more ?

pretbc avatar Oct 13 '23 08:10 pretbc