MM-DFN
MM-DFN copied to clipboard
About the processed data
I am currently examining the processed dataset you have shared in your code. In MELD_features_raw1.pkl, I have noted that the second section pertains to speaker information. Each identifier corresponds to a speech segment, and for each segment, there are as many vectors as there are sentences. However, I observed that each vector has a dimension of 9. Could you kindly clarify if the value 9 holds any particular significance? Thank you very much for your assistance.