Xinrong Zhang
Results
2
issues of
Xinrong Zhang
I have trained Multimodal-Infomax model on CMU MOSI dataset. But I do not know how to predict a piece of my self data(a video with audio). Can anyone help me?
Is there any tutorials on how to release my dataset? And what is the format of computational sequences of MOSI and MOSEI