NExT-QA
NExT-QA copied to clipboard
Some confusions about extract my own pre-computed video features
Thank you for excellent work!I want to extract my own NExT-QA features. As the data preparation section of redeame, I download the the highlighted NExTVideo. When I unzip it,I don't know how to divide the raw video into train set, valid set and test set. The VidOR website also didn't provide the test set.
hello, could you see all the videos are valid? I also download the raw videos but some of them are not valid, which means I could not open the video.
@dongfengxijian
hello, could you see all the videos are valid? I also download the raw videos but some of them are not valid, which means I could not open the video.
I met the same problem. But when I extract frame with the scikit-video, it did‘t report error!Maybe the player is the reason,
@dongfengxijian Hello, sorry to bother you again, could I ask if you could abstract the same appearance feature that the author provided? For example, for the validation set, I set the clip as 16 and I abstract the 8th or 9th(actually I compare all the frames in one clip) frame appearance vector of the first clip and compare the first frame vector of the provide appearance vector, both dimension are (2048,), but I find the appearance vector is different. how about yours?
I now might could answer you question at the beginning. You could separate the video directly from the train.csv val.csv test.csv to make the video split
@dongfengxijian Hello, sorry to bother you again, could I ask if you could abstract the same appearance feature that the author provided? For example, for the validation set, I set the clip as 16 and I abstract the 8th or 9th(actually I compare all the frames in one clip) frame appearance vector of the first clip and compare the first frame vector of the provide appearance vector, both dimension are (2048,), but I find the appearance vector is different. how about yours?
Should the appearance vector be same? I don't quite grasp your question
I now might could answer you question at the beginning. You could separate the video directly from the train.csv val.csv test.csv to make the video split
Thank your for your guidance!
@dongfengxijian , thanks for your answer! My question is: the appearance vector that I abstract is different from what the author provided, and how about you? Because we follow the same method, the appearance vector should be the same I think? because the weight of the resnext is fixed.
@dongfengxijian , thanks for your answer! My question is: the appearance vector that I abstract is different from what the author provided, and how about you? Because we follow the same method, the appearance vector should be the same I think? because the weight of the resnext is fixed.
I didn't use the original model, so I can't answer your question. I‘m sorry about that.
hello, could you see all the videos are valid? I also download the raw videos but some of them are not valid, which means I could not open the video.
I have the same issue