sherlockfeng1995

Results 1 issues of sherlockfeng1995

There is a doubt in this get data function: why only obtain one caption in a video ? def __getitem__(self, idx): out={} if self.is_train: video_idx,cap_idx=self.pair_idxs[idx] video_name=self.video_names[video_idx] mp_feature=self.mp_features[video_idx] sent=self.captions[cap_idx] cap_ids,cap_len=self.process_sent(sent,self.max_words_embedding) out['captions_ids']=cap_ids...