Video-guided-Machine-Translation icon indicating copy to clipboard operation
Video-guided-Machine-Translation copied to clipboard

[Suggestion] Support/Provide global video features

Open LividWo opened this issue 5 years ago • 0 comments

@eric-xw @zzxslp So far, each video is represented by a NumPy array of size (1, num_of_segments, 1024). Since many of the original videos are no longer available, would it be possible for you to provide a pooled/global feature for each video (size of [1, D])?

Such a pooled representation is widely used in image-guided NMT such as Multi30K, and I believe it will also benefit research in VMT.

LividWo avatar Sep 02 '20 08:09 LividWo