cyclical-visual-captioning
cyclical-visual-captioning copied to clipboard
inference on own video
Hi,
I am writing code to apply video captioning models on a single input video. Can you show me how to apply your model on a single input video or single input image? Is there a demo I can follow for the step by step approach? I'm new to this area and trying to understand what is required for testing video captioning models.
Hi Again,
Would you have a description of how to go about running this on my own video/set of videos for only inference?
Thanks