VC-R-CNN
VC-R-CNN copied to clipboard
How can I use pretrained VC-R-CNN for inference on a specify image?
As the title said, I would like to know the way to use pretrained model to generate caption for a specify image.