fairseq-image-captioning icon indicating copy to clipboard operation
fairseq-image-captioning copied to clipboard

Transformer-based image captioning extension for pytorch/fairseq

Results 15 fairseq-image-captioning issues
Sort by recently updated
recently updated
newest added

I want to train a new model using raw images (without bounding boxes) and attached captions to them. Is there any way to do that?

Inspired by coordinate encoding done in https://arxiv.org/abs/2003.08934.

https://arxiv.org/abs/1912.08226

![image](https://user-images.githubusercontent.com/44115671/71583036-ab9bac80-2b47-11ea-89e9-6ca0267b183a.png)