Image-Captioning
Image-Captioning copied to clipboard
Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformers
Image Captioning Overview
In recent years, neural networks have fueled dramatic advances in image captioning. Researchers are looking for more challenging applications for computer vision and Sequence to Sequence modeling systems. They seek to describe the world in human terms. I have implemented three different architectures from simple Encoder Decoders to Transformers with Multi-Head Attention.
Datasets :-