Project Request

Video Captioning with Deep Learning The Video Captioning with Deep Learning project focuses on developing a model that automatically generates descriptive captions for videos.

Field	Computer Vision, OpenCV, and NLP
About	Video captioning using deep learning
Github	aman-kumar29
Email	[email protected]
Label	Project Request

https://github.com/aman-kumar29

Define You

[x] GSSOC Participant
[ ] Contributor

Video Captioning with Deep Learning

Description

The project involves training a deep learning model on a labeled dataset of videos paired with corresponding captions. The model will learn to understand the visual content and temporal dynamics of videos and generate meaningful captions that describe the video content accurately. The project will also include the development of a user interface for real-time video caption generation and evaluation of the model's performance.

Scope

Objectives

Real-Time Caption Generation: Develop a user interface where users can upload videos, and the model generates captions in real time, providing a time-aligned description of the video content.
Content Discovery and Recommendation: Video captioning models can be integrated into video recommendation systems, enhancing personalized video recommendations based on user preferences and interests.

Deliverables

Will give a trained image captioning model which should be able to take an input video and generate a relevant and contextually appropriate caption that accurately describes the visual content.
A user-friendly interface to interact with the model
comprehensive documentation will be provided, including technical documentation and user guides which will clearly describe how to use the interface and how to generate the caption.

Timeline

Start Date: when assigned End Date: 10 August

Video Links or Support Links

https://www.microsoft.com/en-us/research/project/msr-vdc-iccv-2013-video-to-text-challenge/ for the dataset. There is also ActivityNet Captions dataset.
Also some research papers would be helpful in choosing and changing the architecture of the model

May 21 '23 15:05 aman-kumar29

Hello, I'm GSSoC'23 Contributor. Please assign me this issue. I want to work on this and contribute to this.

May 21 '23 18:05 reshma045

@reshma045 hello! would you also like to work on this?

Jul 11 '23 07:07 aman-kumar29

World-of-AI
World-of-AI copied to clipboard

[ML category based PROJECT PROPOSAL]

Project Request

https://github.com/aman-kumar29

Video Captioning with Deep Learning

Description

Scope

Objectives

Deliverables

Timeline

Video Links or Support Links

World-of-AI World-of-AI copied to clipboard

[ML category based PROJECT PROPOSAL]

Project Request

https://github.com/aman-kumar29

Video Captioning with Deep Learning

Description

Scope

Objectives

Deliverables

Timeline

Video Links or Support Links

World-of-AI
World-of-AI copied to clipboard