Deep Learning for Image Captioning

This capstone project proposes image and video captioning using deep learning. The student will implement a system that can automatically generate captions for images and videos by learning from sample data. This goes beyond existing models that only generate labels or basic captions, and will integrate image recognition with a model to predict emojis, stickers, and more descriptive captions. The project aims to fill research gaps in saving user time for captioning and providing more intelligence to automatic systems. Expected outcomes are automatic caption generation from images and videos, and predicting emojis and stickers.

Uploaded by

shivam5singh-25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views2 pages

Deep Learning for Image Captioning

Uploaded by

shivam5singh-25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

CAPSTONE PROJECT

 TOPIC :- Image Captioning Using Deep Learning

 DESCRIPTION :-
In this project, we implemented an image and video captioning system, which
automatically generates informative captions, based on the learning from sample images,
used to train the program.

In this project we are using the concept of "Image recognition” to extract the image
features and to classify them. Previous attempts of “Image recognition” were only useful
in generating labels or creating basic captions example: “like for an image where a dog is
sitting on the floor it will generate simply about the object in the image and its position”,
which poses no outcome or any insights beneficial for humans.

Our Aim is to use this extracted features and integrate it with a model, which will predict
and generate “emojis”, “stickers” and captions for the given input. This model can be
used as an application for social media platforms where infinite numbers of images are
uploaded.

The source of dataset for this project is from “Labelme” and “Google Open Images”, which
contains millions of images (annotated and labeled). The vast range of dataset available
will allow us to analyze the data more easily, and will help in testing the data more
accurately. In future if required, more such sources for datasets can be explored like
“Kaggle” and other repositories.

 NATURE:- General

 What novelty do you see in the proposed research/ project work by the student?

1. Existing models of this topic generates only labels but in this project, integrated systems
of labels and auto generated caption form those labels, is one of the added feature
2. The real time image form the camera can also be utilized for the above mentioned
purposes.
3. All the possible outcomes will be done in the existing real time
 Is it feasible to carry out the proposed work with the facilities available in home? If yes,
Please mention how the project/ research work shall be carried out.

Yes, absolutely the proposed work can easily be carried out with the facilities available at
home and it would be quite feasible and in order to carry out such project/research the
knowledge of the following subjects is required.

1. Python :-
o OpenCV : used to analysis the Image .
o Numpy : used to handle the image array.
o Scipy : used for mathematical operation.
o Matplotlib : API used to plot .
o Tensorflow : used to implement standard machine learning algorithm .
2. Algorithm :-
o CNN( Convolution Neural Network)
o RNN( Recurrent Neural Network)
o Classification based algorithm
Programming Languages - Python,R,
Platforms - OpenCV,Tensor Flow,Jupyter Notebook,Anaconda,R studio
Libraries - Keras,Numpy,Scipy

 Mention the research gap that the proposed/research work intends to fill.

1. To save user time for the caption of image.

2. To providing the intelligence to the system.
3. Integrated system of Image processing and caption generation

 What are the expected research/project outcomes from this proposal submitted by the
students?

1. To generate automatic captions from the given input image.

2. To generate automatic captions from the given input Video.
3. To predict the “emojis” and “stickers” for the given input.

AI Image Captioning System Development
No ratings yet
AI Image Captioning System Development
25 pages
Title Aproval Page
No ratings yet
Title Aproval Page
1 page
Document From Deependra Singh
No ratings yet
Document From Deependra Singh
10 pages
Caption Credits
No ratings yet
Caption Credits
25 pages
Deep Learning Image Captioning
No ratings yet
Deep Learning Image Captioning
6 pages
Internship Report (Sanjay Final)
No ratings yet
Internship Report (Sanjay Final)
45 pages
Image Captioning Using Deep Learning Mait
No ratings yet
Image Captioning Using Deep Learning Mait
8 pages
Image Captioning with Deep Learning
100% (1)
Image Captioning with Deep Learning
5 pages
15 Report PDF
No ratings yet
15 Report PDF
35 pages
Image Caption Genrator Report
No ratings yet
Image Caption Genrator Report
45 pages
Mini Project Fln..
No ratings yet
Mini Project Fln..
51 pages
Project I - Image Captioning With Deep Learning
No ratings yet
Project I - Image Captioning With Deep Learning
3 pages
Voice-Based Image Information Retrieval
No ratings yet
Voice-Based Image Information Retrieval
5 pages
Autonomous Image Captioning Project
No ratings yet
Autonomous Image Captioning Project
35 pages
Report Contents Image Caption Generation-1
No ratings yet
Report Contents Image Caption Generation-1
42 pages
Image Detection with ChatGPT Vision
No ratings yet
Image Detection with ChatGPT Vision
27 pages
Mini Project Final
No ratings yet
Mini Project Final
27 pages
Image Captioning with CNN and LSTM
No ratings yet
Image Captioning with CNN and LSTM
56 pages
Automatic Image Captioning Proposal
No ratings yet
Automatic Image Captioning Proposal
3 pages
Mini Project Report Corrected
No ratings yet
Mini Project Report Corrected
44 pages
Project
No ratings yet
Project
10 pages
Image Caption Generator Project Report
No ratings yet
Image Caption Generator Project Report
27 pages
FYP CSEB Batch37 First Review (Final)
No ratings yet
FYP CSEB Batch37 First Review (Final)
13 pages
Sample Project doc-REC
No ratings yet
Sample Project doc-REC
66 pages
New PDF
No ratings yet
New PDF
48 pages
Cherukuri Varalakshmi-2
No ratings yet
Cherukuri Varalakshmi-2
21 pages
Final Year Project Proposal
No ratings yet
Final Year Project Proposal
3 pages
Building A Voice Based Image Caption Generator With Deep Learning
No ratings yet
Building A Voice Based Image Caption Generator With Deep Learning
6 pages
Mini Project Report
No ratings yet
Mini Project Report
31 pages
4 2final
No ratings yet
4 2final
34 pages
Final - Done (1) 2.0
No ratings yet
Final - Done (1) 2.0
16 pages
Poster 2
No ratings yet
Poster 2
1 page
Image Caption Generator Project Report
No ratings yet
Image Caption Generator Project Report
15 pages
Image Captioning Deep Learning Project
No ratings yet
Image Captioning Deep Learning Project
1 page
Image to Cartoon Conversion Project
No ratings yet
Image to Cartoon Conversion Project
39 pages
Image Captioning with PyTorch Guide
No ratings yet
Image Captioning with PyTorch Guide
3 pages
Image Caption Generator Report
No ratings yet
Image Caption Generator Report
27 pages
Image Caption
No ratings yet
Image Caption
16 pages
Report 1
No ratings yet
Report 1
34 pages
Streamlined Image Captioning Project
No ratings yet
Streamlined Image Captioning Project
3 pages
Prepare in Advance - 20250505 - 200620 - 0000 PDF
No ratings yet
Prepare in Advance - 20250505 - 200620 - 0000 PDF
1 page
Image Captioning Generator Using CNN and LSTM
No ratings yet
Image Captioning Generator Using CNN and LSTM
8 pages
Image Caption Generator Project Report
No ratings yet
Image Caption Generator Project Report
39 pages
AI Mini Project
No ratings yet
AI Mini Project
22 pages
Image Captioning and AI Projects Overview
No ratings yet
Image Captioning and AI Projects Overview
6 pages
Piyush FINAL2 Merged Removed
No ratings yet
Piyush FINAL2 Merged Removed
39 pages
Final Year Project Report
No ratings yet
Final Year Project Report
52 pages
Generative AI for EdTech Interns
No ratings yet
Generative AI for EdTech Interns
23 pages
IGNOU BCA Project Synopsis Handwritten Digit Recognition
No ratings yet
IGNOU BCA Project Synopsis Handwritten Digit Recognition
2 pages
Minor
No ratings yet
Minor
14 pages
Project Synopsis
No ratings yet
Project Synopsis
2 pages
Image Caption Generator PCL
No ratings yet
Image Caption Generator PCL
19 pages
RP Springer
No ratings yet
RP Springer
10 pages
Project Report
No ratings yet
Project Report
53 pages
Animal Image Recognition System
No ratings yet
Animal Image Recognition System
2 pages
Synopsis New 1
No ratings yet
Synopsis New 1
16 pages
Deep Learning for Image Captioning
No ratings yet
Deep Learning for Image Captioning
6 pages
Machine Learning for Object Captioning
No ratings yet
Machine Learning for Object Captioning
45 pages
Cartoonify Images with OpenCV Python
No ratings yet
Cartoonify Images with OpenCV Python
4 pages
Tissue Processors
No ratings yet
Tissue Processors
13 pages
LAW 546 Assignment Guidelines 2021
No ratings yet
LAW 546 Assignment Guidelines 2021
2 pages
Cyber Violence Against Women in India
No ratings yet
Cyber Violence Against Women in India
22 pages
Veterinary Oncology
No ratings yet
Veterinary Oncology
12 pages
Analyzing Wounds: Causes of Death
No ratings yet
Analyzing Wounds: Causes of Death
35 pages
Vol. (18) No. (1) Jan 2020: Suicidal, Homicidal and Accidental Hanging
No ratings yet
Vol. (18) No. (1) Jan 2020: Suicidal, Homicidal and Accidental Hanging
1 page
PDS as Social Safety Net in Allahabad
No ratings yet
PDS as Social Safety Net in Allahabad
169 pages
Jamia Millia Islamia: Course Teacher: Miss. AAKRITI MATHUR Signature of Teacher
No ratings yet
Jamia Millia Islamia: Course Teacher: Miss. AAKRITI MATHUR Signature of Teacher
13 pages
Jytyd
No ratings yet
Jytyd
2 pages
Citizens' Grievances in India: Redressal Mechanism
No ratings yet
Citizens' Grievances in India: Redressal Mechanism
33 pages
PDS Reforms in Sonbhadra
No ratings yet
PDS Reforms in Sonbhadra
14 pages
CA 2 Rubrics
No ratings yet
CA 2 Rubrics
3 pages
Compensation for Rape Victims in India
No ratings yet
Compensation for Rape Victims in India
27 pages
Cyber Violence Against Indian Women
No ratings yet
Cyber Violence Against Indian Women
17 pages
Cyber Victimization of Women in India
No ratings yet
Cyber Victimization of Women in India
11 pages
Sneha Mohanty Marital Rapepaper 1
No ratings yet
Sneha Mohanty Marital Rapepaper 1
10 pages
Child Abuse Issues in India
No ratings yet
Child Abuse Issues in India
15 pages
Amity Law School, Lucknow
No ratings yet
Amity Law School, Lucknow
9 pages
Duties To The Clients: & Casestudy of Shambhu Ram Yadav V. Hanuman Das Khatry
No ratings yet
Duties To The Clients: & Casestudy of Shambhu Ram Yadav V. Hanuman Das Khatry
23 pages
Excel 2010 Basics: User Interface & Navigation
No ratings yet
Excel 2010 Basics: User Interface & Navigation
86 pages
Lu 2023
No ratings yet
Lu 2023
20 pages
Mib Guide
No ratings yet
Mib Guide
191 pages
Design of Fly Wheel (Lecture-05)
No ratings yet
Design of Fly Wheel (Lecture-05)
36 pages
Essential Travel Items Guide
No ratings yet
Essential Travel Items Guide
3 pages
Anil Jaiswal: Contact Email Linkedin Address
No ratings yet
Anil Jaiswal: Contact Email Linkedin Address
3 pages
L860-GL Installation Guideline V1.0.0
No ratings yet
L860-GL Installation Guideline V1.0.0
15 pages
Naïve Bayes Classifier Tutorial Guide
No ratings yet
Naïve Bayes Classifier Tutorial Guide
23 pages
Rru 2238
No ratings yet
Rru 2238
24 pages
DPI Technology Large Format Media Prices
No ratings yet
DPI Technology Large Format Media Prices
3 pages
Experiment 6 - 2X1 Multiplier
No ratings yet
Experiment 6 - 2X1 Multiplier
4 pages
CA Deliver - Ref - ENU
No ratings yet
CA Deliver - Ref - ENU
365 pages
Web Design For Beginners
100% (2)
Web Design For Beginners
180 pages
Digital Counter Lab with Display Design
No ratings yet
Digital Counter Lab with Display Design
3 pages
Helical Vertical Axis Wind Turbine Report
No ratings yet
Helical Vertical Axis Wind Turbine Report
9 pages
Request for PGCIL Structural Steel Approval
No ratings yet
Request for PGCIL Structural Steel Approval
3 pages
DB2 Multiple Choice Questions
100% (3)
DB2 Multiple Choice Questions
16 pages
Week 6 - Impact of E Commerce
No ratings yet
Week 6 - Impact of E Commerce
11 pages
Advanced Actuators for Aerospace & Energy
No ratings yet
Advanced Actuators for Aerospace & Energy
2 pages
Disha MP REPORT
No ratings yet
Disha MP REPORT
6 pages
Electronic Journals Metadata Guide
No ratings yet
Electronic Journals Metadata Guide
2 pages
2020 Grid-Forming Inverters A Critical Asset For The Power Grid
No ratings yet
2020 Grid-Forming Inverters A Critical Asset For The Power Grid
11 pages
Assignment Cs 210
No ratings yet
Assignment Cs 210
4 pages
C++ Programming Lab Guide for MSc IT
No ratings yet
C++ Programming Lab Guide for MSc IT
45 pages
E-commerce Impact Questionnaire for Managers
No ratings yet
E-commerce Impact Questionnaire for Managers
6 pages
Excon 2022 Machine Status Overview
No ratings yet
Excon 2022 Machine Status Overview
19 pages
Changelog
No ratings yet
Changelog
4 pages
It (402) - 5Th Week Notes & Assignment Class: 9 A: Chapter:-Introduction To It and Ites Industry
No ratings yet
It (402) - 5Th Week Notes & Assignment Class: 9 A: Chapter:-Introduction To It and Ites Industry
4 pages
Energy-Efficient Elevators and Escalators in Europe: An Analysis of Energy Efficiency Potentials and Policy Measures
No ratings yet
Energy-Efficient Elevators and Escalators in Europe: An Analysis of Energy Efficiency Potentials and Policy Measures
8 pages
Install Cloudera Hadoop on VirtualBox
No ratings yet
Install Cloudera Hadoop on VirtualBox
33 pages

Deep Learning for Image Captioning

Uploaded by

Deep Learning for Image Captioning

Uploaded by

CAPSTONE PROJECT

 TOPIC :- Image Captioning Using Deep Learning

1. To save user time for the caption of image.

1. To generate automatic captions from the given input image.

You might also like