Ai Image Captioning

Uploaded by

Pallavi Bharti

0% found this document useful (0 votes)

3 views10 pages

Original Title

435267804 Ai Image Captioning

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views10 pages

Ai Image Captioning

Uploaded by

Pallavi Bharti

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 10

Search inside document

Automatic Image Captioning

Bot with CNN and RNN

-Submitted By-Harkirat Singh
CSE-3
01976802717
INTRODUCTION
• Image Captioning is the process of generating textual
description of an image. It uses both Natural Language
Processing and Computer Vision to generate the captions.

• The dataset will be in the form [image → captions]. The

dataset consists of input images and their corresponding
output captions.
Image Captioning

Datasets
•Common Objects in Context (COCO). A collection of more than 120 thousand images with
descriptions
•Flickr 8K. A collection of 8 thousand described images taken from flickr.com.
•Flickr 30K. A collection of 30 thousand described images taken from flickr.com.
•Exploring Image Captioning Datasets, 2016
Data Collection
 There are many open source datasets available for this
problem, like Flickr 8k (containing8k images), Flickr
30k (containing 30k images), MS COCO (containing
180k images), etc.
 But for the purpose of this case study, I have used the
Flickr 8k dataset which you can download by filling this
form provided by the University of Illinois at Urbana-
Champaign. Also training a model with large number of
images may not be feasible on a system which is not a
very high end PC/Laptop.
 This dataset contains 8000 images each with 5 captions A white dog in a grassy area
(as we have already seen in the Introduction section that (Image Captioning )
an image can have multiple captions, all being relevant
simultaneously).
These images are bifurcated as follows:
•Training Set — 6000 images
•Dev Set — 1000 images
•Test Set — 1000 images
Data Preprocessing — Images
 Images are nothing but input (X) to our model.
As you may already know that any input to a
model must be given in the form of a vector.
 We need to convert every image into a fixed sized
vector which can then be fed as input to the
neural network. For this purpose, we opt
for transfer learning by using the InceptionV3
model (Convolutional Neural Network) created
by Google Research.
 This model was trained on Imagenet dataset to
perform image classification on 1000 different
classes of images. However, our purpose here is
not to classify the image but just get fixed-length
informative vector for each image. This process is
called automatic feature engineering.
 Hence, we just remove the last softmax layer
from the model and extract a 2048 length vector
(bottleneck features) for every image as given.
Data Preparation
 This is one of the most important
steps in this case study. Here we
will understand how to prepare the
data in a manner which will be (Train image 1) Caption -> The black cat sat on grass
convenient to be given as input to
the deep learning model.

 Hereafter, I will try to explain the

remaining steps by taking a sample
example as follows.
 Consider we have 2 images and (Train image 2) Caption -> The white cat is walking on road
their 2 corresponding captions as
given.
 First we need to convert both the images to their
corresponding 2048 length feature vector as discussed
above. Let “Image_1” and “Image_2” be the feature
vectors of the first two images respectively
 Secondly, let’s build the vocabulary for the first two (train)
captions by adding the two tokens “startseq” and “endseq”
in both of them: (Assume we have already performed the
basic cleaning steps)
 Caption_1 -> “startseq the black cat sat on grass endseq”
 Caption_2 -> “startseq the white cat is walking on road
endseq”
THANK YOU!

CNN RNN Assignment Set 4
Document2 pages
CNN RNN Assignment Set 4
Surendra Tanwar
0% (1)
Peripheral Vascular Interventions
Document1,225 pages
Peripheral Vascular Interventions
Avinash6614
No ratings yet
A Mirafiori Com Guide
Document31 pages
A Mirafiori Com Guide
Vincenzo Sepe
100% (5)
BusinessExpertPress - Disruptive Innovation and Digital Transformation
Document238 pages
BusinessExpertPress - Disruptive Innovation and Digital Transformation
Cristiano Chagas
100% (1)
Automatic Image Captioning Bot With CNN and RNN: - Submitted By-Harkirat Singh CSE-3 01976802717
Document10 pages
Automatic Image Captioning Bot With CNN and RNN: - Submitted By-Harkirat Singh CSE-3 01976802717
Harkirat Singh
No ratings yet
Image Captioning
Document17 pages
Image Captioning
Sita Putra Teja
No ratings yet
Cad and Dog 2
Document5 pages
Cad and Dog 2
Muhammad Rifai
No ratings yet
Cad and Dog
Document5 pages
Cad and Dog
Muhammad Rifai
No ratings yet
Open Frameworks and OpenCV
Document43 pages
Open Frameworks and OpenCV
Денис Перевалов
No ratings yet
Exercise 2 Building Convolution Neural Network
Document15 pages
Exercise 2 Building Convolution Neural Network
Hockhin Ooi
No ratings yet
Image Captioning
Document14 pages
Image Captioning
Saginala Sharon
No ratings yet
Image Captioning
Document18 pages
Image Captioning
GAURAV RAJ
No ratings yet
Extract A Feature Vector For Any Image With PyTorch - by Christian Safka - Becoming Human - Artificial Intelligence Magazine
Document7 pages
Extract A Feature Vector For Any Image With PyTorch - by Christian Safka - Becoming Human - Artificial Intelligence Magazine
lumierebatalong
No ratings yet
Emgu CV Tutorial Skander
Document36 pages
Emgu CV Tutorial Skander
Joel Matilde
No ratings yet
Lab 1 Report PDF
Document18 pages
Lab 1 Report PDF
Trần Tín
No ratings yet
Why Convolutions?: Till Now in MLP
Document38 pages
Why Convolutions?: Till Now in MLP
Itokiana valimbavaka Rabenantenaina
No ratings yet
CNN Implementation in Python
Document7 pages
CNN Implementation in Python
Muhammad Usman
No ratings yet
Image Reference Guide: Install Pillow
Document4 pages
Image Reference Guide: Install Pillow
guillermo
No ratings yet
Convolutional Neural Network Project On Image Classification
Document8 pages
Convolutional Neural Network Project On Image Classification
Reality Aielumoh
No ratings yet
Decomposing Embedded Images - Loren On The Art of MATLAB
Document12 pages
Decomposing Embedded Images - Loren On The Art of MATLAB
Yeni Ben
No ratings yet
A CNN-Based Human Head Detection Algorithm Implemented On Edge AI Chip
Document5 pages
A CNN-Based Human Head Detection Algorithm Implemented On Edge AI Chip
wafa wafa
No ratings yet
Project
Document15 pages
Project
debojyoti
No ratings yet
Assignment 2: IRAF Fundamentals: 1 Image Headers
Document2 pages
Assignment 2: IRAF Fundamentals: 1 Image Headers
Timothy Leonard
No ratings yet
Conversion of Sign Language To Text
Document13 pages
Conversion of Sign Language To Text
Aakriti
No ratings yet
Digital Image Processing-Lab (15-EC 4110L)
Document13 pages
Digital Image Processing-Lab (15-EC 4110L)
bhaskar
No ratings yet
Augmentation and Segmentation
Document32 pages
Augmentation and Segmentation
Tanmay Sahu
No ratings yet
Similarity-Based Image Retrieval
Document11 pages
Similarity-Based Image Retrieval
Hugo Uego
No ratings yet
Watermark Images: Image Processing - Opencv, Python & C++ By: Rahul Kedia
Document9 pages
Watermark Images: Image Processing - Opencv, Python & C++ By: Rahul Kedia
Kedia Rahul
No ratings yet
Dog - App: 1 Convolutional Neural Networks
Document25 pages
Dog - App: 1 Convolutional Neural Networks
Dinar Mingaliev
No ratings yet
JupyterLab05 CLIP
Document17 pages
JupyterLab05 CLIP
RANIA_MKHININI_GAHAR
No ratings yet
Aaquib Capstone Project Edited
Document2 pages
Aaquib Capstone Project Edited
shivam5singh-25
No ratings yet
Cat and Dog Classification Using CNN: Project Objective
Document7 pages
Cat and Dog Classification Using CNN: Project Objective
coursera details
No ratings yet
Dsaa Group Project
Document3 pages
Dsaa Group Project
msroshi madhu
No ratings yet
Manipulation of Images With Python
Document16 pages
Manipulation of Images With Python
Sabbir Hasan
No ratings yet
2021 Dip 1
Document37 pages
2021 Dip 1
Abdul Kadir
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
Document20 pages
Building Powerful Image Classification Models Using Very Little Data
annjie123456
No ratings yet
Cartoonify An Image With OpenCV in Python
Document13 pages
Cartoonify An Image With OpenCV in Python
Priyam Singha Roy
No ratings yet
DIP Lab File
Document13 pages
DIP Lab File
Aniket Kumar 10
No ratings yet
OpenCV Lections: 3. Mat Class
Document16 pages
OpenCV Lections: 3. Mat Class
Денис Перевалов
100% (1)
Digital Image Processing: Lab Assignements: Instructions
Document4 pages
Digital Image Processing: Lab Assignements: Instructions
S. Magidi
No ratings yet
ANN Final Exam
Document13 pages
ANN Final Exam
basit
100% (1)
Deep Learning Sec4
Document18 pages
Deep Learning Sec4
sama ghorab
No ratings yet
Structural Damage Image Classification: Minnie Ho Jorge Troncoso
Document6 pages
Structural Damage Image Classification: Minnie Ho Jorge Troncoso
Aum Mangal
No ratings yet
Large-Scale Image Classification
Document8 pages
Large-Scale Image Classification
Seun -nuga Daniel
No ratings yet
Image Category Classification Using Deep Learning
Document11 pages
Image Category Classification Using Deep Learning
Hoàng Ngọc Cảnh
No ratings yet
Deng Et Al - 2009 - ImageNet
Document8 pages
Deng Et Al - 2009 - ImageNet
Nicolas Shu
No ratings yet
ICS 2201 Images and GUI Programming
Document89 pages
ICS 2201 Images and GUI Programming
gatmachyuol
No ratings yet
Sketch4Match - Content-Based Image Retrieval System Using Sketches
Document17 pages
Sketch4Match - Content-Based Image Retrieval System Using Sketches
Mothukuri VijayaSankar
No ratings yet
CNN Image Classification - Image Classification Using CNN
Document9 pages
CNN Image Classification - Image Classification Using CNN
chowsaj9
No ratings yet
Database Columns: Updated On 17 Apr 2012, Published On 16 Feb 2008
Document6 pages
Database Columns: Updated On 17 Apr 2012, Published On 16 Feb 2008
Yuvarajasena
No ratings yet
Renjini Paper 2
Document16 pages
Renjini Paper 2
Nancy123
No ratings yet
My Reference Manual EmuCv PDF
Document23 pages
My Reference Manual EmuCv PDF
Phiokham Suriya
No ratings yet
Image Processing
Document4 pages
Image Processing
Manan Gaur
No ratings yet
Analysis With EBimage
Document13 pages
Analysis With EBimage
Renan Godoy
No ratings yet
Automated Neural Image Caption Generator For Visually Impaired People
Document6 pages
Automated Neural Image Caption Generator For Visually Impaired People
Sagor Saha
No ratings yet
Lab - Digital Image Processing
Document38 pages
Lab - Digital Image Processing
አርቲስቶቹ Artistochu animation sitcom by habeshan meme
No ratings yet
Assign 4
Document4 pages
Assign 4
api-252659046
No ratings yet
Image Processing And Acquisition Using Python
From Everand
Image Processing And Acquisition Using Python
successkpk
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
Python and SQLite Development
From Everand
Python and SQLite Development
Agus Kurniawan
No ratings yet
Computer Graphics in Python
From Everand
Computer Graphics in Python
Martin McBride
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
FORGING
Document42 pages
FORGING
Harkirat Singh
No ratings yet
IC 38 Hajmola
Document8 pages
IC 38 Hajmola
Harkirat Singh
No ratings yet
HR Datas
Document10 pages
HR Datas
Harkirat Singh
No ratings yet
(Slideshare Downloader La) 63f3b3656bdc1
Document36 pages
(Slideshare Downloader La) 63f3b3656bdc1
Harkirat Singh
No ratings yet
MP Notes Full Sem
Document288 pages
MP Notes Full Sem
Harkirat Singh
No ratings yet
Web Engineering 6 SEM. 2020
Document1 page
Web Engineering 6 SEM. 2020
Harkirat Singh
No ratings yet
Operating System Lab ETCS 352: Maharaja Surajmal Institute of Technology, C-4 Janak Puri, New Delhi 110058
Document59 pages
Operating System Lab ETCS 352: Maharaja Surajmal Institute of Technology, C-4 Janak Puri, New Delhi 110058
Harkirat Singh
No ratings yet
Java Assignment-4 UNIT-4 Input/Output Stream
Document27 pages
Java Assignment-4 UNIT-4 Input/Output Stream
Harkirat Singh
No ratings yet
Air Pollution: By-Harkirat Singh
Document10 pages
Air Pollution: By-Harkirat Singh
Harkirat Singh
No ratings yet
Motion in A Plane: Made By: Gurkirat Singh
Document12 pages
Motion in A Plane: Made By: Gurkirat Singh
Harkirat Singh
No ratings yet
Air Pollution: By-Harkirat Singh
Document10 pages
Air Pollution: By-Harkirat Singh
Harkirat Singh
No ratings yet
Experiment-1: : Introduction To JAVA
Document34 pages
Experiment-1: : Introduction To JAVA
Harkirat Singh
No ratings yet
Information Security Lab File: Name - Harkirat Singh Class - CSE - 3 Enrollment No - 01976802717
Document25 pages
Information Security Lab File: Name - Harkirat Singh Class - CSE - 3 Enrollment No - 01976802717
Harkirat Singh
No ratings yet
Automatic Image Captioning Bot With CNN and RNN: - Submitted By-Harkirat Singh CSE-3 01976802717
Document10 pages
Automatic Image Captioning Bot With CNN and RNN: - Submitted By-Harkirat Singh CSE-3 01976802717
Harkirat Singh
No ratings yet
RFFFFFFFFFF PDF
Document2 pages
RFFFFFFFFFF PDF
Bokiar Bokiar
No ratings yet
Ejercicios Rephrasing 2012 Doc 20216
Document23 pages
Ejercicios Rephrasing 2012 Doc 20216
valeriaflavia
No ratings yet
(Problems) - Audit of Liabilitiess
Document17 pages
(Problems) - Audit of Liabilitiess
apatos
0% (2)
He Beauty Jshs Las6-8
Document12 pages
He Beauty Jshs Las6-8
Kurt Adam Mikhail Bisula
No ratings yet
Talanov Test - Male
Document122 pages
Talanov Test - Male
Brunow
No ratings yet
Boxing Bag Gloves Price LIst
Document11 pages
Boxing Bag Gloves Price LIst
Iresha Udayangi
No ratings yet
Ukk Bahasa Inggris Kelas 3
Document3 pages
Ukk Bahasa Inggris Kelas 3
kebonarek
No ratings yet
Spoolmatic Pro: Air-Cooled Pistol-Grip Spool Gun
Document8 pages
Spoolmatic Pro: Air-Cooled Pistol-Grip Spool Gun
Lesther Coloch González
No ratings yet
DanielMartins Thesis
Document114 pages
DanielMartins Thesis
Hamed
No ratings yet
Trafoindo Catalogue Padmounted Transformers
Document1 page
Trafoindo Catalogue Padmounted Transformers
Christian Dominic Dela Cruz
No ratings yet
Title: Effectiveness of Parabolic Solar Panel in Gathering Solar Energy
Document6 pages
Title: Effectiveness of Parabolic Solar Panel in Gathering Solar Energy
MGPagaduan
No ratings yet
User Manual TURBO 87
Document334 pages
User Manual TURBO 87
ravenn
No ratings yet
MSC CHEMISTRY SEMESTER I To IV FOR COLLEGES
Document51 pages
MSC CHEMISTRY SEMESTER I To IV FOR COLLEGES
Ayaz Rahil
No ratings yet
Converting Powerpoint Presentations To Video With Camtasia Studio
Document11 pages
Converting Powerpoint Presentations To Video With Camtasia Studio
Marvin Retutal
No ratings yet
(2022) Bahamondes Et Al (JLS) Dyadic IH On SS
Document19 pages
(2022) Bahamondes Et Al (JLS) Dyadic IH On SS
EDUARDO
No ratings yet
Threats To The Coastal Zone
Document9 pages
Threats To The Coastal Zone
James bond
No ratings yet
Power Sunroof: Calibration & Timing Note
Document23 pages
Power Sunroof: Calibration & Timing Note
Engine Tuning UP
No ratings yet
Honeywell DVR DR4300
Document262 pages
Honeywell DVR DR4300
Jonatan Benavides
No ratings yet
Lecture 9 Notches
Document25 pages
Lecture 9 Notches
Ch Zain
No ratings yet
Nick Herbert Dossier: Ong's Hat
Document4 pages
Nick Herbert Dossier: Ong's Hat
Joseph Matheny
0% (1)
89208694-Wiring Diagram, FL
Document246 pages
89208694-Wiring Diagram, FL
Taivar Pind
No ratings yet
Bearing Capacity Computation (Hardcore) : Soil Strength Parameters Footing Dimensions
Document4 pages
Bearing Capacity Computation (Hardcore) : Soil Strength Parameters Footing Dimensions
mayoo1986
No ratings yet
5600
Document2 pages
5600
sigit
No ratings yet
Alloy Steel Socket-Head Cap Screws: Standard Specification For
Document7 pages
Alloy Steel Socket-Head Cap Screws: Standard Specification For
MARCELO DOS SANTOS BARRETOS
No ratings yet
zte-MF971V LTE Ufi User Manual
Document53 pages
zte-MF971V LTE Ufi User Manual
tunga
No ratings yet
Guidelines For Avoiding Food-Drug Interactions
Document5 pages
Guidelines For Avoiding Food-Drug Interactions
lnornelas
No ratings yet
Lecture 8 Chapter 10 Weldin Joints 12 - 3 - 2020
Document20 pages
Lecture 8 Chapter 10 Weldin Joints 12 - 3 - 2020
Adnan Wattoo
No ratings yet