One Pager On Show Me A Story

Uploaded by

Shawkh Ibne Rashid

0% found this document useful (0 votes)

13 views1 page

This is a one page summary of the paper titled, "Show me a story: Towards Coherent Neural Story Illustration"

Original Title

One_Pager_On_Show_Me_A_Story

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

This is a one page summary of the paper titled, "Show me a story: Towards Coherent Neural Story Illustration"

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

13 views1 page

One Pager On Show Me A Story

Uploaded by

Shawkh Ibne Rashid

This is a one page summary of the paper titled, "Show me a story: Towards Coherent Neural Story Illustration"

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

One Pager on: Show me a story: Towards Coherent Neural Story Illustration

Hareesh Ravi, Lezi Wang, Carlos M.Muniz, Leonid Sigal, Dimitris N. Metaxas,
Mubbasir Kapadia

Student Name: Shawkh Ibne Rashid

This paper proposed a method to solve the inverse problem of retrieving correlated
images from a paragraph. The authors referred to the input as story-in-sequence (SIS) and
the output from their model as images-in-sequence (IIS). To find the images from SIS, they
have proposed an end-to-end neural architecture that takes the form of an encoder-decoder.
This model encodes the sentences and decodes these predicted feature representations into
a correlated set of images. To reference the objects and persons in the image, they used
a coherence vector. The model consists of a two-stage GRU-RNN network along with a
VGG-16 CNN architecture. The first GRU-RNN network encodes every word of a sentence
to form a feature vector. If there are n sentences in a paragraph, then n number of feature
vectors will be generated. The second stage of the model introduces sequential nature in the
previously obtained vectors. The corresponding image feature vectors are obtained from
the pre-trained VGG-16 model. The whole network is trained using an order embedding loss
function, which helps to constrain feature vectors from stories to be as close as possible to
the image feature vectors.

The authors conducted a user study with the help of AMT workers to compare the result
of their proposed model with two other models (Baseline Network and Network without
Coherence) and also with the ground truth images on the VIST dataset. The AMT workers
preferred the result of the proposed model to other models, and in some cases, more than
ground truths. The authors also proposed a visual saliency based metric to measure the
coherence among the output images. The goal is to see whether the objects and people
which are common in the paragraph are also maintained in the images. It seems that with
the increase in the number of considered images, Baseline Network performed better than
the proposed network. But in terms of consistency, the proposed model outperformed
others.

The authors pointed out the need for a better evaluation metric for this particular task.
They have noted this as a future task. Also, VIST was the only dataset that authors could find,
which had sequences of images associated with their corresponding text description. There
are some paragraphs in here where coherence is not well defined. So a more comprehensive
dataset can be produced. Not many works had been done to address this problem of the
paragraph to image representation. So there is a good scope of improvement in solving this
problem. Different RNN and CNN architectures, along with parameter tuning, can be tried
to increase the performance of the model.

Complete French Learn French With Teach Yourself
Document513 pages
Complete French Learn French With Teach Yourself
Anastasi
100% (3)
COT RPMS Rating Sheet For T I III For Sy 2023 2024
Document1 page
COT RPMS Rating Sheet For T I III For Sy 2023 2024
Maiden Pascua
98% (46)
Completed Literacy Folio Edu80016
Document27 pages
Completed Literacy Folio Edu80016
api-341010652
100% (1)
Image Super Resolution
Document8 pages
Image Super Resolution
Sam Rock
No ratings yet
Ch1 Review
Document4 pages
Ch1 Review
Elizabeth Dentlinger
No ratings yet
Vision Transformers: Revolutionizing Computer Vision
Document14 pages
Vision Transformers: Revolutionizing Computer Vision
Premanand Subramani
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Program Proposal: St. Paul University Philippines Tuguegarao City, Cagayan 3500
Document8 pages
Program Proposal: St. Paul University Philippines Tuguegarao City, Cagayan 3500
Danica Lorine Robino Taguinod
No ratings yet
Densepose: Dense Human Pose Estimation in The Wild: Seminar: Vision Systems Ma-Inf 4208
Document10 pages
Densepose: Dense Human Pose Estimation in The Wild: Seminar: Vision Systems Ma-Inf 4208
Jelena Trajkovic
No ratings yet
From Words To Pictures Artificial Intelligence Based Art Generator
Document9 pages
From Words To Pictures Artificial Intelligence Based Art Generator
International Journal of Innovative Science and Research Technology
No ratings yet
And The Bit Goes Down
Document11 pages
And The Bit Goes Down
gireesh123123
No ratings yet
2021 - AAAI-RpBERT - A Text-Image Relation Propagation-Based BERT Model For Multimodal NER
Document9 pages
2021 - AAAI-RpBERT - A Text-Image Relation Propagation-Based BERT Model For Multimodal NER
zhao tong
No ratings yet
NeurIPS 2019 Image Captioning Transforming Objects Into Words Paper
Document11 pages
NeurIPS 2019 Image Captioning Transforming Objects Into Words Paper
yazhini shanmugam
No ratings yet
Text-to-Image Generation Using Deep Learning
Document6 pages
Text-to-Image Generation Using Deep Learning
experimental mechanics
No ratings yet
Narrative Paragraph Generation
Document13 pages
Narrative Paragraph Generation
sid202pk
No ratings yet
PGCON Paper Final
Document4 pages
PGCON Paper Final
Arbaaz Shaikh
No ratings yet
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
Document12 pages
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
Jordan Novet
No ratings yet
Automated Caption Generator For The Visually Imapired: Abstract: Automated Captioning of Photos Is A Mission That
Document6 pages
Automated Caption Generator For The Visually Imapired: Abstract: Automated Captioning of Photos Is A Mission That
anchal
No ratings yet
Textually Enriched Neural Module Networks For Visual Question Answering
Document9 pages
Textually Enriched Neural Module Networks For Visual Question Answering
Matt
No ratings yet
Automatic Image Captioning Combining Natural Language Processing and
Document14 pages
Automatic Image Captioning Combining Natural Language Processing and
Mebratu Abuye
No ratings yet
Algorithms: A Combined Full-Reference Image Quality Assessment Method Based On Convolutional Activation Maps
Document21 pages
Algorithms: A Combined Full-Reference Image Quality Assessment Method Based On Convolutional Activation Maps
Cẩm Tú Cầu
No ratings yet
Multilayer Dense Attention Model For Image Caption
Document11 pages
Multilayer Dense Attention Model For Image Caption
Pallavi Bharti
No ratings yet
Open Ended VQA Models Using Transformers
Document10 pages
Open Ended VQA Models Using Transformers
Ravi K
No ratings yet
1 s2.0 S2468502X21000590 Main
Document8 pages
1 s2.0 S2468502X21000590 Main
mirandasuryaprakash_
No ratings yet
A Method For Comparing Content Based Image Retrieval Methods
Document8 pages
A Method For Comparing Content Based Image Retrieval Methods
Atul Gupta
No ratings yet
Conference Paper A5
Document9 pages
Conference Paper A5
20at1a3145
No ratings yet
Project Report
Document62 pages
Project Report
Pulkit Chauhan
No ratings yet
Text To Image Synthesis Using Self
Document20 pages
Text To Image Synthesis Using Self
PG Guides
No ratings yet
Image Caption Generator Final Report
Document28 pages
Image Caption Generator Final Report
zaaaawar
No ratings yet
Image Caption Generator Using AI: Review - 1
Document9 pages
Image Caption Generator Using AI: Review - 1
Anish Jamedar
No ratings yet
M.Phil Computer Science Image Processing Projects
Document27 pages
M.Phil Computer Science Image Processing Projects
kasanpro
No ratings yet
Graph Tree
Document20 pages
Graph Tree
Uyen Nhi
No ratings yet
Wukong-CMNER: A Large-Scale Chinese Multimodal NER Dataset With Images Modality
Document15 pages
Wukong-CMNER: A Large-Scale Chinese Multimodal NER Dataset With Images Modality
zhao tong
No ratings yet
ResGANet PDF
Document17 pages
ResGANet PDF
kamrulhasanbdyahoo.com
No ratings yet
Thesis Report On Image Denoising
Document8 pages
Thesis Report On Image Denoising
afkofcwjq
100% (2)
C R: E R M - F I D G: Ontext EF Valuating Eferenceless ET Rics OR Mage Escription Eneration
Document18 pages
C R: E R M - F I D G: Ontext EF Valuating Eferenceless ET Rics OR Mage Escription Eneration
person4115
No ratings yet
Systematic Evaluation of Convolution Neural Network Advances On The Imagenet-2017
Document9 pages
Systematic Evaluation of Convolution Neural Network Advances On The Imagenet-2017
Jorge Velasquez Ramos
No ratings yet
Aic - 2022 - 35 2 - Aic 35 2 Aic210172 - Aic 35 Aic210172
Document19 pages
Aic - 2022 - 35 2 - Aic 35 2 Aic210172 - Aic 35 Aic210172
mohamed walid
No ratings yet
Text-to-Image Generation With Attention Based Recurrent Neural Networks
Document11 pages
Text-to-Image Generation With Attention Based Recurrent Neural Networks
İlayda Altun
No ratings yet
1.convolutional Neural Networks For Image Classification
Document11 pages
1.convolutional Neural Networks For Image Classification
Muhammad Shoaib
No ratings yet
Convolutional Knowledge Graph Embeddings
Document8 pages
Convolutional Knowledge Graph Embeddings
phanpeter_492
No ratings yet
4924-Article Text-7990-1-10-20190709
Document8 pages
4924-Article Text-7990-1-10-20190709
Tushir
No ratings yet
An Improved Automatic Image Annotation Approach Using Convolutional Neural Network-Slantlet Transform
Document13 pages
An Improved Automatic Image Annotation Approach Using Convolutional Neural Network-Slantlet Transform
sem
No ratings yet
Sensors: Image Captioning Using Motion-CNN With Object Detection
Document13 pages
Sensors: Image Captioning Using Motion-CNN With Object Detection
mirandasuryaprakash_
No ratings yet
A CNN-RNN Framework For Image Annotation From Visual Cues and Social Network Metadata
Document8 pages
A CNN-RNN Framework For Image Annotation From Visual Cues and Social Network Metadata
doktorgoldfisch
No ratings yet
FlowNet - Learning Optical Flow With CNN
Document9 pages
FlowNet - Learning Optical Flow With CNN
sobuz visual
No ratings yet
Training-Free, Single-Image Super-Resolution Using A Dynamic Convolutional Network
Document5 pages
Training-Free, Single-Image Super-Resolution Using A Dynamic Convolutional Network
Chaitali Paralikar
No ratings yet
Real Time Object Detection Using CNN
Document5 pages
Real Time Object Detection Using CNN
yevan
No ratings yet
Real-Time Semantic Slam With DCNN-based Feature Point Detection, Matching and Dense Point Cloud Aggregation
Document6 pages
Real-Time Semantic Slam With DCNN-based Feature Point Detection, Matching and Dense Point Cloud Aggregation
JUAN CAMILO TORRES MUNOZ
No ratings yet
Data Science Interview Questions (#Day27)
Document18 pages
Data Science Interview Questions (#Day27)
ARPAN MAITY
No ratings yet
Multi-Digit Number Recognition From Street View Imagery Using Deep Convolutional Neural Networks
Document12 pages
Multi-Digit Number Recognition From Street View Imagery Using Deep Convolutional Neural Networks
Salvation Tiva
No ratings yet
Deep Visual-Semantic Alignments For Generating Image Descriptions
Document17 pages
Deep Visual-Semantic Alignments For Generating Image Descriptions
Joseph
No ratings yet
Segmenting Scenes by Matching Image Composites
Document9 pages
Segmenting Scenes by Matching Image Composites
machinelearner
No ratings yet
Irjet V6i6499
Document7 pages
Irjet V6i6499
boss
No ratings yet
Image Caption Generator: Review - 1
Document9 pages
Image Caption Generator: Review - 1
Anish Jamedar
No ratings yet
Abstract:: Literature Survey
Document3 pages
Abstract:: Literature Survey
Tirth Shah
No ratings yet
4823-Article Text-7889-1-10-20190709
Document9 pages
4823-Article Text-7889-1-10-20190709
Binay Adhikari
No ratings yet
Science China Information Sciences: Isha PATHAK & Deo Prakash VIDYARTHI
Document12 pages
Science China Information Sciences: Isha PATHAK & Deo Prakash VIDYARTHI
Amira Azzez
No ratings yet
Thesis On Content Based Image Retrieval
Document7 pages
Thesis On Content Based Image Retrieval
amberrodrigueznewhaven
100% (2)
Report: Trends in Generative Models
Document10 pages
Report: Trends in Generative Models
ggmmdd hhhxb
No ratings yet
Sensors: Depth Estimation and Semantic Segmentation From A Single RGB Image Using A Hybrid Convolutional Neural Network
Document20 pages
Sensors: Depth Estimation and Semantic Segmentation From A Single RGB Image Using A Hybrid Convolutional Neural Network
Kratika Varshney
No ratings yet
Paper 91-Comparative Evaluation of CNN Architectures
Document9 pages
Paper 91-Comparative Evaluation of CNN Architectures
sebampitako duncan
No ratings yet
A Study On Similar Image Finder Using Deep Learning
Document15 pages
A Study On Similar Image Finder Using Deep Learning
sahil singh
No ratings yet
Sketch-To-Face Image Translation and Enhancement Using A Multi-GAN Approach
Document7 pages
Sketch-To-Face Image Translation and Enhancement Using A Multi-GAN Approach
IJRASETPublications
No ratings yet
Major Project On: "Age and Gender Detection Master''
Document28 pages
Major Project On: "Age and Gender Detection Master''
Vijay Lakshmi
No ratings yet
Monitoring Tool in EsP 2020
Document22 pages
Monitoring Tool in EsP 2020
Carina Batin Manzanillo
No ratings yet
Prime System
Document39 pages
Prime System
Thio Curls
No ratings yet
Madeline Cho Letter of Recommendation 3
Document1 page
Madeline Cho Letter of Recommendation 3
api-296561432
0% (1)
09-11-2011 Weekly Eval
Document3 pages
09-11-2011 Weekly Eval
api-105224564
No ratings yet
Guide Book Iiiex 2023
Document15 pages
Guide Book Iiiex 2023
kanya parameswari
No ratings yet
Lecture 1 Introduction To Business Research Methods
Document23 pages
Lecture 1 Introduction To Business Research Methods
Eng Matana
No ratings yet
IMPROVING THE STUDENTS' SPEAKING SKILL THROUGH STORY JOKE TECHNIQUE Chapter I
Document12 pages
IMPROVING THE STUDENTS' SPEAKING SKILL THROUGH STORY JOKE TECHNIQUE Chapter I
dwi
No ratings yet
Bahasa Ancaman Dalam Teks Kaba Sabai Nan Aluih Berbasis Pendekatan Linguistik Forensik
Document17 pages
Bahasa Ancaman Dalam Teks Kaba Sabai Nan Aluih Berbasis Pendekatan Linguistik Forensik
Tentakle Young
No ratings yet
TFN
Document21 pages
TFN
Princess Rodginia Mae Magayon
No ratings yet
1 MR Mock Interview Questions and Evaluation
Document3 pages
1 MR Mock Interview Questions and Evaluation
api-268290595
No ratings yet
MBM Midteerm Exam
Document3 pages
MBM Midteerm Exam
LDRRMO RAMON ISABELA
No ratings yet
Conceptualizing Guilt in The Consumer Decision-Making Process
Document11 pages
Conceptualizing Guilt in The Consumer Decision-Making Process
Nikmatur Rahmah
No ratings yet
Personal Development Chapter 5
Document53 pages
Personal Development Chapter 5
Edilene Rosallosa Cruzat
No ratings yet
4 4 Resilience Circle Time Extract
Document2 pages
4 4 Resilience Circle Time Extract
api-254366715
No ratings yet
DLL 2020 Mydev
Document8 pages
DLL 2020 Mydev
Joseph Birung
No ratings yet
Presentation On Reading Skills
Document43 pages
Presentation On Reading Skills
Sehar Khan
No ratings yet
Guia Ingles 1
Document28 pages
Guia Ingles 1
Lore Ce
No ratings yet
St. Mary's Educational Institute: Online Learning 1
Document10 pages
St. Mary's Educational Institute: Online Learning 1
Janine Rose Mendoza
No ratings yet
What Is Attitude?: Paula Exposito Chapter 5 - High Effort Attitude
Document6 pages
What Is Attitude?: Paula Exposito Chapter 5 - High Effort Attitude
Paula Expósito Canal
No ratings yet
Pilar Orero Editor Topics in Audiovisual Translati
Document4 pages
Pilar Orero Editor Topics in Audiovisual Translati
Wagner Nogueira
No ratings yet
Factors Affecting The English Proficiency of Junior High Students
Document7 pages
Factors Affecting The English Proficiency of Junior High Students
Gertrude Miyacah Gomez
No ratings yet
The Namesake Interactive Novel Study
Document41 pages
The Namesake Interactive Novel Study
Geraldine Quiroga
No ratings yet
Advanced Mental Health Evaluation Results
Document13 pages
Advanced Mental Health Evaluation Results
Victor Namur
No ratings yet
Using Commands To Direct Behavior
Document2 pages
Using Commands To Direct Behavior
Vy Hạ
No ratings yet
Schools of Educational Philosophy
Document1 page
Schools of Educational Philosophy
Edwin Estrera
No ratings yet