Welcome to Scribd!

Recurrent Neural Networks: CMSC498L

Uploaded by

0% found this document useful (0 votes)

29 views36 pages

This document discusses recurrent neural networks and their applications. It begins with an overview of RNNs and how they can be used for sentiment classification, image caption generation, and machine translation by operating over sequences. It then covers topics like how RNNs work, the problem of vanishing gradients, gated recurrent units like LSTMs, and language modeling using character-level RNNs. Examples of sequence classification, image caption generation, and language generation are provided. The document also discusses word representations using embedding techniques like skip-gram models.

Original Description:

Original Title

13_recurrent_models.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

29 views36 pages

Recurrent Neural Networks: CMSC498L

Uploaded by

Zu Ki

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 36

Search inside document

CMSC498L

Recurrent Neural Networks

Sweta Agrawal

Slides Adapted from CS498

Sentiment classification

• “The food was really good”

Classifier

h1 h2 h3 h4

“The” “food” “was” “really” “good”

Image Caption Generation

“The dog is hiding”

Machine Translation

https://translate.google.com/
What makes Recurrent Networks so special?

Operation over sequences of vectors

Slide Credits: http://karpathy.github.io/2015/05/21/rnn-effectiveness/

How do RNNs work?

Source: https://en.wikipedia.org/wiki/Recurrent_neural_network
How do RNNs work?

Source: https://en.wikipedia.org/wiki/Recurrent_neural_network
Backpropagation Through Time and Vanishing
Gradients

Source:
http://www.wildml.com/2015/10/recurrent-neural-networks-tutorial-part-3-backpropagation-through-time-and-vanishing-gradients/
Backpropagation Through Time and Vanishing
Gradients

Source:
http://www.wildml.com/2015/10/recurrent-neural-networks-tutorial-part-3-backpropagation-through-time-and-vanishing-gradients/
The Problem of Long-Term Dependencies

Short-term dependency Long-term dependency

Slides: http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Recurrent Neural Networks

Source: https://en.wikipedia.org/wiki/Recurrent_neural_network
Gated Recurrent Networks

Slides: http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Long Short Term Networks

Slides: http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Use Cases

Multiple input – Sequence Classification

Single output

Single - Multiple Image Captioning

Multiple - Multiple Image Captioning

Multiple - Multiple Translation

Sequence Classification

Linear
Ignore Ignore
Classifier
h1 h2 hn
RNN RNN RNN
h1 h2 hn-1

The food good

Sequence Classification
Linear
Classifier

h = Sum(…)
h1 hn
h2
RNN RNN RNN
h1 h2 hn-1

The food good

http://deeplearning.net/tutorial/lstm.html
Image Caption Generation
“The” “dog” “is” “hiding” STOP

Classifier Classifier Classifier Classifier Classifier

h1 h2 h3 h4 h5

h0 h1 h2 h3 h4
CNN START “The” “dog” “is” “hiding”
Language model and Sequence
Generation
What is language modelling?
How likely is it to generate a given text?

I am going home.

I am going house.

P(“I”, “am”, “going”, “home”) > P(“I”, “am”, “going”, “house”)

Slides: http://users.umiacs.umd.edu/~jbg/teaching/CMSC_723/06a_lm_intro.pdf
Estimating P(w1, w2, .., wn )
Chain Rule:

P(w1, w2, w3 , …. ,wn ) = P(w1) P(w2 | w1 ) P(w3 |w1, w2 ) …. P(wn |w1 ,.., wn-1 )

Markov Assumption:

P(w1, w2, w3 , …. ,wn ) = Πi P(wi| wi-1wi-2...wi-k)

Neural Language Modelling: Character RNN

http://karpathy.github.io/2015/05/21/rnn-effectiveness/
Example
Multi-layer RNNs
• We can of course design RNNs with multiple hidden
layers y1 y2 y3 y4 y5 y6

x1 x2 x3 x4 x5 x6
• Anything goes: skip connections across layers, across time,
…
Word Representation

Distributional Hypothesis: Words that occur in the same contexts tend to have
similar meanings

Slides: https://cs224d.stanford.edu/lectures/CS224d-Lecture2.pdf
Word based Cooccurence matrix

- Increase in size with

vocabulary
- Very High dimensional
- Sparsity issues

Slides: https://cs224d.stanford.edu/lectures/CS224d-Lecture2.pdf
Solution: Low dimensional Vectors
Idea: Store “most” of the important information in a fixed, small number of
dimensions

Instead of capturing word co-occurrence counts directly, predict

surrounding words of every word

Slides: https://cs224d.stanford.edu/lectures/CS224d-Lecture2.pdf
Skip-Gram model
● Represent each word as a d dimensional vector -> W
● Represent each context as a d dimensional vector -> V
● Initialize with random weights

Slides: https://cs224d.stanford.edu/lectures/CS224d-Lecture2.pdf
Skip-Gram model
● Generate probabilities for observing surrounding words given context words
● Probability vector generated should match the true probabilities

Slides: https://cs224d.stanford.edu/lectures/CS224d-Lecture2.pdf
Skip-Gram model

Slides: https://cs224d.stanford.edu/lectures/CS224d-Lecture2.pdf
Negative Sampling

Deep Learning with Hadoop
From Everand
Deep Learning with Hadoop
Dipayan Dev
No ratings yet
Recurrent Neural Network (RNN) : Tuan Nguyen - AI4E
Document38 pages
Recurrent Neural Network (RNN) : Tuan Nguyen - AI4E
Thuần Văn
No ratings yet
08.09 Text Mining Methods
Document45 pages
08.09 Text Mining Methods
Muhammad Umair
No ratings yet
Deeplearning - Ai Deeplearning - Ai
Document67 pages
Deeplearning - Ai Deeplearning - Ai
Jian Quan
No ratings yet
L5 TextClassification Updated
Document179 pages
L5 TextClassification Updated
Ike S. Ma
No ratings yet
Seq 2 Seq
Document61 pages
Seq 2 Seq
Thuần Văn
No ratings yet
Building A Smart Question Answering System From Scratch: Minjoon Seo
Document34 pages
Building A Smart Question Answering System From Scratch: Minjoon Seo
Arief Fatchul Huda
No ratings yet
Deep Learning Basics Lecture 10 Neural Language Models
Document34 pages
Deep Learning Basics Lecture 10 Neural Language Models
baris
No ratings yet
Text Categorization With Support Vector Machines: Learning With Many Relevant Features
Document24 pages
Text Categorization With Support Vector Machines: Learning With Many Relevant Features
Khang Nguyen
No ratings yet
AI Group 4 - 1
Document17 pages
AI Group 4 - 1
Raju Bv
No ratings yet
Deep RL Tutorial Small
Document66 pages
Deep RL Tutorial Small
Lê Kim Hùng
No ratings yet
NLP - Natural Language Processing
Document74 pages
NLP - Natural Language Processing
MichaelLevy
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
Document57 pages
Natural Language Processing With Deep Learning CS224N/Ling284
suman
No ratings yet
9 RNN LSTM Gru
Document91 pages
9 RNN LSTM Gru
sandhya
No ratings yet
Department of Education: Republic of The Philippines
Document2 pages
Department of Education: Republic of The Philippines
UHM KI JOON
No ratings yet
203 L03 S16 Handout
Document10 pages
203 L03 S16 Handout
wert1a2
No ratings yet
Class 13
Document22 pages
Class 13
abelteshe_340263389
No ratings yet
Dedup Slides
Document51 pages
Dedup Slides
deponly
No ratings yet
Cross-Language Information Retrieval (CLIR) : Ananthakrishnan R
Document32 pages
Cross-Language Information Retrieval (CLIR) : Ananthakrishnan R
Bapuji Valaboju
No ratings yet
Cleared - CCDH-410 Certification (Certification Results Forum at Coderanch)
Document4 pages
Cleared - CCDH-410 Certification (Certification Results Forum at Coderanch)
santosh kumar
No ratings yet
Large Scale Deep Learning
Document170 pages
Large Scale Deep Learning
pavancreative81
No ratings yet
CENG 223: Discrete Computational Structures
Document2 pages
CENG 223: Discrete Computational Structures
Ali Berkcan Boylu
No ratings yet
4 KnowledgRepresentation Planning Prob Uncertainity
Document214 pages
4 KnowledgRepresentation Planning Prob Uncertainity
26Bhutesh kumar meher IT
No ratings yet
Lecture 7 - Conditional Language Modeling
Document64 pages
Lecture 7 - Conditional Language Modeling
Mario Molina
No ratings yet
cs224n 2017 Lecture4 PDF
Document61 pages
cs224n 2017 Lecture4 PDF
Rada Dara
No ratings yet
First-Order Logic in Artificial Intelligence - Javatpoint
Document10 pages
First-Order Logic in Artificial Intelligence - Javatpoint
Arnab
No ratings yet
BD 2
Document44 pages
BD 2
Rushikesh Dhete
No ratings yet
Ling571 Class14 Distr Thes
Document122 pages
Ling571 Class14 Distr Thes
mausam
No ratings yet
Natural Language Processing (CSE4022) : by N. Ilakiyaselvan
Document80 pages
Natural Language Processing (CSE4022) : by N. Ilakiyaselvan
naruto sasuke
No ratings yet
Neural-Symbolic Cognitive Reasoning: January 2009
Document27 pages
Neural-Symbolic Cognitive Reasoning: January 2009
s
No ratings yet
09 - COE426-Differential Privacy II
Document30 pages
09 - COE426-Differential Privacy II
omar.alnajjar.3001
No ratings yet
Multimodal Recommender Systems: Rakuten Institute of Technology
Document44 pages
Multimodal Recommender Systems: Rakuten Institute of Technology
Achraf Louiza
No ratings yet
Cpts 440 / 540 Artificial Intelligence: Knowledge Representation
Document95 pages
Cpts 440 / 540 Artificial Intelligence: Knowledge Representation
Surekha Sakhare
No ratings yet
ML in The News: Google Deepmind'S Deep Q-Learning PL Aying Atari Breako Ut
Document18 pages
ML in The News: Google Deepmind'S Deep Q-Learning PL Aying Atari Breako Ut
Sherif Magdy
No ratings yet
Experiment No.01: Part A
Document9 pages
Experiment No.01: Part A
anushka
No ratings yet
Natural Language Challenges
Document50 pages
Natural Language Challenges
aivanovu
No ratings yet
l3 Knowledge Representation
Document94 pages
l3 Knowledge Representation
Satnam Singh
No ratings yet
Local Guy S
Document21 pages
Local Guy S
pisco
No ratings yet
Lecture 3-4
Document28 pages
Lecture 3-4
Ahmed Iqbal
No ratings yet
Discreet Structure
Document39 pages
Discreet Structure
Zed Deguzman
No ratings yet
Discrete Maths Class4
Document11 pages
Discrete Maths Class4
Mohammad Gulam Ahamad
No ratings yet
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
Document66 pages
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
Muhammad Arshad Awan
No ratings yet
Mva 2020 SL C4
Document98 pages
Mva 2020 SL C4
Cristian Rodriguez
No ratings yet
Cad 7121 - Python Programming - Module - 1
Document140 pages
Cad 7121 - Python Programming - Module - 1
Abi Veera
No ratings yet
Lecture 2 - Word Emedding
Document45 pages
Lecture 2 - Word Emedding
Andrew Chung
No ratings yet
Deeplearning - Ai Deeplearning - Ai
Document58 pages
Deeplearning - Ai Deeplearning - Ai
9f8z4k2cxs
No ratings yet
Attention: Sharad Jones
Document25 pages
Attention: Sharad Jones
David Guevara
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
Document33 pages
Natural Language Processing With Deep Learning CS224N/Ling284
rakesh
No ratings yet
Lecture 7: Text Classification and Naive Bayes: Information Retrieval Computer Science Tripos Part II
Document48 pages
Lecture 7: Text Classification and Naive Bayes: Information Retrieval Computer Science Tripos Part II
Daniel Ergicho
No ratings yet
CS276A Text Retrieval and Mining: (Borrows Slides From Ray Mooney and Soumen Chakrabarti)
Document48 pages
CS276A Text Retrieval and Mining: (Borrows Slides From Ray Mooney and Soumen Chakrabarti)
Kristine Anne Montoya Quirante
No ratings yet
Pattern Recognition
Document52 pages
Pattern Recognition
Waseem Qassab
No ratings yet
Introduction To RNNS!: Arun Mallya!
Document52 pages
Introduction To RNNS!: Arun Mallya!
Tuấn Đào
No ratings yet
Deeplearning - Ai Deeplearning - Ai
Document93 pages
Deeplearning - Ai Deeplearning - Ai
9f8z4k2cxs
No ratings yet
Other Techiniques
Document63 pages
Other Techiniques
Sarvottam Kumar
No ratings yet
Formal Methods (FM) in Software Engineering (SE) : Cell#: 0308-5059325
Document160 pages
Formal Methods (FM) in Software Engineering (SE) : Cell#: 0308-5059325
Jack Cooper
No ratings yet
Wearing The Hair Shirt
Document68 pages
Wearing The Hair Shirt
sashishk
No ratings yet
Nn4nlp 02 LM
Document47 pages
Nn4nlp 02 LM
Brian Johnson
No ratings yet
Compiler Introduction
Document46 pages
Compiler Introduction
brgrima
No ratings yet
Artificial Intelligence: Knowledge Representation
Document95 pages
Artificial Intelligence: Knowledge Representation
Anuja Beatrice B
No ratings yet
Dynamic Scope and Context-Oriented Programming: Christian Neukirchen Editor in Chief of Anarchaia Euruko 2005
Document53 pages
Dynamic Scope and Context-Oriented Programming: Christian Neukirchen Editor in Chief of Anarchaia Euruko 2005
hoan
100% (1)
15 Unsup+gen PDF
Document210 pages
15 Unsup+gen PDF
Zu Ki
No ratings yet
Attention in Deep Networks: Ishan Misra
Document38 pages
Attention in Deep Networks: Ishan Misra
Zu Ki
No ratings yet
16 RL PDF
Document87 pages
16 RL PDF
Zu Ki
No ratings yet
Introduction To Deep Learning: CMSC498L
Document50 pages
Introduction To Deep Learning: CMSC498L
Zu Ki
No ratings yet
Computational Infrastructure: Sweta Agrawal
Document18 pages
Computational Infrastructure: Sweta Agrawal
Zu Ki
No ratings yet
Introduction To Deep Learning: CMSC498L
Document54 pages
Introduction To Deep Learning: CMSC498L
Zu Ki
No ratings yet
06 Optimization Basics PDF
Document82 pages
06 Optimization Basics PDF
Zu Ki
No ratings yet
05 Labels and Losses PDF
Document80 pages
05 Labels and Losses PDF
Zu Ki
No ratings yet
Assignment 1: CMSC 498L Released: Feb-14. Due Feb-21
Document2 pages
Assignment 1: CMSC 498L Released: Feb-14. Due Feb-21
Zu Ki
No ratings yet
Sales Strategies PPT NB
Document18 pages
Sales Strategies PPT NB
Zu Ki
No ratings yet
PHP Training Course Modules Ittraininghub
Document4 pages
PHP Training Course Modules Ittraininghub
Zu Ki
No ratings yet
Introduction To Moodle 3.3
Document15 pages
Introduction To Moodle 3.3
Zu Ki
No ratings yet
Gondola Calculation
Document6 pages
Gondola Calculation
Budi Susanto
No ratings yet
Influence of Aesthetics Attributes of Brand Web Pages On Customer Brand Engagement
Document22 pages
Influence of Aesthetics Attributes of Brand Web Pages On Customer Brand Engagement
NOOR AKMA AIDA
No ratings yet
Asco Series 238 ASCO Pilot Operated Solenoid Valves (Floating Diaphragm)
Document2 pages
Asco Series 238 ASCO Pilot Operated Solenoid Valves (Floating Diaphragm)
Khyle Laurenz Duro
No ratings yet
Skincare Routine Order Cheat Sheet
Document10 pages
Skincare Routine Order Cheat Sheet
Yel Salenga
100% (3)
Us Navy To Evaluate Anti Submarine Warfare Training System
Document2 pages
Us Navy To Evaluate Anti Submarine Warfare Training System
Victor Pileggi
No ratings yet
08 - 2061 USTR 2222a (1) Supor EKV
Document24 pages
08 - 2061 USTR 2222a (1) Supor EKV
Hassan Houdoud
0% (1)
RD Sharma Class8 Solutions
Document2 pages
RD Sharma Class8 Solutions
ncertsoluitons
100% (2)
English2 Q2 Summative Assessment 4 2
Document4 pages
English2 Q2 Summative Assessment 4 2
ALNIE PANGANIBAN
No ratings yet
EDS-A-0101: Automotive Restricted Hazardous Substances For Parts
Document14 pages
EDS-A-0101: Automotive Restricted Hazardous Substances For Parts
Muthu Ganesh
No ratings yet
CCNA Training New CCNA - RSTP
Document7 pages
CCNA Training New CCNA - RSTP
okotete evidence
No ratings yet
Indoor Air Quality Standard Procedures - 2014 Rev
Document12 pages
Indoor Air Quality Standard Procedures - 2014 Rev
FioriAmeliaHathaway
No ratings yet
The Process: by Andy Zoppelt
Document4 pages
The Process: by Andy Zoppelt
Mark Stephen HuBert
No ratings yet
Ict 2120 Animation NC Ii Week 11 20 by Francis Isaac 1
Document14 pages
Ict 2120 Animation NC Ii Week 11 20 by Francis Isaac 1
Chiropractic Marketing Now
No ratings yet
TheBasicsofBrainWaves - RS PDF
Document4 pages
TheBasicsofBrainWaves - RS PDF
Onutu Adriana-Liliana
No ratings yet
AS and A Level: Chemistry
Document11 pages
AS and A Level: Chemistry
Stingy Bie
No ratings yet
Metal Workers BizHouse - Uk
Document3 pages
Metal Workers BizHouse - Uk
Alex Beke
No ratings yet
Investigation of Skew Curved Bridges in Combination With Skewed Abutments Under Seismic Response
Document5 pages
Investigation of Skew Curved Bridges in Combination With Skewed Abutments Under Seismic Response
Editor IJTSRD
No ratings yet
Ecological Quality Ratio
Document24 pages
Ecological Quality Ratio
foocheehung
No ratings yet
c270 KW NTA855G2 60 HZ
Document31 pages
c270 KW NTA855G2 60 HZ
Ahmad El Khatib
No ratings yet
Scholomance 1 Graviton
Document18 pages
Scholomance 1 Graviton
Fabiano Saccol
No ratings yet
1F-Korean-Nami Mun - Miles From Nowhere
Document4 pages
1F-Korean-Nami Mun - Miles From Nowhere
Neil Patel
No ratings yet
Contoh CV / Daftar Riwayat Hidup
Document2 pages
Contoh CV / Daftar Riwayat Hidup
rusmansyah
No ratings yet
English Class Vii PDF
Document101 pages
English Class Vii PDF
pannapurohit
No ratings yet
Rachel Joyce - A Snow Garden and Other Stories PDF
Document118 pages
Rachel Joyce - A Snow Garden and Other Stories PDF
Игорь Яковлев
No ratings yet
Bardonna Menu
Document16 pages
Bardonna Menu
Farley Elliott
No ratings yet
Para Lec Combined
Document83 pages
Para Lec Combined
Clent Earl Jason O. Basco
No ratings yet
Conquest CXAX Air-to-Water Heat Pump
Document6 pages
Conquest CXAX Air-to-Water Heat Pump
Alexandre Lopes
No ratings yet
T.A.T.U. - Waste Management - Digital Booklet
Document14 pages
T.A.T.U. - Waste Management - Digital Booklet
MarieBL
No ratings yet
Concrete Super Structure Report
Document43 pages
Concrete Super Structure Report
Livian Teddy
No ratings yet
Automatic Train Operation
Document6 pages
Automatic Train Operation
Anupam Khandelwal
No ratings yet