Welcome to Scribd!

Skip carousel

Natural Language Processing: Parsing

Uploaded by

ritesh sinha

0% found this document useful (0 votes)

49 views18 pages

Original Title

Par

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

49 views18 pages

Natural Language Processing: Parsing

Uploaded by

ritesh sinha

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 18

Search inside document

Natural Language Processing

Parsing

Parsing

Parsing is the automatic analysis of a sentence  

with respect to its syntactic structure

Given a CFG, this means deriving a phrase structure
tree assigned to the sentence by the grammar

With ambiguous grammars, each sentence may have
many valid parse trees

• Should we retrieve all of them or just one?

• If the latter, how do we know which one?
Ambiguity

I booked a flight from LA.

• This sentence is ambiguous. In what way?

• What should happen if we parse the sentence?
Ambiguity

Ambiguity

NP VP

Pro Verb NP

I booked Det Nom

a Nom PP

Noun from LA

flight
Ambiguity

Ambiguity

NP VP

Pro Verb NP PP

I booked Det Nom from LA

a Noun

flight
Ambiguity

Combinatorial explosion

1600

linear cubic exponential

1200

800

400

0
1 2 3 4 5 6 7 8
Phrase structure trees

S root (top)

NP VP leaves (bottom)

Pro Verb NP

I prefer Det Nom

a Nom Noun

Noun flight

morning
Basic concepts of parsing

• Two problems for grammar G and string w:

• Recognition: determine if G accepts w

• Parsing: retrieve (all or some) parse trees
assigned to w by G

• Two basic search strategies:

• Top-down: start at the root of the tree

• Bottom-up: start at the leaves
Top-down parsing

• Basic idea

• Start at the root node, expand tree by
matching the left-hand side of rules

• Derive a tree whose leaves match the input

• Potential problems:

• Uses rules that could never match the input

• May loop on recursive rules: VP → VP PP!
Bottom-up parsing

• Basic idea:

• Start with the leaves, build tree by matching
the right-hand side of rules

• Build a tree with S at the root

• Potential problems

• Builds structures that could never be in a tree

• May loop on epsilon productions: NP → ɛ!
Dealing with ambiguity

• The number of possible parse trees grows

exponentially with sentence length

• A naive backtracking approach is too inefficient

• Key observation:

• Alternative parse trees share substructures

• We can use dynamic programming (again)
Probabilistic context-free grammar

• The number of possible parse trees  

grows rapidly with the length of the input.

• But not all parse trees are equally useful.

Example: I booked a flight from Los Angeles.

• In many applications, we want the ‘best’  
parse tree, or the first few best trees.

• Special case: ‘best’ = ‘most probable’
Probabilistic context-free grammar

Probabilistic context-free grammars

A probabilistic context-free grammar (PCFG)  

is a context-free grammar where

• each rule r has been assigned a probability  
p(r) between 0 and 1

• the probabilities of rules with the same  
left-hand side sum up to 1
Probabilistic context-free grammar

A sample PCFG

Rule Probability

S → NP VP 1

NP → Pronoun 1/3

NP → Proper-Noun 1/3

NP → Det Nominal 1/3

Nominal → Nominal PP 1/3

Nominal → Noun 2/3

VP → Verb NP 8/9

VP → Verb NP PP 1/9

PP → Preposition NP 1
Probabilistic context-free grammar

The probability of a parse tree

The probability of a parse tree is defined as  

the product of the probabilities of the rules  
that have been used to build the parse tree.
Probabilistic context-free grammar

The probability of a parse tree

S 1/1

NP 1/3 VP 8/9

Pro Verb NP 1/3

I booked Det Nom 1/3

a Nom 2/3 PP

Noun from LA

flight

Probability: 16/729
Probabilistic context-free grammar

The probability of a parse tree

S 1/1

NP 1/3 VP 1/9

Pro Verb NP 1/3 PP

I booked Det Nom 2/3 from LA

a Noun

flight

Probability: 6/729
Independence assumption

How can we make sense of this in terms of

probability theory?

• The probability of a rule expansion is dependent
only on the left-hand side symbol

• Is this a reasonable independence assumption?

10 Brain Hacks To Learn Fast With Jim Kwik - Workbook
Document12 pages
10 Brain Hacks To Learn Fast With Jim Kwik - Workbook
Manpreet Singh
69% (13)
Syntactic Analysis Ii (Parsing Using CFGS) : Dr. Sukhnandan Kaur Tiet
Document36 pages
Syntactic Analysis Ii (Parsing Using CFGS) : Dr. Sukhnandan Kaur Tiet
Yash Dhawan
No ratings yet
Phrase Structure Grammar
Document4 pages
Phrase Structure Grammar
moin khan
No ratings yet
Salary Account - Easy To Access
Document6 pages
Salary Account - Easy To Access
ritesh sinha
No ratings yet
Lesson Plan Triple Jump
Document5 pages
Lesson Plan Triple Jump
api-264509853
No ratings yet
5 - Syntax
Document102 pages
5 - Syntax
MATTHEW JALOWE MACARANAS
No ratings yet
Syntax Parsing: Implementation Using Basic Grammar-Rules For English Language For Ontology Base Semantic Search Engine
Document15 pages
Syntax Parsing: Implementation Using Basic Grammar-Rules For English Language For Ontology Base Semantic Search Engine
yabez efraim
No ratings yet
Syntax Parsing: Lecture # 6
Document65 pages
Syntax Parsing: Lecture # 6
ahmed hmed
No ratings yet
Syntax
Document35 pages
Syntax
John Goldsmith
No ratings yet
Natural Language Processing
Document21 pages
Natural Language Processing
Raushan kumar
No ratings yet
CS 388: Natural Language Processing: Syntactic Parsing: Raymond J. Mooney
Document99 pages
CS 388: Natural Language Processing: Syntactic Parsing: Raymond J. Mooney
Booming System
No ratings yet
Lecture-2 PropPred Merged
Document35 pages
Lecture-2 PropPred Merged
Aleezay Amir
No ratings yet
3nlp Computer
Document83 pages
3nlp Computer
MustafaBaseim
No ratings yet
Context-Free Grammars For English: From: Chapter 12 of An Introduction To Natural Language
Document31 pages
Context-Free Grammars For English: From: Chapter 12 of An Introduction To Natural Language
Captain Nemo -The Gamer
No ratings yet
Formal Grammars and Parsing
Document11 pages
Formal Grammars and Parsing
Marta Sampedro Gonzalez
No ratings yet
Grammar and Syntax
Document6 pages
Grammar and Syntax
Bel Sterling
No ratings yet
Scaling Up. Using The WWW To Resolve PP Attachment Ambiguities
Document5 pages
Scaling Up. Using The WWW To Resolve PP Attachment Ambiguities
likufanele
No ratings yet
The Penn Treebank: An Overview: University of York Heslington, York, UK
Document18 pages
The Penn Treebank: An Overview: University of York Heslington, York, UK
Nguyen Huyen
No ratings yet
Noun Phrase Extraction: A Description of Current Techniques
Document36 pages
Noun Phrase Extraction: A Description of Current Techniques
Ridha Galih Permana
No ratings yet
Syntax Class Handout
Document6 pages
Syntax Class Handout
Blue Flame
No ratings yet
Syntax
Document19 pages
Syntax
Mahabat
No ratings yet
N-Grams and Language Modeling
Document54 pages
N-Grams and Language Modeling
uihnkn
No ratings yet
Lect#9 - SyntaxandContext-free Grammars
Document38 pages
Lect#9 - SyntaxandContext-free Grammars
بصمة إبداع
No ratings yet
Neu 376792 PDF
Document181 pages
Neu 376792 PDF
Mateo 19al
No ratings yet
Final Practice
Document12 pages
Final Practice
flying ostrich
No ratings yet
F15 CS194 Lec 05 Natural Language
Document69 pages
F15 CS194 Lec 05 Natural Language
Aziz Abibulaiev
No ratings yet
Natural Language Understanding
Document41 pages
Natural Language Understanding
Manjeet Singh
No ratings yet
Triplet Extraction From Sentences:, Including
Document6 pages
Triplet Extraction From Sentences:, Including
sd
No ratings yet
14 Ai Cse551 NLP 2 PDF
Document39 pages
14 Ai Cse551 NLP 2 PDF
Nishit
No ratings yet
AI Notes Part-3
Document29 pages
AI Notes Part-3
Divyanshu Patwal
No ratings yet
Rambow Slides Leysin Tag
Document142 pages
Rambow Slides Leysin Tag
Utkarsh Goel
No ratings yet
03 Slides
Document35 pages
03 Slides
SHOHAN MUKHERJEE
No ratings yet
Syntax 1 Checklist
Document99 pages
Syntax 1 Checklist
ohaoha2
No ratings yet
Stemming: Ilakiyaselvan N, B2 Slot
Document23 pages
Stemming: Ilakiyaselvan N, B2 Slot
naruto sasuke
No ratings yet
618 A Comparative Study of Ga and
Document329 pages
618 A Comparative Study of Ga and
frempong films production F. F. P.
No ratings yet
LING 330 Morphology Assignment 1
Document3 pages
LING 330 Morphology Assignment 1
beilei
No ratings yet
Formats For Morphological Analyses
Document3 pages
Formats For Morphological Analyses
Victoria Choi
No ratings yet
10 1 1 21 1153 PDF
Document6 pages
10 1 1 21 1153 PDF
likufanele
No ratings yet
Ognsyntax Analysis
Document38 pages
Ognsyntax Analysis
Harshit Maurya
No ratings yet
Lexicalized Probabilistic Context-Free Grammars: Michael Collins
Document22 pages
Lexicalized Probabilistic Context-Free Grammars: Michael Collins
Domagoj Volarevic
No ratings yet
Right Node Raising As Conjunction Reduction Fed by Linearization
Document24 pages
Right Node Raising As Conjunction Reduction Fed by Linearization
Edison Hasa
No ratings yet
Translated and Paraphrased Plagiarism: The Cat and Mouse Game Continues..
Document29 pages
Translated and Paraphrased Plagiarism: The Cat and Mouse Game Continues..
Christian Storm
No ratings yet
5 Stats Parsing
Document51 pages
5 Stats Parsing
MATTHEW JALOWE MACARANAS
No ratings yet
CS 388: Natural Language Processing: Statistical Parsing: Raymond J. Mooney
Document31 pages
CS 388: Natural Language Processing: Statistical Parsing: Raymond J. Mooney
pooja mokale
No ratings yet
Ling571 Class16 Refres Flat
Document48 pages
Ling571 Class16 Refres Flat
aisha ahmed
No ratings yet
Noun Valency (PDFDrive) PDF
Document232 pages
Noun Valency (PDFDrive) PDF
মেহেদী হাসান অনীক
No ratings yet
LAL112 Batch 3
Document47 pages
LAL112 Batch 3
Kalumßa Hillary
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
Document28 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
Adi Putra
No ratings yet
Gender in Kuot
Document41 pages
Gender in Kuot
Siwar Siwar
No ratings yet
Lect 11
Document7 pages
Lect 11
Randika Perera
No ratings yet
Natural Language Processing
Document48 pages
Natural Language Processing
pintuojha1989
100% (1)
GG - Farkas PDF
Document10 pages
GG - Farkas PDF
Alex Is
No ratings yet
Phrase (Grammar) : Common and Technical Use
Document5 pages
Phrase (Grammar) : Common and Technical Use
Jesús Hernández
No ratings yet
BS English Literature Morning: Submitted By: Group 4 Sr. 1 2 3 4 5 6
Document13 pages
BS English Literature Morning: Submitted By: Group 4 Sr. 1 2 3 4 5 6
Amna Shoukat
No ratings yet
Syntactic Stylometry For Deception Detection
Document5 pages
Syntactic Stylometry For Deception Detection
Faith Ful
No ratings yet
Unit2 TopDownParsing
Document12 pages
Unit2 TopDownParsing
Mohammad Yaseen Pasha
No ratings yet
Case, Typology and Grammar in Honor of Barry J. Blake (Anna Siewierska (Ed.), Jae Jung Song (Ed.) )
Document400 pages
Case, Typology and Grammar in Honor of Barry J. Blake (Anna Siewierska (Ed.), Jae Jung Song (Ed.) )
魏淑阳
No ratings yet
Lecture 15
Document74 pages
Lecture 15
adrienlee
No ratings yet
A Course in Syntax Lessons 1-8
Document171 pages
A Course in Syntax Lessons 1-8
aya sherif
No ratings yet
ECS 5 Chapter 3
Document7 pages
ECS 5 Chapter 3
Edel Mae Opena
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
Document28 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
Rylee Simth
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
Document28 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
bhavana2264
No ratings yet
A Book of Anagrams: An Ancient Word Game
From Everand
A Book of Anagrams: An Ancient Word Game
Daniel H. Wieczorek
No ratings yet
Circular Form Format
Document1 page
Circular Form Format
KunalAgarwal
No ratings yet
Multiplexer Frame - SDH Sonet
Document63 pages
Multiplexer Frame - SDH Sonet
Ross Sonny Cruz
No ratings yet
Scan Sep 25, 2020
Document25 pages
Scan Sep 25, 2020
ritesh sinha
No ratings yet
India Situation Report 9 PDF
Document2 pages
India Situation Report 9 PDF
Prashant Thakur
No ratings yet
Three Derivations of Principal Component Analysis
Document2 pages
Three Derivations of Principal Component Analysis
ritesh sinha
No ratings yet
Backend Coding Task
Document1 page
Backend Coding Task
ritesh sinha
No ratings yet
The Art of Data Structures: Sorting
Document42 pages
The Art of Data Structures: Sorting
ritesh sinha
No ratings yet
Assignment - 10: Q1. Write A Graphics Program To Implement Bezier Curve Code
Document3 pages
Assignment - 10: Q1. Write A Graphics Program To Implement Bezier Curve Code
ritesh sinha
No ratings yet
Multiple Regression Okk PDF
Document19 pages
Multiple Regression Okk PDF
Silvia Guerrero Gonzalez
No ratings yet
Ritesh Kumar Sinja +91 8117808825 Career Objective
Document2 pages
Ritesh Kumar Sinja +91 8117808825 Career Objective
ritesh sinha
No ratings yet
SQL Nanodegree Program Syllabus
Document12 pages
SQL Nanodegree Program Syllabus
ritesh sinha
No ratings yet
OS Assignment III
Document4 pages
OS Assignment III
ritesh sinha
No ratings yet
Deep Learning With Advanced NLP
Document18 pages
Deep Learning With Advanced NLP
revanth
No ratings yet
Data Science Architecture Design.
Document6 pages
Data Science Architecture Design.
ritesh sinha
No ratings yet
Python Basics
Document15 pages
Python Basics
Iffa Nurfaizah Amatillah Imtisal
No ratings yet
SQL Nanodegree Program Syllabus
Document12 pages
SQL Nanodegree Program Syllabus
ritesh sinha
No ratings yet
One Word Substitution PDF
Document12 pages
One Word Substitution PDF
ritesh sinha
No ratings yet
String Metods and Regex Metacharacters
Document4 pages
String Metods and Regex Metacharacters
ritesh sinha
No ratings yet
The Art of Data Structures: Sorting
Document42 pages
The Art of Data Structures: Sorting
ritesh sinha
No ratings yet
Dom
Document20 pages
Dom
sdfs
No ratings yet
The Geriatric Anxiety Inventory in Primary Care
Document4 pages
The Geriatric Anxiety Inventory in Primary Care
Yudistiro Adi Nugroho
No ratings yet
longquizGATCNT 1
Document2 pages
longquizGATCNT 1
Aira Mari Austria
No ratings yet
WI-03 - Daily Reflection and Activity Log Sheet
Document2 pages
WI-03 - Daily Reflection and Activity Log Sheet
Ryan Magparangalan
No ratings yet
Q4-Authors Analysis
Document2 pages
Q4-Authors Analysis
Jasmine Fatma
No ratings yet
Technology Integration in The Classroom PDF
Document4 pages
Technology Integration in The Classroom PDF
Dhana Raman
100% (1)
Consumer Profile Project - Whirlpool
Document12 pages
Consumer Profile Project - Whirlpool
Anonymous 5GHZXl
No ratings yet
Engineering Design & Drafting Institute in Mumbai
Document3 pages
Engineering Design & Drafting Institute in Mumbai
Suvidya Institute of Technology
No ratings yet
SOP
Document2 pages
SOP
Desiree Matienzo
No ratings yet
Lesson Plan Garbage Gym Game
Document3 pages
Lesson Plan Garbage Gym Game
api-272479731
No ratings yet
Marysa Prettyman: Professional Objective
Document3 pages
Marysa Prettyman: Professional Objective
Marysa Prettyman
No ratings yet
National Elctrical Installation Curriculum
Document34 pages
National Elctrical Installation Curriculum
Rudi Fajardo
No ratings yet
The Looking Glass Self Botanywayboot
Document1 page
The Looking Glass Self Botanywayboot
Ranie Monteclaro
No ratings yet
Venteicherk A2a - Detc 620
Document10 pages
Venteicherk A2a - Detc 620
api-238107999
No ratings yet
Empower Second Edition Starter ESOL Map
Document10 pages
Empower Second Edition Starter ESOL Map
hgfbvdcsx
No ratings yet
MBTI - Validity and Reliability
Document28 pages
MBTI - Validity and Reliability
palak dutt
No ratings yet
DR Who Wellbeing Workshops
Document2 pages
DR Who Wellbeing Workshops
Harriet Cannon
No ratings yet
Training Writing Skills PDF
Document26 pages
Training Writing Skills PDF
firja praditiya
No ratings yet
French Preap and Ap Syllabus
Document5 pages
French Preap and Ap Syllabus
api-177128821
No ratings yet
16speak Company Background
Document9 pages
16speak Company Background
16Speak
No ratings yet
DLL Lesson Plan in Arts q1 WK 5
Document6 pages
DLL Lesson Plan in Arts q1 WK 5
Mitzi Faye Cabbab
No ratings yet
Plantilla Exmaen B2 PDF
Document4 pages
Plantilla Exmaen B2 PDF
RafaelZúñigaGutiérrez
No ratings yet
CMS QAPI Five Elements
Document1 page
CMS QAPI Five Elements
senorvicente
No ratings yet
Agency and The Foundations of Ethics: Nietzschean Constitutivism - Reviews - Notre Dame Philosophical Reviews
Document11 pages
Agency and The Foundations of Ethics: Nietzschean Constitutivism - Reviews - Notre Dame Philosophical Reviews
Timothy Brown
No ratings yet
Describing People (Adjectives) LESSON PLAN JUDY
Document5 pages
Describing People (Adjectives) LESSON PLAN JUDY
DORA ELIANA TOBON
No ratings yet
Data Mining-Lecture6
Document6 pages
Data Mining-Lecture6
ashraf8
No ratings yet
Creative Problem Solving Lesson
Document4 pages
Creative Problem Solving Lesson
api-252935769
No ratings yet
Reading Strategy2
Document5 pages
Reading Strategy2
Lala Khaulani Uar
No ratings yet
Sensory Evaluation of The Developed Product
Document5 pages
Sensory Evaluation of The Developed Product
International Journal of Innovative Science and Research Technology
No ratings yet