Welcome to Scribd!

Discounting Handout

Uploaded by

0% found this document useful (0 votes)

9 views2 pages

This document provides an example of calculating absolute discount smoothing probabilities on a small corpus. It shows: 1) Calculating unigram and bigram MLE probabilities from the corpus. 2) Computing the reserved mass and discounting parameters (alphas) for each word. 3) Applying absolute discount smoothing to calculate smoothed bigram probabilities, discounting observed counts and backing off to unigrams for unseen counts using the alphas.

Original Description:

Original Title

discounting_handout

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

9 views2 pages

Discounting Handout

Uploaded by

Trần Duy Thanh

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

CS159 - Absolute Discount Smoothing Handout

David Kauchak - Fall 2020

To help understand the absolute discounting computation, below is a walkthrough of the

probability calculations on as very small corpus.
Given the following corpus (where we only have one letter words):

a a a b
a b b a
c a a a

We would like to calculate an absolute discounted model with D = 0.5. We’ll ignore the begin
and end sentence tokens, assume that our vocabulary is all three “words” and not worry about
handling out of vocabulary words (i.e. <UNK>).
We first calculate the unigram MLE probabilities as:

MLE prob
p(a) 8/12
p(b) 3/12
p(c) 1/12

and the bigram MLE probabilities as:

MLE prob
p(a|a) 4/6
p(b|a) 2/6
p(c|a) 0
p(a|b) 1/2
p(b|b) 1/2
p(c|b) 0
p(a|c) 1
p(b|c) 0
p(c|c) 0
Using this information, we can now calculate the reserved mass and the αs for each of the
words in our vocabulary (i.e. things that we’re conditioning on):

• a
The numerator for our α is the reserved mass:

reserved mass(a) = (2 ∗ 0.5)/6 = 1/6

1
and the denominator is:
X
1− p(x) = 1 − (p(a) + p(b)) = 1 − (8/12 + 3/12) = 1/12
x:count(ax)>0

giving us an α of:
1/6
α(a) = =2
1/12

• b
reserved mass(b) = (2 ∗ 0.5)/2 = 1/2

1− = 1 − (p(a) + p(b)) = 1 − (8/12 + 3/12) = 1/12

P
x:count(bx)>0 p(x)

1/2
α(b) = 1/12 =6

• c
reserved mass(c) = (1 ∗ 0.5)/1 = 1/2

1− = 1 − p(a) = 1 − (8/12) = 4/12 = 1/3

P
x:count(cx)>0 p(x)

1/2
α(c) = 1/3 = 3/2

Finally, now that we have the αs, we can calculate the smoothed bigram probabilities. For
those that occurred, we simply discount the count. For those that did not occur, we calculate
the probability as alpha times the unigram probability of the word.
eqn prob
p(a|a) (4 - 0.5)/6 3.5/6
p(b|a) (2 - 0.5)/6 1.5/6
p(c|a) 2 * 1/12 1/6
p(a|b) (1-.05)/2 1/4
p(b|b) (1-0.5)/2 1/4
p(c|b) 6 * 1/12 1/2
p(a|c) (1-0.5)/1 1/2
p(b|c) 3/2 * 3/12 3/8
p(c|c) 3/2 * 1/12 1/8
Notice that after the smoothing, the three distributions all still sum to 1. In this case dis-
counting by 0.5 may be a bit aggressive.
Note, for the first two distributions (p(·|a) and p(·|b)), we actually didn’t need to go through
the exercise of calculating α since there was only one “word” we were backing off to and it
therefore would get all of the reserved mass. However, I included the computation here so
you can see that the math still works out.

Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Introduction to Calculus
From Everand
Introduction to Calculus
Joan Van Glabek
Rating: 4.5 out of 5 stars
4.5/5 (8)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
Rating: 3.5 out of 5 stars
3.5/5 (8)
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Algebra Secret RevealedComplete Guide to Mastering Solutions to Algebraic Equations
From Everand
Algebra Secret RevealedComplete Guide to Mastering Solutions to Algebraic Equations
Joseph McDavd
No ratings yet
J. J. Sakurai, Jim J. Napolitano-Instructor's Solutions Manual To Modern Quantum Mechanics (2nd Edition) - Pearson (2010)
Document112 pages
J. J. Sakurai, Jim J. Napolitano-Instructor's Solutions Manual To Modern Quantum Mechanics (2nd Edition) - Pearson (2010)
Prashant Chauhan
50% (6)
John H. Goldthorpe - Sociology As A Population Science-Cambridge University Press (2015)
Document161 pages
John H. Goldthorpe - Sociology As A Population Science-Cambridge University Press (2015)
Bárbara Estévez Leston
No ratings yet
Cot Lesson Plan
Document2 pages
Cot Lesson Plan
Cecille Cervantes
No ratings yet
Quantitative Techniques For Management PDF
Document507 pages
Quantitative Techniques For Management PDF
shr
88% (8)
DLL Demo Stat
Document3 pages
DLL Demo Stat
Mildred Ortiz
100% (1)
Pishro Niksolution 1 PDF
Document218 pages
Pishro Niksolution 1 PDF
Taha
0% (2)
Topology Essentials
From Everand
Topology Essentials
Emil G. Milewski
Rating: 5 out of 5 stars
5/5 (1)
Analytic Geometry: Graphic Solutions Using Matlab Language
From Everand
Analytic Geometry: Graphic Solutions Using Matlab Language
Ing. Mario Castillo
No ratings yet
Sigplan: Garbage In/Garbage Out
Document9 pages
Sigplan: Garbage In/Garbage Out
abijarani
No ratings yet
Solutions Probability Statistics
Document100 pages
Solutions Probability Statistics
Alexandra
No ratings yet
Probability and Statistics Solution Manual
Document100 pages
Probability and Statistics Solution Manual
Patrick Albert
No ratings yet
Probability and Statistics Book Solutions
Document100 pages
Probability and Statistics Book Solutions
Lieke Lokin
50% (4)
FMS04 - Basic Algebra
Document9 pages
FMS04 - Basic Algebra
Jackie Heavisides
100% (1)
SampleSolutionsTut1 Tut3
Document7 pages
SampleSolutionsTut1 Tut3
Apoorva Ansh
No ratings yet
Acc Sample Maths
Document18 pages
Acc Sample Maths
Abdul Aziz Siregar
No ratings yet
Acc Sample Maths PDF
Document18 pages
Acc Sample Maths PDF
Bhagwat Thakker
No ratings yet
Perfect Square Rule, Perfact, Square, Rule, Mathematics
Document2 pages
Perfect Square Rule, Perfact, Square, Rule, Mathematics
Sourav Nath
No ratings yet
Minka Gamma
Document3 pages
Minka Gamma
G Nathan Jd
No ratings yet
Algebra Modulus
Document8 pages
Algebra Modulus
Lincoln Chau
No ratings yet
Rmo 2012 Mumbai Region Previous Year Question Papers of Regional Mathematical Olympiad With Solutions
Document4 pages
Rmo 2012 Mumbai Region Previous Year Question Papers of Regional Mathematical Olympiad With Solutions
Akshay Pandey
No ratings yet
Trigonometric Equations
Document8 pages
Trigonometric Equations
Jose
No ratings yet
hw1 Sols
Document4 pages
hw1 Sols
Jose guiteerrz
No ratings yet
A Level Core Pure Maths by Michael Cook PDF
Document188 pages
A Level Core Pure Maths by Michael Cook PDF
hpant
100% (1)
Lab 3: Solutions: C 2005 Ben Bolker September 14, 2005
Document9 pages
Lab 3: Solutions: C 2005 Ben Bolker September 14, 2005
juntujuntu
No ratings yet
BAS410 Draft Course Notes 2023 PDF
Document49 pages
BAS410 Draft Course Notes 2023 PDF
Nkhandu S M Williams
No ratings yet
Eigenmath
Document50 pages
Eigenmath
Johnny Parrado
100% (1)
Hw2 - Raymond Von Mizener - Chirag Mahapatra
Document13 pages
Hw2 - Raymond Von Mizener - Chirag Mahapatra
kob265
No ratings yet
Solutionbank C1: Edexcel Modular Mathematics For AS and A-Level
Document48 pages
Solutionbank C1: Edexcel Modular Mathematics For AS and A-Level
Momina Zidan
No ratings yet
MIT18 05S14 Class26-Sol
Document20 pages
MIT18 05S14 Class26-Sol
Yuvraj Wale
No ratings yet
QP, Xii Maths 2022-23
Document17 pages
QP, Xii Maths 2022-23
Sreelakshmi
No ratings yet
Numerical Methods - L33
Document14 pages
Numerical Methods - L33
eng.chem.2022.10
No ratings yet
Pertemuan - 2: Operasi Matriks
Document14 pages
Pertemuan - 2: Operasi Matriks
www_ikhsan_nulkhair
No ratings yet
Tutorial 01 Sol MTH2222
Document4 pages
Tutorial 01 Sol MTH2222
Jason Zhou
No ratings yet
MR 1 2009 Article1
Document5 pages
MR 1 2009 Article1
luislatex
No ratings yet
Density of Carmichael Numbers With Three Prime Fac
Document8 pages
Density of Carmichael Numbers With Three Prime Fac
tricksterESP
No ratings yet
Partial Fractions: Linear Factors in Denominator
Document3 pages
Partial Fractions: Linear Factors in Denominator
Hugh Ingram
No ratings yet
A B A B: (Solution) Class - XII Mathematics
Document17 pages
A B A B: (Solution) Class - XII Mathematics
Shivam
No ratings yet
Acc Sample Maths
Document18 pages
Acc Sample Maths
SimiYadav
100% (2)
Matrices and Linear Algebra L-1
Document4 pages
Matrices and Linear Algebra L-1
Farid Hossain
No ratings yet
RATIO, Proportions, Indices and Logarithms
Document3 pages
RATIO, Proportions, Indices and Logarithms
Bhadav Jain
No ratings yet
Logs
Document1 page
Logs
Suhas Js
No ratings yet
Pure Math Unit 1 and 2 Integration Techniques
Document6 pages
Pure Math Unit 1 and 2 Integration Techniques
Hugh Ingram
No ratings yet
MAth 7 Q2 Week 5 Laws of Exponents
Document3 pages
MAth 7 Q2 Week 5 Laws of Exponents
Elle Villanueva Vlog
No ratings yet
Mark Scheme (Results) Summer 2012: AEA Mathematics (9801)
Document8 pages
Mark Scheme (Results) Summer 2012: AEA Mathematics (9801)
will bell
No ratings yet
Essentials of Business Statistics 5th Edition Bowerman Solutions Manual 1
Document27 pages
Essentials of Business Statistics 5th Edition Bowerman Solutions Manual 1
ryan
100% (41)
SauerNumAnalyISM 286855 04 Ch3
Document26 pages
SauerNumAnalyISM 286855 04 Ch3
elvarg09
No ratings yet
Paths To s3
Document59 pages
Paths To s3
ruslanag
No ratings yet
Modular Arithmetic: The Beginning: 1 Divisibility
Document6 pages
Modular Arithmetic: The Beginning: 1 Divisibility
Silver Blaze
No ratings yet
Chiang Wainwright Fundamental Methods CH 2 3 Solutions
Document7 pages
Chiang Wainwright Fundamental Methods CH 2 3 Solutions
bonadie
60% (5)
Introduction To Matlab: Mohamed Hussein
Document21 pages
Introduction To Matlab: Mohamed Hussein
azlan
No ratings yet
Number Theoretic Algorithms
Document134 pages
Number Theoretic Algorithms
Netaji Sagaram
No ratings yet
Sets and Functions 2019
Document17 pages
Sets and Functions 2019
Levi
No ratings yet
Assignment 1
Document3 pages
Assignment 1
huhu
No ratings yet
Essentials of Business Statistics 5th Edition Bowerman Solutions Manual
Document27 pages
Essentials of Business Statistics 5th Edition Bowerman Solutions Manual
adeliadacsc3o
100% (28)
Essentials of Business Statistics 5th Edition Bowerman Solutions Manual Full Chapter PDF
Document48 pages
Essentials of Business Statistics 5th Edition Bowerman Solutions Manual Full Chapter PDF
edithbellamyhd3a
100% (12)
Round 2 Solutions
Document10 pages
Round 2 Solutions
kepler1729
No ratings yet
SAT Subject Math Level 2 Facts
Document11 pages
SAT Subject Math Level 2 Facts
Salvador
No ratings yet
MA215 Fall 12 Mid1
Document7 pages
MA215 Fall 12 Mid1
MohamedKeynan
No ratings yet
0606 w14 Ms 11
Document7 pages
0606 w14 Ms 11
AKRAM
No ratings yet
Theoretical Foundations For Quantitative Finance 9813202475 9789813202474 - Compress
Document222 pages
Theoretical Foundations For Quantitative Finance 9813202475 9789813202474 - Compress
Lyna
No ratings yet
Probability
Document14 pages
Probability
Aadil Arsh
No ratings yet
Kolmogorov and Probability Theory
Document13 pages
Kolmogorov and Probability Theory
pajoroc
No ratings yet
Activity Sheet On Simple Probability
Document4 pages
Activity Sheet On Simple Probability
20100576
No ratings yet
Expected Mean HSC Questions
Document4 pages
Expected Mean HSC Questions
Nathan Ha
No ratings yet
Introduction To Probability Solutions To The Odd Numbered Exercises (Charles M. Grinstead & J. Laurie Snell - 0821807498) PDF
Document51 pages
Introduction To Probability Solutions To The Odd Numbered Exercises (Charles M. Grinstead & J. Laurie Snell - 0821807498) PDF
Igor Medeiros
No ratings yet
Probability 1 Exercises1
Document3 pages
Probability 1 Exercises1
thehousenlot
No ratings yet
Section 4.1
Document16 pages
Section 4.1
mode
No ratings yet
Free English Textbooks
Document50 pages
Free English Textbooks
WENN
No ratings yet
Solution Manual For Introductory Statistics 9th by Mann
Document25 pages
Solution Manual For Introductory Statistics 9th by Mann
KatelynWebsterikzj
100% (43)
Bi Variate
Document46 pages
Bi Variate
Iera Ahmad
No ratings yet
Topic19 8p6 Galvin
Document29 pages
Topic19 8p6 Galvin
Çháråñ Çhèrry
No ratings yet
Normal Distribution and Standardization 1
Document14 pages
Normal Distribution and Standardization 1
Samwel Magoma
No ratings yet
Math10 SLHT, q3, Wk7, M10sp-Iiig-1
Document4 pages
Math10 SLHT, q3, Wk7, M10sp-Iiig-1
Justiniano Salicio
No ratings yet
ENGSTAT Lecture 6 Probability PDF
Document152 pages
ENGSTAT Lecture 6 Probability PDF
Natalie Cu
No ratings yet
Statistika Ekonomi Bisnis Anderson 11th Edition, Chapter 1-6
Document56 pages
Statistika Ekonomi Bisnis Anderson 11th Edition, Chapter 1-6
yunitakn
No ratings yet
SB 2023 Lecture5
Document62 pages
SB 2023 Lecture5
Quyen Pham Tran Thuc
No ratings yet
Statistics and Quantitative Techniques
Document3 pages
Statistics and Quantitative Techniques
nmngoel
No ratings yet
JNTUA R20 B.Tech Information Technology Course Structure 2020 21.
Document52 pages
JNTUA R20 B.Tech Information Technology Course Structure 2020 21.
Srinath M
No ratings yet
Mathematics 8 4th: of Ways The Event Can Occur (Favorable Outcomes)
Document5 pages
Mathematics 8 4th: of Ways The Event Can Occur (Favorable Outcomes)
jeff
No ratings yet
Conditional Distributions and Stochastic Independence
Document2 pages
Conditional Distributions and Stochastic Independence
Althea Ababa
No ratings yet
ISE589 002 SyllabusFall2015 8 18 15
Document5 pages
ISE589 002 SyllabusFall2015 8 18 15
Avik Mukherjee
No ratings yet
Marcel Finan S Exam 1P Manual
Document533 pages
Marcel Finan S Exam 1P Manual
Brian Jang
No ratings yet
Binomial Distribution
Document22 pages
Binomial Distribution
Althea Nicole Canapi
No ratings yet
University of Engineering and Technology, Taxila Department of Basic Sciences
Document2 pages
University of Engineering and Technology, Taxila Department of Basic Sciences
Abdullah Khan
No ratings yet
Probability Distributionsa
Document82 pages
Probability Distributionsa
asdasdas asdasdasdsadsasddssa
No ratings yet