Welcome to Scribd!

Page 32 PDF

Uploaded by

0% found this document useful (0 votes)

28 views1 page

The document describes the Naive Bayes algorithm for document classification. It outlines the training process which involves counting the occurrence of words in labeled documents and calculating word probabilities. It then describes the classification process which calculates a score for a document by summing the log probabilities of its words and compares the score to a threshold to determine the document class.

Original Description:

Original Title

page-32.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

28 views1 page

Page 32 PDF

Uploaded by

raiashish123

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

24 1 Introduction

Algorithm 1.1 Naive Bayes

Train(X, Y) {reads documents X and labels Y}
Compute dictionary D of X with n words.
Compute m, mham and mspam .
Initialize b := log c + log mham − log mspam to offset the rejection threshold
Initialize p ∈ R2×n with pij = 1, wspam = n, wham = n.
{Count occurrence of each word}
{Here xji denotes the number of times word j occurs in document xi }
for i = 1 to m do
if yi = spam then
for j = 1 to n do
p0,j ← p0,j + xji
wspam ← wspam + xji
end for
else
for j = 1 to n do
p1,j ← p1,j + xji
wham ← wham + xji
end for
end if
end for
{Normalize counts to yield word probabilities}
for j = 1 to n do
p0,j ← p0,j /wspam
p1,j ← p1,j /wham
end for
Classify(x) {classifies document x}
Initialize score threshold t = −b
for j = 1 to n do
t ← t + xj (log p0,j − log p1,j )
end for
if t > 0 return spam else return ham

1.3.2 Nearest Neighbor Estimators

An even simpler estimator than Naive Bayes is nearest neighbors. In its most
basic form it assigns the label of its nearest neighbor to an observation x
(see Figure 1.17). Hence, all we need to implement it is a distance measure
d(x, x0 ) between pairs of observations. Note that this distance need not even
be symmetric. This means that nearest neighbor classifiers can be extremely

Multivariate Probability: 1 Discrete Joint Distributions
Document10 pages
Multivariate Probability: 1 Discrete Joint Distributions
hamkarim
No ratings yet
Theoretical Statistics. Lecture 15.: M-Estimators. Consistency of M-Estimators. Nonparametric Maximum Likelihood
Document20 pages
Theoretical Statistics. Lecture 15.: M-Estimators. Consistency of M-Estimators. Nonparametric Maximum Likelihood
Mandar Priya Phatak
No ratings yet
Unexpected Applications of Mean Value Theorem(s)
Document5 pages
Unexpected Applications of Mean Value Theorem(s)
Alexander Marinov
No ratings yet
Prime Number Theory and The Riemann Zeta-Function
Document33 pages
Prime Number Theory and The Riemann Zeta-Function
Mustafa Rhm
No ratings yet
Notes 2a 2018-19 Elementary Prime Theory
Document11 pages
Notes 2a 2018-19 Elementary Prime Theory
adrverhar
No ratings yet
Notes 2 App 2018-19 Elementary Prime Theory
Document4 pages
Notes 2 App 2018-19 Elementary Prime Theory
adrverhar
No ratings yet
MAT 3633 Note 1 Lagrange Interpolation: 1.1 The Lagrange Interpolating Polynomial
Document16 pages
MAT 3633 Note 1 Lagrange Interpolation: 1.1 The Lagrange Interpolating Polynomial
Kuhu Koyaliya
No ratings yet
A Taste of Analytic Number Theory: Ayan Nath
Document26 pages
A Taste of Analytic Number Theory: Ayan Nath
Om Gupta
No ratings yet
Weak Evidence For Goldbach's Conjecture
Document3 pages
Weak Evidence For Goldbach's Conjecture
Alexandre Alessandri
No ratings yet
A Taste of Analytic Number Theory: Ayan Nath
Document26 pages
A Taste of Analytic Number Theory: Ayan Nath
Le
No ratings yet
Density Estimation 36-708
Document32 pages
Density Estimation 36-708
S
No ratings yet
2008 MOP Blue Polynomials-I
Document3 pages
2008 MOP Blue Polynomials-I
Weerasak Boonwuttiwong
No ratings yet
Lect 26
Document4 pages
Lect 26
Nga Truong
No ratings yet
Paper DCRE
Document13 pages
Paper DCRE
Alexandre Bourdain
No ratings yet
Numerical Analysis - Lecture 3: Mathematical Tripos Part IB: Lent 2010
Document2 pages
Numerical Analysis - Lecture 3: Mathematical Tripos Part IB: Lent 2010
Sree Manish Chada
No ratings yet
Densityestimation
Document33 pages
Densityestimation
Richard Richie
No ratings yet
Polynomial Interpolation
Document10 pages
Polynomial Interpolation
Juwanda
No ratings yet
Facts and Formulas 2a
Document9 pages
Facts and Formulas 2a
amc994
No ratings yet
Solucionario Sheldon Ross
Document66 pages
Solucionario Sheldon Ross
Christian Diaz
No ratings yet
Scientific Computing - LESSON 13: Numerical Integration II 1
Document5 pages
Scientific Computing - LESSON 13: Numerical Integration II 1
Lee Hei Long
No ratings yet
CONSECUTIVE PRIMES IN SHORT INTERVALS ARTYOM RADOMSKII - Proc Steklov Institute - Maynard - Radziwill-Matomaki
Document82 pages
CONSECUTIVE PRIMES IN SHORT INTERVALS ARTYOM RADOMSKII - Proc Steklov Institute - Maynard - Radziwill-Matomaki
Sam Taylor
No ratings yet
Matlab - Lagranges Interpolation Method
Document3 pages
Matlab - Lagranges Interpolation Method
arvind yadav
No ratings yet
CH 4
Document33 pages
CH 4
hung13579
No ratings yet
1 When Does A Polyhedron Have Extreme Points?: J ( X) J ( X)
Document6 pages
1 When Does A Polyhedron Have Extreme Points?: J ( X) J ( X)
Jerimiah
No ratings yet
Lectures 1
Document98 pages
Lectures 1
Shy Ronnie
No ratings yet
Ugmath2022 22may Partb Solutions
Document6 pages
Ugmath2022 22may Partb Solutions
TECH GAMERS
No ratings yet
Homework 2
Document4 pages
Homework 2
Muhammad Murtaza
No ratings yet
Maximum Likelihood Estimation
Document7 pages
Maximum Likelihood Estimation
Saurabh Bhandari
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
Document56 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
Utkarsh Patel
No ratings yet
Treaps, Verification, and String Matching
Document6 pages
Treaps, Verification, and String Matching
Geanina Balan
No ratings yet
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
Document16 pages
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
Mahi S
No ratings yet
Weatherwax Theodoridis Solutions
Document212 pages
Weatherwax Theodoridis Solutions
Rakesh
No ratings yet
Analytic Number Theory 1
Document21 pages
Analytic Number Theory 1
Tamás Tornyi
No ratings yet
Lower Bounds For Numbers of ABC-Hits: Sander Dahmen October 5, 2006
Document9 pages
Lower Bounds For Numbers of ABC-Hits: Sander Dahmen October 5, 2006
Dmitri Martila
No ratings yet
Metric
Document10 pages
Metric
nanbasist350
No ratings yet
Probabilistic Method 6
Document6 pages
Probabilistic Method 6
johnoftheroad
No ratings yet
SAA For JCC
Document18 pages
SAA For JCC
Shu-Bo Yang
No ratings yet
CSD311: Artificial Intelligence
Document33 pages
CSD311: Artificial Intelligence
Ayaan Khan
No ratings yet
Probability Problem Solution Strategy in R PDF
Document12 pages
Probability Problem Solution Strategy in R PDF
Asma Zakarya
No ratings yet
13 Independent Random Variables
Document34 pages
13 Independent Random Variables
Antonio J
No ratings yet
17 Random Projections and Orthogonal Matching Pursuit
Document7 pages
17 Random Projections and Orthogonal Matching Pursuit
Arindam
No ratings yet
Imc2021 Day2 Solutions
Document3 pages
Imc2021 Day2 Solutions
Laura Stephanie
No ratings yet
STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method
Document28 pages
STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method
azer
No ratings yet
The Miller-Rabin Randomized Primality Test
Document10 pages
The Miller-Rabin Randomized Primality Test
Hung Nguyen
0% (1)
Lecture 2: Gibb's, Data Processing and Fano's Inequalities: 2.1.1 Fundamental Limits in Information Theory
Document6 pages
Lecture 2: Gibb's, Data Processing and Fano's Inequalities: 2.1.1 Fundamental Limits in Information Theory
Abhishek Prakash
No ratings yet
Homework 1
Document4 pages
Homework 1
Bilal Yousaf
0% (1)
Stanford University CS 229, Autumn 2014 Midterm Examination
Document23 pages
Stanford University CS 229, Autumn 2014 Midterm Examination
Erico Archeti
No ratings yet
Solutions To Exercises
Document61 pages
Solutions To Exercises
vic1234059
No ratings yet
Bayes Algorithm
Document26 pages
Bayes Algorithm
thummalakuntas
No ratings yet
SAT Subject Math Level 2 Facts & Formulas Numbers, Sequences, Factors
Document11 pages
SAT Subject Math Level 2 Facts & Formulas Numbers, Sequences, Factors
neena
No ratings yet
P-Adics For Dummies
Document17 pages
P-Adics For Dummies
Jaime
100% (2)
A Proof of Goldbach Conjecture
Document9 pages
A Proof of Goldbach Conjecture
Michel57
No ratings yet
Closestpair PDF
Document2 pages
Closestpair PDF
Andrew Lee
No ratings yet
Entropy 1
Document7 pages
Entropy 1
concoursmaths2021
No ratings yet
Lecture 7: Weak Duality: 7.1.1 Primal Problem
Document10 pages
Lecture 7: Weak Duality: 7.1.1 Primal Problem
Yasir Butt
No ratings yet
Article Theorem Proof Polynomial Division
Document2 pages
Article Theorem Proof Polynomial Division
minuch00news
No ratings yet
MTH Slides
Document116 pages
MTH Slides
shank5
No ratings yet
Maximum Likelihood
Document7 pages
Maximum Likelihood
Shaibal Barua
No ratings yet
01 Interpolation 1
Document14 pages
01 Interpolation 1
kevincarloscanales
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
Rating: 3.5 out of 5 stars
3.5/5 (8)
Page 58 PDF
Document1 page
Page 58 PDF
raiashish123
No ratings yet
Page 55 PDF
Document1 page
Page 55 PDF
raiashish123
No ratings yet
Page 52 PDF
Document1 page
Page 52 PDF
raiashish123
No ratings yet
Page 54 PDF
Document1 page
Page 54 PDF
raiashish123
No ratings yet
Page 53 PDF
Document1 page
Page 53 PDF
raiashish123
No ratings yet
Page 57 PDF
Document1 page
Page 57 PDF
raiashish123
No ratings yet
Page 59 PDF
Document1 page
Page 59 PDF
raiashish123
No ratings yet
Page 56 PDF
Document1 page
Page 56 PDF
raiashish123
No ratings yet
Page 49 PDF
Document1 page
Page 49 PDF
raiashish123
No ratings yet
Page 47 PDF
Document1 page
Page 47 PDF
raiashish123
No ratings yet
Page 51 PDF
Document1 page
Page 51 PDF
raiashish123
No ratings yet
Page 43 PDF
Document1 page
Page 43 PDF
raiashish123
No ratings yet
Page 50 PDF
Document1 page
Page 50 PDF
raiashish123
No ratings yet
Page 46 PDF
Document1 page
Page 46 PDF
raiashish123
No ratings yet
Page 48 PDF
Document1 page
Page 48 PDF
raiashish123
No ratings yet
Page 45 PDF
Document1 page
Page 45 PDF
raiashish123
No ratings yet
Page 41 PDF
Document1 page
Page 41 PDF
raiashish123
No ratings yet
Page 39 PDF
Document1 page
Page 39 PDF
raiashish123
No ratings yet
Page 35 PDF
Document1 page
Page 35 PDF
raiashish123
No ratings yet
Page 42 PDF
Document1 page
Page 42 PDF
raiashish123
No ratings yet
Page 44 PDF
Document1 page
Page 44 PDF
raiashish123
No ratings yet
Page 40 PDF
Document1 page
Page 40 PDF
raiashish123
No ratings yet
Page 38 PDF
Document1 page
Page 38 PDF
raiashish123
No ratings yet
Page 34 PDF
Document1 page
Page 34 PDF
raiashish123
No ratings yet
Page 31 PDF
Document1 page
Page 31 PDF
raiashish123
No ratings yet
Page 33 PDF
Document1 page
Page 33 PDF
raiashish123
No ratings yet
Page 36 PDF
Document1 page
Page 36 PDF
raiashish123
No ratings yet
Page 37 PDF
Document1 page
Page 37 PDF
raiashish123
No ratings yet
Page 30 PDF
Document1 page
Page 30 PDF
raiashish123
No ratings yet