You are on page 1of 11

Bayesian Classification Methods

GROUP 6
TINASHE MAKOSA R216135C
IKE CHIWARA R2113321F
AMANDA MATINHURE R21887F
RONALD ZAKA R217989T
NYASHA ZINGWE R2118175X
ADMIRE MVERECHA R2111208N
RAY CHIDHUNGWANA R214923G
PETER MUTISWA R2115999V
RODNEY CHAKADYA R218200W
KEVIN SIMANGO R2111815P
DEFINITION OF TERMS

• Use of statistical • The theorem provides a


classifiers to determine way to revise existing
the probability of class predictions or theories
items. given new or additional
• They implement the evidence.
Bayes theorem.

Bayesian Bayes
Classification Theorem
Why we use Bayesian classifiers

They are Incremental; training


Favourable Performance; they example can incrementally Standard; they provide a
exhibit high accuracy and decrease or increase the standard of optimal decision
speed when applied to large probability that a hypothesis is making against which other
databases correct – prior methods can be methods can measure
observed with data
Naive Bayes Algorithm
• It is a classification technique based on Bayes theorem with an
assumption of independence among predictors
• In simple terms a Naïve Bayes classifier assumes that the presence of a
particular feature in a class is unrelated to the presence of any other
feature
• For instance, a fruit may be considered to be an apple if it is red, round
and about 3 inches in diameter.
• Even if these features depend on each other or upon the existence of the
other features, all of these properties independently contribute to the
probability that this fruit is an apple and that is why it is known as Naive
Continuation..

. Naïve Bayes model is easy to build and particularly useful for very
large data sets. Along with simplicity, Naïve Bayes is known to
outperform even highly sophisticated classification methods
.Bayes theorem provides a way of calculating posterior probability
P(c/x) from P(c), P(x) and P(x/c).
Lets take a look at the equation below

• P(c/x) = P(x|c)P(c)
• P(x)
• Now on this equation:
• P(c/x) is the posterior probability of class (c, target) given the
predictor (x, attributes)
• P(c) is the prior probability of class
• P(x/c) is the likelihood which is given the probability of predictor given
class
• P(x) is the prior probability of predictor
How naïve bayes algorithm works

• Below there is a training data set of weather and corresponding target


variable ‘Play’ ( suggesting possibilities of playing). Now we need to
classify whether players will play or not based weather condition but
first we look at the following steps to perform it.
• 1: Convert the data set into a frequency table
• 2: Create likelihood table by finding the probabilities like overcast
probability = 0.29 and probability of playing is 0.64.
Continuation…
Weather Play
Sunny No Frequency Table Likelihood table
Overcast Yes Weather No Yes Weather No Yes
Rainy Yes Overcast 4 Overcast 4 =4/14 0.29
Sunny Yes Rainy 3 2 Rainy 3 2 =5/14 0.36
Sunny Yes Sunny 2 3 Sunny 2 3 =5/14 0.36
Overcast Yes Grand Total 5 9 All 5 9
Rainy No =5/14 =9/14
Rainy No 0,36 0.64
Sunny Yes
Overcast Yes
Overcast Yes
Rainy No
Continuation…

• 3: Now, use Naïve Bayesian equation to calculate posterior probability for


each class. The class with the highest posterior probability is the outcome
of prediction.
• Problem: Players will play if weather is sunny. Is this statement correct
• We can solve it using the above discussed method of posterior probability.
P(Yes | Sunny) = P(Sunny| Yes)*P(Yes)/P(Sunny)
• Here we have P(Sunny |Yes) = 3/9 or 0.33, P(Sunny) = 5/14 or 0.36, P(Yes)
= 9/14 or 0.64
• Now P(Yes | Sunny) = 0.33*0.64/0.36 = 0.60, which has higher probability.
• Naïve Bayes uses a similar method to predict the probability of different
class based on various attributes. This algorithm is mostly used in text
classification and with problems having multiple classes.
Advantages & Disadvantages

• Easy and fast to implement, even in multiclass prediction.


• Good results are found in most cases.
Pros

• Assumption of class conditional statements which reduces accuracy.


• Cannot model dependencies among variables. These are solved by
Cons Bayesian Belief Networks.
Applications of Naïve Bayes

You might also like