Welcome to Scribd!

Assignment No 1:: Department of Computer Sciences The University of Lahore, Sargodha Campus

Uploaded by

0% found this document useful (0 votes)

31 views5 pages

1. The document discusses the Naive Bayes text classification algorithm. 2. Naive Bayes is a simple but effective algorithm for text classification that calculates the probabilities of words appearing in different categories or classes. 3. It breaks sentences down into individual words rather than looking at entire sentences, and classifies new sentences based on which class has the highest combined probability of its words appearing.

Original Description:

Original Title

Assignment No 1

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

31 views5 pages

Assignment No 1:: Department of Computer Sciences The University of Lahore, Sargodha Campus

Uploaded by

Muhammad Ali

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 5

Search inside document

Assignment No 1:

Submitted by:

1- Muhammad Ali (Reg.#BSSE07163044)

Class:

BSSE-8.

Submitted to:

MISS SAIRA MOIN.

Subject:

NATURAL PROCESSING LANGUAGE.

Department of Computer Sciences

The University of Lahore, Sargodha Campus

Text Classification (Navies Bayes)

The Naive Bayes classifier is a simple classifier that classifies based on probabilities of events. It
is the applied commonly to text classification. Though it is a simple algorithm, it performs well in
many text classification problems.

Other Pros include less training time and less training data. That is, less CPU and Memory
consumption.

As with any machine learning model, we need to have an existing set of examples (training set)
for each category (class).

Let us consider text classification to classify a sentence to either ‘question’ or ‘statement’. In this
case, there are two classes (“question” and “statement”). With the training set, we can train a
Naive Bayes classifier which we can use to automatically categorize a new sentence.

We need to find out if a new sentence, say, ‘what is the price of the book’ is a question or not.

Bayes’ Theorem:

We need to find out what is the Probability of class ‘Question’ given the new sentence and the
Probability of class ‘Statement’ given the new sentence
Here, we need to find out which class has a bigger probability for the new sentence. i.e., we need
to find which of the below is bigger

Since the denominator is same for both the equations, we can ignore the denominator and have to
find out the values for the numerator.

The problem here is the new sentence need not have to appear in the class within the training set.
In that case, the probability is zero. i.e., since ‘what is the price of the book’ did not appear in any
of the classes in the training set, the probability is zero. But this is not useful.
So let us split the sentence to words and assume that every word in a sentence is independent of
the other ones. That is, we’re no longer looking at entire sentence, but rather at individual words.

The next step is just to calculate every probability in the above equations.

Now that we have frequency of each words in each class, we can calculate the probability for each
word in a given class. We know the probability of occurrence of words in a class, we can
substitute the values in
Therefore the new sentence will be classified into class which has higher frequency of words as
per results.

E-Book Discrete Mathematics
Document414 pages
E-Book Discrete Mathematics
Himantika Sharma
100% (2)
Student Practice Test Booklet in Reading and Writing: Upper Elementary Grades 3-5 Comprehension and Writing Teacher to Teacher
From Everand
Student Practice Test Booklet in Reading and Writing: Upper Elementary Grades 3-5 Comprehension and Writing Teacher to Teacher
Leslie Knight
No ratings yet
How to Read for the SAT
From Everand
How to Read for the SAT
Katya Seberson
No ratings yet
GRE All the Verbal: Effective Strategies & Practice from 99th Percentile Instructors
From Everand
GRE All the Verbal: Effective Strategies & Practice from 99th Percentile Instructors
Manhattan Prep
No ratings yet
Literacy Toolbox (Easier To Read)
Document59 pages
Literacy Toolbox (Easier To Read)
api-281539417
No ratings yet
Thesis Statement: How to Write a Good Thesis Statement
From Everand
Thesis Statement: How to Write a Good Thesis Statement
Grant Andrews
Rating: 4.5 out of 5 stars
4.5/5 (9)
Linear Algebra
Document449 pages
Linear Algebra
Steve Caca Wanjala
No ratings yet
NZC Mathematics Standards For Years 1-8 Poster
Document1 page
NZC Mathematics Standards For Years 1-8 Poster
api-306029279
No ratings yet
02.centum VP Installation
Document541 pages
02.centum VP Installation
HAMZA BEN
No ratings yet
Discrete Mathematics - An Open Introduction - Oscar Levin
Document336 pages
Discrete Mathematics - An Open Introduction - Oscar Levin
prashantnasa
100% (2)
Discrete Mathematics: An Open Introduction (3rd Edition) PDF
Document408 pages
Discrete Mathematics: An Open Introduction (3rd Edition) PDF
Vilhelmo De Okcidento
100% (1)
LTE Handover PPT Rakesh RAJ
Document28 pages
LTE Handover PPT Rakesh RAJ
team Dinner
No ratings yet
Linear Algebra (UC Davis CC)
Document408 pages
Linear Algebra (UC Davis CC)
Cam Herringshaw
100% (1)
Computer Methods For Ordinary Differential Equations and Differential Algebraic Equations
Document330 pages
Computer Methods For Ordinary Differential Equations and Differential Algebraic Equations
Herib Blanco
100% (1)
Wlan
Document18 pages
Wlan
ratheesh Rs
No ratings yet
Brush-up java for Interview
From Everand
Brush-up java for Interview
Ashutosh Shashi
Rating: 5 out of 5 stars
5/5 (1)
Cpa Marketing PDF
Document18 pages
Cpa Marketing PDF
peter murengi
No ratings yet
Deep Learning Interview Questions - Deep Learning Questions
Document21 pages
Deep Learning Interview Questions - Deep Learning Questions
hehee
No ratings yet
Arista Design Guide DCI With VXLAN
Document31 pages
Arista Design Guide DCI With VXLAN
yahya jnpr
No ratings yet
Full
Document243 pages
Full
mbsuresh
No ratings yet
VLSI Career ICE Breaker
From Everand
VLSI Career ICE Breaker
Yogesh Soni
Rating: 3 out of 5 stars
3/5 (1)
Java Interview Questions
Document82 pages
Java Interview Questions
Sanjiv Kumar
No ratings yet
Research Paper On Naive Bayes Classifier
Document4 pages
Research Paper On Naive Bayes Classifier
n1dihagavun2
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
Document12 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
Marian Aldescu
No ratings yet
Rule Based Learning
Document35 pages
Rule Based Learning
Carlos Eduardo Corrêa Medeiros
No ratings yet
Intelligent Q &A PDF
Document5 pages
Intelligent Q &A PDF
arshadbwp1
No ratings yet
Robert Chan, Michael Wang, Multiclass Sentiment Analysis of Movie Reviews
Document5 pages
Robert Chan, Michael Wang, Multiclass Sentiment Analysis of Movie Reviews
Alex Ibollit
No ratings yet
A Bayesian/Information Theoretic Model of Learning To Learn Via Multiple Task Sampling
Document33 pages
A Bayesian/Information Theoretic Model of Learning To Learn Via Multiple Task Sampling
Carlos
No ratings yet
Question Difficulty - How To Estimate Without Norming, How To Use For Automated Grading
Document10 pages
Question Difficulty - How To Estimate Without Norming, How To Use For Automated Grading
TheSurfingdays
No ratings yet
Naive Bayes
Document40 pages
Naive Bayes
prabhakaran sridharan
No ratings yet
Dmoi Tablet
Document412 pages
Dmoi Tablet
bszw5eys5v
No ratings yet
Digital Portfolio Lesson Plan
Document2 pages
Digital Portfolio Lesson Plan
api-534494728
No ratings yet
Sarahm Math Lesson Plansfm12-08-16c
Document7 pages
Sarahm Math Lesson Plansfm12-08-16c
api-341095033
No ratings yet
Linear Guest
Document430 pages
Linear Guest
Frimpong Justice Alex
No ratings yet
Ezehdominic CS 3304 Analysis of Algorithm 4
Document2 pages
Ezehdominic CS 3304 Analysis of Algorithm 4
Dominic Ezeh
No ratings yet
(JSTOR) An Inductive Method of Teaching Composition BY Francis X. Trainor and Brian K. McLaughlin
Document8 pages
(JSTOR) An Inductive Method of Teaching Composition BY Francis X. Trainor and Brian K. McLaughlin
Hasan El Talliss
No ratings yet
Journal 1 Compute The Probability Distribution of A Random Variable For Both Discrete and Continuous Data
Document1 page
Journal 1 Compute The Probability Distribution of A Random Variable For Both Discrete and Continuous Data
John Andrae Manglo
No ratings yet
College Algebra 1
Document15 pages
College Algebra 1
Hans Yang
No ratings yet
LESSON OBJECTIVES (Students Will Be Able To )
Document4 pages
LESSON OBJECTIVES (Students Will Be Able To )
api-667959565
No ratings yet
Julie Palmisano School Reading Problems Lesson Reflection Lessons 10 and 11
Document6 pages
Julie Palmisano School Reading Problems Lesson Reflection Lessons 10 and 11
api-299523826
No ratings yet
Text-Based Classification
Document7 pages
Text-Based Classification
assad
No ratings yet
Identifying Problems and Solutions in Scientific Text: Kevin Heffernan, Simone Teufel
Document16 pages
Identifying Problems and Solutions in Scientific Text: Kevin Heffernan, Simone Teufel
Rohit
No ratings yet
Evaluating The Difficulty of Concepts On Domain Knowledge Using Latent Semantic Analysis
Document4 pages
Evaluating The Difficulty of Concepts On Domain Knowledge Using Latent Semantic Analysis
Anh Thi
No ratings yet
Grad Lesson Plan 4
Document8 pages
Grad Lesson Plan 4
api-539765733
No ratings yet
A PAPER of Discourse Analysis Group 3
Document13 pages
A PAPER of Discourse Analysis Group 3
koo koo
No ratings yet
Types of Classification Algorithm
Document27 pages
Types of Classification Algorithm
Vaibhav Koshti
No ratings yet
Thesis Statement and Topic Sentence Quiz
Document8 pages
Thesis Statement and Topic Sentence Quiz
afkndyipf
100% (2)
How To Write GRE Essay
Document29 pages
How To Write GRE Essay
Melissa Castelino
No ratings yet
Adamson Congressi Lesson3
Document21 pages
Adamson Congressi Lesson3
api-247583224
No ratings yet
Monsoon 21
Document1 page
Monsoon 21
Uttkarsh Kohli
No ratings yet
Thesis On Abstract Algebra
Document5 pages
Thesis On Abstract Algebra
stephanierobertscharleston
100% (2)
Review On Comparison Between Text Classification Algorithms
Document4 pages
Review On Comparison Between Text Classification Algorithms
International Journal of Application or Innovation in Engineering & Management
No ratings yet
Planning Commentary Part 2
Document10 pages
Planning Commentary Part 2
Crystal Atlacamani Perez
No ratings yet
Assess
Document4 pages
Assess
Siti Salma
No ratings yet
A Unified Architecture For Natural Language Processing: Deep Neural Networks With Multitask Learning
Document9 pages
A Unified Architecture For Natural Language Processing: Deep Neural Networks With Multitask Learning
nghia
No ratings yet
How To Identify Your Tier 3 Content-Area Vocabulary Words
Document10 pages
How To Identify Your Tier 3 Content-Area Vocabulary Words
Alfred Intong
No ratings yet
Getting To Know SPSS
Document204 pages
Getting To Know SPSS
Tarig Gibreel
No ratings yet
Tufts CS170
Document3 pages
Tufts CS170
Tewei Robert Luo
No ratings yet
Linear Algebra: Jim Hefferon
Document447 pages
Linear Algebra: Jim Hefferon
Mehmet Dilmenler
No ratings yet
Complex Variables Homework Solutions
Document8 pages
Complex Variables Homework Solutions
acpjxhznd
100% (2)
DLP Word Formation.
Document11 pages
DLP Word Formation.
Pascual Denesse S.
No ratings yet
Course Outline
Document7 pages
Course Outline
Robert Jenkins
No ratings yet
Lesson Plan
Document4 pages
Lesson Plan
api-334611328
No ratings yet
The Ultimate In: Word Power and Comprehension for the "New" S.A.T.®
From Everand
The Ultimate In: Word Power and Comprehension for the "New" S.A.T.®
GFS
No ratings yet
The Anatomy of an English Paragraph
From Everand
The Anatomy of an English Paragraph
Mahmoud Elsayess
No ratings yet
The Art of Case Analysis: How To Improve Your Classroom Performance
From Everand
The Art of Case Analysis: How To Improve Your Classroom Performance
Robert Ronstadt
No ratings yet
Assignment No 4
Document5 pages
Assignment No 4
Muhammad Ali
No ratings yet
Assignment No 5:: Submitted by
Document3 pages
Assignment No 5:: Submitted by
Muhammad Ali
No ratings yet
Assignment No 3
Document3 pages
Assignment No 3
Muhammad Ali
No ratings yet
Grading: Final Term: 40 % Term Paper: 30% Assignments and Quizzes: 30%
Document46 pages
Grading: Final Term: 40 % Term Paper: 30% Assignments and Quizzes: 30%
Muhammad Ali
No ratings yet
Definition of Minimum Edit Distance
Document49 pages
Definition of Minimum Edit Distance
Muhammad Ali
No ratings yet
Language Modeling: Introduction To N-Grams
Document79 pages
Language Modeling: Introduction To N-Grams
Muhammad Ali
No ratings yet
SP13212 - Shashank Jaiswal - 141403 - Akash Parmar - 141407 - Harshit Singh - 141422 - CSE - 2018
Document40 pages
SP13212 - Shashank Jaiswal - 141403 - Akash Parmar - 141407 - Harshit Singh - 141422 - CSE - 2018
NooraFukuzawa Nor
No ratings yet
Sam6c Tab6c-2
Document6 pages
Sam6c Tab6c-2
Beloved Dhina
No ratings yet
A Study On Customer Satisfaction Towards Mobile Phone in Coimbatore City
Document5 pages
A Study On Customer Satisfaction Towards Mobile Phone in Coimbatore City
20PCO20 MATHIVANAN.R
No ratings yet
General Walk-In Freezer Cálculo de Carga De: Temperatura Interna Temperatura Externa
Document5 pages
General Walk-In Freezer Cálculo de Carga De: Temperatura Interna Temperatura Externa
Jair Cuesta Gomez
No ratings yet
Cryptography, Winter Term 16/17: Sample Solution To Assignment 4
Document3 pages
Cryptography, Winter Term 16/17: Sample Solution To Assignment 4
Safenat Safenat
No ratings yet
Akash Mahanty 2
Document9 pages
Akash Mahanty 2
Ee0UYDAIWa5wdZxHpYeI7mYET2p8o9Wkdj5CvpM
No ratings yet
Space Adjacency Analysis 2
Document21 pages
Space Adjacency Analysis 2
Ken Mendoza Sencilla
No ratings yet
TLE10 Types of Malwares
Document24 pages
TLE10 Types of Malwares
Xavier
No ratings yet
08 GSM BSS Network KPI Immediate Assignment Success Rate Optimization Manual
Document34 pages
08 GSM BSS Network KPI Immediate Assignment Success Rate Optimization Manual
Edward Orejudos
No ratings yet
Proposed Office Plan For: Water Layout
Document1 page
Proposed Office Plan For: Water Layout
Design Hub
No ratings yet
VR-3200 Brochure
Document32 pages
VR-3200 Brochure
Andres Melendez Hernandez
No ratings yet
Aspen HYSYS (DYNAMICS) Training Course: Sameh Tawfeek
Document25 pages
Aspen HYSYS (DYNAMICS) Training Course: Sameh Tawfeek
Ayauwu Loveday
No ratings yet
PDF Bistos BT 300 Fetal Monitor Service Manual2 - Compress PDF
Document37 pages
PDF Bistos BT 300 Fetal Monitor Service Manual2 - Compress PDF
Sarita Technocare
No ratings yet
The Digital Consumer An Introduction and PDF
Document11 pages
The Digital Consumer An Introduction and PDF
Matheus Majer
No ratings yet
25800-220-M6-0330-00002 (004) Celdas Rougher Fila 1
Document2 pages
25800-220-M6-0330-00002 (004) Celdas Rougher Fila 1
Edwing William Salhuana Mendoza
No ratings yet
ADDA ERP Intro PDF
Document18 pages
ADDA ERP Intro PDF
fabeel Bundhoo
No ratings yet
Operating Manual: Air Band Transceiver
Document40 pages
Operating Manual: Air Band Transceiver
Просто Слесарь
No ratings yet
6 RedHat
Document20 pages
6 RedHat
koenjava
No ratings yet
Cognitive Aspects: Attention
Document5 pages
Cognitive Aspects: Attention
Mark Dee
No ratings yet
Examination of 12 Gauge Flare Guns
Document46 pages
Examination of 12 Gauge Flare Guns
gabetaijeron
No ratings yet
KL 300
Document4 pages
KL 300
Zaheer Abbas
No ratings yet
Expired Password Remote Desktop
Document9 pages
Expired Password Remote Desktop
Hell Cat
No ratings yet
Course12 - Requirement of WLAN HLD Design Homework - 2022.04
Document7 pages
Course12 - Requirement of WLAN HLD Design Homework - 2022.04
Akbar Ali
No ratings yet
Multisim Exercises
Document4 pages
Multisim Exercises
Noor Hassan
No ratings yet