CS4248 AY 2021/22 Semester 1 Tutorial 2: C C C C C P C P

Uploaded by

Caiyi Xu

0% found this document useful (0 votes)

14 views3 pages

Original Title

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

14 views3 pages

CS4248 AY 2021/22 Semester 1 Tutorial 2: C C C C C P C P

Uploaded by

Caiyi Xu

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

CS4248

AY 2021/22 Semester 1
Tutorial 2

1. Text classification is the task of assigning a class ci to a text t based on the content of
t , where ci is chosen from a predefined set of classes. The task can be formulated as a
supervised learning task from labeled training texts, using Bayesian classification.

Suppose the following 4 labeled training texts have been collected for 2 classes:
c1 : usa is the champion in tennis
c1 : wimbledon is the ultimate in tennis
c2 : brazil is the champion in soccer
c2 : soccer is popular

(a) Compute the maximum likelihood estimate for P(c1 ) and P(c2 ) , where P( ci ) is the
proportion of training texts in class ci .

w P( w | c1 ) P ( w | c2 )
MLE add-one MLE add-one
brazil
champion
in
is
popular
soccer
tennis
the
ultimate
usa
wimbledon

(b) Let V be the set of words occurring in the training texts. For each word w ∈ V , compute
the maximum likelihood estimate (MLE) for P( w | c1 ) and P( w | c2 ) , where P( w | ci ) is
the probability of selecting word w from the bag of words that constitute all the training
texts belonging to ci . That is, provide the probabilities in the MLE columns in the above
table.
(c) Use add-one smoothing to compute P( w | c1 ) and P( w | c2 ) such that P( w | ci ) > 0 for
each word w ∈ V . That is, provide the probabilities in the add-one columns in the above
table.
(d) For text classification, each text t can be represented as the bag of words occurring in
t . The naïve Bayes assumption states that the words w1 ,, wn in a text t are conditionally
independent given the class c of t . That is,

1
P( w1 ,, wn | c) = P( w1 | c) ⋅  ⋅ P( wn | c)

Give the formula for determining the class that a new test text t belongs to, using Bayesian
classification and making the naïve Bayes assumption.
(e) Given the following two texts:

germany is the champion in soccer

wimbledon is played in the uk

determine the class to which each text is assigned. (Ignore any word in a test text that does
not belong to V )

2. Give a trace of the minimum edit distance algorithm (a dynamic programming

algorithm) to compute the minimum cost of transforming the string “cheap” to “help”, by
filling out every cell entry in the following table, where each cell entry denotes the
minimum cost of transforming the associated substrings. Assume that the cost of inserting
a character is 1, the cost of deleting a character is 1, and the cost of substituting a character
by a different character is 2. Add appropriate backtrace pointer(s) to each cell entry, and
trace the optimal paths in the table.

p 5

a 4

e 3

h 2

c 1

0 1 2 3 4

h e l p

2
3. Consider a language with only two symbols I and O. The probability pI of the symbol I
occurring in the language is unknown. Suppose one particular sample text consists of a
sequence of N symbols drawn randomly and independently from the language, out of which
the symbol I occurs NI times and the symbol O occurs N – NI times in this particular sample
text.
(a) Write an expression for the probability of the sample text in terms of pI, NI, and N.
(b) Derive mathematically the value of pI that maximizes the probability of the sample text,
in terms of NI and N. This value serves as the maximum likelihood estimate of pI.

End Semester Examination: Thapar University, Patiala
Document2 pages
End Semester Examination: Thapar University, Patiala
KSHITIJ CHAUDHARY
No ratings yet
Mid Semester Examination (Sep 2018) B. E. - III Year, Semester V
Document2 pages
Mid Semester Examination (Sep 2018) B. E. - III Year, Semester V
KSHITIJ CHAUDHARY
No ratings yet
Inf2b Learn Note07 2up
Document5 pages
Inf2b Learn Note07 2up
NiravSinghDabhi
No ratings yet
Cse 22 Cse 211 2018-19
Document4 pages
Cse 22 Cse 211 2018-19
ashik
No ratings yet
Assignment 1
Document2 pages
Assignment 1
SuneelKumarChauhan
No ratings yet
CS60057 Speech and Natural Language Processing MA 2016
Document2 pages
CS60057 Speech and Natural Language Processing MA 2016
Nirmalendu Kumar
No ratings yet
Computer Programming - Seminary Work
Document15 pages
Computer Programming - Seminary Work
IONUȚ-DRAGOȘ NEREMZOIU
No ratings yet
L-LFF - l/CSE Date: 31/10/2019: For (N 1 N 5 N++)
Document23 pages
L-LFF - l/CSE Date: 31/10/2019: For (N 1 N 5 N++)
Sumaiya Binte Yousuf
No ratings yet
Portfolio FLA - Syllabus PDF
Document35 pages
Portfolio FLA - Syllabus PDF
Kashish Verma
No ratings yet
CBSE Board QP
Document81 pages
CBSE Board QP
JNV DHAR PRINCIPAL
No ratings yet
ComputerScience SQP
Document13 pages
ComputerScience SQP
Luv Sharma
No ratings yet
2019 Dec. CS301-E - Ktu Qbank
Document3 pages
2019 Dec. CS301-E - Ktu Qbank
Sowmya k.s.
No ratings yet
IE506 Assignment1
Document2 pages
IE506 Assignment1
010Denish Lakhani
No ratings yet
Homework 3 (MATH 5490-01) Name (Print) : Due Date: Friday, Nov. 4, 2016
Document1 page
Homework 3 (MATH 5490-01) Name (Print) : Due Date: Friday, Nov. 4, 2016
Vasanth Aradhya
No ratings yet
Problem Set 12
Document3 pages
Problem Set 12
akdjf
No ratings yet
Home Assignment 1
Document3 pages
Home Assignment 1
anujkumarsharma276
No ratings yet
Problem Set 04
Document2 pages
Problem Set 04
satishjena
No ratings yet
Automata Theory Problems and Exercises PDF Free
Document11 pages
Automata Theory Problems and Exercises PDF Free
Lokman Azrai
No ratings yet
Assignment 2 Math 205 Linear Algebra
Document6 pages
Assignment 2 Math 205 Linear Algebra
laraib Aftab zedie
No ratings yet
Programming and Data Structures Using Python Final Exam, Dec 2020-Mar 2021
Document3 pages
Programming and Data Structures Using Python Final Exam, Dec 2020-Mar 2021
Ishan Singh
No ratings yet
BCS 042
Document16 pages
BCS 042
Mr. Janardhana V ECE
No ratings yet
Toc
Document6 pages
Toc
Karina Rathore
No ratings yet
2011 Daa End-Regular
Document4 pages
2011 Daa End-Regular
8207dayaan Bashir
No ratings yet
Automata Theory Assignment
Document4 pages
Automata Theory Assignment
Shyam Bathvar
No ratings yet
Assignment No. 1 Class: T.E. Computer Subject: Theory of Computation
Document6 pages
Assignment No. 1 Class: T.E. Computer Subject: Theory of Computation
Vinit Tribhuvan
No ratings yet
Comp Sample
Document13 pages
Comp Sample
Abhinav Mukherjee
No ratings yet
TOC Questions 012210030237 1
Document4 pages
TOC Questions 012210030237 1
Prabhu Dhandapani
No ratings yet
Smis Mid Term - Add Maths
Document6 pages
Smis Mid Term - Add Maths
YuanWei Siow
No ratings yet
Toc 2018 - Merged
Document11 pages
Toc 2018 - Merged
atulo1308
No ratings yet
Computer Science Theory Final Exam
Document8 pages
Computer Science Theory Final Exam
Cassie Marie
No ratings yet
IR&TM Review II: F A S T
Document5 pages
IR&TM Review II: F A S T
navid.basiry960
No ratings yet
18696.assignment 3 For IEC 2
Document6 pages
18696.assignment 3 For IEC 2
vikrant
No ratings yet
BEET 405 Theory of Automata & Formal Languages
Document2 pages
BEET 405 Theory of Automata & Formal Languages
jatinshaarmaa
No ratings yet
CS 162 - Automata Theory - Fall 2015 - Goodrich - Final Exam
Document10 pages
CS 162 - Automata Theory - Fall 2015 - Goodrich - Final Exam
Syed Fahad Bukhari
No ratings yet
TOA FinalAssignment Part01
Document6 pages
TOA FinalAssignment Part01
Aqib Raza
No ratings yet
UCS701 Theory of Computation MST
Document2 pages
UCS701 Theory of Computation MST
KSHITIJ CHAUDHARY
No ratings yet
DM (4th) May 2022
Document3 pages
DM (4th) May 2022
princevirsingh3
No ratings yet
SOLUTIONS: Homework Assignment 2: CSCI 2670 Introduction To Theory of Computing, Fall 2018 September 18, 2018
Document8 pages
SOLUTIONS: Homework Assignment 2: CSCI 2670 Introduction To Theory of Computing, Fall 2018 September 18, 2018
Orijit Nandy
No ratings yet
Ee 793 Class 15121
Document11 pages
Ee 793 Class 15121
techy
No ratings yet
Tut 1
Document2 pages
Tut 1
Phan Minh Nhut
No ratings yet
Question Bank: Unit 1: Introduction To Finite Automata
Document8 pages
Question Bank: Unit 1: Introduction To Finite Automata
Bhavani Paturi
No ratings yet
Model Python Paper
Document3 pages
Model Python Paper
Yashraj Bhadauria
0% (1)
Theory of Computation Question Paper 2015 - Tutorialsduniya
Document7 pages
Theory of Computation Question Paper 2015 - Tutorialsduniya
vanshika
No ratings yet
02qbank PDF
Document4 pages
02qbank PDF
kjhdskjhfkjfh
No ratings yet
2021 B.tech. 5st Semester (Cse) Formal Languages & Automata
Document4 pages
2021 B.tech. 5st Semester (Cse) Formal Languages & Automata
Kapil Kumar
No ratings yet
Final Al-Mahdi11thg 13-14
Document3 pages
Final Al-Mahdi11thg 13-14
api-253679034
No ratings yet
CENG 223: Discrete Computational Structures
Document3 pages
CENG 223: Discrete Computational Structures
Ali Berkcan Boylu
No ratings yet
MATH1005 Final Exam From 2021
Document14 pages
MATH1005 Final Exam From 2021
Spamy Spam
No ratings yet
Assignment 2: Information Security Spring 2022
Document3 pages
Assignment 2: Information Security Spring 2022
Chika A.
No ratings yet
Python Programming KNC 302
Document2 pages
Python Programming KNC 302
Digvijoy Ranjan
No ratings yet
Coursework Submission Form
Document5 pages
Coursework Submission Form
Shakeel Bhatti
No ratings yet
AOT Question Bank
Document34 pages
AOT Question Bank
Lina Das
No ratings yet
Math1081 2016 S1
Document6 pages
Math1081 2016 S1
Christopher
No ratings yet
DMCS (5th) Dec2017
Document2 pages
DMCS (5th) Dec2017
ayushsharma.blp001
No ratings yet
2017 CSN523 T1 Solution PDF
Document3 pages
2017 CSN523 T1 Solution PDF
saiavinash duddupudi
No ratings yet
AT Questions
Document4 pages
AT Questions
Parth Shah
No ratings yet
EE230 2014 hw2
Document2 pages
EE230 2014 hw2
Burakcan Guvenir
No ratings yet
hw1 PDF
Document3 pages
hw1 PDF
何明涛
No ratings yet
Spring 2004 VITEC Software Design and Development Engineer Examination (Morning)
Document35 pages
Spring 2004 VITEC Software Design and Development Engineer Examination (Morning)
Hữu Hưởng Nguyễn
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
Rating: 3 out of 5 stars
3/5 (1)
CS4248 AY 2021/22 Semester 1 Tutorial 1: Rexpr S
Document2 pages
CS4248 AY 2021/22 Semester 1 Tutorial 1: Rexpr S
Caiyi Xu
No ratings yet
4225 5425Quiz19S2 PaperV1 Answer
Document16 pages
4225 5425Quiz19S2 PaperV1 Answer
Caiyi Xu
No ratings yet
Course Project Guideline - New
Document6 pages
Course Project Guideline - New
Caiyi Xu
No ratings yet
CS2108 L1 Introduction v8
Document25 pages
CS2108 L1 Introduction v8
Caiyi Xu
No ratings yet
CS2108 L2 SP Basis Fourier Series v12
Document61 pages
CS2108 L2 SP Basis Fourier Series v12
Caiyi Xu
No ratings yet
Lecture 1 - Introduction To Concurrency: CS3211 Parallel and Concurrent Programming
Document32 pages
Lecture 1 - Introduction To Concurrency: CS3211 Parallel and Concurrent Programming
Caiyi Xu
No ratings yet
CS3211 - Parallel and Concurrent Programming: Admin Information
Document10 pages
CS3211 - Parallel and Concurrent Programming: Admin Information
Caiyi Xu
No ratings yet
CS3211 TentativeSchedule AY2122 S2
Document2 pages
CS3211 TentativeSchedule AY2122 S2
Caiyi Xu
No ratings yet
XT1C 160 TMD 25-450 4p F F: General Information
Document3 pages
XT1C 160 TMD 25-450 4p F F: General Information
Henrics Mayores
No ratings yet
Lecture 2
Document33 pages
Lecture 2
eng fourm
No ratings yet
Model 4WI 100 - 800 HP Boilers: 1.4 Submittals
Document4 pages
Model 4WI 100 - 800 HP Boilers: 1.4 Submittals
sebaversa
No ratings yet
Turbine Control System
Document8 pages
Turbine Control System
Zakariya
50% (2)
Peter and Wolf Resource Guide - Boston Philharmonic Orchestra
Document32 pages
Peter and Wolf Resource Guide - Boston Philharmonic Orchestra
Emily Sharratt
100% (5)
Energies 14 04973
Document19 pages
Energies 14 04973
roméo legue
No ratings yet
MQXUSBDEVAPI
Document32 pages
MQXUSBDEVAPI
wonderx
No ratings yet
DO Request
Document3 pages
DO Request
Ahmed Ismail
No ratings yet
Goldscmidt Algo
Document4 pages
Goldscmidt Algo
chayanpathak
No ratings yet
Lesson 5 - Web Development
Document25 pages
Lesson 5 - Web Development
Joshua Omolewa
No ratings yet
Surpac Introduction
Document207 pages
Surpac Introduction
Krist Jan Jimenez Separa
0% (1)
Sae Summer Keps
Document6 pages
Sae Summer Keps
Pritesh Kumar
No ratings yet
R S Aggarwal Solutions Class 11 Maths Chapter 15 Trigonometric or Circular Functions
Document131 pages
R S Aggarwal Solutions Class 11 Maths Chapter 15 Trigonometric or Circular Functions
Saswat Sinha
No ratings yet
Excel Cheatsheet
Document1 page
Excel Cheatsheet
Boring Bland
No ratings yet
Science Geography
Document133 pages
Science Geography
sampcant212
No ratings yet
Course STKO+OpenSees FEUP-UM
Document1 page
Course STKO+OpenSees FEUP-UM
anon_897435228
No ratings yet
Segger Eval Software
Document24 pages
Segger Eval Software
fercho573
No ratings yet
Basic Calculus Module 4
Document26 pages
Basic Calculus Module 4
Settei Hainah Ampuan
No ratings yet
EMV Multi Cable Transit Modular System (EMC-System) : PDF
Document7 pages
EMV Multi Cable Transit Modular System (EMC-System) : PDF
bakien-can
No ratings yet
Oracle Database JDBC Developer Guide and Reference
Document432 pages
Oracle Database JDBC Developer Guide and Reference
api-25919427
100% (1)
OCS Call Detail Record Reference (V100R002C02LG0237 - 26) (Pakistan, Ufone) PDF
Document354 pages
OCS Call Detail Record Reference (V100R002C02LG0237 - 26) (Pakistan, Ufone) PDF
Muhammad Saad Farooq
0% (1)
878
Document47 pages
878
IulianCiobanu
No ratings yet
Contaminants in Landfill Soils
Document8 pages
Contaminants in Landfill Soils
Choi Khoiron
No ratings yet
Ict Igcse Paper 2 Revision Database
Document9 pages
Ict Igcse Paper 2 Revision Database
Indianagrofarms
No ratings yet
Lipid Analysis: Melisa Intan Barliana
Document38 pages
Lipid Analysis: Melisa Intan Barliana
Chantique Maharani
0% (1)
Advanced Higher Music Listening Booklet 2020 To Accompany Sways On Weebly
Document6 pages
Advanced Higher Music Listening Booklet 2020 To Accompany Sways On Weebly
api-259438750
No ratings yet
Python Lab 1
Document4 pages
Python Lab 1
vaskore
No ratings yet
APIU CARMT Syllabus Version 1.0 - Confidential For Review Committee
Document49 pages
APIU CARMT Syllabus Version 1.0 - Confidential For Review Committee
Fuad Hassan
No ratings yet
RD Sharma Solutions For Class 10 Chapter 2 Polynomials Exercise 2.1
Document10 pages
RD Sharma Solutions For Class 10 Chapter 2 Polynomials Exercise 2.1
Ruturaj Parida
No ratings yet
Effect of MG Content On Microstructure and
Document12 pages
Effect of MG Content On Microstructure and
Mohamed Ramadan
No ratings yet