V MML (18G5B17) - Test - 3 - Merged QPS

Academic year 2021-2022 (Odd semester 2021)
DEPARTMENT OF MATHEMATICS
Date 27 January 2021 Time 2:00 PM to 4:00 PM
Test II (OPEN BOOK_Online) Maximum Marks 50
Course Title MATHEMATICS FOR MACHINE LEARNING Course Code 18G5B17
Semester V Programs Global Elective
Instructions: i) Answer all questions. Use of Scientific calculator is allowed.
Sl. No. Questions M BT CO

1 The datasets given below contains the details of the policy holders at an insurance
company. The attributes (i.e., descriptive features) included in the table describe each
policy holder’s ID, gender, age, the type of insurance policy they hold and their
preferred contact channel. The preferred contact channel attribute is the class label
attribute (i.e., target feature) that has two different values (phone, email).
ID Gender Age Policy Classification (Channel)
1 Female young Plan C phone
2 Male young Plan A email
3 Male young Plan A email
4 Female Middle-aged Plan C email 10 3 3
5 Female Middle-aged Plan B phone
6 Male Middle-aged Plan C phone
7 Male senior Plan B email
8 Male senior Plan C phone
9 Female senior Plan A phone
10 Female young Plan B email
a) Construct maximum likelihood probability tables for the attributes: (gender, age and
policy).
b) Set up a Naïve Bayes model to classify the policy holder into which category would
you classify a FEMALE YOUNG choosing PLAN A.
2. The joint probability distribution of a pair of random variables (x, y), where x is scaled
temperature and y is difference in temperature as given by the following table:
P X, Y (x, y) x = 0 x = 1 x = 2 x = 3
y=0 0.05 0.07 0.12 0.08 10 2 3
y=1 0.13 0.21 0.08 0.15
y=2 0.07 0.02 0.01 0.01
a) Determine the marginal PMFs of the random variables X and Y.
b) Compute the covariance matrix and correlation matrix for the above data.
3. The assessment data were gathered from 8 homes of an apartment in Bengaluru is
provided in the following table:
x 57.3 63.8 65.4 57.0 63.8 63.2 60.2 57.7
y 74.8 74.0 72.9 70.0 74.9 76.0 72.0 73.5 10 2 2
a) Fit a regression model ŷ i = β0 + β1 x, where x = assessed value (in thousands of
rupees) and y = selling price (in thousands of rupees) using the method of least squares.
b) Calculate the fitted values ŷ i and residuals ê i ; i = 1, 2… 8.
1|Page
4. Consider the following two-dimensional data.

X 1 2 2 3 3 4
Y 2 1 3 2 4 3 10 3 4
Apply principal component analysis technique to reduce the dimensions for the above
data. Note: For covariance calculations, the denominator is the total number of data
points in the set.
5. At a ticket counter, T, the time it takes to issue a ticket is often modelled as
k e(-t/5) , t  0
fT (t)=  10 2 1
. Compute (i) the value of k, (ii) E [T], the expected duration of
 0, otherwise
a ticket being issued? (iii) the variance and standard deviation of T (iv) P [0  T 10].
BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks

COS / BT CO1 CO2 CO3 CO4 L1 L2 L3 L4 L5 L6
Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****
2|Page
Sl. No. TEST M BT CO

1 The following data indicates the various attributes of cars.
ID Colour Type Origin Stolen?

1 Red Sports Domestic Yes
2 Red Sports Domestic No
4 Yellow Sports Domestic No
5 Yellow Sports Imported Yes
6 Yellow SUV Imported No
7 Yellow SUV Imported Yes 10 3 3
8 Yellow SUV Domestic No
9 Red SUV Imported No
10 Red Sports Imported Yes
a) Construct maximum likelihood probability tables for the attributes: (colour, sports
and origin).
b) Set up a Naïve Bayes model to classify the cars into which category would you
classify a RED DOMESTIC SUV?
2. Random variables X and Y have the joint PDF
c x + y , x = -2, 0, 2 ; y = -1, 0, 1
PX, Y (x, y) =  10 2 3
 0, otherwise
a) Determine the value of the constant c and P [Y=X].
3.
Students’ scores in the advanced statistical methods course examination, x, and on the
cumulative grade point average (CGPA), y are given below:
x 4 2 5 3 2 3 4 3 5 2 10 2 2
y 3.12 3.00 4.5 4.75 3 3.5 3.75 4.12 4.54 3.1
1|Page

X 1 2 4 5 7 6
Y 2 5 9 8 5 4 10 3 4
points in the set.
5. The Random variable X has a probability density function
0, x  1
 x + 1, -1  x  0

f X (x)=  10 2 1
1 - x, 0  x  1
0, x>1
Compute (i) Sketch the PDF, (ii) the expected value, variance and standard deviation of
X (iv) P [-0.5  x  0.5].

Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****
2|Page

1 Ten training datasets for animals are given below:
Training Animal Size of animal Body color Classification
example (can we pet them)
1 Dog Medium Black Yes
2 Dog Big White No
3 Rat Small White Yes
4 Cow Big White Yes
5 Cow Small Brown No
6 Cow Big Black Yes 10 3 3
7 Rat Big Black No
8 Dog Small Brown Yes
9 Dog Medium Brown Yes
10 Cow Medium White No
a) Construct maximum likelihood probability tables for the attributes (animal, size of
animal and body color).
b) Set up a Naïve Bayes model to classify the animals into which category would you
classify a SMALL BLACK DOG?
2. Random variables X and Y have the joint PDF
k x - y , x = -1, 0, 1 ; y = -2, 0, 2
PX, Y (x, y) =  10 2 3
 0, otherwise
a) Determine the value of the constant k and P [Y=X].
3.
The following data are chloride concentration y (in milligrams per liter) and roadway
area in the water shed x (in percentage) reported by an article in the Journal of
Environmental Engineering assessment data were gathered from 8 homes of an
apartment in Bengaluru is provided in the following table:
10 2 2
x 0.19 0.15 0.57 0.70 0.67 0.63 0.47 0.70
y 4.4 6.6 9.7 10.6 10.8 10.9 11.8 12.1
1|Page
4. The following are the data on the variables x1 = sales (in billions) and x2 = profits (in
billions) for the four companies:
x1 9 2 6 5 7 6 2
x2 12 8 6 4 5 4 5 10 3 4
points in the set.
5. The Random variable X has a probability density function

  x
k 1 -
f X (x)=    , -4 < x  4
4 10 2 1
0,
 otherwise
Compute (i) the value of k, (ii) the expected value E [X] and E [-2X+3] (iii) P [-0.5  x
 0.5] and P [0 x  3].

Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****
2|Page

1 Suppose that the given data consists of N data-points (xn, yn), where xn and yn are
scalars.
n 1 2 3 4 5 6 7 8
xn -5 -4 -3 -1 2 3 4 5
10 2 2
yn -10 -8 -6 -4 4 6 8 10
a) Fit a regression model ŷ i = w0 + w1 x, using the method of least squares.

2 The Random variable X has a probability density function
0, x  1
 x + 1, -1  x  0

f X (x)= 
1 - x, 0  x  1 10 2 1
0, x>1
Compute (i) Sketch the PDF, (ii) the expected value, variance and standard deviation of
X (iv) P [-0.5  x  0.5].
3 The following data indicates the various attributes of cars.
ID Colour Type Origin Stolen?

2 Red Sports Domestic No
4 Yellow Sports Domestic No
5 Yellow Sports Imported Yes
6 Yellow SUV Imported No
10 3 3
7 Yellow SUV Imported Yes
8 Yellow SUV Domestic No
9 Red SUV Imported No
10 Red Sports Imported Yes
a) Construct maximum likelihood probability tables for the attributes: (colour, sports
and origin).
b) Set up a Naïve Bayes model to classify the cars into which category would you
classify a RED DOMESTIC SUV?
1|Page
4.
The joint probability distribution of a pair of random variables (x, y), where x is scaled
temperature and y is difference in temperature as given by the following table:
P X, Y (x, y) x = 0 x = 1 x = 2 x = 3
y=0 0.05 0.07 0.12 0.08 10 2 3
y=1 0.13 0.21 0.08 0.15
y=2 0.07 0.02 0.01 0.01
a) Determine the marginal PMFs of the random variables X and Y.
X 1 2 4 5 7 6
Y 2 5 9 8 5 4 10 3 4
points in the set.

Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****
2|Page

V MML (18G5B17) - Test - 3 - Merged QPS

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

V MML (18G5B17) - Test - 3 - Merged QPS

Uploaded by

Copyright:

Available Formats

Academic year 2021-2022 (Odd semester 2021)

Sl. No. Questions M BT CO

4. Consider the following two-dimensional data.

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks

Sl. No. TEST M BT CO

ID Colour Type Origin Stolen?

4. Consider the following two-dimensional data.

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks

Sl. No. Questions M BT CO

5. The Random variable X has a probability density function

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks

Sl. No. Questions M BT CO

a) Fit a regression model ŷ i = w0 + w1 x, using the method of least squares.

ID Colour Type Origin Stolen?

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks

You might also like