You are on page 1of 8

Academic year 2021-2022 (Odd semester 2021)

DEPARTMENT OF MATHEMATICS
Date 27 January 2021 Time 2:00 PM to 4:00 PM
Test II (OPEN BOOK_Online) Maximum Marks 50
Course Title MATHEMATICS FOR MACHINE LEARNING Course Code 18G5B17
Semester V Programs Global Elective
Instructions: i) Answer all questions. Use of Scientific calculator is allowed.

Sl. No. Questions M BT CO


1 The datasets given below contains the details of the policy holders at an insurance
company. The attributes (i.e., descriptive features) included in the table describe each
policy holder’s ID, gender, age, the type of insurance policy they hold and their
preferred contact channel. The preferred contact channel attribute is the class label
attribute (i.e., target feature) that has two different values (phone, email).
ID Gender Age Policy Classification (Channel)
1 Female young Plan C phone
2 Male young Plan A email
3 Male young Plan A email
4 Female Middle-aged Plan C email 10 3 3
5 Female Middle-aged Plan B phone
6 Male Middle-aged Plan C phone
7 Male senior Plan B email
8 Male senior Plan C phone
9 Female senior Plan A phone
10 Female young Plan B email
a) Construct maximum likelihood probability tables for the attributes: (gender, age and
policy).
b) Set up a Naïve Bayes model to classify the policy holder into which category would
you classify a FEMALE YOUNG choosing PLAN A.
2. The joint probability distribution of a pair of random variables (x, y), where x is scaled
temperature and y is difference in temperature as given by the following table:
P X, Y (x, y) x = 0 x = 1 x = 2 x = 3
y=0 0.05 0.07 0.12 0.08 10 2 3
y=1 0.13 0.21 0.08 0.15
y=2 0.07 0.02 0.01 0.01
a) Determine the marginal PMFs of the random variables X and Y.
b) Compute the covariance matrix and correlation matrix for the above data.
3. The assessment data were gathered from 8 homes of an apartment in Bengaluru is
provided in the following table:
x 57.3 63.8 65.4 57.0 63.8 63.2 60.2 57.7
y 74.8 74.0 72.9 70.0 74.9 76.0 72.0 73.5 10 2 2
a) Fit a regression model ŷ i = β0 + β1 x, where x = assessed value (in thousands of
rupees) and y = selling price (in thousands of rupees) using the method of least squares.
b) Calculate the fitted values ŷ i and residuals ê i ; i = 1, 2… 8.

1|Page
Academic year 2021-2022 (Odd semester 2021)

4. Consider the following two-dimensional data.


X 1 2 2 3 3 4
Y 2 1 3 2 4 3 10 3 4
Apply principal component analysis technique to reduce the dimensions for the above
data. Note: For covariance calculations, the denominator is the total number of data
points in the set.
5. At a ticket counter, T, the time it takes to issue a ticket is often modelled as
k e(-t/5) , t  0
fT (t)=  10 2 1
. Compute (i) the value of k, (ii) E [T], the expected duration of
 0, otherwise
a ticket being issued? (iii) the variance and standard deviation of T (iv) P [0  T 10].

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks


COS / BT CO1 CO2 CO3 CO4 L1 L2 L3 L4 L5 L6
Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****

2|Page
Academic year 2021-2022 (Odd semester 2021)
DEPARTMENT OF MATHEMATICS
Date 27 January 2021 Time 2:00 PM to 4:00 PM
Test II (OPEN BOOK_Online) Maximum Marks 50
Course Title MATHEMATICS FOR MACHINE LEARNING Course Code 18G5B17
Semester V Programs Global Elective
Instructions: i) Answer all questions. Use of Scientific calculator is allowed.

Sl. No. TEST M BT CO


1 The following data indicates the various attributes of cars.

ID Colour Type Origin Stolen?


1 Red Sports Domestic Yes
2 Red Sports Domestic No
3 Red Sports Domestic Yes
4 Yellow Sports Domestic No
5 Yellow Sports Imported Yes
6 Yellow SUV Imported No
7 Yellow SUV Imported Yes 10 3 3
8 Yellow SUV Domestic No
9 Red SUV Imported No
10 Red Sports Imported Yes

a) Construct maximum likelihood probability tables for the attributes: (colour, sports
and origin).
b) Set up a Naïve Bayes model to classify the cars into which category would you
classify a RED DOMESTIC SUV?
2. Random variables X and Y have the joint PDF
c x + y , x = -2, 0, 2 ; y = -1, 0, 1
PX, Y (x, y) =  10 2 3
 0, otherwise
a) Determine the value of the constant c and P [Y=X].
b) Compute the covariance matrix and correlation matrix for the above data.
3.
Students’ scores in the advanced statistical methods course examination, x, and on the
cumulative grade point average (CGPA), y are given below:

x 4 2 5 3 2 3 4 3 5 2 10 2 2
y 3.12 3.00 4.5 4.75 3 3.5 3.75 4.12 4.54 3.1
a) Fit a regression model ŷ i = β0 + β1 x, where x = assessed value (in thousands of
rupees) and y = selling price (in thousands of rupees) using the method of least squares.
b) Calculate the fitted values ŷ i and residuals ê i ; i = 1, 2… 8.

1|Page
Academic year 2021-2022 (Odd semester 2021)

4. Consider the following two-dimensional data.


X 1 2 4 5 7 6
Y 2 5 9 8 5 4 10 3 4
Apply principal component analysis technique to reduce the dimensions for the above
data. Note: For covariance calculations, the denominator is the total number of data
points in the set.
5. The Random variable X has a probability density function
0, x  1
 x + 1, -1  x  0

f X (x)=  10 2 1
1 - x, 0  x  1
0, x>1
Compute (i) Sketch the PDF, (ii) the expected value, variance and standard deviation of
X (iv) P [-0.5  x  0.5].

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks


COS / BT CO1 CO2 CO3 CO4 L1 L2 L3 L4 L5 L6
Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****

2|Page
Academic year 2021-2022 (Odd semester 2021)
DEPARTMENT OF MATHEMATICS
Date 27 January 2021 Time 2:00 PM to 4:00 PM
Test II (OPEN BOOK_Online) Maximum Marks 50
Course Title MATHEMATICS FOR MACHINE LEARNING Course Code 18G5B17
Semester V Programs Global Elective
Instructions: i) Answer all questions. Use of Scientific calculator is allowed.

Sl. No. Questions M BT CO


1 Ten training datasets for animals are given below:
Training Animal Size of animal Body color Classification
example (can we pet them)
1 Dog Medium Black Yes
2 Dog Big White No
3 Rat Small White Yes
4 Cow Big White Yes
5 Cow Small Brown No
6 Cow Big Black Yes 10 3 3
7 Rat Big Black No
8 Dog Small Brown Yes
9 Dog Medium Brown Yes
10 Cow Medium White No
a) Construct maximum likelihood probability tables for the attributes (animal, size of
animal and body color).
b) Set up a Naïve Bayes model to classify the animals into which category would you
classify a SMALL BLACK DOG?
2. Random variables X and Y have the joint PDF
k x - y , x = -1, 0, 1 ; y = -2, 0, 2
PX, Y (x, y) =  10 2 3
 0, otherwise
a) Determine the value of the constant k and P [Y=X].
b) Compute the covariance matrix and correlation matrix for the above data.
3.
The following data are chloride concentration y (in milligrams per liter) and roadway
area in the water shed x (in percentage) reported by an article in the Journal of
Environmental Engineering assessment data were gathered from 8 homes of an
apartment in Bengaluru is provided in the following table:
10 2 2
x 0.19 0.15 0.57 0.70 0.67 0.63 0.47 0.70
y 4.4 6.6 9.7 10.6 10.8 10.9 11.8 12.1
a) Fit a regression model ŷ i = β0 + β1 x, where x = assessed value (in thousands of
rupees) and y = selling price (in thousands of rupees) using the method of least squares.
b) Calculate the fitted values ŷ i and residuals ê i ; i = 1, 2… 8.
1|Page
Academic year 2021-2022 (Odd semester 2021)

4. The following are the data on the variables x1 = sales (in billions) and x2 = profits (in
billions) for the four companies:
x1 9 2 6 5 7 6 2
x2 12 8 6 4 5 4 5 10 3 4
Apply principal component analysis technique to reduce the dimensions for the above
data. Note: For covariance calculations, the denominator is the total number of data
points in the set.

5. The Random variable X has a probability density function


  x
k 1 -
f X (x)=    , -4 < x  4
4 10 2 1
0,
 otherwise
Compute (i) the value of k, (ii) the expected value E [X] and E [-2X+3] (iii) P [-0.5  x
 0.5] and P [0 x  3].

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks


COS / BT CO1 CO2 CO3 CO4 L1 L2 L3 L4 L5 L6
Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****

2|Page
Academic year 2021-2022 (Odd semester 2021)
DEPARTMENT OF MATHEMATICS
Date 27 January 2021 Time 2:00 PM to 4:00 PM
Test II (OPEN BOOK_Online) Maximum Marks 50
Course Title MATHEMATICS FOR MACHINE LEARNING Course Code 18G5B17
Semester V Programs Global Elective
Instructions: i) Answer all questions. Use of Scientific calculator is allowed.

Sl. No. Questions M BT CO


1 Suppose that the given data consists of N data-points (xn, yn), where xn and yn are
scalars.
n 1 2 3 4 5 6 7 8
xn -5 -4 -3 -1 2 3 4 5
10 2 2
yn -10 -8 -6 -4 4 6 8 10

a) Fit a regression model ŷ i = w0 + w1 x, using the method of least squares.


b) Calculate the fitted values ŷ i and residuals ê i ; i = 1, 2… 8.
2 The Random variable X has a probability density function
0, x  1
 x + 1, -1  x  0

f X (x)= 
1 - x, 0  x  1 10 2 1
0, x>1
Compute (i) Sketch the PDF, (ii) the expected value, variance and standard deviation of
X (iv) P [-0.5  x  0.5].
3 The following data indicates the various attributes of cars.

ID Colour Type Origin Stolen?


1 Red Sports Domestic Yes
2 Red Sports Domestic No
3 Red Sports Domestic Yes
4 Yellow Sports Domestic No
5 Yellow Sports Imported Yes
6 Yellow SUV Imported No
10 3 3
7 Yellow SUV Imported Yes
8 Yellow SUV Domestic No
9 Red SUV Imported No
10 Red Sports Imported Yes

a) Construct maximum likelihood probability tables for the attributes: (colour, sports
and origin).
b) Set up a Naïve Bayes model to classify the cars into which category would you
classify a RED DOMESTIC SUV?

1|Page
Academic year 2021-2022 (Odd semester 2021)

4.
The joint probability distribution of a pair of random variables (x, y), where x is scaled
temperature and y is difference in temperature as given by the following table:
P X, Y (x, y) x = 0 x = 1 x = 2 x = 3
y=0 0.05 0.07 0.12 0.08 10 2 3
y=1 0.13 0.21 0.08 0.15
y=2 0.07 0.02 0.01 0.01
a) Determine the marginal PMFs of the random variables X and Y.
b) Compute the covariance matrix and correlation matrix for the above data.
5. Consider the following two-dimensional data.
X 1 2 4 5 7 6
Y 2 5 9 8 5 4 10 3 4
Apply principal component analysis technique to reduce the dimensions for the above
data. Note: For covariance calculations, the denominator is the total number of data
points in the set.

BT-Blooms Taxonomy, CO-Course Outcomes, M-Marks


COS / BT CO1 CO2 CO3 CO4 L1 L2 L3 L4 L5 L6
Marks Distribution
Max Marks 10 10 20 10 00 30 20 00 - -
*****

2|Page

You might also like