You are on page 1of 2

DataMiningSpring2018HW2

1. From your text book page 80

2. Let a football coach tested the age and body fat data for 18 randomly selected player with the
following result

2.1. Calculate the mean, median and standard deviation of age and body
fat.
2.2. Draw the box-plots for age and body fat.
2.3. Draw a scatter plot and a q-q plot based on these two variables.











3. Given two samples denoted by the rows (22, 1, 42, 10) and (20, 0, 36, 8)
3.1. Compute the Euclidean distance between the two samples.
3.2. Compute the Manhattan distance between the two samples.
3.3. Compute the Minkowski distance between the two samples, using h = 3.
4. Briefly outline how to compute the dissimilarity between objects described by the
following:
(a) Nominal attributes

(b) Asymmetric binary attributes
(c) Numeric attributes

(d) Term-frequency vectors

You might also like