Professional Documents
Culture Documents
Cse
Cse
CAT – I
Answer the following questions 10 X 2 = 20 Marks
(a) TRUE TRUE TRUE TRUE TRUE (b) TRUE FALSE TRUE FALSE TRUE
(c) “” “” “” “” “” (d) FALSE FALSE FALSE FALSE FALSE
x <- c('one','two','three','four')
y <- c(1,2,3,4)
z <- c(4.5,8.9,3.3,4.5)
lst <- list(x,y,z)
lst[x]
(a) one two three four (b) ‘one’ ‘two’ ‘three’ ‘four’
(c) 1 2 3 4 (d) NULL NULL NULL NULL
x <- c('one','two','three','four')
y <- c(1,2,3,4)
z <- c(4.5,8.9,3.3,4.5)
arr <- array(c(x,y,z))
arr
(a) Error
(b)
(d) 'one' 'two' 'three' 'four' '1' '2' '3' '4' '4.5' '8.9'
(c) '3.3' '4.5
Consider the following data and answer the following questions from 11-15 . 5 X 2 = 10 Marks
This famous (Fisher's or Anderson's) iris data set gives the measurements in centimeters of the variables
sepal length and width and petal length and width, respectively, for 10 flowers from each of 3 species of
iris. The species are Iris setosa, versicolor, and virginica.
14 What is the normalized value of 2.5 in the column “Sepal.Width” using min-max normalization
having new minimum value as 11 and new maximum value is 13
(a) (b)
(c) (d)
15 What is the normalized value of 2.5 in the column “Petal.length” using z-score normalization?
(a) (b)
(c) (d)
16. Suppose that the data for analysis include the attribute age. The age values for the data tuples are (in
increasing order): 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45,
46, 52, 70
Smooth the data using following methods using a bin depth of 3
(a) smoothing by bin means (3 Marks)
(b) smoothing by bin medians (3 Marks)
(c) smoothing by bin boundaries (4 Marks)
17. In the following real-world data, tuples with missing values for some attributes are a common occurrence.
Fill the missing values with appropriate method and justify why you have chosen that method. (10 M)
ph. ph. pat. meal.
Id no Inst time status age sex Ecog karno karno Cal wt.loss
2 3 455 2 68 1 0 90 90 1225 15
3 3 1 56 1 0 90 90 15
4 5 210 57 1 1 90 60 1150 11
5 1 883 2 60 1 0 90 0
6 12 1022 1 1 1 50 80 513 0
7 7 310 2 68 2 2 70 60 384 10
8 11 2 71 2 2 60 80 1
9 1 218 2 53 1 1 70 80 825 16
10 7 166 2 61 2 70 271 34