You are on page 1of 4

E C

B6 6403 3A4 4A6 CB4 480 00E 133 3714 4CC 1B


4 A C 0 1 7 1 6
64 03AA4A 6C B48 8000 0E1 337 14C CC1 B64 4A6
0 B 3 1 B
40 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64AA6AAB6
3A A CB 48 00 13 37 4C 1 64 6 B 40
03 4A 6C 4 00 E1 3 14 C B6 A AB 64 3A
A 6 B 80 0 3 71 C 1B 4 6A 6 03 4
A4 4A6 CB 480 00E E133 371 4CC C1B 64AA6A B6 403 A4AA6C
A C 4 0 1 4 6 B 4 A
4A 6C B48 8000 0E1 337 714C CC1 1B64 4A6 6AB 640 03A 4A6 6CB B480
6 B 0 E 3 3 1 C B A A 6 4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 3A4 A6C CB4 4800 00E
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133
T.E.(Information
6C 4 00 E1 37 4 C1 6 A6 B 4 3A A CB 48 00 13 7
B 80 0E 33 1 CC B 4A A 6 03 4 6C 4 00 E 3 14

5.
4.
3.
2.
1.
B4 4800 00E 133 714 4CC 1B6 64A 6A B64 403AA4AA6C B48 800 0E1 1337 714C CC1

Q.1
8 1 7 C 1 6 B 0 B 0 3 1 B
48 000 0E13 337 14C C1 B64 4A6 AB 640 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64A

Option C:
Option B:
Option C:
Option B:
Option C:
Option B:
Option C:
Option B:
Option C:
Option B:

Option D:
Option D:
Option D:
Option D:
Option D:

Option A:
Option A:
Option A:
Option A:
Option A:
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A

45%
40%
30%
20%
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E

data

Clustering
Time: 2 hour 30 minutes

64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13

Regression
& Business Intelligence DATE: 18/5/2022

Actual
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14

Classification
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B

Bin 1: 4, 4, 4, 15
Bin 1: 4, 4, 4, 15
data matrix_____

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A

Bin 1: 4, 4, 15, 15
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6

Bin 1: 4, 15, 15, 15

Association mining
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A

No
Yes
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A

Total
Classes
Cancer
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
has reduced number of rows
QP CODE: 91760

A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
has reduced number of columns

64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13


03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14

Yes
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
has same number of rows and columns

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A

Non Trivial process of choosing dataset


00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB

No

90 210
Bin 2: 21, 25, 25, 25
Bin 2: 21, 21, 21, 25
Bin 2: 21, 21, 25, 25
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
after smoothing the data by Bin Boundaries.

Bin 2: 21, 25, 25, 25


37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03

Predicted data
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A

For the given confusion matrix compute recall


has reduced number of both rows and columns

CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A

Non Trivial process of creating patterns in data


Knowledge discovery in databases is referred to
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
University of Mumbai
Examinations Summer 2022

64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4

300
140 9560 9700
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80

Total

230 9770 10000

64A6AB6403A4A6CB48000E133714CC1B
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
compulsory and carry equal marks (2 marks each)

64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13


03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14

Non Trivial process for identifying useful patterns in data


A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC

Non Trivial process for identifying invalid patterns in data


CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6
Bin 3: 26, 26, 26, 34
Bin 3: 26, 26, 26, 34

Bin 3: 26, 26, 26, 34


Bin 3: 26, 26, 34, 34

64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6

Poor. Finding reviews of a new restaurant is an example of____________


AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 80
Consider the following data: 4, 8, 9, 15, 21, 21, 24, 25, 26, 28, 29, 34.

CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80
If dimensionality reduction is performed on a record data matrix, the transformed
Choose the correct option for following questions. All the Questions are

You are given reviews of food quality of few restaurants as Good, Average or
Partition the given data with Bin size: 4. What is the output obtained

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00


Max. Marks: 80
=====================================================================

00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00

1 | Page
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E
37 4C 1 64 6 B 40 A A6 B 80 0E 1
14 C B6 A AB 64 3A 4A C 48 00 1
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33
Technology Engineering)(SEM-VI)(Choice Base Credit Grading System ) (R-2020-21) (C Scheme) / 89381 - Data Mining

1B 64A 6A B6 403 A4 6C B4 800 0E 33


64 6A B6 40 A4 A6 B 80 0E 133 7
A6 B 40 3A A CB 48 00 13 71
A 64 3A 4A 6C 48 00 E1 37 4
E C
B6 6403 3A4 4A6 CB4 480 00E 133 3714 4CC 1B
4 A C 0 1 7 1 6
64 03AA4A 6C B48 8000 0E1 337 14C CC1 B64 4A6
0 B 3 1 B
40 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64AA6AAB6
3A A CB 48 00 13 37 4C 1 64 6 B 40
03 4A 6C 4 00 E1 3 14 C B6 A AB 64 3A
A 6 B 80 0 3 71 C 1B 4 6A 6 03 4
A4 4A6 CB 480 00E E133 371 4CC C1B 64AA6A B6 403 A4AA6C
A C 4 0 1 4 6 B 4 A
4A 6C B48 8000 0E1 337 714C CC1 1B64 4A6 6AB 640 03A 4A6 6CB B480
6 B 0 E 3 3 1 C B A A 6 4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 3A4 A6C CB4 4800 00E
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133
6C 4 00 E1 37 4 C1 6 A6 B 4 3A A CB 48 00 13 7
B 80 0E 33 1 CC B 4A A 6 03 4 6C 4 00 E 3 14

9.
8.
7.
6.

10.
B4 4800 00E 133 714 4CC 1B6 64A 6A B64 403AA4AA6C B48 800 0E1 1337 714C CC1
8 1 7 C 1 6 B 0 B 0 3 1 B
48 000 0E13 337 14C C1 B64 4A6 AB 640 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64A

Option C:
Option B:
Option C:
Option B:
Option C:
Option B:
Option C:
Option B:
Option C:
Option B:

Option D:
Option D:
Option D:
Option D:
Option D:

Option A:
Option A:
Option A:
Option A:
Option A:

00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A

60%
40%
50%
20%
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13

K1= {2,3 }
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
K1= {2,3,4}

Data Mining
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14

Data dredging
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC

K1= {2,3,4,10}
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
Partitioning approach

K1= {2,3,4,10,11,12}
Hierarchical approach

37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
Density-based approach

Decision support system


CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
Distribution based approach

Artificial Intelligence system


A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
K2= {20,30,25}

14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
BIRCH falls under which clustering approach

{Milk} is antecedent and {eggs} is consequent

CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
K2={11,12,20,30,25}

A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80

64A6AB6403A4A6CB48000E133714CC1B
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13
K2= {10,11,12,20,30,25}

03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
K2={4,10,11,12,20,30,25}

A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC
association rule among the given set of items, it is inferred

CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6

managing the public and private enterprises and organizations.


0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
{Milk} is antecedent and the item set {bread, eggs} is consequent

37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
The item set {milk, bread} is antecedent and {eggs} is consequent
The item set {milk, bread} is consequent and {eggs} is antecedent

14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 80
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80
m2=11. Apply k -means clustering technique and find its output after 1st iteration

For the given transactional database compute confidence for the rule Milk ⇒ Beer
In one of the frequent item-set examples, it is observed that if milk and bread are

mathematical models to help decision makers solve complex problems faced in


___________ is an interactive computer-based application that combines data and
Given {2,4,3,10,11,12,20,25,30}, Assume k=2 and initial means are m1=4,

bought then eggs are also purchased by the customers. After generating an

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00


00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00

2 | Page
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E
37 4C 1 64 6 B 40 A A6 B 80 0E 1
14 C B6 A AB 64 3A 4A C 48 00 1
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33
1B 64A 6A B6 403 A4 6C B4 800 0E 33
64 6A B6 40 A4 A6 B 80 0E 133 7
A6 B 40 3A A CB 48 00 13 71
A 64 3A 4A 6C 48 00 E1 37 4
E C
B6 6403 3A4 4A6 CB4 480 00E 133 3714 4CC 1B
4 A C 0 1 7 1 6
64 03AA4A 6C B48 8000 0E1 337 14C CC1 B64 4A6
0 B 3 1 B
40 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64AA6AAB6
3A A CB 48 00 13 37 4C 1 64 6 B 40
03 4A 6C 4 00 E1 3 14 C B6 A AB 64 3A
A 6 B 80 0 3 71 C 1B 4 6A 6 03 4
A4 4A6 CB 480 00E E133 371 4CC C1B 64AA6A B6 403 A4AA6C
A C 4 0 1 4 6 B 4 A
4A 6C B48 8000 0E1 337 714C CC1 1B64 4A6 6AB 640 03A 4A6 6CB B480
6 B 0 E 3 3 1 C B A A 6 4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 3A4 A6C CB4 4800 00E
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133

Q.3
Q.2

6C 4 00 E1 37 4 C1 6 A6 B 4 3A A CB 48 00 13 7
B 80 0E 33 1 CC B 4A A 6 03 4 6C 4 00 E 3 14
B4 4800 00E 133 714 4CC 1B6 64A 6A B64 403AA4AA6C B48 800 0E1 1337 714C CC1
8 1 7 C 1 6 B 0 B 0 3 1 B
48 000 0E13 337 14C C1 B64 4A6 AB 640 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64A

B
B

C
A
C
A

00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A

Y
Y
N
N
N
N
Y
Y
Y
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C

chills
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80 chills
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13

45,46,52,70
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC
their diagnosis)

CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B

N
Y
Y
Y
N
Y
N
Y
N

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A


00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64

F
E
B
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03

D
C
A
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C

runny nose
runny nose

64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4

v) Show box plot of the data


A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80

iii) What is mid range of data?


AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
No
No

64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13


03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37

Mild
Mild
Mild
Mild
classify the unknown data sample?

A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14

Solve any Two Questions out of Three


Solve any Two Questions out of Three

Strong
Strong
Strong

A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC


CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B

3
4
3
5
1
X

1.5

for data tuples are (in increasing order):


headache
headache

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A


00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64

iv) Give the five point summary of the data.


37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
Y
Y
N
Y
N
Y
Y
N
Y

14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
fever
fever

64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4

i) What is mean of data? What is median of data?


A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80

64A6AB6403A4A6CB48000E133714CC1B
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E

4
4
5
1
Y
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13

3.5
1.5
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37

ii) What is mode of data? Comment on data's modality.


A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC
?

CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
Y
N
Y
N
Y
Y
Y
N

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A


Do I believe that patient with following symptoms has the flu?

What is Business Intelligence (BI)? Explain BI architecture in detail


00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
flu ?
flu ?

0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03 single linkage clustering and draw dendrogram for the given data.
Suppose we have six objects with name A, B, C, D, E and F. Apply
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
Define data warehouse. Describe different OLAP operations in detail

CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6

13,15,16,16,19,20,20,21,22,22,25,25,25,25,30,33,33,35,35,35,35,36,40,
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C
Explain multi-level and multidimensional association rules with example

A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6
Given all the previous patients I’ve seen(below are their symptoms and
Apply Naive Bayes classifier algorithm to the dataset given below, and

Suppose the data for analysis includes the attribute age. The age values
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 80

10
10
10
10
10
10

CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80
Marks

48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00


00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00

3 | Page
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E
37 4C 1 64 6 B 40 A A6 B 80 0E 1
14 C B6 A AB 64 3A 4A C 48 00 1
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33
1B 64A 6A B6 403 A4 6C B4 800 0E 33
64 6A B6 40 A4 A6 B 80 0E 133 7
A6 B 40 3A A CB 48 00 13 71
A 64 3A 4A 6C 48 00 E1 37 4
E C
B6 6403 3A4 4A6 CB4 480 00E 133 3714 4CC 1B
4 A C 0 1 7 1 6
64 03AA4A 6C B48 8000 0E1 337 14C CC1 B64 4A6
0 B 3 1 B
40 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64AA6AAB6
3A A CB 48 00 13 37 4C 1 64 6 B 40
03 4A 6C 4 00 E1 3 14 C B6 A AB 64 3A
A 6 B 80 0 3 71 C 1B 4 6A 6 03 4
A4 4A6 CB 480 00E E133 371 4CC C1B 64AA6A B6 403 A4AA6C
A C 4 0 1 4 6 B 4 A
4A 6C B48 8000 0E1 337 714C CC1 1B64 4A6 6AB 640 03A 4A6 6CB B480
6 B 0 E 3 3 1 C B A A 6 4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 3A4 A6C CB4 4800 00E
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133
Q.4

6C 4 00 E1 37 4 C1 6 A6 B 4 3A A CB 48 00 13 7
B 80 0E 33 1 CC B 4A A 6 03 4 6C 4 00 E 3 14
B4 4800 00E 133 714 4CC 1B6 64A 6A B64 403AA4AA6C B48 800 0E1 1337 714C CC1
8 1 7 C 1 6 B 0 B 0 3 1 B
48 000 0E13 337 14C C1 B64 4A6 AB 640 3A4 4A6 6CB 480 000E E13 371 4CC C1B 64A
B

C
A

00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC
CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6

05
04
03
02
01

0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
TID

13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80
Minimum confidence of 70%.

1,4

AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E


64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13
Items

03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
2,3,5,7
1,3,4,6

A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14
Solve any Two Questions out of Three

2,5,9,10
1,2,3,5,8

A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC


CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
************
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6C
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C B4
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6 B 80
Briefly explain Bagging and Boosting of classifiers

64A6AB6403A4A6CB48000E133714CC1B
AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB 480 00E
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB 480 00E 13
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48 00 1 37
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14
What is an outlier? Describe methods used for outlier analysis.

A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC


CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80 0E 133 71 CC 1B
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00E 13 71 4CC 1B 64A
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00 13 37 4C 1 64 6
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E1 37 14 C1 B6 A AB
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E 33 14 CC B 4A 6A 64
37 4C 1 64 6 B 40 A A6 B 80 0E 13 71 C 1B 64 6A B6 03
14 C B6 A AB 64 3A 4A C 48 00 1 37 4C C1 64 A6 B 40 A
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A
1B 64A 6A B6 403 A4 6C B4 800 0E 33 714 CC 1B6 4A 6A 64 03A 4A 6
64 6A B6 40 A4 A6 B 80 0E 133 714 CC 1B 4A 6A B6 03 4A 6C
A6 B 40 3A A CB 48 00 13 71 C 1B 64 6A B6 40 A4 6
and strong association rules. Assume Minimum Support of 30% and
For the table given, apply Apriori algorithm and show frequent item set

AB 64 3A 4A 6CB 48 000 E1 37 4C C1 64 A6 B 40 3A A6 CB
64 03A 4A 6C 48 000 E1 337 14C C1 B64 A6 AB 640 3A 4A6 CB
03 4A 6C B4 00 E1 33 14 C B6 A AB 64 3A 4A C 48
A4 6 B 80 0E 33 71 CC 1B 4A 6A 6 03 4A 6C B4
A6 CB 480 00E 13 71 4CC 1B 64A 6A B6 403 A4 6C B4 80
10
10
10

CB 48 00 13 371 4C 1B 64 6A B6 40 A4 A6 B 80
48 000 E13 37 4C C1 64 A6 B 40 3A4 A6 CB 480 00
00 E1 37 14 C1 B6 A6 AB 64 3A A CB 48 00

4 | Page
0E 33 14 CC B 4A A 64 03 4A 6C 4 00 E
13 71 CC 1B 64A 6A B6 03 A4 6C B4 800 0E
37 4C 1 64 6 B 40 A A6 B 80 0E 1
14 C B6 A AB 64 3A 4A C 48 00 1
CC 1B 4A 6A 6 03 4A 6C B4 00 E1 33
1B 64A 6A B6 403 A4 6C B4 800 0E 33
64 6A B6 40 A4 A6 B 80 0E 133 7
A6 B 40 3A A CB 48 00 13 71
A 64 3A 4A 6C 48 00 E1 37 4

You might also like