You are on page 1of 2

BTDSE613N Data mining and Visualization

Assignment-1
Last Date of submission: 01 March 2024

1. Discuss two major technologies commonly used in data mining and their respective
roles in the data mining process.
2. Explain the concept of data objects and attribute types in data mining, providing
examples for clarification.
3. Describe the basic statistical descriptions used in data mining and discuss their
significance in analysing datasets.
4. Explore three distinct applications of data mining in various domains such as
healthcare, finance, and e-commerce. For each application, describe the data mining
techniques employed and the benefits they provide.
5.
Consider the following two dataset.
Gender = {M, F},
Caste = {H, M}, Education = {L, I},
Food = {V, N},
Hobby = {T, C}, Job = {Y, N}

Calculate the Jaccard coefficient between Ram and Hari assuming that all binary attributes
are asymmetric and for each pair values for an attribute, first one is more
frequent than the second.

6. Consider the following set of records, where each record is defined by two ordinal
attributes size={S, M, L} and Quality = {Ex, A, B, C} such that S<M<L and
Ex>A>B>C.

Normalized values are shown in brackets.


Find the dissimilarity matrix, when each object is defined by only one ordinal
attribute say size (or quality).
7. Consider the following set of objects. Obtain the similarity matrix.

You might also like