You are on page 1of 1

Yeshwantrao Chavan College of Engineering

(An Autonomous Institution Affiliated to Rashtrasant Tukadoji Maharaj Nagpur University)

Hingna Road, Wanadongri, Nagpur

Department of Computer Technology M. Tech. I Semester (CSE)

Assignment No: 1

1. Discuss whether or not each of the following activities is a data mining task. (a) Dividing the customers of a company according to their gender. (b) Dividing the customers of a company according to their profitability. (c) Computing the total sales of a company. (d) Sorting a student database based on student identification numbers. (e) Predicting the outcomes of tossing a (fair) pair of dice. (f) Predicting the future stock price of a company using historical records. (g) Monitoring the heart rate of a patient for abnormalities. (h) Monitoring seismic waves for earthquake activities. (i) Extracting the frequencies of a sound wave. 2. Explain what is meant by Data Cleaning and why it may be required before mining a large data set. 3. Describe three commonly used data cleaning operations. //4. Explain three methods for handling missing data in dataset. 5. Given the following data set [4, 8, 9, 15, 21, 21, 24, 25, 26, 28, 29, 34] a. Divide the data set into 3 bins that are smoothed by their means. 6. How is a data warehouse different from a database? How are they similar? 7. What are the values ranges of the following normalization methods? (a) min-max normalization (b) z-score normalization (c) normalization by decimal scaling

You might also like