You are on page 1of 4

Data Warehousing and Data Mining JNTU previous years question papers

Time: 3hours Max. Marks: 80 Answer any FIVE questions All Questions Carry Equal Marks 1.a) Explain the storage models of OLAP? b) How does the data warehousing and data mining work together. [8+8] 2. Suppose that the data for analysis includes the attribute age. the age values for the data tuples are increasing order 13 16 16 23 23 25 25 25 25 30 30 30 30 35 35 35 40 40 45 45 45 70 a) How might you determine the outliers in the data? b) What other methods are there for data smoothing? [16] 3. List and describe the primitives for the data mining task? [16] 4. Why perform attribute relevance analysis? Explain the various methods of it's? [16] 5.a) How is association rules mined from large databases? b) Describe the different classifications of associate rule mining? [8+8] 6. How will you solve a classification problem using decision trees? [16] 7.a) What are the fields in which clustering techniques are used? b) What are the major requirements of clustering analysis? [8+8] 8. Write short notes on: i) Discriminating different classes ii) Statistical measures in large databases. [8+8]

Data Warehousing and Data Mining JNTU previous years question papers

Time: 3hours Max. Marks: 80 Answer any FIVE questions All Questions Carry Equal Marks 1. Briefly compare and explain by taking an example of your point(s). a) Snowflake schema, fact constellation b) Data cleaning, data transformation.[8+8] 2.a) Discuss various issues in data integration? b) Explain the concept hierarchy generation for categorical data? [8+8] 3.a) Why is it important to have a data mining query language? b) Define schema and operation-derived hierarchies? [8+8] 4. Outline a data cube-based incremental algorithm for mining analytical class comparisons? [16] 5. List and explain the five techniques to improve the efficiency apriori algorithm?[16] 6. What is backpropagation? Explain classification by back-propagation? [16] 7. Why is outlier mining important? Discuss about different outlier detection approaches? Briefly discuss about any two hierarchical clustering methods with suitable examples? [16] 8. Write short notes on: i) Mining Spatial Databases ii) Mining the World Wide Web. [16]

Data Warehousing and Data Mining JNTU previous years question papers
Time: 3hours Max. Marks: 80 Answer any FIVE questions All Questions Carry Equal Marks

1.a) Differentiate between OLAP and OLTP? b) Draw and explain the star schema for the data warehouse? [8+8] 2. What is data compression? How would you compress data using principle component analysis (PCA)?[16] 3. List and describe the various types of concept hierarchies? [16] 4. List the statistical measures for the characterization of data dispersion, and discuss how they can be computed efficiently in large data bases? [16] 5. What is Divide and Conquer? How it could be helpful for FP Growth method in generating frequent item sets without candidate generation? [16] 6. Can we get classification rules from decision trees? If so how? What are the enhancements to the basic decision tree? [16] 7. What are the different types of data used in cluster analysis? Explain in brief each one with an example? [16] 8. Write short notes on: i) Data objects ii) Sequence Data Mining iii) Mining Text Databases. [16] Time: 3hours Max. Marks: 80 Answer any FIVE questions All Questions Carry Equal Marks

1. What are the various issues in data mining? Explain each one in detail? [16] 2. Why preprocess the data and explain in brief? [16] 3. Write short notes on GUI, DMQL? How to design GUI based on DMQL? [16] 4. How is class comparison performed? Can class comparison mining be implemented efficiently using data cube techniques? If yes explain? [16] 5. Describe example of data set for which apriori check would actually increase the cost? [16]

6. Explain the various preprocessing steps to improve the accuracy, efficiency, and scalability of the classification or prediction process? [16] 7.a) What are the differences between clustering and nearest neighbor prediction? b) Define nominal, ordinal, and ratio scaled variables? [8+8] 8.a) What are the various issues relating to the diversity of database types? b) Explain how data mining used in health care analysis? [8+8]