Code No: RR410509

Set No. 1

IV B.Tech I Semester Supplementary Examinations, February 2007 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) What is partitioning data? Discuss with an example of a partitioned retail sales fact table (b) Discuss about the summary information relating to the data warehouse. [10+6] 2. (a) Discuss when a data mart is appropriate. (b) Explain designing data marts. (c) Discuss costs of data marting. [6+6+4]

3. (a) Describe the server management features of a data warehouse system. (b) “Management tools are required to manage a large, dynamic and complex system such as data warehouse syste“ offer your explanation with justification. [8+8] 4. (a) Describe the role of security restrictions once the data warehouse has gone live (b) What are the audit requirements to impose security restrictions at the beginning of data Warehouse. [8+8] 5. Estimate the Disk space required for a data warehouse. [16]

6. (a) Describe the class histogram, count matrix and AVC sets. Are they similar in some respect? [6+2] (b) Compare ID3 and C4.5 DECISION TREE construction algorithms. 7. (a) Discuss the principles underlying text clustering. (b) Discuss about i. Transverse & Intrinsic Links, ii. Reference Nods & Index nodes. 8. (a) What is spatial trend? Explain about the spatial trend detection algorithm. [3+5] (b) What is spatial clustering? Write about spatial characterization. ⋆⋆⋆⋆⋆ [3+5] [8+8] [8]

1 of 1

Code No: RR410509

Set No. 2

IV B.Tech I Semester Supplementary Examinations, February 2007 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. Explain the role of extract and load process with an exphasis on the following topics. (a) Controlling the process (b) Snapshot and initiating the extract. (c) Loading the data. 2. (a) Explain design of summary tables. (b) Explain load manager architecture.

[16] [8+8]

3. (a) Discuss the issues involved in the design of server environments in a data warehouse system. (b) Describe the design issues involved in the selection of user-front end hardware of a data Warehouse system. [10+6] 4. (a) Describe the role and importance of backup strategy of a data warehouse. (b) Explain the role of hardware to implement backup strategy of a data warehouse. [8+8] 5. (a) What is the significance of performance assessment before tuning the warehouse. (b) Explain the AD hoc query tuning mechanism in a data warehouse. [10+6] 6. (a) Explain about the Three basic levels of Testing. (b) Write in detail about the stages in Developing the Test Plan. [8+8]

7. (a) Explain about the Scatter/Gather interface. In what way it is useful in text clustering. (b) Discuss about [8+8] i. Transverse & Intrinsic Links, ii. Reference Nods & Index nodes. 8. (a) What is Episode Discovery? In what way, it is similar to sequence mining. [3+5] (b) What is the event-prediction problem? Propose one algorithm to solve this problem. [3+5] ⋆⋆⋆⋆⋆ 1 of 1

Code No: RR410509

Set No. 3

IV B.Tech I Semester Supplementary Examinations, February 2007 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) Discuss Business Care analysis, Education and Prototyping. (b) Explain Business requirements analysis and technical blueprint. (c) Describe the requirements Evolution and History load. [6+6+4]

2. (a) Explain difference between designing a Data Warehouse and an OLTP system. (b) Explain fact table identification process. [8+8]

3. (a) Explain the massively parallel processing architecture and its capabilities. (b) Describe the advantages and disadvantages of massively parallel processing architecture. [8+8] 4. Describe the of day to day operations of a data warehouse system. [16]

5. (a) Discuss with a neat sketch dataflow through data warehouse with reference to tuning the data load. (b) What are fixed queries? 6. What is splitting criteria? With an example explain about the (a) Class Histogram, and (b) Count Matrix. [12+4] [2] [7] [7]

7. (a) Explain about the Scatter/Gather interface. In what way it is useful in text clustering. (b) Discuss about i. Transverse & Intrinsic Links, ii. Reference Nods & Index nodes. 8. (a) What is Temporal DATA MINING? Explain about the types of Temporal data. [3+5] (b) Write in detail about the Temporal DATA MINING tasks. ⋆⋆⋆⋆⋆ [8] [8+8]

1 of 1

Code No: RR410509

Set No. 4

IV B.Tech I Semester Supplementary Examinations, February 2007 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. Explain the role of extract and load process with an exphasis on the following topics. (a) Controlling the process (b) Snapshot and initiating the extract. (c) Loading the data. [16]

2. (a) Explain difference between designing a Data Warehouse and an OLTP system. (b) Explain fact table identification process. [8+8]

3. Describe the operational design issues involved in the data warehouse system. Explain with the help of an example situation. [16] 4. (a) Describe the role access hierarchy of a data warehouse. (b) The tighter the security the more person oriented. Do you accept the statement? Offer your remarks with justification. [8+8] 5. How much CPU bandwidth is required and explain why? [16]

6. (a) What is a Decision Tree? What are the advantages and disadvantages of DECISION TREE classifications? [3+5] (b) For the given data set create a Decision Tree? And explain about the knowledge obtained from it. [4+4] OUTLOOK sunny sunny sunny sunny overcast overcast overcast Rain Rain Rain TEMP(F) 79 56 79 60 88 63 88 78 66 68 HUMIDITY(%) 90 70 75 90 88 75 95 60 70 60 WINDY True Flase True True False True False False False True CLASS play play no play no play no play play play play no play play

7. (a) What are the different types of web mining? How is web usage mining different from web structure mining and web content mining? [3+5] 1 of 2

Code No: RR410509

Set No. 4
[8]

(b) What is the underlying principles of “The Hidden Web”? How is text mining related to web mining? What are the techniques of text mining? [3+2+3] 8. (a) Discuss the major algorithms of the sequence mining problem.

(b) What is the event-prediction problem? Propose one algorithm to solve this problem. [3+5] ⋆⋆⋆⋆⋆

2 of 2