Code No: RR410509

Set No. 1

IV B.Tech I Semester Supplimentary Examinations, February 2008 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) Explain metadata and Data Marting concepts briefly with reference to data warehousing. (b) Draw the three tire decision support information diagram depicting the summary and detailed information. [8+8] 2. (a) Discuss data transformation and load. (b) Explain Query Generation. [8+8]

3. Describe the operational design issues involved in the data warehouse system. Explain with the help of an example situation. [16] 4. (a) “A data warehouse is a changing environment and you should perform regular backup testing ” Justify (b) Write a detailed notes on“disaster recovery”. 5. Estimate the Disk space required for a data warehouse. [8+8] [16]

6. (a) What is a Decision Tree? What are the advantages and disadvantages of DECISION TREE classifications? [3+5] (b) For the given data set create a Decision Tree? And explain about the knowledge obtained from it. [4+4] OUTLOOK sunny sunny sunny sunny overcast overcast overcast Rain Rain Rain TEMP(F) 79 56 79 60 88 63 88 78 66 68 HUMIDITY(%) 90 70 75 90 88 75 95 60 70 60 WINDY True Flase True True False True False False False True CLASS play play no play no play no play play play play no play play

7. (a) What are the different types of web mining? How is web usage mining different from web structure mining and web content mining? [3+5] (b) What is concept hierarchy? How is it related to web mining? 1 of 2 [3+5]

Code No: RR410509

Set No. 1

8. (a) What is “Constrained Sequence Mining Problem”? In which situation we will use constrained sequence mining. [8] (b) Discuss about SPIRIT algorithm. In what way it is different from WUM. [5+3] ⋆⋆⋆⋆⋆

2 of 2

Code No: RR410509

Set No. 2

IV B.Tech I Semester Supplimentary Examinations, February 2008 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) What is partitioning data? Discuss with an example of a partitioned retail sales fact table (b) Discuss about the summary information relating to the data warehouse. [10+6] 2. (a) Explain horizontal partitioning. (b) Explain vertical partitioning. [10+6]

3. Describe the operational design issues involved in the data warehouse system. Explain with the help of an example situation. [16] 4. (a) Describe the role of software to implement backup strategy of a data warehouse system. (b) What are the distinct features of software to implement backup strategy. [8+8] 5. (a) Is daily processing different from overnight processing for Load estimation process? (b) What are the system administration requirements of database siting. [10+6] 6. (a) What is a Decision Tree? What are the advantages and disadvantages of DECISION TREE classifications? [3+5] (b) For the given data set create a Decision Tree? And explain about the knowledge obtained from it. [4+4] OUTLOOK sunny sunny sunny sunny overcast overcast overcast Rain Rain Rain TEMP(F) 79 56 79 60 88 63 88 78 66 68 HUMIDITY(%) 90 70 75 90 88 75 95 60 70 60 WINDY True Flase True True False True False False False True CLASS play play no play no play no play play play play no play play

7. (a) Discuss the principles underlying text clustering. (b) Discuss about 1 of 2 [8+8]

Code No: RR410509 i. Transverse & Intrinsic Links, ii. Reference Nods & Index nodes.

Set No. 2

8. (a) What is time series analysis? What is n-series? Write in detail about similarity function. [3+2+3] (b) What is spatial mining? Explain about the spatial mining tasks? ⋆⋆⋆⋆⋆ [3+5]

2 of 2

Code No: RR410509

Set No. 3

IV B.Tech I Semester Supplimentary Examinations, February 2008 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) Explain the ADHOC query and Automation in Data Warehouse delivery process. (b) Explain to the idea“Can we do without an Enterprise data warehous”? [8+8] 2. (a) Discuss when a data mart is appropriate. (b) Explain designing data marts. (c) Discuss costs of data marting. [6+6+4]

3. (a) Explain the massively parallel processing architecture and its capabilities. (b) Describe the advantages and disadvantages of massively parallel processing architecture. [8+8] 4. (a) Describe the role of security restrictions once the data warehouse has gone live (b) What are the audit requirements to impose security restrictions at the beginning of data Warehouse. [8+8] 5. How much CPU bandwidth is required and explain why? 6. (a) Explain about the Three basic levels of Testing. [16] [8]

(b) Explain about the GUILLOTINE CUT phenomenon. What is the advantage of this method comparing with other. [4+4] 7. (a) What are the different types of web mining? How is web usage mining different from web structure mining and web content mining? [4+4] (b) What is Page Rank? How is it computed? [3+5]

8. (a) What is “Constrained Sequence Mining Problem”? In which situation we will use constrained sequence mining. [8] (b) Discuss about SPIRIT algorithm. In what way it is different from WUM. [5+3] ⋆⋆⋆⋆⋆

1 of 1

Code No: RR410509

Set No. 4

IV B.Tech I Semester Supplimentary Examinations, February 2008 DATA MINING AND DATA WAREHOUSING ( Common to Computer Science & Engineering and Information Technology) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) What is a Data Warehouse? Discuss in detail. (b) Describe with the help of a figure the typical process flow within a Data Warehouse. [8+8] 2. (a) Discuss data transformation and load. (b) Explain Query Generation. [8+8]

3. (a) Describe the issues involved in the selection of hardware architecture for data ware system. (b) Discuss the issues involved in the client-side of a data warehouse system. [8+8] 4. (a) Explain the need and role of security on the performance of data warehouse (b) Describe the impact of security on the design of the data warehouse. 5. Explain various query tuning methods in Data warehouse. [8+8] [16]

6. (a) What is a Decision Tree? What are the advantages and disadvantages of DECISION TREE classifications? [3+5] (b) For the given data set create a Decision Tree? And explain about the knowledge obtained from it. [4+4] OUTLOOK sunny sunny sunny sunny overcast overcast overcast Rain Rain Rain TEMP(F) 79 56 79 60 88 63 88 78 66 68 HUMIDITY(%) 90 70 75 90 88 75 95 60 70 60 WINDY True Flase True True False True False False False True CLASS play play no play no play no play play play play no play play

7. (a) What are the different types of web mining? How is web usage mining different from web structure mining and web content mining? [4+4] (b) What is Page Rank? How is it computed? [3+5]

1 of 2

Code No: RR410509

Set No. 4
[8]

8. (a) How do you distinguish between spatial mining from temporal mining?

(b) Is it possible to use BIRCH for spatial clustering? If so, devise a method for spatial clustering using BIRCH. [2+6] ⋆⋆⋆⋆⋆

2 of 2

Sign up to vote on this title
UsefulNot useful