P. 1
Data Mining Important Questions

Data Mining Important Questions

|Views: 697|Likes:
Published by Prakash Chandra

More info:

Published by: Prakash Chandra on Jan 21, 2012
Copyright:Attribution Non-commercial


Read on Scribd mobile: iPhone, iPad and Android.
download as DOCX, PDF, TXT or read online from Scribd
See more
See less





Data Mining Important Questions

Q1. What are the characteristics of data in data ware house? Q2.Explain data warehouse cycle? Q3.What are the uses of data ware house? Q4.What is data architecture of data warehouse operations? Q5. What is a data warehouse? How does it differ from a database? Q6.What are the steps involved in the acquisition of data for a data ware house? Q7.What are the difficulties in implementing a data warehouse? Q8.What is a multidimensional data model? How is it used in data warehouse? Q9.Define the terms : (a) OLAP (b)ROLAP (c)MOLAP (d)DSS (e)Data marts. Q10. Describe the characteristics of data warehouse. How is the concept of relational view related to Data Warehouse? Q11.What is data mining? In your answer address the following: (a) Is it another type? (b) Is it a simple transformation of technology developed from database, statistics and machine learning? (c) Explain how the evolution database technologies lead to data mining? (d) Describe the steps involved in the data mining when viewed as a process of knowledge discovery. Present an example where data mining is crucial to success of business. What data mining functions does this business need? Can they be performed alternatively by data query processing or simple statistics analysis? Q12.How is a data warehouse differing from a database? How are they similar to each other? Describe different challenges regarding data mining methodologies and user interactions. Q13.In both data mining and data warehousing, it is important to have some hierarchical information associated with each dimension. If such a hierarchy is not given, discuss how to generate such hierarchy automatically for the first case of dimension containing only numeric data and also for the second case of a dimension containing only categorical data. Q14. What do you mean by data mining? Differentiate between data mining techniques and data mining strategy. Q15. Define the term Data Cleaning with example.

The following are a list of prices of commonly sold items at a company.5.15.20. (b)Data quality can be assessed in the terms of accuracy.If your dataset contains missing value. analytical processing and data mining? Discuss the motivation behind OLAP Mining. How would you differentiate between Data warehouse and Views? Q29. Q31.15. Explain clustering and regression with example.20. Define the term data generalization and analytical characterization with examples. Q28. Identify and describe the phases in the KDD process. Q21. Q20. Q24.What are the differences between three main types of data ware house usage: information processing.What are the typical functionalities of a data warehouse. Q33. Q25. What is Z-Score normalization? Q23.Make a histogram for price using singleton buckets. two other dimensions of the data quality. Distinguish between dimensionality reduction and numerosity reduction. Explain Histogram. discuss the basic analysis and the corresponding decisions you will take in the preprocessing phase of the data mining process. (a)Describe mining association rules in large databases.5.Describe the benefits and drawbacks of a source-driven architecture for gathering of data at a datawarehouse as compared to a destination-driven architecture.10.1.20.Q16. Q27.8.5. Define KDD. Q30. Q32.Write short notes on the following: (a) Data mining metrices (b) Social implications of data mining Q17.20.8.Propose an algorithm. Explain data mining process with neat diagram. Q22.Write short notes on dimensionality reduction.. the automatic generation of a concept hierarchy for numerical database on the equi-depth partitioning. Propose .Differentiate between the following: (a) Data warehouse and operational databases (b) Intrinsic and actual value Q19. Q18.15. Develop a software tool for the detection of outliers if the data for preprocessing are given in the form of a flat file with n-dimensional samples.Describe the structure of data warehouse with the help of a diagram Q26. completeness and consistency. The number have been stored 1. in pseudo code or in your favourite language you know.15.10.

Explain the concept of data cube and where it is used for visualization of large data sets.Discuss the most commonly used techniques in data mining.State 12 ±guidelines/rules for evaluating OLAP products developed by E. (b) The Aproiri Algorithm: Finding frequent item sets using candidate generation. status.What do you mean by association rules. Discuss the advantage and disadvantage of data mining.List out the reasons why we perform attribute relevance analysis? Q38. Q45. Q41.Suppose that university course database for UPTU contains the following attributes: name. Write short notes on: (a) Bayesian Classification (b) Back Propagation Algorithm Q48. Discuss the key features of Data warehouse with example. Q55. major GPA and address. Q54. Q37.F. What do you understand by neural network? Explain multilayer Feed Forward Neural network.Describe the following: (a) Mining single dimensional Boolean association rule from transactional databases. Q52. Q42. address. What do you understand by outliers? Q40. Q56. Q50.propose a concept hierarchy for the attributes status.Codd.What is the role of Artificial Intelligence in Data Mining. Q47. Give examples of main task that are solved by a data mining system. Write short note on Divisive hierarchical clustering. State the goals and tasks of data mining. Q46. What do you understand by the terms data characterization in the content to concept description? Q36. Describe various issues regarding Classification and Prediction.What are the main purposes of statistics used in data mining? Q39. With the help of an example explain data discriminations in brief.Q34. Q35. major of each student and their cumulative grade point average(GPA). Q43. . Q51. Q53. What do you mean by clustering? Explain Data Types in Clustering Q49. Explain Decision tree? Give the algorithm for Decision Tree Induction. Describe the capabilities of OLAP. Differentiate between Feed-forward and feed-backward system. Q44. for what purpose it is being used? Explain with example.

What do you mean by aggregation? Explain in brief. (i) Concept Hierarchy (ii) 3-tier architecture Q58.Write short note on: (i) Testing data warehouse (ii) Backup and Recovery (iii) Data mining interfaces (iv) Neural Networks (v) OLAP Queries «««««««««««««««««««««««««««««««««««««««««« ««««««««««««««««««««««««««««««« !! ALL THE BEST !! . What is multidimensional data model? How we convert tables and spreadsheets to data Convert 2-D tables into 3-D data cubes. Describe the following with example. Explain Star.What are the main features of OLAP servers. Q65.What is clustering and how is it different from classification? Q63. how the OLAP handles aggregation? Write the differences between MOLAP and HOLAP. Q64. Define data warehousing with suitable example. Q61.Write short note on: (a) Decision Tree (b) Genetic algorithm Q62.Q57. Q60. cubes? Q59. why we need a separate data warehouse? Differentiate between OLAP and OLTP. Explain OLAP functions and tools in brief . Snow Flake and Fact Constellation schemas.

You're Reading a Free Preview

/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->