You are on page 1of 3

Q1. What is OLTP? Compare OLTP and Data Warehouses.

[3 +7] 10
Answer: Transaction processing is a type of computer processing that takes place
in the presence of a computer user. It provides for an immediate response to a
user request (or transaction). When a large number of transactions are taken and
stored to be dealt with at a later time (without the presence of a user), the p
rocess is known as batch processing. Different examples of transaction processin
g include automated teller machines, credit card authorizations, online bill pay
ments, self-checkout stations at grocery stores, the trading of stocks over the
Internet, and various other forms of electronic commerce.
Q2. What are the major components of DWH Architecture [5] 05
Answer: The major components of DWH Architecture are:
·Source Data Component
1. Production Data
2. Internal Data
3. Archived Data
Q3. What is OLAP? Explain. [2 +3] 05
Answer: On-Line Analytical Processing (OLAP) is a category of software technolog
y that enables analysts, managers and executives to gain insight into data throu
gh fast, consistent, interactive access in a wide variety of possible views of i
nformation that has been transformed from raw data to reflect the real dimension
ality of the enterprise as understood by the user.
OLAP allows business users to slice and dice data at will. Normally data in an o
rganization is distributed in multiple data sources and are incompatible with ea
ch other. A retail
Q4. What is Data mining? Explain. [2 +3] 05
Answer:Data mining is concerned with finding hidden relationships present in bus
iness data to allow businesses to make predictions for future use. It is the pro
cess of data-driven extraction of not so obvious, but useful information from la
rge databases. Data mining has emerged as a key business intelligence technology
Data mining streamlines the transformation of masses of information into meaning
ful knowledge, which is essential or bottom-line of Business intelligence.
Typical techniques for data mining involve decision trees, neural networks, near
est neighbor
Q5. What are the Objectives of using data mining in business? [10] 10
Answer: Data Mining and Statistics
Statistics is a branch of Mathematics. Statistics techniques are incorporated in
to Data mining methods. Data mining methods or techniques find the relations bet
ween variables or data in the given data base and express these relations using
statistical nomenclature. Without statistics, there would be no data mining, as
statistics is the foundation of most technologies on which data mining is built.
Classical statistics embrace concepts such as Regression Analysis, Standard Dis
tribution, Standard Deviation, Standard Variance, Discriminant Analysis, Cluster
Analysis, and Confidence Intervals, all of which are used to study data and dat
a relationships. These are
Q6. What is Clustering? Explain in detail. [3 +7] 10
Answer: Clustering is the method by which like records are grouped together. Usu
ally this is done to give the end user a high level view of what is going on in
which most marke
the database. Clustering is sometimes used to mean segmentation
ting people will tell you is useful for coming up with a birds eye view of the b
usiness. Clustering is a form of learning by observation rather than learning by

click. List its advantages and drawbacks. · Web information integration and schema matching: Although the Web contains a hug e amount of data. each web site (or even page) represents similar information di fferently. 6 Describe about Multilevel Databases and Web Query Systems MCA5043 Data Warehousing and Data Mining 1 Explain the Top-Down and Bottom-up Data Warehouse development Methodologies. machine learning and auto matic extraction are used to solve this problem. List its advantages and drawbacks. Two main types of techniques. Identifying or matching semantically similar 1 What is a Data Warehouse? Mention its advantages 2 What is Metadata? Describe about Data Marts. 4 Explain the relation between data warehousing and data mining. Extracting such data allows one to provide services. Define the following terminologies (a) User (b) page view [2 +3] 05 Answer: To provide some industry level consistency. (Ex: alerts) Q8. 3 Differentiate between E-R modeling and Dimensional Modeling. since they are using the same baselines and language. 2 Explain the Functionalities and advantages of Data Warehouses 3 Describe about Hyper Cube and Multicube 4 List and explain the Strategies for data reduction. A user is defined as a single individual who is accessing files from one or more Web servers Web Agent: A software agent is a computer program which runs on anagent interact ion machine. What are the Web content mining problems/challenges? Explain.streams and page v iews amongst others. such as products and search results is a difficult task. comparisons are easier t o make. 5 Describe K-means method for clustering.Q7. With an agreed set of definitions. . [10] 10 Answer: Web content mining problems/challenges: · Data/Information extraction: Extraction of structured data from Web pages. the World Wide Web Committee (W3C) has produced definition of terms such as users. 5 Describe K-means method for clustering. 5 Differentiate between K-means and Hierarchical clustering 6 Differentiate between Web content mining and Web usage .DATA WAREHOUSING & DATA MINING 1 Differentiate between Data Mining and Data Warehousing 2 Describe the key features of a Data Warehouse 3 Differentiate between Data Integration and Transformation 4 Differentiate between database management systems (DBMS) and data mining.6 Describe about Multilevel Databases and Web Query Systems MC0088.