Professional Documents
Culture Documents
What is ETL – ETL Vs ELT – Types of Data warehouses - Data warehouse Design and Modeling -
Delivery Process - Online Analytical Processing (OLAP) - Characteristics of OLAP - Online
Transaction Processing (OLTP) Vs OLAP - OLAP operations- Types of OLAP- ROLAP Vs
MOLAP Vs HOLAP
It is well-known as
It is well-known as an
an online database
Definition online database
query management
modifying system.
system.
Consists of
Consists of only
Data source historical data from
operational current data.
various Databases.
It makes use of a
It makes use of a standard database
Method used
data warehouse. management system
(DBMS).
It is subject-
oriented. Used
for Data Mining, It is application-oriented.
Application
Analytics, Used for business tasks.
Decisions making,
etc.
PART B – 16 Marks
1 OLAP Vs OLTP
OLAP (Online Analytical OLTP (Online Transaction
Category Processing) Processing)
It is well-known as an online
database query management It is well-known as an online database
Definition system. modifying system.
Consists of historical data Consists of only operational current
Data source from various Databases. data.
It makes use of a standard database
Method used It makes use of a data warehouse. management system (DBMS).
It is subject-oriented. Used for
Data Mining, Analytics, It is application-oriented. Used for
Application Decisions making, etc. business tasks.
In an OLAP database, tables are In an OLTP database, tables are
Normalized not normalized. normalized (3NF).
The data is used in planning,
problem-solving, and decision- The data is used to perform day-to-day
Usage of data making. fundamental operations.
It provides a multi-dimensional It reveals a snapshot of present
Task view of different business tasks. business tasks.
It serves the purpose to extract It serves the purpose to Insert, Update,
information for analysis and and Delete information from the
Purpose decision-making. database.
The size of the data is relatively small as
A large amount of data is stored the historical data is archived in MB,
Volume of data typically in TB, PB and GB.
Relatively slow as the amount of
data involved is large. Queries Very Fast as the queries operate on 5%
Queries may take hours. of the data.
The OLAP database is not often
updated. As a result, data The data integrity constraint must be
Update integrity is unaffected. maintained in an OLTP database.
Backup and It only needs backup from time to The backup and recovery process is
Recovery time as compared to OLTP. maintained rigorously
It is comparatively fast in processing
The processing of complex because of simple and straightforward
Processing time queries can take a lengthy time. queries.
This data is generally managed by This data is managed by clerksForex
Types of users CEO, MD, and GM. and managers.
Only read and rarely write
Operations operations. Both read and write operations.
Updates With lengthy, scheduled batch The user initiates data updates, which
operations, data is refreshed on a are brief and quick.
regular basis.
Nature of The process is focused on the
audience customer. The process is focused on the market.
Design with a focus on the Design that is focused on the
Database Design subject. application.
2 ETL VS ELT
3 Which OLAP type is a combination of multi dimensional and relational OLAP, explain in
detail and list the differences between all three types.
4 Apply the two dimensional in a data cube in OLAP operations and explain them in detail.
Refer Book technical publications
5 Analyze the type of approach that will create small data warehouses and them merge them all
to create large data warehouse
List the differences between Inmon and Kimball approaches in detail.
6 Analyze the OLAP type that is used BI to provide comprehensive insights through dashboards
and reports, enabling data-driven decision-making and list the differnces between
multidimensional and relational OLAPs
7 Apply the Codd’s rule and his 12 guidelines in Financial sector and explain their process in
detail.
OLAP was introduced by Dr.E.F.Codd in 1993 and he presented 12 rules regarding OLAP:
1. Multidimensional Conceptual View:
Multidimensional data model is provided that is intuitively analytical and easy to
use. A multidimensional data model decides how the users perceive business
problems.
2. Transparency:
It makes the technology, underlying data repository, computing architecture, and
the diverse nature of source data totally transparent to users.
3. Accessibility:
Access should provided only to the data that is actually needed to perform the
specific analysis, presenting a single, coherent and consistent view to the users.
4. Consistent Reporting Performance:
Users should not experience any significant degradation in reporting performance
as the number of dimensions or the size of the database increases. It also ensures
users must perceive consistent run time, response time or machine utilization
every time a given query is run.
5. Client/Server Architecture:
It conforms the system to the principles of client/server architecture for optimum
performance, flexibility, adaptability, and interoperability.
6. Generic Dimensionality:
It should be ensured that very data dimension is equivalent in both structure and
operational capabilities. Have one logical structure for all dimensions.
7. Dynamic Sparse Matrix Handling:
Adaption should be of the physical schema to the specific analytical model being
created and loaded that optimizes sparse matrix handling.
8. Multi-user Support:
Support should be provided for end users to work concurrently with either the
same analytical model or to create different models from the same data.
9. Unrestricted Cross-dimensional Operations:
System should have abilities to recognize dimensional and automatically perform
roll-up and drill-down operations within a dimension or across dimensions.
10. Intuitive Data Manipulation:
Consolidation path reorientation, drill-down, and roll-up and other manipulations
to be accomplished intuitively should be enabled and directly via point and click
actions.
11. Flexible Reporting:
Business user is provided capabilities to arrange columns, rows, and cells in
manner that gives the facility of easy manipulation, analysis and synthesis of
information.
12. Unlimited Dimensions and Aggregation Levels:
There should be at least fifteen or twenty data dimensions within a common
analytical model.
From the above axes, explain the OLAP operations in a detailed manner.
Refer Book technical publications
9 The store has to process everyday transactions, such as purchases, inventory, and so on to
collect data for analysis. This data is put through the systems to analyze previous purchases,
fast-moving products, price ranges, and other custom attributes to understand which products
might work best in a discount sale.
Explain the methodology used here and list out the differences between OLAP and OLTP
10 Apply the three dimensional in a data cube in OLAP operations and explain them in detail.
Refer Book technical publications
11
12 If two customers simultaneously place an order for the last item in stock, the customer whose
transaction processes first will receive the product. The system will update the inventory and
cash transactions regularly to ensure orders are placed only for products in stock.
Analyze the data processing method used here and explain the procedure and list out the
differences between analytical and transactional processing methods.
13 A team or project manager makes decisions, which then filter down through a hierarchical
structure. Managers gather knowledge, analyze it, and draw actionable conclusions. They then
develop processes that are communicated to and implemented by the rest of the team. You may
hear this style of management referred to as “command and control” or “autocratic leadership.”
Explain the Inmon approach for the above scenario in detail
14 Which OLAP type is a combination of multi dimensional and relational OLAP, explain in
detail and list the differences between all three types.
15
From the above data, explain the OLAP operations in a detailed manner
Refer Book technical publications
16 You are a Senior Analyst in the IT department of a company manufacturing automobile parts.
The marketing VP is complaining about the poor response by IT in providing strategic
information. Draft a proposal to him explaining the reasons for the problems and why a data
warehouse would be the only viable solution.