You are on page 1of 18

DATA WAREHOUSING

MANAGEMENT
DATA LAKE
DATA WAREHOUSE DATA MART

Data lakes are designed to Data warehouses primarily Data marts are subsets of
store both structured and store structured data that has data warehouses or data
unstructured data, including been cleaned, transformed, lakes that contain data
raw, unprocessed data. and integrated for analysis. specific to a particular
business unit or department.
DATA WAREHOUSE
A data warehouse is a central repository of information that can be
analyzed to make more informed decisions. Data flows into a data
warehouse from transactional systems, relational databases, and
other sources, typically on a regular cadence.
source
system
source source
system system

source
system
DATA WAREHOUSE
source
source system
system
REPOSITORY
A repository, or repo, is computer storage for
maintaining data or software packages.
Data Warehouse Architecture

01
BOTTOM TIER
The bottom tier is responsible for storing
raw data from various source systems. It
acts as the foundation of the data
warehouse.

02
MIDDLE TIER
The middle tier is responsible for preparing
and organizing data for analytical purposes.

03
TOP TIER
The top tier is where end-users interact with
the data warehouse to extract insights and
perform analytical tasks.
Data Business
Warehousing Intelligence (BI)
refers to the procedural and technical
infrastructure that collects, stores, and
analyzes the data produced by a
company's activities. BI is a broad term
that encompasses data mining, process
analysis, performance benchmarking, and
descriptive analytics.
END.
Building a Data Warehouse
Data-Driven Approach

01
STARTING WITH DATA carefully designing how the data will be organized and stored.This
STRUCTURE: involves considering how different pieces of information relate to
each other and how they can be efficiently accessed and analyzed.
Building a Data Warehouse
Data-Driven Approach

02
OPTIMIZATION FOR The primary goal of structuring the data is to optimize it for
ANALYSIS AND analysis and reporting purposes.
REPORTING
Building a Data Warehouse
Data-Driven Approach

03
BUILDING APPLICATIONS you can then develop applications and tools that leverage this
AND TOOLS AROUND well-organized data.
DATA:
Building a Data Warehouse
Data-Driven Approach

04
LIBRARY ANALOGY This allows for greater flexibility in how the data can be stored,
accessed, and analyzed.
DATA ANALYSIS
TECHNIQUES
The techniques of data analysis can
impact the type of data model
selected and its content.
Data Analysis. Several methods of data analysis are in common use
NORMALIZED DATA
MODEL:

Increases redundancy: This approach involves duplicating


Emphasized for providing query and reporting capability. data, which can lead to increased storage requirements and
Organizes data in a standardized, normalized fashion. potentially more maintenance effort.
Well-suited for tasks that involve selecting associated Enhances query performance: By reducing the need for
data elements, summarizing them, and grouping them by complex joins, denormalized tables can speed up query
processing.
categories.
Suitable for scenarios where read performance is critical and
May involve direct table scans for efficient access to
data modification is relatively infrequent.
data. Commonly used in data warehousing and reporting
applications.
DIMENSIONAL A dimensional data model is a type of data model that is
specifically designed for supporting multidimensional analysis
DATA MODEL: in data warehouses and business intelligence systems. It's
optimized for querying and reporting on large volumes of data.

Fact tables are connected to dimension tables through


relationships. These relationships are established using keys
(foreign keys) that link the dimensions to the fact table. For
example, a sales fact table might be linked to a product
dimension table through a product ID.

It is designed to facilitate efficient querying and reporting of


large volumes of data for analytical purposes.
MULTIDIMENSIONAL
ANALYSIS
This type of analysis involves looking at data from
multiple perspectives, or dimensions, such as
product, day, store, and price range

Drilling Down and Rolling Up:

Multidimensional analysis allows users to


navigate through data by drilling down to lower
levels of detail or rolling up to higher levels of
summarization. This flexibility helps in gaining
deeper insights.
QUERY AND QUERY DEFINITION

REPORTING DATA ACCESS AND


RETRIEVAL
Query and reporting analysis is the process of
formulating questions, retrieving relevant data
from a data warehouse, contextualizing it, and CALCULATION
presenting it in a readable format. MANIPULATION

REPORT
PREPARATION

REPORT DELIVERY

Query and Reporting. The process of query and reporting starts with query
definition and ends with
report delivery

You might also like