Professional Documents
Culture Documents
CONSTRUCTS
AND
COMPONENTS
Group 2
DATAWAREHOUSE A data warehouse is a centralized
repository that contains historical and
CONSTRUCTS commutative data from single or
multiple sources that employees of an
AND organization can use for analysis,
drawing insights, and data driven
Two-laye
It is known as the simplest DBMS architecture.. layers which
Because the database is implemented in this manner, it is warehouses.
classified into tiers. A single database server houses all of two parts to
the data in a single database. business da
The objective of a single layer is to minimize the to store the
amount of data stored. This goal is to remove data the second p
redundancy. Small businesses will benefit from this type of users.
architecture because it is simple to manage and However
inexpensive to construct. It is not suitable for businesses supporting
with complex data requirements and numerous data connectivity
streams. This architecture is not frequently used in a single-tier
practice. database ser
Two-tier architecture Three-Tie
a Warehouse
ces and data Bottom
re, there are
st part of a
s this is used The datab
It is the most common type of modern DWH design as it tier. It is usua
e is located in produces a well-organized data flow from raw information to from your sou
abase for the valuable insights. It consists of the Top, Middle and Bottom Tier. bottom tier us
The botto
and also not marts, and da
It also has integration too
ations. Unlike combine and a
stem and a
tecture
Bottom Tier:
External Sources
Staging area
External source is a source from
where data is collected irrespective
of the type of data. Data can be
structured, semi-structured and
unstructured as well.
Top-down approach: Stage Area
External Sources
Since the data, extracted from the
external sources does not follow a
particular format, there is a need to
validate this data to load into the data
warehouse. For this purpose, it is
Staging area
recommended to use the ETL tool.
Data warehouse
Data Marts
These Extract, Transform, and Load tools may generate cron jobs, background
jobs, Cobol programs, shell scripts, etc. that regularly update data in data
warehouses. These tools are also helpful to maintain the Metadata.
These ETL Tools have to deal with challenges of Database & Data
heterogeneity.
Meta Data :
In a typical data warehouse architecture, metadata describes the data warehouse database
and offers a framework for data. It helps in building, maintaining and managing the data
warehouse.
Technical Metadata
Business Metadata
comprises information
that can be used by includes information
developers and that offers an easily
managers when understandable
executing warehouse standpoint of the data
development and stored in the
administration tasks warehouse.
Data Warehouse Access Tools :
A data warehouse uses a database or group of databases as a foundation. Data warehouse
corporations generally cannot work with databases without the use of tools unless they have
database administrators available. However, that is not the case with all business units. This is
why they use the assistance of several no-code data warehousing tools, such as:
The following are the four database types that you can use
The reporting layer in the data warehouse allows the end-users to access the
BI interface or BI database architecture. The purpose of the reporting layer in the
data warehouse is to act as a dashboard for data visualization, create reports, and
take out any required information.
We will learn about the Data Warehouse
Components and Architecture of Data Warehouse
with Diagram as shown below:
The Data Warehouse is based on an RDBMS server
which is a central information repository that is
surrounded by some key Data Warehousing
components to make the entire environment
functional, manageable and accessible.
- Data warehouse is an information system that contains
historical and commutative data from single or multiple
sources. These sources can be traditional Data
Warehouse,
Summary 1) Database
2) ETL Tools
3) Metadata
4) Query Tools
5) DataMarts
- These are four main categories of query tools
1. Query and reporting, tools
2. Application Development tools,
3. Data mining tools
4. OLAP tools
- The data sourcing, transformation, and migration tools
are used for performing all the conversions and
Summary summarizations.