Professional Documents
Culture Documents
Data Warehousing
Data Warehousing
▪ Subject oriented.
-Data are organized by detailed subject, such
as sales, products, or customers, containing
only information relevant for decision support.
-Subject orientation enables users to determine
not only how their business is performing, but
why.
-A data warehouse differs from an operational
database in that most operational databases
have a product orientation and are tuned to
Dr.handle transactions
John O. Oredo, that update the database.
PhD-University of Nairobi
Characteristics of Data Warehousing
▪ Non-volatile.
-After data are entered into a data warehouse,
users cannot change or update the data.
-Obsolete data are discarded, and changes are
recorded as new data.
▪ Web based.
-Warehouses are typically designed to provide
an efficient computing environment for Web-
based applications.
▪ Relational/multidimensional.
-A data warehouse uses either a relational
structure or a multidimensional structure.
▪ Client/server.
-A data warehouse uses the client/server
architecture to provide easy access for end
users.
▪ Real time.
-Newer data warehouses provide real-time, or
active, data-access and analysis capabilities.
Dr. John O. Oredo, PhD-University of Nairobi
Characteristics of Data Warehousing
▪ Include metadata.
-A data warehouse contains metadata (data
about data) about how the data are organized
and how to effectively use them.
▪ Comprehensive database.
-Essentially, this is the EDW to support all
decision analysis by providing relevant
summarized and detailed information originating
from many different sources.
▪ Metadata.
-Metadata are maintained so that they can be
assessed by IT personnel and users.
-Metadata include software about data and rules
for organizing data summaries that are easy to
Dr.index andPhD-University
John O. Oredo, search, especially with Web tools.
of Nairobi
Data Warehousing Process
a) Star Schema
-A star schema contains a central fact table
surrounded by and connected to several
dimension tables.
-The fact table contains a large number o f rows
that correspond to observed facts and external
links (i.e., foreign keys).
-A fact table contains the descriptive attributes
needed to perform decision analysis and query
reporting, and foreign keys are used to link to
Dr.dimension tables.of Nairobi
John O. Oredo, PhD-University
Data Warehousing Schemas
b) Snowflake Schema
-The snowflake schema is a logical arrangement
of tables in a multidimensional database in such a
way that the entity-relationship diagram
resembles a snowflake in shape.
-Related to the star schema, the snowflake
schema is represented by centralized fact tables
(usually only one) that are connected to multiple
dimensions.
-In the snowflake schema dimension tables are
Dr.normalized into multiple
John O. Oredo, PhD-University of Nairobi related tables.
Data Warehousing Schemas