You are on page 1of 5

Prateek Agarwal

G-33
A0101909422
MIS
Assignment on Data warehouse
Data Warehouse

A single, complete and consistent store of data obtained from a variety of different sources made

available to end users in a way they can understand and use in a business context.

• a data warehouse is data management and data analysis.

• data webhouse is a distributed data warehouse that is implement over the web with no

central data repository.

• goal: is to integrate enterprise wide corporate data into a single reository from which

users can easily run queries.

Data Warehousing – It is a process

A process of transforming data into information and making it available to users in a timely

enough manner to make a difference.

 Technique for assembling and managing data from various sources for the purpose of

answering business questions. Thus making decisions that were not previous possible

 A decision support database maintained separately from the organization’s operational

database

• data warehousing is subject-oriented, integrated, time-variant, and non-volatile collection

of data in support of management’s decision-making process.

• Subject-orientedWH is organized around the major subjects of the enterprise..rather

than the major application areas.. This is reflected in the need to store decision-support

data rather than application-oriented data


• Integratedbecause the source data come together from different enterprise-wide

applications systems. The source data is often inconsistent using..The integrated data

source must be made consistent to present a unified view of the data to the users

• Time-variantthe source data in the WH is only accurate and valid at some point in

time or over some time interval. The time-variance of the data warehouse is also shown

in the extended time that the data is held, the implicit or explicit association of time with

all data, and the fact that the data represents a series of snapshots

• Non-volatiledata is not update in real time but is refresh from OS on a regular basis.

New data is always added as a supplement to DB, rather than replacement. The DB

continually absorbs this new data, incrementally integrating it with previous data

The main problem addressed by a data warehouse is that end-users have a difficult time

producing ad-hoc or other specialized queries and reports. This is due to several factors:

 Most of the data is stored in ADABAS, which is difficult for end-users to access.

 The data stores were designed for transaction processing not ad-hoc reporting.

 Obtaining the data or a report usually requires waiting for a programmer to either develop the

report or provide a customized download program.

 All of the data may not be consistent as of the same point in time.

 There may not be enough copies of the data kept for historical reporting in the operational

systems.

 End-users do not have the knowledge of what is kept in the existing data stores.
Advantages

The data warehouse addresses these factors and provides many advantages to the end-users of

the University including:

 Improved end-user access to a wide variety of University data

 Increased data consistency

 Additional documentation of the data

 Potentially lower computing costs and increased productivity

 Providing a place to combine related data from separate sources

 Creation of a computing infrastructure that can support changes in computer systems and

business structures

 Empowering end-users to perform any level of ad-hoc queries or reports without impacting the

performance of the operational systems

Architecture

One possible simple conceptualization of data warehouse architecture consists of the following

interconnected layers:

Operational database layer

The source data for the data warehouse — an organization's Enterprise Resource

Planning systems fall into this layer.

Data access layer


The interface between the operational and informational access layer — Tools to extract,

transform, load data into the warehouse fall into this layer.

Metadata layer

The data directory - This is usually more detailed than an operational system data

directory. There are dictionaries for the entire warehouse and sometimes dictionaries for

the data that can be accessed by a particular reporting and analysis tool.

Informational access layer

The data accessed for reporting and analyzing and the tools for reporting and analyzing

data — Business intelligence tools fall into this layer.

You might also like