Professional Documents
Culture Documents
INTRODUCTION There exists an information gap amongst organizations. Organizations have plenty of data, but little information. Most data is gathered in a fragmented manner from many sources (both internal and external). Additionally, most systems are designed for transactional purposes, which doesnt lend itself to information gathering. Think about how limits of the operation of an ATM machine.
Information
Data
Transaction Processing
OLAP Servers
Analysis Query
Reporting
Data Warehouse
Serve
Data Mining
organization Also can come from outside sources Examples: Weather Reports Demographic information by Zip Code
1. 2. 3.
Extraction Phase
Source systems export data via files or
Combines related data Removes redundancies Common Codes (Commercial Customer) Spelling Mistakes (Lozenges) Consistency (PA,Pa,Penna,Pennsylvania) Formatting (Addresses)
Loading Phase
Places the cleaned data into the DBMS in
users
Data Warehouse
Data Marts
OLAP
Data Warehouse
Data Mining
Data Marts
Physical storage capacity of the DBMS Loading, indexing, and processing speed Availability Handle your data needs Operational integrity, reliability, and manageability
OLAP
Data Warehouse
Data Marts
Metadata
Metadata is Data about Data. Users and Developers often need a way to find information on the data they use. Information can include: Source System(s) of the Data, contact information Related tables or subject areas Programs or Processes which use the data Population rules (Update or Insert and how often) Status of the Data Warehouses processing and condition
OLAP - Online Analytical Processing. It works by aggregating detail data and looks at it by dimensions
Gives the ability to Drill Down in to the detail data Decision Support Analysis Tool Multidimensional DB focusing on retrieval of precalculated data Ends the big reports with large amounts of detailed data These tools are often graphical and can run on a thin client such
as a web browser
Answers the questions you didnt know to ask Analyzes great amounts of data (usually contained in a Data
Most famous example is the Huggies - Heineken case Used in retail sector to analyze buying habits Used in financial areas to detect fraud Used in the stock market to find trends Used in scientific research Used in national security
Creates a single point for all data System is optimized and designed specifically for analysis Access data without impacting the operational systems Users can access the data directly without the direct help from IT
dept
staffing)
Lack of compatibility between components Data from many sources are hard to combine, data integrity issues Bad designs and practices can lead to costly failures