This action might not be possible to undo. Are you sure you want to continue?
1) Data Warehousing Overview
• The purpose of Data Warehousing • The history of Data Warehousing
2) The Techniques, Technology and Tools 3) Terms and Definitions
Data Warehousing Basics
Section 1 Data Warehousing Overview Data Warehousing Basics .
(EIS. Data Mining.) Data Warehousing Basics .) Make better decisions!! Turn data into information Create competitive advantage Methods to support the decision making process. etc. Analysis. DSS.The Purpose of Data Warehousing Realize the value of data!! Data / information is an asset Data / information can be sold Methods to realize the value. etc. (Reporting.
The Bill Inmon Definition In 1993. Bill Inmon. the "father of data warehousing". gave this definition of a data warehouse: Data Warehousing Basics .
diverse sources.Huh . Nonvolatile: The data is static. rather than typical systems which generally provide just detailed current information. one version of the truth regardless of when the question is asked. Data Warehousing Basics . Integrated: Data is collected and consistently stored from multiple. which is more intuitive for users to navigate.What does this mean? Subject oriented: Data is arranged by subject area rather than by application. Time variant: Allows for access to and analysis of data over time.
current . Operational Database Data Warehouse subject oriented integrated nonvolatile time-variant Data Warehousing Basics Operational Database application oriented multiple diverse sources updateable real-time.Data Warehouse vs.
processing and storage advancements • Tools . Kimball.client server.The History We’ve always managed historical data . The buzz words and hype The evolution of tools Success stories Data Warehousing Basics . web. and other desktop tools Evolution of the techniques: • • • • Inmon. etc. Technology advancements in the 90’s: • Hardware . . .
Section 2 Data Warehousing Overview Data Warehousing Basics .
DOLAP) Data Movement Data Quality Meta data management Data Warehousing Basics . MOLAP. (ROLAP.The Techniques Multi-Dimensional Modeling Dimension and Fact tables Star Schema / Snowflake Schema Specialized tools and knowledge DBMS OLAP.
Relational OLAP MD OLAP .The Technology & Tools OLAP . DBMS DB2.Desktop OLAP Hardware MVS. . IDMS. Sybase. . Red Brick. Oracle. Informix. Transform. Extract. Load (ETL) Data Warehousing Basics .Online Analytical Processing ROLAP/MOLAP/DOLAP ROLAP . SQL Server . . Unix.Multi Dimensional OLAP DOLAP . . NT .
Section 3 Terms and Definitions Data Warehousing Basics .
Data Warehousing Basics . Note that a business model exists independently of any data or database.A family of advanced indexing algorithms that optimize RDBMS query performance by maximizing the search capability of the index per unit of memory and per CPU instruction.An object-oriented model that captures the kinds of things in a business or a business area and the relationships associated with those things (and sometimes associated business rules. A data warehouse should be designed to match the underlying business models or else no tools will fully unlock the data in the warehouse. Business Model . too).Terms and Definitions Bitmapped Indexing . bitmapped indices eliminate all table scans in query and join processing. Properly implemented.
A collection of Meta Data.Terms and Definitions .continued Corporate Data . old and new transaction systems. middleware. data warehouses and data marts. This includes legacy systems. general business systems. including database management systems. Data Warehousing Basics . and query tools. Data Dictionary .All the databases of the company. Many kinds of products in the data warehouse arena use a data dictionary. client/server databases. modeling tools.
and the relationships between the tables. The data usually is extracted from the data warehouse and further denormalized and indexed to support intense usage by targeted customers. Data Mining .A subset of a data warehouse that focuses on one or more specific subject areas. the meanings of the keys.continued Data Mart .Techniques for finding patterns and trends in large data sets. Data Model .Terms and Definitions .The road map to the data in a database. See also Data Visualization. This includes the source of tables and columns. Data Warehousing Basics .
There are many specialized techniques designed to make particular kinds of visualization easy. Data Warehousing Basics . purchasing. human resources. Data Warehouse . and analyses. etc.Terms and Definitions .continued Data Visualization . The data needs to be cleaned and restructured to support queries. management. summaries.A database built to support information access.Data access targeted to provide the information needed by business decision makers. Typically a data warehouse is fed from one or more transaction databases. Examples include pricing. Decision Support .Techniques for turning data into information by using the high capacity of the human brain to recognize visually recognize patterns and trends. manufacturing.
"data about data. Methodology .The steps followed to guarantee repeatability of success.continued Decision Support System (DSS) . Meta Data . how it is encoded." More usefully. descriptions of what kind of information is stored where. For example. how it is related to other information. A good methodology is built on top of real world experience. A hot topic right now is standardizing meta data across products from different vendors. warehouse(s). where it comes from. and how it is related to your business.Literally.Terms and Definitions . see The Hughes-Vollum Methodology.Database(s). and/or mart(s) in conjunction with reporting and analysis software optimized to support timely business decision making. Data Warehousing Basics .
Terms and Definitions . Data Warehousing Basics . to move and structure data. Users that need a lot of slices and dices might appreciate a multidimensional database. The best systems support standard RDBMS functionality and add high-bandwith support for multidimensional data and queries. Multidimensional database (MDD) .A DBMS optimized to support multidimensional data. and/or to pre-summarize data for use by queries and reports.Hardware and software used to connect clients and servers.continued Middleware .
class Order has a relationship Customer to class Customer.continued Object Oriented Analysis (OOA) .g. or less commonly.) Object Oriented Design (OOD) . specialization/generalization).A design methodology that uses Object Oriented Analysis to promote object reusability and interface clarity. subclass/superclass. Also identified for each class are its attributes (e.g.Terms and Definitions . class Person has attribute Hair Color) and its conventional relationships to other classes(e. Data Warehousing Basics . is-a relationships as subtype/supertype.A process of abstracting a problem by identifying the kinds of entities in the problem domain. the is-a relationships between the kinds (kinds are known as classes. and the has-a relationships between the classes.
continued OLAP . and analyses need to be delivered in a timely fashion. Performance .A specific atomic request for information from a database.Terms and Definitions .Data. A common use of a data warehouse that involves real time access and analysis of multidimensional data such as order information. Query . Data Warehousing Basics .An acronym for On Line Analytical Processing. summaries. Performance is often a key issue with data warehouses: the right answer isn't worth much if it shows up after the decisions have been made.
For performance and reliability several independent copies are often created of each data warehouse.OLAP based on conventional relational databases rather than specialized multidimensional databases. Replicator . Replication .A standard technique in data warehousing.continued Relational On-Line Analytic Processing (ROLAP) . Even data marts can require replication on multiple servers to meet performance and reliability standards. Data Warehousing Basics .Terms and Definitions . Often these tools use special load and unload database APIs and have scripting languages that support automation.Any of a class of product that supports replication.
Security . Data Warehousing Basics . Note that a business analyst may need access to summaries of data s/he should not see.continued Report . formatted. Security systems need to make this easy to implement while making sure outsiders or rogue employees do not see data they should not see. Usually a report formats and combines several related queries.Terms and Definitions .The right data for the right person.A repeatable. Snowflake Schema .A layering of Star Schema that scales that technique to handle an entire warehouse. nonatomic request for information from a database.
"Fact" tables each join to a larger number of independent "dimension" tables.A standard technique for designing the summary tables of a data warehouse. The tables may be partially denormalized for performance.continued Star Schema . Data Warehousing Basics .Terms and Definitions . but most queries will still need to join in one or more of the star tables.
stonebridgegroup. MN 55416 (763) 923-7900 (763) 923-7901 fax www. Suite 170 Minneapolis.StoneBridge Group 701 Xenia Ave.com Data Warehousing Basics . South.
This action might not be possible to undo. Are you sure you want to continue?
We've moved you to where you read on your other device.
Get the full title to continue reading from where you left off, or restart the preview.