Professional Documents
Culture Documents
Demo 3
Demo 3
Business Intelligence helps to manage data by applying different skills, technologies, security and quality
risks. This also helps in achieving a better understanding of data. Business intelligence can be considered
as the collective information. It helps in making predictions of business operations using gathered data in a
warehouse. Business intelligence application helps to tackle sales, financial, production etc business data. It
helps in a better decision making and can be also considered as a decision support system.
Data mining is a method for comparing large amounts of data for the purpose of finding patterns. Data
mining is normally used for models and forecasting. Data mining is the process of correlations, patterns by
shifting through large data repositories using pattern recognition techniques.
Data warehousing is the central repository for the data of several business systems in an enterprise. Data
from various resources extracted and organized in the data warehouse selectively for analysis and
accessibility.
E.g. Dimension tables include employee, projects and status. Status table can be further broken into
status_weekly, status_monthly.
A tracking process or collecting status can be performed by using fact less fact tables. The fact table does
not have numeric values that are aggregate, hence the name. Mere key values that are referenced by the
dimensions, from which the status is collected, are available in fact less fact tables.
A dimension table will not have parent table in star schema, whereas snow flake schemas have
one or more parent tables.
The dimensional table itself consists of hierarchies of dimensions in star schema, where as
hierarchies are split into different tables in snow flake schema. The drilling down data from top most
hierarchies to the lowermost hierarchies can be done.
Real time Data Warehouse: Data warehouses are updated based on transaction or event basis in this
stage. An operational system performs a transaction every time.
Integrated Data Warehouse: The activity or transactions generation which are passed back into the
operational system is done in this stage. These transactions or generated transactions are used in the daily
activity of the organization.
What is data modeling and data mining? What is this used for?
Data modeling aims to identify all entities that have data. It then defines a relationship between these
entities. Data models can be conceptual, logical or Physical data models. Conceptual models are typically
used to explore high level business concepts in case of stakeholders. Logical models are used to explore
domain concepts. While Physical models are used to explore database design.
Data mining is used to examine or explore the data using queries. These queries can be fired on the data
warehouse. Data mining helps in reporting, planning strategies, finding meaningful patterns etc. it can be
used to convert a large amount of data into a sensible form.
A degenerate table does not have its own dimension table. It is derived from a fact table. The column
(dimension) which is a part of fact table but does not map to any dimension.
E.g. employee_id
Direct or Faster load:- The data is directly loaded without checking for any constraints
Describe the foreign key columns in fact table and dimension table.
The primary keys of entity tables are the foreign keys of dimension tables.
The Primary keys of fact dimensional table are the foreign keys of fact tables.
Describe the foreign key columns in fact table and dimension table.
A foreign key of a fact table references other dimension tables. On the other hand, dimension table being a
referenced table itself, having foreign key reference from one or more tables.
The facts that can not be summed up for the dimensions present in the fact table are called non-additive
facts. The facts can be useful if there are changes in dimensions. For example, profit margin is a nonadditive fact for it has no meaning to add them up for the account level or the day level.
Data cleaning is performed by reading all records in a set and verifying their accuracy. Typos and spelling
errors are rectified. Mislabeled data if available is labeled and filed. Incomplete or missing entries are
completed. Unrecoverable records are purged, for not to take space and inefficient operations.