Professional Documents
Culture Documents
Systems
Time to Think
Databases and Business Decision Making
Tools for Business Intelligence
Time to Think
How is the data organized?
Time to Think?
How is the data organized?
Time to Think?
How is the data organized?
Time to Think?
How is the data organized?
File organization concepts
–Database: Group of related files
Attributes of an entity
A computer system
organizes data in a
hierarchy that starts with
the bit, which represents
either a 0 or a 1. Bits can be
grouped to form a byte to
represent one character,
number, or symbol. Bytes
can be grouped to form a
field, and related fields can
be grouped to form a record.
Related records can be
collected to form a file, and
related files can be
organized into a database.
DBMS:Issues with traditional Data Organization
◼ Program-data dependence
◼ Lack of flexibility
◼ Poor security
◼ Database
◼ Collection of data organized to serve many applications by
centralizing data and controlling redundant data
◼ Relational DBMS
◼ Represent data as two-dimensional tables called relations or files
◼ Each table contains data on entity and attributes
A relational database
organizes data in the
form of two-
dimensional tables.
Illustrated here are
tables for the entities
SUPPLIER and PART
showing how they
represent each entity
and its attributes.
Supplier Number is a
primary key for the
SUPPLIER table and a
foreign key for the
PART table.
A DBMS has three important capabilities:
◼ 2)
the data dictionary stores definitions of data elements
and their characteristics;
The select, project, and join operations enable data from two different tables
to be combined and only selected attributes to be displayed.
Designing Databases
–you must understand the relationships among the data,
–how the data will be used, and how the organization will need to change to
manage data from a company-wide perspective.
–Normalization
–Entity-relationship diagram
•To derive business value from these data, organizations need new
technologies and tools capable of managing and analyzing non-
traditional data along with their traditional enterprise data.
Databases and Business Decision
Making
◼ Databases store historical data so the information about
trends, changes across entire company cannot be obtained
from a database
◼ Data marts:
◼ Subset of data warehouse
◼ Summarized or highly focused portion of firm’s data for use by
specific population of users
◼ Typically focuses on single subject or line of business
Hadoop
–is an open source software framework managed by the Apache Software
Foundation
–breaks a big data problem down into sub-problems, distributes them among
up to thousands of inexpensive computer processing nodes,
– and then combines the result into a smaller data set that is easier to analyze.
–Facebook announced the data gathered in the warehouse grows by roughly half
a PB per day. / PB is 1000⁵
In-Memory Computing
A series of
analytical tools
works with data
stored in
databases to find
patterns and
insights for
helping managers
and employees
make better
decisions to
improve
organizational
performance.
Tools for Business Intelligence
Online analytical processing (OLAP)
◼ Predictive analysis
◼ Uses data mining techniques, historical data, and assumptions
about future conditions to predict outcomes of events
◼ E.g., Probability a customer will respond to an offer or purchase
a specific product
◼ Text mining
◼ Extracts key elements from large unstructured data sets (e.g.,
stored e-mails)
Data as service model-Gartner
hype cycle