Professional Documents
Culture Documents
Components of a
Big data and predictive
analysis Chapter 3 database management
system
Components of a
Big data and predictive
analysis Chapter 3 database management
system
Database system System where all files are integrated, meaning information can be linked
Data hierarchy The structure and organization of data, which involves fields, records, and files
Databases
Database management Software for creating, storing, maintaining and accessing database files.
system (DBMS) A DBMS makes using databases more efficient.
Internal
Types of data
External.
Organized and processed in numerical or sequential order, typically the order in which they were entered.
Sequential access file
Because access speed usually is not critical, these records are typically stored on magnetic tape.
structure Normally used for backup and archive files because they rarely need updating.
Records can be accessed in any order, regardless of their physical locations in storage media.
Methods for Random access file This method of access is fast and very effective when a small number of records need to be processed
daily or weekly.
accessing files structure
To achieve this speed, these records are often stored on magnetic disks. Disks are random access devices.
Records can be accessed sequentially or randomly, depending on the number being accessed
Indexed sequential For a small number, random access is used.
access method (ISAM) For a large number, sequential access is used.
Access speed with this method is fast, so it is recommended when records must be accessed frequently.
Tableau and Power BI Databases
Components of a
Big data and predictive
analysis Chapter 3 database management
system
Operation Data stored in a relational model is retrieved from tables by using operations that pick and combine data from one or more tables.
Examples:
Select : searches data in a table and retrieves records based on certain criteria (also called conditions)
Project: pares down a table by eliminating columns (fields) according to certain criteria
Join: (combines two tables based on a common field e.g., the primary key in the first table and the foreign key in the second table
Intersect/ Union/ Difference
Tableau and Power BI Databases
Components of a
Big data and predictive
analysis Chapter 3 database management
system
Used to Examples:
Create the data dictionary Any changes to a database's structure are Adding a field
Data definition Maintain the data dictionary made with this component
Deleting a field
Changing a field's size
Define the structure of files in a database Changing the data type stored in a field
Used to add, delete, modify, retrieve Structured Query Language (SQL) Query by example (QBE) A graph database is a database that
data Standard fourth-generation query Request data from a database by uses graph structures for query
language used by many DBMS constructing a statement made up operation with nodes, edges, and
Use packages of query forms. properties to represent and store
SQL Consists of key words specifying With current graphical databases, data.
Data QBE actions to take simply click to see query forms A typical relational database stores
Example: SELECT field FROM table instead of having to remember entities and their properties in
manipulation or file WHERE conditions keywords tables, whereas a graph database in
Can add AND, OR, NOT operators to addition stores relations between
the QBE form to fine-tune the entities.
query. It focuses on connections between
entities and navigates and manages
connected data.
Components of a
Big data and predictive
analysis Chapter 3 database management
system
Encapsulation Inheritance
Both data and the relationships Refers to the grouping into a class of various Refers to new objects being created faster and more
easily by entering new data in attributes.
Object-oriented are contained in a single object. objects along with their attributes and methods –
An object consists of attributes meaning, grouping related items into a single unit
databases and methods that can be This helps handle more complex types of data,
performed on the object's data such as images and graphs
Methods
Interaction with an object-oriented database takes place via methods (not query languages), which are
called by sending a message to an object.
Messages are usually generated by an event of some kind, such as pressing Enter or clicking the mouse
button.
Natural
language
processing
(later chapter)
Tableau and Power BI Databases
Components of a
Big data and predictive
analysis Chapter 3 database management
system
ETL
Components Input Storage Output
(Extractrion,Transformation, Loading)
External sources Enterprise resource planning (ERP) systems collect,
Customer relationship management (CRM) systems
Internal sources integrate and process data that can be used by all
collect and process customer data to provide
Input Databases functional areas in an organization.
information for improving customer service.
ERP systems
CRM systems
As raw data
Collected information is organized in a data
Storage warehouse
As summary data (subtotals of category),
As or metadata (information about the data).
Complex queries for all types of information as Online analytical processing (OLAP) is used to Data-mining analysis is used to discover patterns and
well as reports used for decision can be generated generates business intelligence relationships
faster and easier that with databases OLAP uses multiple sources of information and
Can use a variety of sources in different formats provides multidimensional analysis, such as
stored in different locations viewing data based on
Can cross-feference segments of an organization's time,
Output operations for comparison purposes product, and
Can find patterns and trends location.
Can analyze large amounts of historical data This is the “slicing and dicing” of hypercubes,
quickly permitting “drilling down” and “drilling up”.
Can assist management in making well-informed
business decisions.
Uses OLAP and Data-Mining
Tableau and Power BI Databases
Components of a
Big data and predictive
analysis Chapter 3 database management
system
Disadvantages Data marts usually have more limited scope than data warehouses
compared to Consolidating information from different departments or functional
databases
areas is more difficult.
Tableau and Power BI Databases
Components of a
Big data and predictive
analysis Chapter 3 database management
system
What happened? What was the problem? What decisions must be Why did it happen? What will happen if the trend continues?
Compared to BI made based on the available data? What actions should be taken?
Business analytics (BA)
BI uses dashboards, scorecards, OLAP, and query reports to BA uses statistical analysis, data-mining tools, and predictive
support decision-making activities. modeling
Components of a
Big data and predictive
analysis Chapter 3 database management
system
Volume
Variety (data is structured and unstructured)
Five dimensions
Big data and predictive analysis
Velocity (speed with which date needs to be processes not to miss window of opportunity)
of big data (5Vs) Veracity (social media posts, abbreviations, typos, colloquial speech make this V important)
Value (i.e. most important V).
Industries to
Many industries could benefit from big data analytics and gain a competitive advantage.
benefit
Technologies
Mobile and wireless technology
and apps Popularity of social networks
contributing to Enhanced power and sophistication of smartphones and handheld devices
Significant improvements in storage technology and substantial cost reduction
its growth and
Improved capabilities and affordability of analytics tools.
popularity
Big data analytics could reveal and expose certain information that puts some people's privacy at risk.
It also may create some legal and ethical concerns. These include:
Discrimination
Risks Privacy breaches and embarrassments
Unethical (although legal) actions based on interpretations
Loss of anonymity
Few legal protections exist for the involved individuals.
The IoT (Internet of Things) adds structured and unstructured data to Big Data.
The future IIoT (Industrial Internet of Things) big data analytics will improve nearly all operations of industrial devices.
Tableau and Power BI Databases
Components of a
Big data and predictive
analysis Chapter 3 database management
system
The goal of a
successful The goal of any organization is to generate the highest possible revenue for the organization.
marketing
campaign
Calculating customer lifetime value (CLTV) (estimate what the lifetime relationship of a typical customer will
Tasks usually be worth to a business)
Recency, frequency, and monetary analysis (RFM) (80 percent of business revenue comes from 20 percent of
performed by a
its customers)
successful
Customer communications (different techniques to communicate effectively with customers including e-
marketing
mail, Web sites, a portal, and the intranet)
campaign
Analytical software monitoring behavour (using different techniques in order to monitor customers'
behavior across a number of retail channels, including Web sites, mobile apps, and social media)
Tableau and Power BI Databases
Components of a
Big data and predictive
analysis Chapter 3 database management
system
formats such as Excel and PDF) and relational databases, as well as big data
sources.