Professional Documents
Culture Documents
Quang Duong
Queen’s College
LATEX
September 16, 2023 BDM 1043 Fall 2023 Week 2 1 / 21
Part 1: Business Motivation and Drivers
1 Marketplace Dynamics
2 Business Architecture
The following ICT development that have accelerated the pace of Big
Data adoption in business:
Data analytic and Data Science: statistical techniques, data
warehousing, machine learning
Digitization: online banking, online shopping, streaming video
Affordable Technology and Commodity Hardware: open sources
software
Social Media: customer interaction
Hyper-Connected Communities and Devices: increase in the number
of available data streams
Cloud Computing: external datasets, scalable processing, and vast
amount of storage
Internet of Everything
4 Organization Prerequisites
5 Privacy
6 Security
7 Governance Requirement
8 Clouds
Securing Big Data involves ensuring that the data networks and
repositories are sufficiently secured via authentication and
authorization mechanism
Big Data security involves establishing data access levels for different
categories of users
Gather data from sources then filter to remove bad quality data or no
relevant data
Need to store the original copy of the datasets
Adding metadata to improve classification and querying
What is metadata ?
– Time and date of creation
– Creator or author of the data
– File size
– Source of the data
– Process used to create the data
Data input into Big Data analyses can be unstructured without any
indication of validity
Remove duplicate or irrelevant observations
Fix structural errors
Filter unwanted outliers and handle missing data
Figure: Heatmap
Figure: Faceted logistic regression