Professional Documents
Culture Documents
Course Overview
Data science
Exploratory data analysis
R Tableau
Schedule
Day 1
Morning
(9am 12pm)
Afternoon
(1pm 3pm)
Day 2
Module 1
Introduction to
data science
Module 3
Exploratory data
analysis
Module 2 Data
modeling and
visualization
tool
Module 4 Lab:
visualization
using Tableau
Day 3
Day 4
Module 5
Statistical
analysis and
DOE
Module 7
Predictive
modeling
Module 9
Similarities, and
clustering
Module 8
Fitting a model
to data
Module 10
Data analytic
thinking
Module 6 Lab:
Data preparation
using R
Day 5
Current Positions
Assistant Professor
Department of Computer Engineering
King Mongkuts University of Technology Thonburi
Data Scientist
Big Data Experience Center
Senior Consultant
InsightEra Co. Ltd.
Module I Overview
Learning Outcome
Understand the relationships
between big data and data science
Explain the basic concepts of data
science and the roles of data
scientists
Identify common tasks in data
sciences from the problem
Recognize the problems that can be
solved by the data science process
Agenda
Big data and data science
Data scientists
Case studies in data sciences
Social Analytics
Business Analytics
Predictive
(Proactive)
Prescriptive
(Proactive)
What happen?
What is happening?
Business reporting
Dashboards
Scorecards
Data warehousing
Behavior analysis
Cause and effect
analysis
Correlation
Well-defined
business problems
and opportunities
Accurate projections
of the future states
and conditions
Outcomes
Questions
Diagnostic
(Reactive)
Enablers
Descriptive
(Reactive)
Data mining
Text mining
Internet mining
Forecasting
Optimization
Simulation
Decision modeling
Expert systems
Best possible
business decision
and transaction
The Synopsis
A set of fundamental concepts/principles that underlie techniques
for extracting useful knowledge from data.
How data science fits in the organization
General ways of thinking data analytically
General concepts for actually extracting knowledge from data.
The Synopsis
The science
Extracting useful knowledge from data to solve business
problems can be treated systematically by following a
process with reasonably well-defined stages.
The technology
From a large mass of data, IT can be used to find
informative descriptive attributes of entities of interest
Data-Analytic Thinking
Every aspect of business is open to data
collection
Operations/Manufacturing
Supply-chain management
Customer behavior
Marketing campaign performance
Marketing trends
Industry news
Competitors movement
Targeted marketing
Campaign combinations for effective up-selling
Recommendations for cross-selling
Customer behavior analysis: the key to marketing in this digital
era is contacting customers just
Accessing and
processing of
massive-scale data
flexibly and
efficiently with Big
Data technologies
Target
Apply analytical skills to help the store enhance revenues,
forecast trends, improve process and much more.
Business Impact
Customer Churn
Segmentation
Risk Profiling
Subscription fraud
PBX hacking
Wangiri fraud
Phishing
Abuse of service term and conditions
SMS faking
Batch real-time
Thresholds anomaly
detection
Rules machine learning
SQL SQL and graph
analysis
Silos of data data lakes
Scale-up hardware
commodity Hadoop
architectures
Unsupervised learning:
Find hidden structure in unlabeled data (no training data)
Clustering, co-occurrence grouping, profiling
Might include the same set of examples but would not include the target
information
Do our customers fall into different groups ?
Similarity matching, link prediction, data reduction can be either
Iterative Process
End of Module I
Question?