Professional Documents
Culture Documents
Presentation
Data
and Data Modeling
Exploration
Automation
Data Science Process Overview
The Tips
Spend time
understanding the goals
and context of your
research
Setting the Research Goal
Create Project Charter
Internal
Acquiring All the Data you Need Data
External
Data
R e t r i e v i n g D a t a
Internal Data
Don’t be afraid to
shop around
https://archive.ics.uci.edu/ml/index.php https://www.kaggle.com/datasets
R e t r i e v i n g D a t a
The Tip
Data
Cleansing
Combinin
g Data
D a t a P r e p a r a t i o n
Cleansing Data
focuses on removing
errors in your data
D a t a P r e p a r a t i o n
Cleansing Data
Redundant Whitespace
Outliers
D a t a P r e p a r a t i o n
Cleansing Data
Missing Values
D a t a P r e p a r a t i o n
Cleansing Data
Certain models
require their data to
be in a certain shape
D a t a P r e p a r a t i o n
Data Transformation
Turning Variables
into Dummies
D a t a P r e p a r a t i o n
Data Transformation
focus on integrating
different sources comes
from different places
D a t a P r e p a r a t i o n
Combining Data
Joining Tables
D a t a P r e p a r a t i o n
Combining Data
Appending Tables
D a t a P r e p a r a t i o n
Combining Data
Graphical
Non-
Graphical
D a t a E x p l o r a t i o n
Graphical
D a t a E x p l o r a t i o n
Link and Brush
Tabulation, clustering,
and building simple
models
S t a g e F i v e
D a t a M o d e l i n g
D a t a M o d e l i n g
At Glance
Selection
Diagnostic
D a t a M o d e l i n g
Model and Variable Selection
Automating
S U M M A R Y
Summary of The Course