You are on page 1of 12

Section: 2

0 / 4
What is Data Science?
2.
Intro (what you will learn in this section)
0:44
3.
Profession of the future
6:58
4.
Areas of Data Science
5:58
5.
IMPORTANT: Course Pathways
5:52
Section: 3
0 / 1
--------------------------- Part 1: Visualisation ---------------------------
6.
Welcome to Part 1
1:57
Section: 4
0 / 11
Introduction to Tableau
7.
Intro (what you will learn in this section)
0:28
8.
Installing Tableau Desktop and Tableau Public (FREE)
6:08
9.
Challenge description + view data in file
2:32
10.
Connecting Tableau to a Data file - CSV file
5:17
11.
Navigating Tableau - Measures and Dimensions
8:42
12.
Creating a calculated field
6:14
13.
Adding colours
7:37
14.
Adding labels and formatting
11:00
15.
Exporting your worksheet
7:40
16.
Section Recap
3:34
Quiz 1:
Tableau Basics
0:00
Section: 5
0 / 9
How to use Tableau for Data Mining
17.
Intro (what you will learn in this section)
0:44
18.
Get the Dataset + Project Overview
7:12
19.
Connecting Tableau to an Excel File
3:56
20.
How to visualise an ad-hoc A-B test in Tableau
6:29
21.
Working with Aliases
4:05
22.
Adding a Reference Line
4:53
23.
Looking for anomalies
8:35
24.
Handy trick to validate your approach / data
9:13
25.
Section Recap
5:04
Section: 6
0 / 11
Advanced Data Mining With Tableau
26.
Intro (what you will learn in this section)
0:44
27.
Creating bins & Visualizing distributions
9:55
28.
Creating a classification test for a numeric variable
4:25
29.
Combining two charts and working with them in Tableau
8:31
30.
Validating Tableau Data Mining with a Chi-Squared test
10:29
31.
Chi-Squared test when there is more than 2 categories
8:15
32.
Visualising Balance and Estimated Salary distribution
11:04
33.
Bonus: Chi-Squared Test (Stats Tutorial)
19:12
34.
Bonus: Chi-Squared Test Part 2 (Stats Tutorial)
9:10
35.
Section Recap
5:44
36.
Part Completed
1:38
Section: 7
0 / 1
--------------------------- Part 2: Modelling ---------------------------
37.
Welcome to Part 2
3:54
Section: 8
0 / 6
Stats Refresher
38.
Intro (what you will learn in this section)
0:29
39.
Types of variables: Categorical vs Numeric
5:26
40.
Types of regressions
8:09
41.
Ordinary Least Squares
3:11
42.
R-squared
5:11
43.
Adjusted R-squared
9:56
Section: 9
0 / 6
Simple Linear Regression
44.
Intro (what you will learn in this section)
0:37
45.
Introduction to Gretl
2:34
46.
Get the dataset
4:03
47.
Import data and run descriptive statistics
4:25
48.
Reading Linear Regression Output
6:48
49.
Plotting and analysing the graph
4:22
Section: 10
0 / 10
Multiple Linear Regression
50.
Intro (what you will learn in this section)
1:15
51.
Caveat: assumptions of a linear regression
1:47
52.
Get the dataset
4:12
53.
Dummy Variables
8:05
54.
Dummy Variable Trap
2:10
55.
Ways to build a model: BACKWARD, FORWARD, STEPWISE
15:41
56.
Backward Elimination - Practice time
16:08
57.
Using Adjusted R-squared to create Robust models
10:17
58.
Interpreting coefficients of MLR
12:47
59.
Section Recap
4:15
Section: 11
0 / 8
Logistic Regression
60.
Intro (what you will learn in this section)
1:34
61.
Get the dataset
4:13
62.
Binary outcome: Yes/No-Type Business Problems
9:09
63.
Logistic regression intuition
17:03
64.
Your first logistic regression
8:04
65.
False Positives and False Negatives
8:01
66.
Confusion Matrix
4:57
67.
Interpreting coefficients of a logistic regression
10:03
Section: 12
0 / 10
Building a robust geodemographic segmentation model
68.
Intro (what you will learn in this section)
1:01
69.
Get the dataset
7:32
70.
What is geo-demographic segmenation?
5:05
71.
Let's build the model - first iteration
8:26
72.
Let's build the model - backward elimination: STEP-BY-STEP
11:11
73.
Transforming independent variables
10:09
74.
Creating derived variables
6:09
75.
Checking for multicollinearity using VIF
8:11
76.
Correlation Matrix and Multicollinearity Intuition
8:20
77.
Model is Ready and Section Recap
6:27
Section: 13
0 / 10
Assessing your model
78.
Intro (what you will learn in this section)
0:37
79.
Accuracy paradox
2:11
80.
Cumulative Accuracy Profile (CAP)
11:16
81.
How to build a CAP curve in Excel
14:47
82.
Assessing your model using the CAP curve
7:11
83.
Get my CAP curve template
6:20
84.
How to use test data to prevent overfitting your model
3:34
85.
Applying the model to test data
8:09
86.
Comparing training performance and test performance
11:33
87.
Section Recap
3:33
Section: 14
0 / 7
Drawing insights from your model
88.
Intro (what you will learn in this section)
0:34
89.
Power insights from your CAP
13:52
90.
Coefficients of a Logistic Regression - Plan of Attack (advanced topic)
3:47
91.
Odds ratio (advanced topic)
8:29
92.
Odds Ratio vs Coefficients in a Logistic Regression (advanced topic)
7:08
93.
Deriving insights from your coefficients (advanced topic)
13:15
94.
Section Recap
3:26
Section: 15
0 / 5
Model maintenance
95.
Intro (what you will learn in this section)
0:37
96.
What does model deterioration look like?
4:36
97.
Why do models deteriorate?
15:26
98.
Three levels of maintenance for deployed models
8:21
99.
Section Recap
1:38
Section: 16
0 / 1
--------------------------- Part 3: Data Preparation ---------------------------
100.
Welcome to Part 3
2:24
Section: 17
0 / 8
Business Intelligence (BI) Tools
101.
Intro (what you will learn in this section)
0:23
102.
Working with Data
1:15
103.
What is a Data Warehouse? What is a Database?
3:28
104.
Setting up Microsoft SQL Server 2014 for practice
8:05
105.
Important: Practice Database
9:44
106.
ETL for Data Science - what is Extract Transform Load (ETL)?
2:01
107.
Microsoft BI Tools: What is SSDT-BI and what are SSIS/SSAS/SSRS ?
4:04
108.
Installing SSDT with MSVS Shell
4:24
Section: 18
0 / 6
ETL Phase 1: Data Wrangling before the Load
109.
Intro (what you will learn in this section)
0:48
110.
Preparing your folder structure for your Data Science project
2:20
111.
Download the dataset for this section
1:27
112.
Two things you HAVE to do before the load
4:56
113.
Notepad ++
1:00
114.
Editpad Lite
1:11
Section: 19
0 / 7
ETL Phase 2: Step-by-step guide to uploading data using SSIS
115.
Intro (what you will learn in this section)
0:50
116.
Starting and navigating an SSIS Project
1:46
117.
Creating a flat file source task and OLE DB destination
1:53
118.
Setting up your flat file source connection
6:08
119.
Setting up your database connection and creating a RAW table
7:43
120.
Run the Upload & Disable
2:39
121.
Due Dilligence: Upload Quality Assurance
2:02
Section: 20
0 / 16
Handling errors during ETL (Phases 1 & 2)
122.
Intro (what you will learn in this section)
0:50
123.
Download the dataset for this section
0:46
124.
How excel can mess up your data
3:46
125.
Bulletproof Blueprint for Data Wrangling before the Load
7:13
126.
SSIS Error: Text qualifier not specified
7:15
127.
What do you do when your source file is corrupt? (Part 1)
18:01
128.
What do you do when your source file is corrupt? (Part 2)
6:09
129.
SSIS Error: Data truncation
15:56
130.
Handy trick for finding anomalies in SQL
3:45
131.
Automating Error Handling in SSIS: Conditional Split
8:20
132.
Automating Error Handling in SSIS: Conditional Split (Level 2)
9:03
133.
How to analyze the error files
16:40
134.
Due Dilligence: the one thing you HAVE to do every time
4:57
135.
Types of Errors in SSIS
4:00
136.
Summary
19:06
137.
Homework
3:39
Section: 21
0 / 17
SQL Programming for Data Science
138.
Intro (what you will learn in this section)
0:31
139.
Download the dataset for this section
0:38
140.
Getting To Know MS SQL Management Studio
2:14
141.
Shortcut to upload the data
4:20
142.
SELECT * Statement
5:52
143.
Using the WHERE clause to filter data
5:50
144.
How to use Wildcards / Regular Expressions in SQL (% and _)
4:38
145.
Comments in SQL
2:43
146.
Order By
5:49
147.
Data Types in SQL
7:54
148.
Implicit Data Conversion in SQL
3:35
149.
Using Cast() vs Convert()
3:51
150.
Working with NULLs
5:03
151.
Understanding how LEFT, RIGHT, INNER, and OUTER joins work
6:18
152.
Joins with duplicate values
2:32
153.
Joining on multiple fields
5:21
154.
Practicing Joins
5:00
Section: 22
0 / 16
ETL Phase 3: Data Wrangling after the load
155.
Intro (what you will learn in this section)
0:57
156.
RAW, WRK, DRV tables
5:54
157.
Download the dataset for this section
1:32
158.
Create your first Stored Proc in SQL
3:30
159.
Executing Stored Procedures
2:49
160.
Modifying Stored Procedures
8:25
161.
Create table
9:30
162.
Insert INTO
5:42
163.
Check if table exists + drop table + Truncate
5:59
164.
Intermediate Recap - Procs
4:16
165.
Create the proc for the second file
11:36
166.
Adding leading zeros
7:29
167.
Converting data on the fly
10:21
168.
How to create a proc template
7:52
169.
Archiving Procs
4:38
170.
What you can do with these tables going forward [drv files etc.]
13:50
Section: 23
0 / 12
Handling errors during ETL (Phase 3)
171.
Intro (what you will learn in this section)
0:53
172.
Download the dataset for this section
0:46
173.
Upload the data to RAW table
11:02
174.
Create Stored Proc
5:09
175.
How to deal with errors using the isnumeric() function
7:45
176.
How to deal errors using the len() function
7:36
177.
How to deal with errors using the isdate() function
7:40
178.
Additional Quality Assurance check: Balance
3:51
179.
Additional Quality Assurance check: ZipCode
3:17
180.
Additional Quality Assurance check: Birthday
4:08
181.
Part Completed
9:52
182.
ETL Error Handling "Vehicle Service" Project
7:45
Section: 24
0 / 1
--------------------------- Part 4: Communication ---------------------------
183.
Welcome to Part 4
1:31
Section: 25
0 / 8
Working with people
184.
Intro (what you will learn in this section)
0:44
185.
Cross-departmental Work
4:13
186.
Come to me with a Business Problem
2:10
187.
Setting expectations and pre-project communication
3:30
188.
Go and sit with them
5:20
189.
The art of saying "No"
5:24
190.
Sometimes you have to go to the top
2:37
191.
Building a data culture
5:07
Section: 26
0 / 11
Presenting for Data Scientists
192.
Intro (what you will learn in this section)
1:42
193.
Case study
2:00
194.
Analysing the intro
3:33
195.
Intro dissection - recap
9:26
196.
REAL Data Science Presentation Walkthrough - Make Your Audience Say "WOW"
16:29
197.
My brainstorming method
3:03
198.
How to present to executives
5:27
199.
The truth is not always pretty
2:45
200.
Passion and the Wow-factor
1:59
201.
Bonus: my full presentation | LIVE 2015
16:20
202.
Bonus: links to other examples of good storytelling
0:00
Section: 27
0 / 6
Homework Solutions
203.
Advanced Data Mining with Tableau: Visualising Credit Score & Tenure
5:44
204.
Advanced Data Mining with Tableau: Chi-Squared Test for Country
4:18
205.
ETL Error Handling (Phases 1 and 2)
19:51
206.
ETL Error Handling "Vehicle Service" Project (Part 1 of 3)
19:09
207.
ETL Error Handling "Vehicle Service" Project (Part 2 of 3)
10:41
208.
ETL Error Handling "Vehicle Service" Project (Part 3 of 3)