Professional Documents
Culture Documents
the best-curated content available on the internet that would allow you to have a structured learning
path)
Live
doubtssessions: Therebasis
on a real-time will be 2-3 live
if they areclasses every weekIn
stuck somewhere. byaddition,
Data Science experts
this will also allow learners to
interact with the mentors and fellow learners
Live doubt clearing and mentorship session will be organized every week based on the
requirements of the learners
This week is ideal for beginners to get started with a programming language. Other learning
track individuals are also most welcome to join in.
Intro to Data Science - its prominence and use-cases 5/20/2020
Environment setup - python installation - anaconda ide
Python for Data Science - why? its important etc
Basics of Python
Print a string "Hello World"
Python basic syntax
Data structures and types
Python Lists & Strings
Intro to Functions and Packages
LO: loops, importing, numpy 5/24/2020
Week #1 - Data Analysis and Data Visualization (27th May - 2nd June)
Dive deep into Pandas 5/27/2020
Intro to Data Visualization and Dive Deep into Matplotlib library 5/30/2020
Exploratory Data Analysis
Week #2 - Advanced Exploratory Analysis and Data Pre-Processing (3rd June - 9th June)
(Data Cleaning,
Data Science Outlier detection
Processes - setting etc.)
up the base - process and we are
starting with data cleaning - maybe explain about regression and classification 6/3/2020
Basic Statistics
Charts and Visualization
Outlier Analysis
Handling Missing Values 6/6/2020
Handling Imbalanced datasets, Oversampling - SMOTE
Standardization/Normalization of data - what, why and when?
Week #3 - Feature Selection and Building ML Models (10th June - 16th June) 6/10/2020
Intro to feature extraction and selection and how they are different 6/13/2020
Elaborate more on Feature Extraction and various methodologies
Feature selection and its importance
Various feature selection/engineering techniques
Boruta
Building efficient and effective models
Splitting data into test and train datasets, cross validation
ML Algorithms:
Intro to Classification and Regression problems/models
Linear Regression
Logistic Regression
Cost function & Gradient Descent
Overfitting & Underfitting
Week #5 - Applied Data Science & ML - Problem-solving (23rd June - 1st July) TBD
TBD
HR Analytics problem - predicting employee churn TBD
Ed-tech customer analysis - predicting user churn TBD
Fraud analytics - predicting fraud detection TBD
Anti-money laundering analytics - predicting money laundering cases in transactions data TBD
Real-estate price analysis - problem TBD
Sentiment analysis on movie reviews TBD
Getting started with Data Science competitions - Kaggle TBD
Explainable A.I showcase - by taking a dataset - problem solving TBD
Data Story Telling by Vijay Pravin Maharajan
Day Duration Tutor/Speaker
Day-wise modules
Sunday 1-1.5 hrs Gayatri
Day-wise Modules
Ayon/Admond/Dikscha/Gayatri (4)
Gradient Descent
Day-wise Modules
Ayon/Admond/Dikscha/Gayatri (4)
Gradient Descent