You are on page 1of 5

Data Science Learning Path

Course Duration - 6 Months (265 Hours)


Training (208Hours) +Q&A (57 Hours)

Python Programming & SQL

 Introduction to Data Science using Python-


 Python basic constructs
Assignment 1 : Basic Python Programs
 Data types in python
Assignment 2 : Data Types and Functions
 Conditional Statements
Assignment 3 : Conditional Logic
 Iterative Statements
Assignment 4 : Iterative Logic
 Functions in Python
Assignment 5 : Miscellaneous
 OOP in Python
 File Handling
 Exception Handling
Assignment 6 : Classes and Objects
 Databases and SQL
Regular Expressions.

Project 1 : Creating Databases and Operations


Project 2 : Handling Databases and CRUD operations using Python
Statistics & Data Analysis

o Probability
o Basic Terminology: Events, Sample Space, Experiment, types of events
o Probability
o Conditional Probability
o Multiplication theorem
o Partition of sample space, Theorem of total probability
o Bayes' theorem
o Random variables and probability distributions
o Continuous and Discrete distributions
o Binomial Distribution
o Normal and Standard Normal Distributions
Assignment 1 : Probability Questions

Descriptive Statistics
o Measures of central tendency
o Measures of dispersion
o Visualizations
o Outlier Detection
o Covariance and correlation

Inferential Statistics
o Central Limit Theorem
o Confidence interval and confidence level
o Hypothesis testing: Null and Alternate hypothesis
o P value, Significance level
o Statistical tests

 Assignment 2 : Applying Statistics using SciPy


 NumPy for mathematical computing
 Data manipulation using Pandas
 Data visualization with Matplotlib and Seaborn
 Data Pre-Processing
 Web Scraping Using Beautifulsoup

Assignment 3 : Data Visualization using Automobile or Titanic


Dataset Assignment 4 : Data Analysis using Pandas, Matplotlib and
Seaborn (2 Datasets)
Assignment 5 : Create own dataset using Web Scraping and perform Data
Analysis on it

Project 1 : End- to-End Project (Web- Scraping, Data Pre-processing and


Data Cleaning, Exploratory Data Analysis and Data Visualization)

Machine Learning

Introduction to Machine Learning


Supervised Learning
 Regression
o Linear Regression
o Multiple and Polynomial regression
o Regularization, Ridge and Lasso regression
 Classification
o Logistic Regression
o K Nearest Neighbours
o Decision Trees
o Naïve Bayes
o Support Vector Machine
o Ensemble Techniques: Random Forests, Gradient boosting machines

Unsupervised Learning
 Clustering
 Principal Component Analysis

 Assignments : Regression Analysis, Classification (Binary and


Multiclass), Image classification, Clustering etc.
Projects : Credit Card Fraud Detection
AI & Deep Learning

 Introduction to Deep Learning and Neural Networks


 Introduction to Linear Algebra
 Artificial neural Networks
Assignment 1 : 1. Image Classification using ANN and 2.
Regression using ANN
 Neurons, Layers and Activation Functions
 Convolutional neural networks - Computer Vision
Assignment 2 : 1. Image Classification (CiFAR 10)
 Convolution Operation, Pooling, Padding and Strides
 Transfer Learning (LeNet 5, Alex Net, VGG 19 & 16, ResNet, Inception
V3)
Assignment 3 : 1. Image Classification (Cats & Dogs)
 Recurrent neural networks - Natural Language processing
Assignment 4 : 1. Text Pre-Processing
 LSTM, GRU
Assignment 5 : 1. Movie Review Analysis 2. Sentiment Analysis
 Computer Vision & Natural Language processing using CNNs & RNNs
 Assignments: Image classification using neural networks,
Transfer learning, Sentiment Analysis

Projects : 1. Covid Detection using X-Ray Images 2. Flexible Project

R for Data Science

Introduction to Data Science using R


Installation and introduction to R Studio
Variables and Data types in R
Operators, Statements and expressions
Conditional Statements
Iterative Statements
Functions
Data manipulation using R (dplyr)
Data visualization using R (ggplot)
Time series forecasting using R
Tableau and Power Bi

(Self learning material )

Power BI:

• Power BI introduction
• Power BI components
• Power BI Architecture
• Connecting to the different data sources
• Power BI Data Modeling
• Power BI Reports
• Various Charts
• DAX (Data Analysis Expressions)
• Dashboards

Tableau:

• Tableau introduction
• Tableau architecture
• Connecting to the different data sources
• Data Modeling
• Tableau Calculations
• Various Charts
• Reports and Dashboards

Capstone (Industry Specific) Projects

1. COVID-19 detection- Healthcare


2. Real-time face detection- Cyber Security
3. Sentiment analysis- e-commerce

You might also like