Data Science and Analytics Course Outline
Topics
Introduction to Data
• Roadmap Towards Data Science
Science
• Introduction
• Installation
• Print function
• Variable
• Input
• Python Casting
• Arithmetic operation
• If statement
• Logical operator
• Comparison operator
• Comment
• Switch-case
• List
Programming Language • For Loop
(Python) • Python Statements
• Nested loops
• While loops
• Tuples
• Dictionaries
• Array
• Function
• Parameters
• Math Function
• Return statements
• Lambda
• Math Function
• Return statements
• Python Datetime
• Python Strftime method
• Class
• Inheritance
• polymorphism
• Exception
• File read & write
• Problem Solving
• Project
• Setup Jupyter Notebook
• Import numpy
• Numpy ndarray object
• Find min/max/mean
• Slicing array
• Generating dummy variable
Data Analysis (Numpy)
• Arrange array
• Random value
• Numpy Arithmetic Operations
• Reshape array
• Joining numpy array
• Splitting numpy array
• Project
• Collect Dataset
• Import dataset
• Pandas basic method
Data Analysis (Pandas)
o Drop
o Copy dataset
o create new dataset from
existing dataset
o Add column
o Unicode error solving
o Groupby method
o Loc, iloc
o And many more
• Analyzing various dataset
• Project
• Import Matplotlib
• Plot
• Marker
• Line
• Labels
• Grid
• Subplot
Data Visualization • Scatter
(Matplotlib) • Bar
• Horizontal Bar
• Figsize
• Histogram
• Pie
• Legend
• Matplotlib Style
• Project
•
• Import Seaborn
• Seaborn plot
• Relational Plot
Data Visualization
o Scatterplot
(Seaborn) o Lineplot
• Categorical Plot
o Barplot
o Boxplot
o Stripplot
o Swarmplot
o Violin Plot
o Pointplot
• Distributions plot
o Histplot
o Kdeplot
o Ecdfplot
o Rugplot
• Countplot
• Seaborn Colors Palettes
• Multi-Plot grids
• Plot Aesthetics
• Project
• Introduction
• Create worksheet
BI Tool (Tableau)
• Create dashboard
• Project
• Descriptive Statistics
• Probability Distribution
Statistics • Normal Distribution
• Hypothesis Testing
• Data Preprocessing
o Data Cleaning
o Feature Engineering
o Feature Selection
Machine Learning
o Feature Transformation and
Scaling
o Data Mining
o Data Splitting
• Supervised Learning
• Regression & Classification
Algorithm like
oLinear Regression
o Logistic Regression
o Ridge Regression
o Lasso Regression
o Neural Network
o Decision Tree
o SVM
o Random Forest
o KNN
o Naive Bayes
o XgBoost
• Accuracy and other parameters
• Explain threshold, ROC, AUC,
Hypothesis Testing
• Project
• Handling Imbalanced Dataset
o Undersampling
o Oversampling
o SMOTE
o Project on Imbalanced
dataset
• Unsupervised Learning
o K-means clustering
o Hierarchal clustering
o Market Basket Analysis
o Project
Natural Language • Import nltk
Processing • Tokenization
• Stemming
• Lemmatization
• Bag of Words
• TF-IDF
• Word2Vec
• Project
• Installation of Tensorflow
• Introduction to neural network
• Workflow of neural network
• Activation function
• Multillayer neural networK
• Gradient descent
• Drop out layers in Multillayer neural
network
• Stochastic Gradient descent
• Global Minima and Local Minima
• Loss function
Deep Learning • Create Artificial Neural Network
• Hyperparameter
• Padding in CNN
• Max pooling in CNN
• Convolution neural network
• Forward Propogation
• Back Propogation
• Create Recurrent neural network
• Image classification
• Project
• Installation
• Create API
Web Framework (Flask) • HTML & CSS Basic Operation
HTML & CSS • Creating web app
• Project
Database Mysql
o Installation
o Create Database
o Basic Queries
o Create connection to project
• Installation
Streamlit • Creating web app
• Installation and Setup
• Creating profile
Git & Github • Understanding bash
• Uploading project in repositories
• Project Submission
Final Project • Discussion
Prerequisite for this course:
❖ Basic Knowledge of Statistics will be preferred.
❖ Basic Knowledge of Mathematics will be preferred.
❖ Basic Knowledge of Calculas will be preferred.