You are on page 1of 36

Advance

Data Science & AI


Certification Program
In Collaboration with

Domain Specialization

Capstone Project Certified from IBM

100% Guaranteed Job Referrals

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


TABLE OF
CONTENTS

Program Details 1

Why Choose Us 2

Dual Certification 3

Domain Electives 4

Job Referrals 5

Success Stories 6

Transition Process 7

Program Fee & Financing 8

Program Outline 9

Real-Time Projects &


10
Detailed Syllabus

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


PROGRAM DETAILS

COURSE PREREQUISITE
There are no such hard prerequisite criteria.
Just the urge to learn programming and basic
ideas about advanced math is enough.

WHO IS THIS PROGRAM FOR?

Working professional having more than 6 months of


experience in any domain (Technical/Non-Technical)
Qualification: BE/B.Tech (from any branch), BBA/MBA,
MCA/M.Tech, B.Com, B.Sc (in any branch)

INDUSTRIAL EXPERTS

Our trainers are working professionals having more than


8+ years of experience as Sr. Data Scientist, Machine
Learning Engineer, AI Engineer, BI Developer, Big Data
Architect, Sr. Data Analyst etc.

Weekday Batch : 7 Months


Course Monday to Friday - 2 Hours/Day
Duration Weekend Batch : 9 Months
Saturday & Sunday - 3.5 Hours/Day

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


WHY CHOOSE US?
We focus on working professionals and help them achieve the peak of success
without losing their designation or wasting their existing experience.

DOMAIN SPECIALIZATION
Make a switch as a professional,
not as a fresher
Master with domain specific
industrial projects
Break through the crowd to get
noticed by recruiters

PROJECT INNOVATION LAB


Experts from MNCs and MAANG
assist in online and offline project
sessions
Attain classroom session in 7+ cities
(Pune, Mumbai, Delhi, Kolkata,
Hyderabad, Chennai, Bangalore)

*Project Sessions are also


..available in Online Mode

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


WHY CHOOSE US?

1-ON-1 DEDICATION
Live interactive session with
expert for every individual
Each session is guided by
industrial expert
24*7 seamless technical support
from our dedicated team

2 YEARS SUBSCRIPTION
Limitless access for all the learning
materials, live batches, and project
sessions
Professionals get to switch
between weekdays and weekends
Make your learning calendar as per
your convenience

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


WHY CHOOSE US?

Others

Learnbay

Boring Recorded Sessions

Access to only Recorded


100% Live Interactive
Videos
Sessions from Expert

No Model
2 Years of Subscription to
Live Classes No Classroom sessions for
projects.
Hybrid Model
Live + Classroom Project No Guaranteed Interview
Sessions in 7+ Cities calls

10 Guaranteed Interview No Doubt Clearing Session


calls with Expert

1:1 Doubt Clearing Session


with Expert

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


DUAL CERTIFICATION

Priority access to startup job sites Get certified by IBM on


and requirements for individual completion of industry-level
projects. projects.

Capstone Project Certificate

Highlight your profile and get


recognition from renowned
industries worldwide

1 capstone project certificate


from IBM

Course Completion Certificate

Complete your training with the


internationally recognized certificate

Get acknowledged in IT sector by


adding IBM Certificate to your profile

Validate your abilities and skills with


IBM Certificate

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


DOMAIN ELECTIVES
*Opt for any 2 domain electives

Sales, Marketing & HR

Healthcare

BFSI

Manufacturing, Automotive
and Telecom

Ecommerce & Supply Chain

Oil, Gas and Energy

Media, Hospitality
and Transportation

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


10 GUARANTEED INTERVIEW CALLS

We have partnered with 250+ Top MNC'S & FinTech Startups across
the globe to offer genuine job leads. Most of our learners were hired for
their dream jobs one month before the course completion.

Dedicated Placement Cell for Working


Professionals to ensure a smooth Career
Transition
Avg Hike 97%
Prioritize growth and salary hike
with in-demand skillset

Make a transition without losing their


designation or wasting their existing
experience

10k+ Learners
Mock Interviews

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


Success Stories

Thanks to the Learnbay data


Got placed in: science course and excellent
Mohd. Israr guidance, I was able to ace the
TCS interview and secure a job
Designation With a hike of with a 400% pay raise. All of the
real-world time projects helped
Data Scientist 210%
me develop my concepts as a data
scientist.
Domain: Mechanical

Learnbay has helped me a lot to


Got placed in:
learn data science applications in
Shravanthi A
the e-commerce industry. The
live class concept was really
Designation With a hike of
helpful in receiving proper DS
Data Scientist 230% training. Thanks to all my
mentors and the placement team.

Domain: Mechanical

Got placed in: I knew nothing about data science


Ritesh Kumar before I joined Learnbay. But
through a variety of instructors, I
steadily developed my notion and
Designation With a hike of
received solid knowledge and
Associate 150% conceptual training in data science
Consultant
with hike of 119%.
Domain: Mechanical

Read More Reviews

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


Success Stories

Got placed in: When I joined Learnbay I did not


Saurabh Kumar have any knowledge apart from
the very basics. I gradually build
Designation my concept via various trainers
With a hike of
and get trained in data science
Data Scientist & 135% with strong knowledge/concepts.
Statistician

Domain: Math Professor

I come from a nontechnical


Got placed in:
background. However, with
Ankit Biswas
Learnbay's well-structured
course, amazing mentorship, and
Designation With a hike of
consistent support, I was able to
Data Scientist 180% not only enhance my skills but
also land my dream career.

Domain: Software Engineer

Got placed in: The course structure is excellent


Preksha Mishra with emphasis on concept
building and tools & software at
Designation the same time. The support team
With a hike of
is excellent and supportive and
Lead Data 140% quite agile to respond to doubts.
Scientist

Domain: Telecom

Read More Reviews

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


TRANSITION PROCESS
LEARNING PHASE
Learn updated tools and modules from
basic to advance by industry expert

ASSESSMENT
Evaluate your skillset with real-time
case studies and assignments

12 LIVE PROJECTS
Work on domain specific industrial projects
and make your experience relevant

IBM CERTIFICATION
Earn Dual Certification from IBM and
get globally recognized

PROFILE GROOMING
Get Interview Ready with experts.
Attain Resume Build-Up, 1:1 Mock
Interview

10 INTERVIEW CALLS
Guaranteed Interviews call from
FinTech Startups and top MNCs

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


PROGRAM FEE &
FINANCING

We provide a choice of financing alternatives to make it more


cost-effective, and make our programs accessible for all learners

Financing as low as

No Cost EMI Rs. 9,342/month


We have partnered with the
following financing companies
For one-time payment
to provide competitive finance
options at 0% interest rate with
no hidden costs.
Internet Credit/Debit
Banking Card

Program Fee
Rs. 95000/- +18% GST
Rs. 1,12,100/-
To know more about course fees &
scholarship, click here

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


PROGRAM OUTLINE

Cohort Orientation + Special Programming


8 Hrs
Classes

Python Programming (Basic + Advance)


50 Hrs
Python, Anaconda, Github, Pandas

Statistics and Machine Learning


70 Hrs
Matplotlib, Scikit-Learn, Seaborn

Data Science Tools


SQL, MongoDB, Tableau, PowerBI, 86 Hrs
Big Data & Spark Analytics, Time Series

Artificial Intelligence Tools


Deep Learning, NLP, 54 Hrs
Deployment (AWS+GCP)

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263


REAL-TIME INDUSTRIAL PROJECTS

Domain: HR Domain: Marketing

1. Career progression planning of 2. Descriptive study of trends and


employees with workforce irregularities with prediction
defections & efficiency analysis for conversion.

IBM intends to boost its HR Swiggy seeks a broad marketing


department by identifying campaign. But they need automated
employees' masked inconsistency. keyword generation tools. They also
They need models to identify the require proper message preparation
graphical variations in their 14000+ and delivery of the same to the right
employees' performances. Help audience at the right time. You can
them build models with your help them with text analytics and
regressions and other ML abilities. NLP-based keyword research.

Machine Learning Python Exploratory Data Analysis

SQL PySpark Big Data NLP

Domain: Sales Domain: Healthcare

3. Forecasting future sales with 4. Understanding covid-19 cases


trends and price maximization and fatality rate by time series
forecasting

BMW customers can sell old Samsung will launch a new


vehicles, but rivals provide superior healthcare app soon. The key goal
resale prices. BMW's data science- of this app is an accurate human
powered software will deliver the activity tracking and providing
greatest market value for used relevant health-related
vehicles based on Km travelled, recommendations. Continuous
daily price changes, production analysis of a massive amount of
dates, etc. Such tasks build mobile data is required for such an
analytical abilities. app.
Scikit-learn XG Boost Supervised Machine Learning

Customer Segmentation Python (Pandas Library)


REAL-TIME INDUSTRIAL PROJECTS

Domain: BFSI Domain: Media

5. Learn and develop classification 6. Building a content


techniques for the digital recommendation model on the basis
transformation of banking of regional viewer categorization

JPMorgan offers tax-friendly Netflix is a global entertainment


insurance choices. You can help video streaming site. They offer
them forecast insurance premiums. content in various regional
Targeted marketing using your languages. Build a local
random forest algorithm skills can recommendation engine for Netflix
help obtain better premium values. customers residing in south
Bangalore on their weekend and
weekdays activities, utilizing NLP.

Data Analytics Matplotlib ML Customer Segmentation

Logical Regression Python (Data-Preprocesssing)

Domain: Transportation Domain: Oil, Gas and


energy

7. Reduction of waiting time via a 8. Understanding in-depth about


highly precise forecasting model logging while drilling (LWD)
technique

Make a demand forecasting model Saudi Aramco company is working


based on specific time period rider on the development of high-
demands. Such a model will help efficiency drilling models. Use the
both riders and cab drivers to bright sides of big data analytics to
ensure the least possible waiting identify the most cost-effective and
time. You can include measures like highly productive drilling sites.
latitude and longitude
identification.

Machine Learning Hadoop Matplotlib in Python

Time Series Analysis Big Data


REAL-TIME INDUSTRIAL PROJECTS

Domain: Telecom Domain: E-comm

9. Churn forecasting for the 10. Recommendation system with


telecom industry using R customer lifetime value analysis
programming with ML (CLV)

The goal of this project is to design Amazon wants to find the most
a precise customer churn successful electronics. Live
prediction model. Based on the consumer reviews are needed. Using
same, Jio can identify the exact data visualisation, help regenerate
reason for customer dissatisfaction consumer insights from ongoing and
and work accordingly. current reviews.

R Programming Decision Tree Deep Neural Network

Data Preprocessing Machine Learning MongoDB

Domain: Manufacturing Domain: Supply chain

11. Condition-based preventative 12. Automated inventory monitoring


maintenance and fault prediction for supportable supply chain
in depth management

This project helped BOSCH to An automated inventory


predict their internal failures by management system will keep track
production line dataset analysis. of stock levels and upcoming
But still, they are struggling to orders. In addition, you can
predict automated faults in their contribute to DataCo's intelligent
assembly stage. Help them by supply chain software generation
building more advanced predictive project by using ML algorithms and
models for assembly stage R programming skills.
monitoring.
ML (Reinforcement Learning) Python PowerBI

Data Warehousing (Tableau) Machine Learning


Preparatory Session
Module 0
8 hours

1. Cohort Orientation 2. Cohort Orientation


A brief introduction to tools related to Significance of data in decision-making
data Scope of data in research and
Learn about particular real-time development
projects and Capstone projects Utilizing data, to enhance industrial
Data and its impact on career operations and management
opportunities Data in performance evaluation
Fundamental relevance of projects Data in customer segmentation
using data
Role of data in businesses

3. Fundamentals of programming 4. Fundamentals of Statistics


Types of code editors in python Mean, Median, Mode
Introduction to Anaconda & Jupyter Standard Deviation, Average.
notebook Probability, permutations, and
Flavors of python combinations
Introduction to Git, GitHub Introduction to Linear Algebra
Python Fundamentals
Source code vs Byte code vs
Machine code
Compiler & Interpreter
Memory Management in Python

Tools Covered

Note: This module 0 is for those who are from a non-technical background like
Mechanical, BBA, MBA, B.Com, M.Com, etc. Or for those who work in Non-IT sectors,
who are new to programming & statistics (basic mathematics)
Python Programming Term 1
Module 1
50 hours

1. Programming Basics & 2. Python Programming Overview


Environment Setup Python Overview
Installing Anaconda, Anaconda Basics Python 2.7 vs Python 3
and Introduction Writing your First Python Program
Get familiar with version control, Git and Lines and Indentation, Python
GitHub. Identifiers
Basic Github Commands. Various Operators and Operators
Introduction to Jupyter Notebook Precedence
environment. Basics Jupyter notebook Getting input from User, Comments,
Commands. Multi line Comments.
Programming language basics.

3. Strings, Decisions & Loop Control 4. Python Data Types


Working With Numbers, Booleans List, Tuples, Dictionaries 
and Strings, String types and Python Lists, Tuples, Dictionaries
formatting, String operations Accessing Values, Basic Operations
Simple if Statement, if-else Statement Indexing, Slicing, and Matrixes
if-elif Statement. Built-in Functions & Methods
Introduction to while Loops, for Loops, Exercises on List, Tuples And
Using continue and break. Dictionary
Class hands-on :
6 programs/coding exercise on string,
loop and conditions in classroom

5. Functions And Modules Class hands-on (Python Data


Introduction To Functions Types):
Defining & Calling Functions Program to convert tuple to
Functions With Multiple Arguments. dictionary
Anonymous Functions - Lambda Remove Duplicate from Lists
Using Built-In Modules, User-Defined Python program to reverse a tuple
Modules, Module Namespaces, Program to add all elements in
Iterators And Generators list.
Class hands-on : + 3 more programs to be covered
8+ Programs to be covered in class of in class
functions, Lambda, modules, Generators
and Packages.
Python Programming Term 1
Module 1
50 hours

6. File I/O And Exceptional Handling Regular Expression Modifiers


and Regular Expression Regular Expression Patterns
Opening and Closing Files
open Function,file Object Attributes Class hands-on :
close() Method ,Read,write,seek. 10+ Programs to be covered in class
Exception Handling, try-finally Clause from File IO, Reg-ex and exception
Raising an Exceptions,User-Defined handling.
Exceptions
Regular Expression- Search and
Replace

7. Data Analysis Using Numpy 8. Data Analysis Using Pandas

Introduction to Numpy. Array Pandas : Introduction to Pandas


Creation, Printing Arrays, Basic Importing data into Python
Operation - Indexing, Slicing and Pandas Data Frames, Indexing Data
Iterating, Shape Manipulation - Frames ,Basic Operations With Data
Changing shape, stacking and splitting frame, Renaming Columns, Subsetting
of array and filtering a data frame.
Vector stacking, Broadcasting with
Numpy, Numpy for Statistical Operation.

Assignment 1 (Week 2):


10 Coding exercises on Python Basics - Variables,
Operators, Strings, Loops, Control Statement
Assignment 2 (Week 3):
10 Python programs and practice set on List, Tuples,
Dictionaries & Matrices operations
Assignment 3 (Week 4):
10 Coding exercises on Functions, Lambda,
Input-Output, File and Regular Expression
Python Programming Term 1
Module 1
50 hours

9. Data Visualization using 10. Data Visualization using


Matplotlib Seaborn

Matplotlib: Introduction, plot(), Seaborn :


Controlling Line Properties, Subplot Intro to Seaborn And Visualizing
with Functional Method, Multiple statistical relationships , Import and
Plot, Working with Multiple Figures, Prepare data. Plotting with categorical
Histograms data and Visualizing linear
relationships.
Seaborn Exercise

3 Case Study on Numpy, Pandas, REAL TIME USE CASES IN PYTHON


Matplotlib TO BE COVERED IN CLASS
1 Case Study on Pandas And Seaborn
WITH 5 ASSIGNMENTS
Assessment Test in Python :
2 hour of Assessment Test in
Python (Coding & Objective
Questions)
Statistics Term 2
Module 1
30 hours

1. Fundamentals of Math and 2. All about Population & Sample


Probability Population vs Sample, Sample Size
Probability distributed function & Simple Random Sampling,
cumulative distribution function. Systematic Sampling, Cluster
Conditional Probability, Baye’s Sampling, Stratified Sampling,
Theorem Convenience Sampling, Quota
Problem solving for probability Sampling, Snowball Sampling and
assignments Judgement Sampling
Random Experiments, Mutually
Exclusive Events, Joint Events,
Dependent & Independent Events

3. Introduction to Statistics, 4. Descriptive Statistics


Statistical Thinking Measures of Central Tendency –
Variable and its types Mean, Median and Mode
Quantitative, Categorical, Discrete, Measures of Dispersion – Standard
Continuous, Deviation, Variance, Range, IQR
*all with examples (Inter-Quartile Range)
Measure of Symmetricity/ Shape –
Five Point Summary and Box Plot Skewness and Kurtosis
Outliers, Causes of Outliers, How to
treat Outliers, I-QR Method and Z-
Score Method

5. Inferential Statistics 6. Hypothesis Testing


Central Limit Theorem Type of test and Rejection Region
Point estimate and Interval estimate Type o errors-Type 1 Errors, Type 2
Creating confidence interval for Errors. P value method, Z score
population parameter Method. The Chi-Square Test of
Characteristics of Z-distribution and T- Independence.
Distribution. Regression. Factorial Analysis of
Type of test and rejection region. Variance. Pearson Correlation
Type of errors in Hypothesis Testing Coefficients in Depth. Statistical
Significance
Statistics Term 2
Module 1
30 hours

Null and Alternative Hypothesis 7. Linear Algebra


One-tailed and Two-tailed Tests, Dot Product, Projecting Point on
Critical Value, Rejection region, Axis.
Inference based on Critical Value Matrices in Python, Element
Indexing, Square Matrix, Triangular
Binomial Distribution
Matrix, Diagonal Matrix, Identity
Assumptions of Binomial Distribution,
Matrix, Addition of Matrices, Scalar
Normal Distribution, Properties of
Multiplication, Matrix Multiplication,
Normal Distribution, Z table,
Matrix Transpose, Determinant,
Empirical Rule of Normal Distribution
Trace
& Central Limit Theorem and its
Applications

T-Test, Analysis of variance (ANOVA), 8. Data Processing & Exploratory


and Analysis of Covariance (ANCOVA) Data Analysis
Regression analysis in ANOVA What is Data Wrangling
Class Hands-on: Data Pre-processing and cleaning?
Problem solving for C.L.T How to Restructure the data?
Problem solving Hypothesis Testing What is Data Integration and
Problem solving for T-test, Z-score test Transformation
Case study and model run for
ANOVA, ANCOVA

9. EDA Note: Problem-Solving Techniques


Finding and Dealing with Missing and Case Studies using Statistics will
Values. be covered in class from week 2.
What are Outliers?
Using Z-scores to Find Outliers.
Bivariate Analysis, Scatter Plots and Statistics Assignments : Total 4
Heatmaps. practice set and Assignments from
Introduction to Multivariate Statistics
Analysis
Machine Learning Term 2
Module 2
40 hours

1. Machine Learning Introduction 2. Regression and Classification


Definition, Examples, Importance of Models
Machine Learning Definition of regression, OLS
Definition of ML Elements: Algorithm, Algorithm, Sum of Squares of
Model, Predictor Variable, Response residuals, Gradient Descent
Variable, Training - Test Split, Steps in Algorithm, Cost Function
Machine Learning, Evaluation Metrics for Regression
ML Models Type: Supervised Model: MAE, MSE, RMSE, R Square,
Learning, Unsupervised Learning Adjusted R Square
and Reinforcement Learning.

3. Linear Regression Model 4. Data Preprocessing


Comparing MAE, MSE, and RMSE. Types of Missing values (MCAR, MAR,
Significance of Adjusted R square. MNAR) , Methods to handle missing
Overfitting and Underfitting. Bias and values
Variance. Outliers, Methods to handle outliers:
Regularization methods: IQR Method, Z Method
Ridge and Lasso Feature Scaling: Definition , Methods:
Multicollinearity, VIF. Using Python Absolute Maximum Scaling, Min-Max
library Sklearn to create the Linear Scaler , Normalization, Standardization,
Regression Model and evaluate the Robust Scaling
model created.

5. Data Preprocessing 7. Evaluation Metrics for


Encoding the data: Definition, Classification model
Methods: OneHot Encoding, Mean Confusion Matrix, Accuracy,
Encoding, Label Encoding, Target Misclassification, TPR, FPR, TNR,
Guided Ordinal Encoding Precision, Recall, F1 Score, ROC
Curve, and AUC. Using Python
6. Logistic Regression Model library Sklearn to create the Logistic
Definition. Why is it called the Regression Model and evaluate the
“Regression model”? model created
Sigmoid Function, Transformation &
Graph of Sigmoid Function
Machine Learning Term 2
Module 2
40 hours

8. K Nearest Neighbours Model 9. Decision Tree Model


Definition, Steps in KNN Model, Definition, Basic Terminologies, Tree
Types of Distance: Manhattan Splitting Constraints, Splitting
Distance, Euclidean Distance, ‘Lazy Algorithms:
Learner Model’. CART, C4.5, ID3, CHAID
Confusion Matrix of Multi Class Splitting Methods:
Classification GINI, Entropy, Chi-Square, and
Using Python library Sklearn to Reduction in Variance
create the K Nearest Neighbours Using Python library Sklearn to create
Model and evaluate the model the Decision Tree Model and evaluate
the model created

10. Random Forest Model 11. Hyperparameter Tuning


Ensemble Techniques: GridSearchCV, Variable Importance.
Bagging/bootstrapping & Boosting. Using Python library Sklearn to create
Definition of Random Forest, OOB the Random Forest Model and
Score evaluate the model created.
K-Fold Cross-Validation Use cases

12. Naive Baye’s Model Case Study


Definition, Advantages, Baye’s Business Case Study for Kart
Theorem Applicability, Disadvantages Model
of Naive Baye’s Model, Laplace’s Business Case Study for  Random
Correction, Types of Classifiers: Forest
Gaussian, Multinomial and Bernoulli Business Case Study for  SVM
Using Python library Sklearn to create To classify an email as spam or
the Naive Baye’s Model and evaluate not spam using logistic
the model created Regression.
Application of Linear Regression
for Housing Price Prediction
Machine Learning Term 2
Module 2
40 hours

13. K Means and Hierarchical 14. Hierarchical Clustering


Clustering Dendrogram, Agglomerative
Definition of Clustering, Use cases of Clustering, Divisive Clustering,
Clustering Comparison of K Means Clustering and
K Means Clustering Algorithm, Hierarchical Clustering
Assumptions of K Means Clustering Using Python library Sklearn to create
Sum of Squares Curve or Elbow Curve and evaluate the clustering model

15. Principal Component 16. Support Vector


Analysis(PCA): Machine(SVM)
Definition, Curse of Dimensionality, Model: Definition, Use Cases,
Dimensionality Reduction Kernel Function, Aim of Support
Technique, When to use PCA, Vectors, Hyperplane, Gamma
Use Cases Value, Regularization Parameter
Steps in PCA, EigenValues and Using Python library Sklearn to
EigenVectors, Scree Plot. create and evaluate the SVM
Using Python library Sklearn to Model
create Principal Components

Summary of all Machine Learning Case Study


Models and Discussion about the Recommendation Engine for
Capstone Project e-commerce/retail chain
Twitter data analysis using NLP
Note :
All  Machine Learning Algorithms are
covered in depth with real time case
studies for each algorithm.
Once 60% of ML is completed,
Capstone Project will be released for
the batch.
SQL Term 3
Module 1
14 hours

1. SQL and RDBMS 2. Advance SQL


RDBMS And SQL Operations. Advance SQL Operations
Single Table Queries - SELECT, Data Aggregations and summarizing
WHERE, the data
ORDER BY, Distinct, And, OR Ranking Functions: Top-N Analysis
Multiple Table Queries: INNER, SELF, Advanced SQL Queries for Analytics
CROSS, and OUTER, Join, Left Join,
Right
Join, Full Join, Union

3. NoSQL, HBase & MongoDB 4. JSON Data & CRUD


NoSQL Databases Basics and CRUD Operation
Introduction to HBase Databases, Collection & Documents
HBase Architecture, HBase Shell & MongoDB drivers
Components, Storage Model of What is JSON Data
HBase. Create, Read, Update, Delete
HBase vs RDBMS Finding, Deleting, Updating, Inserting
Introduction to Mongo DB, CRUD Elements
Advantages of MongoDB over Working with Arrays
RDBMS Understanding Schemas and Relations

5. Programming with SQL Assignments


Mathematical Functions Working with multiple tables
Variables Practice Joins, Grouping and
Conditional Logic Subqueries
Loops Using GROUP BY and HAVING
Custom Functions Clauses
Grouping and Ordering Practice Aggregation Queries
Partitioning
Filtering Data
Subqueries
MongoDB Term 3
Module 2
14 hours

1. Introduction to MongoDB 2. MongoDB (Advance)


What is MongoDB MongoDB Use cases
Characteristics and Features MongoDB Structures
MongoDB Ecosystem MongoDB Shell vs MongoDB
Installation process Server
Connecting to MongoDB database Data Formats in MongoDB
Introduction to NoSQL MongoDB Aggregation Framework
Introduction of MongoDB module Aggregating Documents
What are Object Ids in MongoDB

2. MongoDB (Advance) Tool Covered


Working with MongoDB Compass
& exploring data visually
Understanding Create, Read,
Update, Delete
Schemas & Relations Assignment
Document Structure Obtain the data in the format you
Working with Numeric Data want by formulating queries that
Working on Scheme Designing are both effective and high-
performing.
Tableau Term 3
Module 3
14 hours

1. Introduction to Tableau 2. Visual Analytics


Connecting to data source Getting Started With Visual
Creating dashboard pages Analytics
How to create calculated columns Sorting and grouping
Different charts Working with sets, set action
Filters: Ways to filter, Interactive
Filters
Forecasting and Clustering

3. Dashboard and Stories 4. Tableau (Advance)


Working in Views with Mapping
Dashboards and Stories Coordinate points
Working with Sheets Plotting Latitude and Longitude
Fitting Sheets Custom Geocoding
Legends and Quick Filters Polygon Maps
Tiled and Floating Layouts, WMS and Background Image
Floating Objects

Hands-on Assignments
Connecting data source and
data cleansing
Working with various charts
Deployment of Predictive
model in visualization

Tool Covered
PowerBI Term 3
Module 4
14 hours

1. Getting Started With Power BI 2. Programming with Power BI


Installing Power BI Desktop and Working with Time Series
Connecting to Data Understanding aggregation and
Overview of the Workflow in Power granularity
BI Desktop Filters and Slicers in Power BI Maps
Introducing the Different Views of Scatterplots and BI Reports
the Data Mode
Query Editor Interface
Working on Data Model

Connecting Dataset with Power BI Assignments


Creating a Customer Create Bar charts
Segmentation Dashboard Create Pie charts
Analyzing the Customer Create Tree maps
Segmentation Dashboard Create Donut Charts
Create Waterfall Diagrams
Creating Table Calculations for
Gender

Tool Covered

Note: All the assignments will be


covered in-depth with real-time
examples
Big Data & Spark Analytics Term 3
Module 5
16 hours

1. Introduction To Hadoop & Big 2. What is Spark


Data Introduction to Spark RDD
Distributed Architecture - A Brief Introduction to Spark SQL and Data
Overview. Understanding Big Data frames
Introduction To Hadoop, Hadoop Using R-Spark for machine learning
Architecture Hands-on:
HDFS, Overview of MapReduce Installation and configuration of
Framework Spark
Hadoop Master: Slave Architecture Using R-Spark for machine learning
MapReduce Architecture programming
Use cases of MapReduce

3. Getting to know PySpark Hands-on


Pyspark Introduction Map reduce Use Case 1: Youtube data
Pyspark Environment Setup analysis
pySpark - Spark context
RDD , Broadcast and Accumulator Map reduce Use Case 2: Uber data
Sparkconf and Sparkfiles analytics
Spark MLlib Overview Algorithms
and utilities in Spark Mlib Spark RDD programming

Spark SQL and Data frame


programming

Tools Covered
Time Series Term 3
Module 6
14 hours

1. Introduction to Time Series 2. Introduction to ARIMA Models


Forecasting ARIMA Model Calculations, Manual
Basics of Time Series Analysis and ARIMA Parameter Selection
Forecasting ARIMA with Explanatory Variables
Method Selection in Forecasting Understanding Multivariate Time
Moving Average (MA) Forecast Example Series and their Structure
Different Components of Time Series Checking for Stationarity and
Data Differencing the MTS
Log Based Differencing, Linear
Regression for Detrending

Case Study Case Study


Time series classification of Performing Time Series
smartphone data to predict Analysis on Stock Prices
user behavior
Time series forecasting of
sales data
Note: All the assignments and case
studies will be covered in-depth with
real-time examples
Deep Learning using TensorFlow Term 4
Module 1
20 hours

1. Introduction to Deep Learning And Creating A Graph – Graph


TensorFlow Visualization
Neural Network Creating a Model – Logistic
Understanding Neural Network Model Regression
Installing TensorFlow Model Building using TensorFlow
Simple Computation, Constants, and
Variables
Types of file formats in TensorFlow

2. TensorFlow Classification 3. Understanding Neural Networks


Examples With TensorFlow
Introduction to TensorFlow Basic Neural Network
Installing TensorFlow Single Hidden Layer Model
Simple Computation, Contents Multiple Hidden Layer Model
and Variables Backpropagation – Learning Algorithm
Types of file formats in TensorFlow and visual representation
Creating A Graph - Graph Visualization Understand Backpropagation – Using
Creating a Model - Logistic Regression Neural Network Example
Model Building TensorBoard
TensorFlow Classification Examples

4. Convolutional Neural Network Project


(CNN) Building a CNN for Image
Convolutional Layer Motivation Classification
Convolutional Layer Application Project on backpropagation
The architecture of a CNN using Neural Networks with
Pooling Layer Application TensorFlow
Deep CNN
Understanding and Visualizing a Tool Covered
CNN
Natural Language Processing (NLP) Term 4
Module 2
24 hours

1. Natural Language Processing 2. Text Analysis


Text Analytics Distance Algorithms used in Text
Introduction to NLP Analytics
Use cases of NLP algorithms String Similarity
NLP Libraries Cosine Similarity Mechanism -
Need for Textual Analytics The similarity between two text
Applications of NLP documents
Word Frequency Algorithms for NLP Levenshtein distance - measuring the
Sentiment Analysis difference between two sequences.

Important 3. KNN
Applications of Levenshtein distance Information Retrieval Systems
LCS(Longest Common Sequence ) Information Retrieval - Precision,
Problems and solutions, LCS Recall,F- score TF-IDF
Algorithms. KNN for document retrieval
K-Means for document retrieval
Clustering for document retrieval

Use cases on NLP Use cases on NLP


Sentiment analysis for Application to translate and
marketing summarize the news
Toxic comments classification RESTful API for similarity check
Language identification
Generating research papers
titles
Model Training & Deployment using (AWS GCP) Term 4
Module 3
10 hours

1. AWS (Amazon Web Services) 3. Introduction to AWS and GCP


Deployment Strategies Cloud ML Engine
Automations CloudML Engine & AWS in Machine
Monitoring and Logging Learning WorkFlow
Communication and Collaboration Components of AWS & Cloud ML
Engine
2. GCP (Google Cloud Platform) GCP and AWS Console.
GCP Development Tools - Cloud SDK, gcloud command-line tool and Rest
Repositories, Plugins API
Deployment Manager and Cloud
Endpoint

4. Deploying Machine Learning 5. Training Machine Learning


Model Model
Deploying Models, Understanding Developing a trained model
training graphs and serving graphs application
Check and adjust model size Running and monitoring a machine
Build an optimal prediction graph learning model
Creating input function Using hyperparameter tuning
creating a model version Using GPUs for training models in
Getting Online Prediction the cloud

Tools Covered
Contact Us

Learn Here,
Lead Anywhere

Click on the icon to follow us!

Address Book a counselling session


#1090 with expert!
1st floor, 18th Cross Rd,
above Sangam Sweets,
Sector 3, HSR Layout,
Book a session
Bengaluru, Karnataka
560102

www.learnbay.co Learnvista Pvt. Ltd +91 73492 22263

You might also like