You are on page 1of 4

Python Programming

Data Types, Python Introduction, Installation and Setup, Python Basics,


Conditionals & Loops, Working with Functions, List manipulation, Tuple, Set &
Dictionary, Regular Expression, Date Time

Data Manipulation with Pandas


Pandas in Python offers data structures and processes for manipulating numerical
tables and time series. Data Manipulation, Missing Values, Data Pre-processing,
Grouping, Merge, Broadcasting

NumPy

Numpy has functions for operating in the domain of linear algebra, matrices, and
more. Why NumPy is fast, Create NumPy arrays, Slicing & Indexing, Mathematical
Operations - 1D, Boolean Indexing - 1D, Boolean Indexing - 2D, NumPy Broadcasting.

Data Visualization with Matplotlib


Bar charts, scatter plots, count plots, line plots, pie charts, donut charts, etc,
with Python matplotlib.

Data Visualization with Seaborn


Regression plots, categorical plots, area plots, etc, with Python seaborn.

Data Visualization with Plotly


Creating advance and interactive plots with Plotly

Data Analysis with Excel


Reading the Data, Referencing in formulas , Name Range, Logical Functions,
Conditional Formatting, Advanced Validation, Dynamic Tables in Excel, Sorting and
Filtering, Handling Text Data, Splitting, combining, data imputation on text data,
Working with Dates in Excel, Data Conversion, Handling Missing Values, Data
Cleaning, Working with Tables in Excel, etc. Charts, Pie charts, Scatter and
bubble chartsBar charts, Column charts, Line charts, Maps. Binary Classification
Problems, Confusion Matrix, AUC and ROC curve Multiple Classification Problems.
Standardization, Normalization, Probability Distributions Inferential Statistics,
Hypothesis Testing, ANOVA, Covariance, Correlation, Linear Regression, Logistic
Regression, Error in regression, Information Gain using Regression, Probability,
Entropy, Dependence Mutual Information.

R Programming
The basics of coding on R studio platform, Inputs and R objects (vector, matrix,
dataframes and factors) R datatypes, Using dplyr package, Text manipulations using
String, Reading data (csv file), Data Visualization with ggplot, Supervised ad
Unsupervised Modelling, H2O, Lubridate, Caret.

SQL
Introduction to DBMS, Schema Design, Key Constraints & Basics Of Normalization,
Joins, Subqueries Involving Joins & Aggregations, Sorting, Independent Subqueries,
Correlated Subqueries, Analytic Functions, Set Operations, Grouping and Filtering

Data Preprocessing
Encoding, Scaling with Normalization and Min Max Scaling, Outlier Correction,
Missing Values, Polynomial Variables etc. Unstructured Data, Feature Extraction,
Feature Engineering, Bias Variance Trade-off, Unbalanced Data,

Statistics
Introduction of Statistics, Data Types in Statistics, Sample & Population, Simple
Random Sampling, Stratified sampling, Cluster sampling, Systematic Sampling,
Categories of Statistics. Measures in Descriptive Statistics, Measures of central
tendency, Measures of Spread, Range, Variance & Standard Deviation, Measure of
Position. Introduction to Inferential Statistics, Why Inferential Statistics?,
Probability Distribution, Normal Distribution, Standard Normal Distribution,
Sampling Distribution, Central Limit Theorem. What is Hypothesis Testing, Null &
Alternative Hypothesis, Significance Level, Test statistic, Test Statistic:
Critical value & Rejection Region, Test Statistic: Type of Test, Errors in
Hypothesis Testing.

Supervised Machine Learning : Regression Analysis


Introduction to Linear Regression, Optimal Coefficients, Cost function, Coefficient
of Determination, Analysis of Linear Regression using dummy Data, Linear Regression
Intuition. Multiple regression and use in solving real world problems. RIDGE,
LASSO, ELASTICENET AND POLYNOMIAL REGRESSION, L1 and L2 regularization. Regression
Analysis, Handling, Residuals analysis, AIC, BIC, Model Fitting, Training and Test
Data, R-Square, Dummy variables, Non Linear Regression, Gradient descent algorithm
that is an iterative optimization approach to finding local minimum and maximum of
a given function, KNN, Regression using Decision Trees and Random Forest. Support
Vector Machine, Decision Tree. How to train the model, how to evaluate the model
and how to optimize the efficiency of the model.

Supervised Machine Learning : Classification Analysis


Handling Classification Problems, Logistic Regression, Cost Function, Finding
Optimal Values, Solving Derivatives, Multiclass Logistic Regression, Finding
Complex Boundaries and Regularization, Using Logistic Regression from Sklearn.
Bayes Theorem, Independence Assumption in Na�ve Bayes, Probability estimation for
Discrete Values Features, How to handle zero probabilities, Implementation of Na�ve
Bayes, Finding the probability for continuous valued features, Text Classification
using Na�ve Bayes. Introduction to KNN, Feature scaling, Cross Validation, Finding
Optimal K, Implement KNN, Curse of Dimensionality, Handling Categorical Data, Pros
& Cons of KNN. Intuition behind SVM, SVM Cost Function, Decision Boundary & the C
parameter, using SVM from Sklearn, Finding Non Linear Decision Boundary, Choosing
Landmark Points, Similarity Functions, How to move to new dimensions, Multi-class
Classification, Choosing Parameters using Grid Search, Using Support Vectors to
Regression. Decision Trees, Getting Best Decision Tree, Deciding Feature to Split
on, Continuous Valued Features, Code using Sklearn decision tree, information gain,
Gain Ratio, Gini Index, Decision Trees & Overfitting, Pruning. Introduction to
Random Forests, Data Bagging and Feature Selection, Extra Trees, Classification
report to evaluate the model on recall, precision, f-support, support, accuracy
etc. Confusion matrix to evaluate the true positive, true negative, false positive
and false negative outcomes in the model.

Ensemble Modelling
Bagging, Boosting, Random Forest, AdaBoost (Adaptive Boosting), Gradient boosting,
Hyperparameter Tunning, Cross Validation, Grid Search and more

Unsupervised Machine Learning


Clustering, K-means, How to choose Optimal K, Silhouette algorithm to choose K,
Introduction to K Medoids, K Medoids Algorithm, Hierarchical Clustering, Bottom
up/Divisive Approach. Distance methods - Euclidean, Manhattan, Cosine, Mahalanobis.
Principal Component Analysis, Intuition behind PCA, Math behind PCA, Finding
Optimal Number of Features. LDA or linear discriminant analysis to reduce or
optimize the dimensions in the multidimensional data.

Recommendation System
Purposes of Recommender Systems, Paradigms of Recommender Systems, Collaborative
Filtering, Association Rule Mining, Market Basket Analysis, Generation Apriori
Algorithm, Apriori Algorithm, User Movie Recommendation Model

TensorFlow and Keras


Introduction to TensorFlow, Introduction to Keras, Creating Models with Keras,
Working with Keras APIs

Deep Learning
Implementing Neural Network, How to compose Models in Pytorch, Saving and Loading
model, Intuitively building networks, Introduction to Artificial Neural Networks,
Hidden layers, Activation function, Loss Functions, Understand Forward & Back
propagation, Regularization, Types of Regularization, Normalization, Different
Optimization Technique, Gradient Descent, Vanishing Gradient, Batch Norm, Transfer
Learning, Q Learning, Encoder Decoder, Reinforcement Learning,

Computer Vision
Pooling Layer, Data Flow in CNN, Architecture of CNN, Initializing weights, Forward
Propagation in TensorFlow, Convolution and Maxpool Functions, Regularization using
Dropout layer, Adding Dropout Layer to the network, Building CNN Keras, AlexNet,
VGGNet, Resnet, ResNext, Face Detection, Face Tracking, Face Recognition, Object
Detection,

Natural Language Programming


Regular Expression, Using Words as Features, Basics of word processing, Stemming,
Part of Speech, Lemmatization, Building Feature set, Classification using NLTK
Na�ve Bayes, Count vectorizer, N-gram, TF-IDF, Word cloud, Principal Component
Analysis, Bigrams & Trigrams, Web Scraping with BeautifulSoup, Text summarization,
Lex Rank algorithm, Latent Dirichlet Allocation (LDA) Technique, Word2vec
Architecture (Skip Grams vs CBOW), Text classification, Document vectors, Text
classification using Doc2vec, Music Analytics, Machine Translation, Text
Classification, Text Segmentation, Sentiment Analysis, NLP vs. NLU vs. NLG,
Word2vec and Glove, RNN/ LSTM/ Bi-LSTM/ GRU

Time Series Analysis


Introduction to Time Series, Stationary and Non Stationary, Auto-Correlation,
Rolling Forecast, Exponential Forecast, Autoregressive Moving Average (ARMA)
Models, Autoregressive Integrated Moving Average (ARIMA) Models, Financial Time
Series, Auto Regressive Conditional Heteroscedasticity (ARCH) Models, Generalized
Auto Regressive Conditional Heteroscedasticity (GARCH) Models, Vector Auto
Regressive (VAR) Models, RNN and LSTM

Optimization
Linear Programming, Solver, Optimization Concepts.

Big Data
Python integration with Hadoop MapReduce and Spark, Introduction to Big Data and
Hadoop Ecosystem, HDFS and Hadoop Architecture, MapReduce and Sqoop, Basics of
Impala and Hive, Working with Hive and Impala, Type of Data Formats, Advanced HIVE
concept and Data File Partitioning, Apache Flume and HBase, Apache Pig, Basics of
Apache Spark, RDDs in Spark, Implementation of Spark Applications, Spark Parallel
Processing, Spark RDD Optimization Techniques, Spark Algorithm, Spark SQL

Cloud Computing
Cloud Computing Fundamentals, Traditional IT Infrastructure, Cloud Infrastructure,
Cloud Companies (IBM, Microsoft Azure, GCP, AWS ) & their Cloud Services, Use Cases
of Cloud computing, Overview of Cloud Deployment, Models Implementation in Cloud

Tableau
Introduction to Data Visualization, Introduction to Tableau, Connect to and
Transform Data, Basic Charts and Dashboard, Descriptive Statistics, Dimensions and
Measures, Explore and Analyse, Create Content, Dashboard Design & Principles,
Advanced Design Components, Special Chart Types
Power BI
Introduction to Power BI, Power BI components, Power BI Desktop, workflows and
reports , Data Extraction with Power BI. Power Query Editor, Advance Editor, Query
Dependency Editor, Data Transformations, Shaping and Combining Data, M Query and
Hierarchies in Power BI. Data Visualization with Analytics Slicers, filters, Drill
Down Reports, Power BI Query, Q & A and Data Insights

Marketing & Retail Analytics


Customer Analytics, KNIME, Customer Churn, Association Rules Mining

Financial & Risk Analytics


Credit Risk Models, Overview of Probability of Default (PD) Modelling, PD Models,
Types of Models, Market Risk, Value at Risk, Fraud Detection

Supply Chain & Logistic Analytics


Introduction to Supply Chain, Demand Planning, Inventory Control, Inventory
Management, Inventory Modelling, Advanced Forecasting Methods.

Fintech and Blockchain


Blockchain, Digital Signature, Distributed Ledger, SHA, Proof of Work, Proof of
Stake, Mining, Rigs, Public Blockchain, Private Blockchain, Crypto Currency,
BigTech

Healthcare & Pharma Analytics


Marketing & Sales Analytics, Provider-Payer-Patient Analytics, Claims Analytics,
Fraud Analytics, MROI

Telecom Analytics
Network Analytics, Subscriber Analytics, Loyalty Analytics, Revenue Leakage
Analytics

You might also like