You are on page 1of 6

# Course in Data Science

Contact: +917095167689
In this course you will get an introduction to the main tools and ideas which are required for Data
Scientist/Business Analyst/Data Analyst. The course gives an overview of the data, questions, and
tools that data analysts and data scientists work with. There are two components to this course. The first
is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a
practical introduction to the tools that will be used in the program like R Programming, SAS, MINITAB and
EXCEL.

Course features:

## Real Time Case Study driven approach

Live Project

Placement Assistance

Qualification

## Mode of course delivery

Online Training

Faculty Details:

A team of faculty having an average 20 + years experience in the data analysis across various
industries and training.

## Module:1 - Descriptive & Inferential Statistics:(30 Hrs)

1. Turning Data into Information

5. Hypothesis Testing

Data Visualization

Hypothesis Testing

## Type I and Type II Errors

Measures of Variability

## Decision Making in Hypothesis Testing

Measures of Shape

Covariance, Correlation

## Using Software-Real Time Problems

2. Probability Distributions

## Hypothesis Testing for a Mean,

Variance, Proportion

Random Variables

## Two Variances Test(F-Test)

Normal distribution

## Using Software-Real Time Problems

3. Sampling Distributions

Proportions

## Central Limit Theorem

ANOVA Assumptions

## Multiple Comparisons (Tukey, Dunnett)

Proportion, p-hat

Mean, x-bar

## Using Software-Real Time Problems

4. Confidence Intervals

8. Association Between
Categorical Variables

## Statistical Significance of Observed

Statistical Inference
Constructing confidence intervals to

Proportion

## Calculating the Chi-Square Test

Statistic

Contingency Table
Using Software-Real Time Problems

## Module:2 Prediction Analytics (25Hrs)

1. Simple Linear Regression

Influence

Parameters

## Using Software-Real Time Problems

Intercept

7. Polynomial Regression

Coefficient of Determination

## Using Software-Real Time

2. Multiple Regression

## Multiple Regression Models

8. Dummy Variables

## Polynomial Model in One/ Two /More

Variable
Orthogonal Polynomials
Using Software-Real Time Problems

## The General Concept of Indicator

Variables

Regression

Multicollinearity

Building

## Forward Selection/Backward Elimination

Residual Analysis

Stepwise Regression

Concept of GLM

## Using Software-Real Time Problems

Logistic Regression

4. Transformations

Poisson Regression

## Negative Binomial Regression

Exponential Regression

Variance-Stabilizing Transformations

## Analytical Methods for selecting a

11. Autocorrelation

Transformation

Errors

## Module:3 Applied Multivariate Analysis (25hrs)

1. Measures of Central Tendency,

6. MANOVA

## Measures of Central Tendency/

Measures of Dispersion

Hypothesis Tests

MANOVA table

Distribution

## 3. Sample Mean Vector and

Sample Correlation

(PCA)

Procedure

## Using Software-Real Time Problems

5. Factor Analysis

## Principal Component Analysis method

Communalities

Factor Rotations
Using Software-Real Time Problem

MANOVA
Test Statistics for MANOVA

1. Introduction

## 6. Support Vector Machine

Application Examples

## Maximum Marginal Classifier

Supervised Learning

## Support Vector Classier

Unsupervised Learning

## 2. Regression Shrinkage Methods

Ridge Regression
Lasso Regression
Using Software-Real Time Problems

3. Classification

7. Cluster Analysis

## Agglomerative Hierarchical Clustering

K-Means Procedure

Logistic Regression

## Using Software-Real Time Problems

8. Dimensionality Reduction

Discriminant Analysis(LDA/QDA)

Classifier)

## Using Software-Real Time Problems

4. Tree-based Methods

## The Basics of Decision Trees

Regression Trees

Classication Trees

Ensemble Methods

## Using Software-Real Time Problems

5. Neural Networks

Introduction

## Single Layer Perceptron

Multi-layer Perceptron

## Using Software-Real Time Problems

9. Association rules

1. R Programming

R Basics

## Module 1-4 demonstrated using R

Numbers, Attributes

Creating Vector

Mixing Objects

Explicit Coercion

## Matrices, List, Factors, Data Frames,

Missing Values, Names

Using Dput/DDump

## Sub setting R objects

Vectorized Operations

## Managing Data Frames with the DPLYR

package

Control Structures

Functions

Loop Functions

Debugging

programming