You are on page 1of 6

# Course in Data Science

Contact: +917095167689
In this course you will get an introduction to the main tools and ideas which are required for Data
Scientist/Business Analyst/Data Analyst. The course gives an overview of the data, questions, and
tools that data analysts and data scientists work with. There are two components to this course. The first
is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a
practical introduction to the tools that will be used in the program like R Programming, SAS, MINITAB and
EXCEL.

Course features:

## Real Time Case Study driven approach

Live Project

Placement Assistance

Qualification

## Mode of course delivery

Classroom/Online Training

Faculty Details:

A team of faculty having an average 20 + years experience in the data analysis across various
industries and training.

## Module:1 - Descriptive & Inferential Statistics:(35 Hrs)

1. Turning Data into Information

5. Hypothesis Testing

Data Visualization

Hypothesis Testing

## Type I and Type II Errors

Measures of Variability

## Decision Making in Hypothesis Testing

Measures of Shape

## Hypothesis Testing for a Mean,

Covariance, Correlation

## Using Software-Real Time Problems

2. Probability Distributions

## Probability Distributions: Discrete

Variance, Proportion

Random Variables

## Two Variances Test(F-Test)

Normal distribution

## Using Software-Real Time Problems

3. Sampling Distributions

Proportions

## Central Limit Theorem

ANOVA Assumptions

## Multiple Comparisons (Tukey, Dunnett)

Proportion, p-hat

Mean, x-bar

## Using Software-Real Time Problems

4. Confidence Intervals

Statistical Inference

## Constructing confidence intervals to

8. Association Between
Categorical Variables

## Statistical Significance of Observed

Relationship / Chi-Square Test

Proportion

## Calculating the Chi-Square Test

Statistic

Contingency Table

## Module:2 Prediction Analytics (35Hrs)

1. Simple Linear Regression

Influence

Parameters

## Using Software-Real Time Problems

Intercept

7. Polynomial Regression

Coefficient of Determination

## Using Software-Real Time

Orthogonal Polynomials

2. Multiple Regression

## Multiple Regression Models

8. Dummy Variables

Variable

## The General Concept of Indicator

Variables

Regression

Multicollinearity

Building

## Forward Selection/Backward Elimination

Residual Analysis

Stepwise Regression

Concept of GLM

## Using Software-Real Time Problems

Logistic Regression

4. Transformations

Poisson Regression

Variance-Stabilizing Transformations

## Transformations to Linearize the Model

Exponential Regression

## Analytical Methods for selecting a

11. Autocorrelation

Transformation

Errors

## Module:3 Applied Multivariate Analysis (20hrs)

1. Measures of Central Tendency,
Dispersion and Association

## Measures of Central Tendency/

6. Discriminant Analysis

Measures of Dispersion

7. MANOVA

Distribution

MANOVA

Hypothesis Tests

MANOVA table

## 3. Sample Mean Vector and

Sample Correlation

(PCA)

Procedure

## Using Software-Real Time Problems

5. Factor Analysis

Communalities

Factor Rotations

Varimax Rotation

1. Introduction

## 6. Support Vector Machine

Application Examples

## Maximum Marginal Classifier

Supervised Learning

## Support Vector Classier

Unsupervised Learning

Ridge Regression

Lasso Regression

## Using Software-Real Time Problems

3. Classification

7. Cluster Analysis

## Agglomerative Hierarchical Clustering

K-Means Procedure

Logistic Regression

## Using Software-Real Time Problems

Discriminant Analysis(LDA/QDA)

Classifier)

## Using Software-Real Time Problems

4. Tree-based Methods

## The Basics of Decision Trees

Regression Trees

Classication Trees

Ensemble Methods

## Using Software-Real Time Problems

5. Neural Networks

Introduction

## Single Layer Perceptron

Multi-layer Perceptron

## Using Software-Real Time Problems

8. Dimensionality Reduction

9. Association rules

1. R Programming

R Basics

## Module 1-4 demonstrated using R

Numbers, Attributes

Creating Vector

Mixing Objects

Explicit Coercion

## Matrices, List, Factors, Data Frames,

Missing Values, Names

Using Dput/DDump

## Sub setting R objects

Vectorized Operations

## Managing Data Frames with the DPLYR

package

Control Structures

Functions

Loop Functions

Debugging

programming