Welcome to Scribd!

Ann2018 L7

Uploaded by

0% found this document useful (0 votes)

22 views17 pages

This document discusses several concepts related to input normalization and dimensionality reduction in machine learning: - Input data should be centered around the origin and have small random initial weights to avoid saturation and ensure decision surfaces pass through the data. - Principal component analysis (PCA) reduces dimensionality by transforming to fewer uncorrelated variables that retain maximum variation from the original data. - Covariance measures relationships between variable pairs in multidimensional data and the covariance matrix is used in calculating PCA eigenvectors and eigenvalues to identify principal components.

Original Description:

ann

Original Title

ANN2018-L7

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

22 views17 pages

Ann2018 L7

Uploaded by

Amartya Keshri

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 17

Search inside document

Input normalization (Preprocessing)

• Small random values of weights for avoidance of

saturation.
• The connection weights from the inputs to a hidden unit
determine the orientation of the hyperplane. The bias
determines the distance of the hyperplane from the
origin.
• If the data are not centered at the origin, the hyperplane
may fail to pass through the data cloud.
•If all the inputs have a small coefficient of variation, it is
quite possible that all the initial hyperplanes will miss the
data entirely.
• To avoid saturation
• If the bias terms are all small random numbers, then
all the decision surfaces will pass close to the origin.
If the data are not centered at the origin, the decision
surfaces will not pass through the data points
‘prestd’ or ‘mapstd’ command in MATLAB
Consider an MLP with two inputs (X and Y)
and 100 hidden units.
It will be easy to learn a hyperplane passing through any part of these regions at any
angle.
Curse of Dimensionality

Example: Fisher Iris problem is a 3-class pattern recognition problem.

Assume that we are taking only one feature (x1), say sepal length.
If we are forced to work with a limited quantity of data then increasing the
dimensionality of the space rapidly leads to the point where the data is very
sparse, in which case it provides a very poor representation of the mapping.
Principal Component Analysis (PCA)

• Reduce the dimensionality of a data set

which consists of a large number of
interrelated variables by linearly transforming
the original data set to a new set of usually
fewer uncorrelated variables (PCs), while
retaining as much as possible of the variation
present in the original data set.
• The PC causing higher variation has more
impact on the observations, thus intuitively
more informational.
Mean, Standard Deviation and Variance
The average distance
from the mean of the data set to a point

Covariance
The covariance Matrix

Covariance is always measured between 2 dimensions. If we

have a data set with more than 2 dimensions, there is more than
one covariance measurement that can be calculated. For example,
from a 3 dimensional data set
Mean 1.81 and 1.91
Original data set
1.5

0.5

-0.5

-1

-1.5
-1.5 -1 -0.5 0 0.5 1 1.5
r'
ans =
-0.8280
1.7776
-0.9922
-0.2742
-1.6759
datared=(v(:,2)'*[xadj yadj]')' -0.9130
0.0991
v(:,2)‘=0.67787 0.735 1.1446
0.4381
1.2239
(v'*[xadj yadj]')'
Step 1: Get some data
Step 2: Subtract the mean
Step 3: Calculate the covariance matrix
Step 4: Calculate the eigenvectors and eigenvalues
of the covariance matrix
Step 5: Choosing components and forming a
feature vector
Step 6: Deriving the new data set

dataorig=(v'*datatrans')'+[xmean*ones(10,1) ymean*ones(10,1)]
A set of variables that define a projection that encapsulates the
maximum amount of variation in a dataset and is orthogonal (and
therefore uncorrelated) to the previous principal component of the
same dataset.

The blue lines represent 2 consecutive

principle components. Note that they are
orthogonal (at right angles) to each other.

Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Data Structures and Algorithms in Swift: Implement Stacks, Queues, Dictionaries, and Lists in Your Apps
From Everand
Data Structures and Algorithms in Swift: Implement Stacks, Queues, Dictionaries, and Lists in Your Apps
Elshad Karimov
No ratings yet
ANN6
Document19 pages
ANN6
ARPIT SANJAY AVASARMOL R2566003
No ratings yet
DMDW 5
Document25 pages
DMDW 5
Anu agarwal
No ratings yet
Topic 1 Numerical Measure
Document11 pages
Topic 1 Numerical Measure
Nedal Abuzwida
No ratings yet
Note 5
Document24 pages
Note 5
nuthan manideep
No ratings yet
Logistic Regression Implementation in R: The Dataset
Document8 pages
Logistic Regression Implementation in R: The Dataset
suprabhatt
No ratings yet
Week 11 Probability and Statistics
Document27 pages
Week 11 Probability and Statistics
RamenKing12
No ratings yet
RENR 690 - Geostatistics Lab
Document6 pages
RENR 690 - Geostatistics Lab
Wladimir Gonzalo Rondan
No ratings yet
Unit II e - PCA
Document28 pages
Unit II e - PCA
aruna
No ratings yet
Standard Deviation and Coefficient of Standard Deviation
Document4 pages
Standard Deviation and Coefficient of Standard Deviation
vishnu krishna
No ratings yet
Machine Learning Notes
Document27 pages
Machine Learning Notes
abdhatemsh
No ratings yet
ML Support Vector Machines 2
Document22 pages
ML Support Vector Machines 2
23mb0072
No ratings yet
Data Science Chapitre 2
Document98 pages
Data Science Chapitre 2
Leonel Ska
No ratings yet
Introduction To Data Science With R Programming
Document12 pages
Introduction To Data Science With R Programming
Vimal Kumar
No ratings yet
الواجب الأول
Document5 pages
الواجب الأول
nawaf
No ratings yet
Tsne On Credit Card
Document9 pages
Tsne On Credit Card
gopisai
No ratings yet
Analisis Jalur
Document30 pages
Analisis Jalur
adityanugrahass
No ratings yet
ML Practical File
Document43 pages
ML Practical File
Pankaj Singh
100% (1)
Data Normalization
Document7 pages
Data Normalization
Ruchira Saha
No ratings yet
Principal Component Analysis
Document13 pages
Principal Component Analysis
Shil Shambharkar
No ratings yet
En Tanagra Kohonen SOM R
Document21 pages
En Tanagra Kohonen SOM R
Vbg Da
No ratings yet
Measures of Dispersion
Document23 pages
Measures of Dispersion
solanogermie18
No ratings yet
Linear Discriminant Analysis
Document16 pages
Linear Discriminant Analysis
Medhini Dubey
No ratings yet
It-3031 (DMDW) - CS End Nov 2023
Document23 pages
It-3031 (DMDW) - CS End Nov 2023
21051796
No ratings yet
Numerical Solution of Ordinary Differential Equations Part 4 - Approximation & Interpolation
Document58 pages
Numerical Solution of Ordinary Differential Equations Part 4 - Approximation & Interpolation
Melih Tecer
No ratings yet
utf-8''C2M1 Assignment
Document24 pages
utf-8''C2M1 Assignment
Sarah Mendes
No ratings yet
Segment Trees: Justin Zhang December 8, 2017
Document4 pages
Segment Trees: Justin Zhang December 8, 2017
Kartick Gupta
No ratings yet
ME 406 The Logistic Map: 1. Introduction
Document32 pages
ME 406 The Logistic Map: 1. Introduction
sustras
No ratings yet
7.3 Pca
Document17 pages
7.3 Pca
Matrix Bot
No ratings yet
Simple Linear Regression
Document9 pages
Simple Linear Regression
Naresh
No ratings yet
Chapter 01
Document12 pages
Chapter 01
nikadon
No ratings yet
Lecture 4. Measures of Variability (Part 1)
Document13 pages
Lecture 4. Measures of Variability (Part 1)
ziadwageh74
No ratings yet
Multivariate
Document78 pages
Multivariate
Vatsa Adarsh
No ratings yet
3point5point2 Normalization
Document3 pages
3point5point2 Normalization
Sally Sameh
No ratings yet
Using The MarkowitzR Package
Document12 pages
Using The MarkowitzR Package
Almighty59
No ratings yet
Numerical Methods: Biplove Pokhrel Teaching Assistant, WRC
Document22 pages
Numerical Methods: Biplove Pokhrel Teaching Assistant, WRC
Biplove Pokhrel
No ratings yet
Inla Spde Howto
Document4 pages
Inla Spde Howto
Stelios Konstas
No ratings yet
Data Points Frequency
Document4 pages
Data Points Frequency
Natasha Kapoor
No ratings yet
Homework Assignment 3 Homework Assignment 3
Document10 pages
Homework Assignment 3 Homework Assignment 3
Ido Akov
No ratings yet
Decimal
Document56 pages
Decimal
asher.ak3
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
Document47 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
tanvi wadhwa
No ratings yet
Assignment R New 1
Document26 pages
Assignment R New 1
Sohel Rana
No ratings yet
Normal Distributions and The Empirical Rule
Document3 pages
Normal Distributions and The Empirical Rule
Jhonalyn M. Alfaro
No ratings yet
Bda Assign
Document15 pages
Bda Assign
Aishwarya Biradar
No ratings yet
Error and Uncertainty: General Statistical Principles
Document8 pages
Error and Uncertainty: General Statistical Principles
déborah_rosales
No ratings yet
7708 - MBA PredAnanBigDataNov21
Document11 pages
7708 - MBA PredAnanBigDataNov21
Indian Lizard King
No ratings yet
K Means
Document329 pages
K Means
yousef shaban
100% (1)
Numerical Solution of Ordinary Differential Equations Part 5 - Least Square Regression
Document23 pages
Numerical Solution of Ordinary Differential Equations Part 5 - Least Square Regression
Melih Tecer
No ratings yet
Normalization A Preprocessing Stage
Document5 pages
Normalization A Preprocessing Stage
Tanzeel Hassan
No ratings yet
Fitting Models With JAGS
Document15 pages
Fitting Models With JAGS
Mohammad
No ratings yet
Lab-4: Regression Analysis: Logistic & Multinomial Logistic Regression
Document10 pages
Lab-4: Regression Analysis: Logistic & Multinomial Logistic Regression
Vishal Ramina
No ratings yet
Exercise 4: Self-Organizing Maps: Articial Neural Networks and Other Learning Systems, 2D1432
Document7 pages
Exercise 4: Self-Organizing Maps: Articial Neural Networks and Other Learning Systems, 2D1432
Durai Arun
No ratings yet
K-Nearest Neighbor: General Gist
Document14 pages
K-Nearest Neighbor: General Gist
SANPREET SINGH GILL
No ratings yet
DSBDAL - Assignment No 10
Document5 pages
DSBDAL - Assignment No 10
sp
No ratings yet
Business Statistics: Session 2
Document60 pages
Business Statistics: Session 2
HARI SINGH CHOUHAN
No ratings yet
Coincent - Data Science With Python Assignment
Document23 pages
Coincent - Data Science With Python Assignment
Sai Nikhil Nellore
100% (2)
WQD7005 Final Exam - 17219402
Document12 pages
WQD7005 Final Exam - 17219402
AdamZain788
100% (1)
Josh Denney: Projectiles: Lab 3 October 16, 2014
Document6 pages
Josh Denney: Projectiles: Lab 3 October 16, 2014
angrycabbage
No ratings yet
Here's An Visualization of The K-Nearest Neighbors Algorithm
Document5 pages
Here's An Visualization of The K-Nearest Neighbors Algorithm
akif barbaros dikmen
No ratings yet
Ann2018 L3 PDF
Document19 pages
Ann2018 L3 PDF
Amartya Keshri
No ratings yet
Optimization Methods: - Gradient Descent - Conjugate Gradient - Levenberg-Marquardt - Quasi-Newton - Evolutionary Methods
Document23 pages
Optimization Methods: - Gradient Descent - Conjugate Gradient - Levenberg-Marquardt - Quasi-Newton - Evolutionary Methods
Amartya Keshri
No ratings yet
Ann2018 L2 PDF
Document18 pages
Ann2018 L2 PDF
Amartya Keshri
No ratings yet
Radial Basis Function (RBF) Networks
Document24 pages
Radial Basis Function (RBF) Networks
Amartya Keshri
No ratings yet
Genetic Algorithms: Genetic Algorithms in Search, Optimization, and Machine Learning-David E. Goldberg
Document15 pages
Genetic Algorithms: Genetic Algorithms in Search, Optimization, and Machine Learning-David E. Goldberg
Amartya Keshri
No ratings yet
Ann2018 L5
Document23 pages
Ann2018 L5
Amartya Keshri
No ratings yet
Problems With Backpropagation
Document8 pages
Problems With Backpropagation
Amartya Keshri
No ratings yet
Classification Problem: Feedforwardnet Patternnet Fitnet
Document16 pages
Classification Problem: Feedforwardnet Patternnet Fitnet
Amartya Keshri
No ratings yet
Ann2018 L6
Document18 pages
Ann2018 L6
Amartya Keshri
No ratings yet
Filipino: Markahan 3 - Modyul 2: Elehiya NG Bhutan (Elemento NG Elehiya)
Document15 pages
Filipino: Markahan 3 - Modyul 2: Elehiya NG Bhutan (Elemento NG Elehiya)
JomajFalcatanDelaCruz
100% (1)
Challenges of Newspaper Circulation in Nigeria
Document60 pages
Challenges of Newspaper Circulation in Nigeria
Emmanuel Kings
No ratings yet
IEA PVPS Trends 2018 in Photovoltaic Applications PDF
Document88 pages
IEA PVPS Trends 2018 in Photovoltaic Applications PDF
Carlos Tadeo Capistran
No ratings yet
Tata Motors Anual Report 2016
Document288 pages
Tata Motors Anual Report 2016
Dwarika
No ratings yet
Employee Stock Option Scheme 2017 - 18 S - 1
Document3 pages
Employee Stock Option Scheme 2017 - 18 S - 1
Mukesh Punetha
No ratings yet
Economic Dispatch
Document37 pages
Economic Dispatch
Hasan Kurniawan
No ratings yet
Airworthiness Directive: 2006-21-01 Boeing: Amendment 39-14784. Docket No. FAA-2006-23815 Directorate Identifier
Document3 pages
Airworthiness Directive: 2006-21-01 Boeing: Amendment 39-14784. Docket No. FAA-2006-23815 Directorate Identifier
Carlos Varrenti
No ratings yet
Case Study
Document10 pages
Case Study
Jessica Follero
No ratings yet
50k in A Day, 7 Figures in 6 Months
Document10 pages
50k in A Day, 7 Figures in 6 Months
hamzaelomari0077
0% (1)
Energy Audit of A 400-220 KV Substation
Document8 pages
Energy Audit of A 400-220 KV Substation
abhishekrathi09
100% (2)
Bernhard Franken
Document20 pages
Bernhard Franken
Manimegalai Prasanna
0% (1)
Risen 300-350 WP Mono
Document2 pages
Risen 300-350 WP Mono
AgoesPermana
No ratings yet
EKNOS
Document2 pages
EKNOS
Bipin tiwari
No ratings yet
VPRS 4300V VPRM5450
Document3 pages
VPRS 4300V VPRM5450
Tuan Minh
No ratings yet
Interview Guide Name of The Organization: Communication Students' Association Designation and Function of Officers and Advisers
Document3 pages
Interview Guide Name of The Organization: Communication Students' Association Designation and Function of Officers and Advisers
Sionet Alangilan
No ratings yet
Youth Unemployment and Mental Health: Prevalence and Associated Factors of Depression Among Unemployed Young Adults in Gedeo Zone, Southern Ethiopia
Document11 pages
Youth Unemployment and Mental Health: Prevalence and Associated Factors of Depression Among Unemployed Young Adults in Gedeo Zone, Southern Ethiopia
Kindhun Tegegn
No ratings yet
Unit 5-Elements of Costing-Financial Accounting-Nikita Keshan
Document58 pages
Unit 5-Elements of Costing-Financial Accounting-Nikita Keshan
harshita bhatia
No ratings yet
FABM 2 Module 6 Cash Bank Recon
Document6 pages
FABM 2 Module 6 Cash Bank Recon
JOHN PAUL LAGAO
40% (5)
Unit 4: Practice (Morphology) : Task 1. Consider The Following Words and Answer The Questions Below
Document2 pages
Unit 4: Practice (Morphology) : Task 1. Consider The Following Words and Answer The Questions Below
Huong Dang
No ratings yet
Vizag Araku 2n3d Package Tour 178
Document2 pages
Vizag Araku 2n3d Package Tour 178
Vizag Tourism
No ratings yet
Wafer Check Valves - O&M
Document4 pages
Wafer Check Valves - O&M
jayakumar
No ratings yet
Production of Phenol Via Chlorobenzene and Caustic Process
Document1 page
Production of Phenol Via Chlorobenzene and Caustic Process
Patricia Miranda
No ratings yet
Inspur Server BIOS User Manual V1.1
Document154 pages
Inspur Server BIOS User Manual V1.1
alberto
No ratings yet
Elastic and Inelastic Collisions Lab Report
Document16 pages
Elastic and Inelastic Collisions Lab Report
Delfina Farias
No ratings yet
Homework 1 Solution
Document5 pages
Homework 1 Solution
Mrinmoy Saha
No ratings yet
Rishabh Malhotra - Offer Letter
Document4 pages
Rishabh Malhotra - Offer Letter
rishabh
No ratings yet
Chapter 4 - Car M
Document239 pages
Chapter 4 - Car M
Ashwani Kumar Gupta
No ratings yet
Tobii Studio 1.X User Manual
Document116 pages
Tobii Studio 1.X User Manual
KingHodor
No ratings yet
Kantar Worldpanel - Accelerating The Growth of Ecommerce in FMCG - Report
Document16 pages
Kantar Worldpanel - Accelerating The Growth of Ecommerce in FMCG - Report
Abhishek Goel
No ratings yet
Resume For Deck Cadet
Document4 pages
Resume For Deck Cadet
Daniel M. Martin
No ratings yet