PCA by Vikram Kumar

Uploaded by

Vikram Kumar

0% found this document useful (0 votes)

6 views19 pages

This PowerPoint slide describes how to apply principle component analysis on Boston Housing Prices Data set.

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

This PowerPoint slide describes how to apply principle component analysis on Boston Housing Prices Data set.

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views19 pages

PCA by Vikram Kumar

Uploaded by

Vikram Kumar

This PowerPoint slide describes how to apply principle component analysis on Boston Housing Prices Data set.

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 19

Search inside document

Principal Component

Analysis(PCA)
What is PCA?
• The main idea of Principal Component Analysis (PCA) is
to reduce the dimensionality of a data set consisting of
many variables correlated with each other, while retaining
the variation present in the dataset, up to the maximum
extent.
Need For PCA

Problem of Over fitting

Inaccurate Assessment of Target Values

Original Data Set, High Dimensional, Data Set After PCA , Two Dimensional,
Overfitted Best Fit
PCA Method
• 1. Standardize the data.
• 2. Generate a Covariance matrix
• 3. Obtain Eigenvectors and Eigenvalues from the covariance matrix.
• 4. Sort the eigenvalues in descending order.
• 5. Select the k eigenvectors with the largest eigenvalues.
• 6. Construct a new matrix with the selected k eigenvectors.

BUT DON’T WORRY WE WILL BE USING A SHORTCUT APPROACH TO PCA.

So we are going to implement
PCA on
Boston House Prices Dataset
IMPORTING THE DATASET
DATA PREPROCESSING
EXPLORATORY DATA ANALYSIS
1.Understanding values of Target column
By Plotting a Histogram
INFERENCE: We see that the values of 'target' are distributed normally with few outliers.
• Next, we create a correlation matrix that measures the linear
relationships between the variables. We will use the heatmap function
from the seaborn library to plot the correlation matrix.¶
• The correlation coefficient ranges from -1 to 1. If the value is close to
1, it means that there is a strong positive correlation between the two
variables. When it is close to -1, the variables have a strong negative
correlation.
StandardScaler is a common method used to
standardize/normalize data: the mean of the
data is subtracted from each value and divided
by the standard deviation.
Fitting of Standardized Dataset
Using PCA function of sklearn
Displaying Principal Components
Concatenating Principal Components with
Target Values

This is our required Dataset

which has Been reduced from
(506,14) to (506,3)
Plotting of Principal Components

Principal Component Analysis
Document13 pages
Principal Component Analysis
Shil Shambharkar
No ratings yet
Pca&kmean
Document6 pages
Pca&kmean
goelkartik23
No ratings yet
Pca
Document18 pages
Pca
gerry
No ratings yet
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
Document16 pages
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
「瞳」你分享
No ratings yet
ML Unit - 3 DimensionalitY Reduction
Document39 pages
ML Unit - 3 DimensionalitY Reduction
Kunapuli Poojitha
No ratings yet
DuongToGiangSon 517H0162 HW2 Nov-26
Document17 pages
DuongToGiangSon 517H0162 HW2 Nov-26
Son Tran
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
Document12 pages
Dimensionality Reduction (Principal Component Analysis)
Arnab Talukdar
No ratings yet
IML Assignment 6 Report
Document18 pages
IML Assignment 6 Report
Hasya Patel
No ratings yet
PCA Pres
Document15 pages
PCA Pres
inspiring 123
No ratings yet
Data Mining - Module 2 - HU
Document88 pages
Data Mining - Module 2 - HU
Test
No ratings yet
Principal Component Analysis
Document1 page
Principal Component Analysis
مزمل عبدالقیوم
No ratings yet
Dat Science: CLASS 11: Clustering and Dimensionality Reduction
Document30 pages
Dat Science: CLASS 11: Clustering and Dimensionality Reduction
ashishamitav123
No ratings yet
Principal Component Analysis
Document10 pages
Principal Component Analysis
Deeksha Manoj
No ratings yet
3.2 Pca
Document27 pages
3.2 Pca
Javada Javada
No ratings yet
Dimensionality Reduction: Principal Component Analysis (PCA)
Document11 pages
Dimensionality Reduction: Principal Component Analysis (PCA)
tanmayi nandiraju
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
Document59 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
Indumathy Paranthaman
No ratings yet
Principal Component Analysis
Document9 pages
Principal Component Analysis
Geethakshaya
100% (1)
PCA Analysis Validation Guide
Document2 pages
PCA Analysis Validation Guide
patryk langer
No ratings yet
Unit II e - PCA
Document28 pages
Unit II e - PCA
aruna
No ratings yet
Sess03 Dimension Reduction Methods
Document36 pages
Sess03 Dimension Reduction Methods
Kriti Sinha
No ratings yet
Dimensional Reduction in R
Document24 pages
Dimensional Reduction in R
Shil Shambharkar
No ratings yet
Feature Construction
Document8 pages
Feature Construction
MUHAMMAD NUR FITRI BIN NORAFFENDI
No ratings yet
PCA Finds Representation Through Linear Transformation
Document28 pages
PCA Finds Representation Through Linear Transformation
sartg
No ratings yet
Principal Component Analysis
Document17 pages
Principal Component Analysis
AsemSaleh
No ratings yet
Feature Extraction
Document3 pages
Feature Extraction
nandha shree
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
Document32 pages
Dimensionality Reduction Using Principal Component Analysis
sai varun
No ratings yet
Data Preprocessing Part 2
Document14 pages
Data Preprocessing Part 2
new acc jeet
No ratings yet
Colloquium - Bayesian Optimization Algorithm - Sajib Kumar Biswas
Document25 pages
Colloquium - Bayesian Optimization Algorithm - Sajib Kumar Biswas
cs
No ratings yet
Neal Zhang
Document33 pages
Neal Zhang
saurabh_34
No ratings yet
Assignment
Document24 pages
Assignment
Santhi Palanisamy
No ratings yet
Coincent - Data Science With Python Assignment
Document23 pages
Coincent - Data Science With Python Assignment
Sai Nikhil Nellore
100% (2)
Education - Post 12th Standard - CSV
Document11 pages
Education - Post 12th Standard - CSV
Ruhee's Kitchen
No ratings yet
Comparison of Density-Based Clustering Algorithms: Mariam Rehman
Document5 pages
Comparison of Density-Based Clustering Algorithms: Mariam Rehman
suser
No ratings yet
Education - Post 12th Standard - CSV
Document11 pages
Education - Post 12th Standard - CSV
Zohaib Imam
88% (16)
Pca and t-SNE Dimensionality Reduction
Document3 pages
Pca and t-SNE Dimensionality Reduction
Иван Радонов
No ratings yet
A Short Course in Multivariate Statistical Methods With R
Document11 pages
A Short Course in Multivariate Statistical Methods With R
qwety300
No ratings yet
KMEANS
Document9 pages
KMEANS
johnzenbano120
No ratings yet
Lecture Slides-Week15,16
Document50 pages
Lecture Slides-Week15,16
moazzam kiani
No ratings yet
Unit 5 Big Data
Document55 pages
Unit 5 Big Data
Venkatesh Sharma
No ratings yet
Supervised Learning 1 PDF
Document162 pages
Supervised Learning 1 PDF
Alexander
No ratings yet
Dim Red
Document13 pages
Dim Red
Pratyush Jain
No ratings yet
VectorApplicationsInDS
Document31 pages
VectorApplicationsInDS
Sara Nukho
No ratings yet
DimensionalitY Reduction
Document29 pages
DimensionalitY Reduction
The Caspian
No ratings yet
Week10 KNN Practical
Document4 pages
Week10 KNN Practical
seerungen jordi
No ratings yet
Lecture2 2013
Document60 pages
Lecture2 2013
Fc Khan
No ratings yet
KNN VS Kmeans
Document3 pages
KNN VS Kmeans
Soubhagya Kumar Sahoo
No ratings yet
Principal Components Analysis (PCA) Final
Document23 pages
Principal Components Analysis (PCA) Final
endale
No ratings yet
PCA Lecture 9 and 10
Document30 pages
PCA Lecture 9 and 10
Ifra Noor
No ratings yet
Introduction To Data Analysis
Document72 pages
Introduction To Data Analysis
Thu Le
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
Document19 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
subithaperiyasamy
No ratings yet
6 - Data Pre-Processing-III
Document30 pages
6 - Data Pre-Processing-III
Kanika Chanana
No ratings yet
Dimensionality Reduction
Document19 pages
Dimensionality Reduction
Atul Patil
No ratings yet
Feature Extraction: - Saheni Patra
Document17 pages
Feature Extraction: - Saheni Patra
Arindam Roy
No ratings yet
Unit 3 ML
Document24 pages
Unit 3 ML
Samarth Pratap Singh
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
Document11 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
girish90
No ratings yet
TOD 212 - Digging Through Data - PPT - For Students - Monsoon 2023 (Autosaved)
Document18 pages
TOD 212 - Digging Through Data - PPT - For Students - Monsoon 2023 (Autosaved)
dhyani.s
No ratings yet
Unit 3
Document110 pages
Unit 3
Nishanth Nuthi
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet