Professional Documents
Culture Documents
ID NO : 2000080061
Date: DD/MM/YYYY
Outcome: Students are able to implement dimensionality reduction and classification using ANN.
Pre Lab:
1) What is dimensionality reduction and what is the need for reducing the
dimensions?
A) Dimensionality Reduction is a technique used to reduce the attributes or features for the data. This makes
removal of some features and due to those removal remaining features data are updated such that the loss will be
neglected. It is used for reducing the computational time and resources by not checking unnecessary features of
the data.
1|P a ge
In Lab:
EXP4:
a) In this dataset, there are various factors given, which are involved when a patient is hospitalized. On
the basis of these factors, predict whether the patient will survive or not. But it has 85 columns so,
perform Dimensionality reduction using PCA.
b) Normalize the data in the given dataset and perform classification using ANN.
Program:
2|P a ge
Applying CountEncoder, RobustScaler and PCA on data
3|P a ge
Output
Info of the Data
Variance Ratios
4|P a ge
Post Lab:
Ans) PCA is one of the methods to do Dimensionality Reduction Technique. It is most widely used in Machine
Learning for predictive models and Data Analysis. PCA is an unsupervised technique used to examine the
interrelations between features / variables within the data. PCA uses an orthogonal transformation that converts
a set of correlated variables to a set of uncorrelated variables. It is also known as general factor of analysis
where regression determines a line of best fit.
It increases interpretability yet, at the same time it minimizes the information loss. The remaining features or
data will be changed according to the data removed from the main data, so that information loss will be
reduced.
5|P a ge