You are on page 1of 5

Business Intelligence

Practical #3
Name Priti Yadav Roll Number 20302C0005
Class TYIT Division D
Subject/Cour
Business intelligence
se:
Topic Principal Component Analysis

Overview of Principal Component Analysis


What are the steps to perform PCA?

Download Wine dataset and upload in files section in google colab and then performing Commands
Wine Dataset:

1) import numpy as np
import pandas as pd
2) df=pd.read_csv("/content/wine.csv")
3) df.head()

4) x=df.drop(['Wine','Malic.acid','Ash','Acl','Mg','Proanth','Color.int','Hue','OD','Proline'],1)

Vidyalankar School of Information Technology


5) y=df['Proline']

6) from sklearn.model_selection import train_test_split


x_train, x_test, y_train, y_test=train_test_split(x,y,test_size=0.2,random_state=0)

Vidyalankar School of Information Technology


7) from sklearn.preprocessing import StandardScaler

Vidyalankar School of Information Technology


sc=StandardScaler()
x_train=sc.fit_transform(x_train)
x_test=sc.transform(x_test)

8) from sklearn.decomposition import PCA


pca=PCA()
x_train=pca.fit_transform(x_train)
x_test=pca.transform(x_test)

Vidyalankar School of Information Technology


9) explained_variance=pca.explained_variance_ratio_
explained_variance

10) print('Variance of each component:',pca.explained_variance_ratio_)


print('\n Total Variance Explained :',round(sum(list(pca.explained_variance_ratio_))*100))

Vidyalankar School of Information Technology

You might also like