Welcome to Scribd!

Skip carousel

Eda 3

Uploaded by

diyalap01

0% found this document useful (0 votes)

2 views6 pages

Original Title

eda3

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views6 pages

Eda 3

Uploaded by

diyalap01

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

Name: Vidya Janani V

Register Number: 913121205090

Ex. No: 03 Feature Extraction with Correlation (Bivariate) Analysis and

Date: 04.03.2024 Categorization Using Python/R

Aim:

To Perform Feature Extraction with correlation (Bivariate) Analysis and categorisation using
Python/R.

Steps:

1. Data Preparation:

• Import necessary libraries (pandas', 'seaborn', and 'matplotlib').

• Load your data from a CSV file into a Pandas DataFrame.
2. Correlation Matrix:

• Create a correlation matrix for specific columns related to air quality indices
3. Visualize the correlation matrix:

• Create a heatmap of the correlation matrix, enhancing visual understanding.

• Customize the heatmap with annotations, color palette, and grid lines.
4. Display the correlation heatmap:

• Show the heatmap with correlations between the air quality indices, helping identify
relationships.
5. Feature selection based on the correlation matrix:

• Calculate correlations between all columns and the "PM2.5 AQI Value."
• Sort and select features with correlations greater than 0.25 in absolute value.
• Print and display the selected features, helping identify which variables correlate significantly
with the target variable.
Importing the dataset and formation of the Co-relation Matrix
Python Code:
import pandas as pd

# Load the dataset into a DataFrame

# Replace 'your_dataset.csv' with the actual path to your dataset
job_placement_df = pd.read_csv('job_placement.csv')

# Display the first few rows of the DataFrame to understand its structure
print(job_placement_df.head())

# Compute the correlation matrix

corr_matrix = job_placement_df.corr()

21PCS02 – Exploratory Data Analysis Laboratory Dept of IT

Name: Vidya Janani V
Register Number: 913121205090

# Display the correlation matrix

print("Correlation Matrix:")
print(corr_matrix)
Output:

Visualizing the correlation matrix

Python Code

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

# Load the dataset into a DataFrame

# Replace 'your_dataset.csv' with the actual path to your dataset
job_placement_df = pd.read_csv('job_placement.csv')

# Compute the correlation matrix

corr_matrix = job_placement_df.corr()

# Plotting the correlation matrix using seaborn heatmap

21PCS02 – Exploratory Data Analysis Laboratory Dept of IT

Name: Vidya Janani V
Register Number: 913121205090

plt.figure(figsize=(10, 8))
sns.heatmap(corr_matrix, annot=True, cmap='coolwarm', fmt=".2f", linewidths=.5)
plt.title('Correlation Matrix of Job Placement Dataset')
plt.show()

Output

Displaying the correlation heatmap:

Python Code

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

# Load the dataset into a DataFrame

# Replace 'your_dataset.csv' with the actual path to your dataset
job_placement_df = pd.read_csv('job_placement.csv')

# Compute the correlation matrix

corr_matrix = job_placement_df.corr()

# Plotting the correlation heatmap

21PCS02 – Exploratory Data Analysis Laboratory Dept of IT
Name: Vidya Janani V
Register Number: 913121205090
plt.figure(figsize=(10, 8))
sns.heatmap(corr_matrix, annot=True, cmap='coolwarm', fmt=".2f", linewidths=.5)
plt.title('Correlation Heatmap of Job Placement Dataset')
plt.show()

Output

Feature selection based on the correlation matrix:

Python Code

import pandas as pd

# Load the dataset into a DataFrame

# Replace 'your_dataset.csv' with the actual path to your dataset
job_placement_df = pd.read_csv('job_placement.csv')

# Compute the correlation matrix

corr_matrix = job_placement_df.corr()

# Display the correlation matrix

print("Correlation Matrix:")
print(corr_matrix)

# Setting a threshold for correlation

21PCS02 – Exploratory Data Analysis Laboratory Dept of IT
Name: Vidya Janani V
Register Number: 913121205090
# You can adjust this threshold based on your requirements
threshold = 0.5

# Selecting features highly correlated with each other

# Here, we remove one of the features from each pair of highly correlated features
correlated_features = set()
for i in range(len(corr_matrix.columns)):
for j in range(i):
if abs(corr_matrix.iloc[i, j]) > threshold:
colname = corr_matrix.columns[i]
correlated_features.add(colname)

print("Correlated Features:")
print(correlated_features)

Output

21PCS02 – Exploratory Data Analysis Laboratory Dept of IT

Name: Vidya Janani V
Register Number: 913121205090

21PCS02–Exploratory Data Analysis Marks

Laboratory
Observation ( 20 )

Record ( 5 )

Total ( 25 )

Result:
In this Experiment , Feature Extraction with correlation (Bivariate) Analysis and
categorization using Python/R was implemented and the output is verified successfully.

21PCS02 – Exploratory Data Analysis Laboratory Dept of IT

AEC 2018 Aluminum Extrusion Manual
Document191 pages
AEC 2018 Aluminum Extrusion Manual
Juan Andrés Díaz Rivero
No ratings yet
60 ChatGPT Prompts For Data Science 2023
Document67 pages
60 ChatGPT Prompts For Data Science 2023
T L
100% (2)
Bollard Pull Calculations
Document16 pages
Bollard Pull Calculations
Luis Sierra
100% (1)
Machine Learning With SQL
Document12 pages
Machine Learning With SQL
prince krish
100% (1)
Credit Card NCC BANK
Document1 page
Credit Card NCC BANK
Kazi Foyez Ahmed
No ratings yet
Manual Bomba Koomey
Document95 pages
Manual Bomba Koomey
Diego De Jesus
No ratings yet
Machine Learning LAB: Practical-1
Document24 pages
Machine Learning LAB: Practical-1
Tsering Jhakree
100% (1)
Chemistry: Quarter 1 - Module 5: "Recognize Common Isotopes and Their Uses."
Document13 pages
Chemistry: Quarter 1 - Module 5: "Recognize Common Isotopes and Their Uses."
Norman
100% (2)
Electronic Document Management System of RMTU
Document6 pages
Electronic Document Management System of RMTU
Daniel Bachillar
No ratings yet
Warehouse Order Processing
Document67 pages
Warehouse Order Processing
Yogitha Balasubramanian
No ratings yet
Scala Data Analysis Cookbook
From Everand
Scala Data Analysis Cookbook
Manivannan Arun
No ratings yet
Me IoT Governance
Document16 pages
Me IoT Governance
Elly Wong
100% (1)
Vid 4
Document6 pages
Vid 4
diyalap01
No ratings yet
AIDS - DM Using Python - Lab Programs
Document19 pages
AIDS - DM Using Python - Lab Programs
yelubandirenukavidyadhari
No ratings yet
Unit1 ML Programs
Document5 pages
Unit1 ML Programs
diroja5648
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Document20 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Saloni Tuli
No ratings yet
ML - LAB - FILE Pankaj
Document13 pages
ML - LAB - FILE Pankaj
khatmalmain
No ratings yet
Ass-2 Ds
Document29 pages
Ass-2 Ds
Vedant Andhale
No ratings yet
ML - LAB - FILE Amrit
Document13 pages
ML - LAB - FILE Amrit
khatmalmain
No ratings yet
DS Practical
Document30 pages
DS Practical
XYZ NK
No ratings yet
Introduction To Python and Computer Programming 1704298503
Document44 pages
Introduction To Python and Computer Programming 1704298503
el.tico.138623
No ratings yet
Pattern Recognition
Document26 pages
Pattern Recognition
Aryan Attri
No ratings yet
DNN ALL Practical 28
Document34 pages
DNN ALL Practical 28
4073Himanshu Patle
No ratings yet
ML - Practical File
Document15 pages
ML - Practical File
Jatin Mathur
No ratings yet
2324 BigData Lab3
Document6 pages
2324 BigData Lab3
Elie Al Howayek
No ratings yet
Experiment No 3 Importing and Exporting Data in Python Using Pandas Student
Document6 pages
Experiment No 3 Importing and Exporting Data in Python Using Pandas Student
chavansrushti21
No ratings yet
KRAI Practical
Document14 pages
KRAI Practical
Contact Vishal
No ratings yet
How To Create A Correlation Matrix Using Pandas - Data To Fish
Document3 pages
How To Create A Correlation Matrix Using Pandas - Data To Fish
intluser
No ratings yet
Codes
Document37 pages
Codes
Tame PcAddict
No ratings yet
Finance
Document1 page
Finance
ahmadkhalil
No ratings yet
Muhammad Hassaan 288203 Software Construction Lab 7
Document7 pages
Muhammad Hassaan 288203 Software Construction Lab 7
Ahmad Dogar
No ratings yet
Operationalizing The Model
Document46 pages
Operationalizing The Model
Mohamed Rahal
No ratings yet
Python Code
Document7 pages
Python Code
Gnan Shetty
No ratings yet
Pattern
Document1 page
Pattern
ahmadkhalil
No ratings yet
ML With Python Practical
Document22 pages
ML With Python Practical
n58648017
No ratings yet
DEV Lab Material
Document16 pages
DEV Lab Material
dharun0704
No ratings yet
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
Document2 pages
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
SHELPTS
No ratings yet
Section 10 - Data Visualization - Part 2
Document15 pages
Section 10 - Data Visualization - Part 2
Amany Zaky
No ratings yet
DMC - Record
Document54 pages
DMC - Record
mrsanthoosh.edu
No ratings yet
Aiml Lab Copy Saurav
Document8 pages
Aiml Lab Copy Saurav
Pallabi Jaiswal
No ratings yet
Tensor Flow and Keras Sample Programs
Document22 pages
Tensor Flow and Keras Sample Programs
vinothkumar0743
No ratings yet
DWDM Lab Report
Document26 pages
DWDM Lab Report
Simran Shrestha
No ratings yet
Certificate
Document25 pages
Certificate
Tanmay Mane
No ratings yet
Informatics Practices Class 12 Study Material
Document128 pages
Informatics Practices Class 12 Study Material
Rishikesh Crafts and Tech
No ratings yet
Practical Record File XII Physics 22-23 Final
Document34 pages
Practical Record File XII Physics 22-23 Final
SUVODEEP SARKAR
No ratings yet
Final Lab Manual
Document34 pages
Final Lab Manual
SNEHAL RALEBHAT
No ratings yet
IP - Record 2023-24
Document79 pages
IP - Record 2023-24
Freya
No ratings yet
# Import Necessary Modules
Document2 pages
# Import Necessary Modules
4NM20IS003 ABHISHEK A
No ratings yet
CO-367 Machine Learning Lab File: Submitted To: Submitted by
Document12 pages
CO-367 Machine Learning Lab File: Submitted To: Submitted by
Shubham Anand
No ratings yet
t4 m2
Document49 pages
t4 m2
Amazon Mío
No ratings yet
Carreon WS06
Document4 pages
Carreon WS06
Keneth Carreon
No ratings yet
Bda Assign
Document15 pages
Bda Assign
Aishwarya Biradar
No ratings yet
Assvid
Document13 pages
Assvid
diyalap01
No ratings yet
Module 5 Pandas Assignment Updated
Document3 pages
Module 5 Pandas Assignment Updated
rashid
No ratings yet
A2 Vishal Borra
Document2 pages
A2 Vishal Borra
vishal.borra
No ratings yet
DS Manual
Document30 pages
DS Manual
Zoom Communication
No ratings yet
2nd Programme AIML 7th Sem
Document2 pages
2nd Programme AIML 7th Sem
awfullymeee
No ratings yet
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
Document33 pages
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
viniskykumar
No ratings yet
Roll NO 2020
Document8 pages
Roll NO 2020
Ali Mohsin
No ratings yet
Pandas I Notes 06 - June 20
Document13 pages
Pandas I Notes 06 - June 20
sukaina fatima
No ratings yet
Principal Component Analysis For Data Science
Document4 pages
Principal Component Analysis For Data Science
shivaybhargava33
No ratings yet
Advance Operations On Dataframes
Document20 pages
Advance Operations On Dataframes
Pranav Pratap Singh
No ratings yet
Lab5 Example Fall 23
Document4 pages
Lab5 Example Fall 23
Patel Vedant
No ratings yet
Experiment 2.1 - Bablu Kumar DT
Document24 pages
Experiment 2.1 - Bablu Kumar DT
Bablu Raaz
No ratings yet
Practical File Python
Document25 pages
Practical File Python
kaizenpro01
No ratings yet
Ass 2 DSBDL
Document29 pages
Ass 2 DSBDL
Anvi
No ratings yet
Implementing PCA in Python With Scikit
Document6 pages
Implementing PCA in Python With Scikit
Shobha Kumari Choudhary
No ratings yet
17 Ensemble Techniques Problem Statement
Document28 pages
17 Ensemble Techniques Problem Statement
Jadhav A.S
No ratings yet
Vidya PC Lab
Document9 pages
Vidya PC Lab
diyalap01
No ratings yet
Assvid
Document13 pages
Assvid
diyalap01
No ratings yet
Ex. No. 03 Construct An Application That Draws Basic Graphical Primitives On The Screen Date
Document4 pages
Ex. No. 03 Construct An Application That Draws Basic Graphical Primitives On The Screen Date
diyalap01
No ratings yet
Ex. No. 03 Construct An Application That Draws Basic Graphical Primitives On The Screen Date
Document4 pages
Ex. No. 03 Construct An Application That Draws Basic Graphical Primitives On The Screen Date
diyalap01
No ratings yet
Logs
Document7 pages
Logs
diyalap01
No ratings yet
PTG5
Document2 pages
PTG5
Josue Carlin
No ratings yet
Eng 10 Cur-Map
Document7 pages
Eng 10 Cur-Map
Fe Jandugan
No ratings yet
Fire Fighting
Document18 pages
Fire Fighting
kenny
No ratings yet
Capgemini Proposal Template
Document14 pages
Capgemini Proposal Template
Phillip
100% (1)
Examples For Autotrophs That Uses Chemosynthesis - Google Search
Document1 page
Examples For Autotrophs That Uses Chemosynthesis - Google Search
n9hhb88r95
No ratings yet
BRANIGAN - Eduard - A Point of View in The Cinema
Document6 pages
BRANIGAN - Eduard - A Point of View in The Cinema
Sonia Rocha
No ratings yet
Parts
Document4 pages
Parts
burakkkkkkkk
No ratings yet
Final Year 10 Career Education Booklet
Document44 pages
Final Year 10 Career Education Booklet
sophieminissale
No ratings yet
KCNCcatalog20190905－２-已壓縮 - compressed 2
Document61 pages
KCNCcatalog20190905－２-已壓縮 - compressed 2
Vladimir Kunitsa
No ratings yet
Functions of Limbic System
Document4 pages
Functions of Limbic System
Leonidah jerono
No ratings yet
Project Report On Business Eco Magazine - Vivek Kumar Shaw
Document36 pages
Project Report On Business Eco Magazine - Vivek Kumar Shaw
Anand shaw
No ratings yet
Joanne Wong Min Min (B2200130)
Document35 pages
Joanne Wong Min Min (B2200130)
joanne wong
No ratings yet
Getting Path To Executing Dll..
Document3 pages
Getting Path To Executing Dll..
Biochem M. July
No ratings yet
ST - Anne'S: Multiple Choice Questions UNIT-2 (50X1 50 Marks)
Document6 pages
ST - Anne'S: Multiple Choice Questions UNIT-2 (50X1 50 Marks)
St. Anne's CET (EEE Department)
No ratings yet
Proyectos de Aplicación en El Mundo Midas GTS NX
Document46 pages
Proyectos de Aplicación en El Mundo Midas GTS NX
Enrique Barragán
No ratings yet
Self-Accelerated Corrosion of Nuclear Waste Forms at Material Interfaces
Document9 pages
Self-Accelerated Corrosion of Nuclear Waste Forms at Material Interfaces
Muhammad Adnan Hafeez
No ratings yet
Variability in The Project Completion Date
Document12 pages
Variability in The Project Completion Date
Nguyễn Quốc Việt
No ratings yet
DLL - English 6 - Q2 - W2
Document3 pages
DLL - English 6 - Q2 - W2
Xyrelle Grace Borbon Vernaula
No ratings yet
Rajala Varaprasad Reddy: Objective
Document2 pages
Rajala Varaprasad Reddy: Objective
RAKESH REDDY THEEGHALA
No ratings yet
MC0082 - Theory of Computer Science
Document235 pages
MC0082 - Theory of Computer Science
Purushottam Kumar
No ratings yet
LimnaiosG Final Report
Document11 pages
LimnaiosG Final Report
Giorgos Lakeman
No ratings yet
Playbox Certified Profiles - v3
Document5 pages
Playbox Certified Profiles - v3
Cincu Cristian
No ratings yet