Welcome to Scribd!

Skip carousel

Avinash DA 6

Uploaded by

saurabh tiwari

0% found this document useful (0 votes)

2 views3 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views3 pages

Avinash DA 6

Uploaded by

saurabh tiwari

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

NAME- Avinash Tiwari

ROLL NO.- 2100290110041

DATA ANALYTICS LAB 6
EXP 6: To perform data pre-processing operation 1) Handling Missing data 2)
Min-Max normalization

CODE:
import pandas as pd import numpy as np from

sklearn.preprocessing import MinMaxScaler

# Example DataFrame data = {'Salary': [50000, np.nan,

60000, np.nan, 70000]} df = pd.DataFrame(data)

# Handling Missing Data

# Method 1: Fill missing values using the previous valid value

df['Salary'] = df['Salary'].fillna(method='pad')

# Method 2: Replace NaN values with 0

# df['Salary'] = df['Salary'].replace(to_replace=np.nan, value=0)

# Method 3: Interpolate missing values linearly

# df['Salary'] = df['Salary'].interpolate(method='linear', direction='forward')

# Min-Max Normalization

trans = MinMaxScaler()

df['Salary_normalized'] =
trans.fit_transform(df[['Salary'

]])

print(df)

THEORY:

Handling Missing Data:

This code will handle missing data in the 'Salary' column using the fillna() method with the
'pad' method for forward filling. If you want to use another method like replacing NaN with
0 or linear interpolation, you can comment/uncomment the respective lines.

Missing data can occur due to various reasons such as data entry errors, equipment
malfunction, or intentional omission.

It's essential to handle missing data appropriately because they can lead to biased results
and incorrect conclusions if not addressed.

Common strategies for handling missing data include:

Imputation: Filling in missing values with estimated values. Methods include filling with
mean, median, mode, or using more sophisticated techniques like linear interpolation.

Deletion: Removing rows or columns with missing values. However, this approach can
lead to loss of valuable information if not done carefully.

Prediction: Using machine learning algorithms to predict missing values based on other
features in the dataset.

In Python, libraries like pandas provide convenient functions like fillna() and interpolate()
for handling missing data effectively.

Min- Max Normalization:

For Min-Max normalization, the code initializes a MinMaxScaler object and fits it to the
'Salary' column, transforming the data and storing the normalized values in a new column
called 'Salary_normalized'.
Min-Max normalization, also known as feature scaling, is a technique used to scale
numeric features to a specific range, typically between 0 and 1.

The formula for Min-Max normalization is:

\text{X_normalized} = \frac{X - X_{\text{min}}}{X_{\text{max}} - X_{\text{min}}}

Here,

X is the original value,

min X

min

is the minimum value of the feature, and

max

X max

is the maximum value of the feature.

Min-Max normalization ensures that all features have the same scale, which can be crucial
for algorithms sensitive to feature scales, such as gradient descent-based optimization
algorithms.

In Python, libraries like scikit-learn provide the MinMaxScaler class, which makes it easy
to perform Min-Max normalization on datasets.

By understanding and applying these data pre-processing techniques in Python, you

can ensure that your data is clean, standardized, and suitable for further analysis or
machine learning tasks.

Introduction To Politics and International Relations PDF
Document52 pages
Introduction To Politics and International Relations PDF
yikil2
100% (1)
Bhatia's Manual
Document13 pages
Bhatia's Manual
mrinalini bhat
No ratings yet
Machine Learning LAB: Practical-1
Document24 pages
Machine Learning LAB: Practical-1
Tsering Jhakree
100% (1)
Machine Learning With SQL
Document12 pages
Machine Learning With SQL
prince krish
100% (1)
Lost Leaves Women Writers of Meiji Japan
Document305 pages
Lost Leaves Women Writers of Meiji Japan
Marina TheZan
100% (5)
Practical File of Machine Learning 1905388
Document42 pages
Practical File of Machine Learning 1905388
Devansh
No ratings yet
Interview Questions About Python Programming
Document16 pages
Interview Questions About Python Programming
kprdeepak
No ratings yet
As Diverse As The Spectrum Itself: Trends in Sexuality, Gender and Autism
Document10 pages
As Diverse As The Spectrum Itself: Trends in Sexuality, Gender and Autism
Daniele Pendeza
No ratings yet
1 Chapter 4 Presentation, Analysis and Interpretation of Data INTERPERSONAL SKILLS OF ELEMENTARY EDUCATION STUDENTS OF PSU-URDANETA CAMPUS
Document5 pages
1 Chapter 4 Presentation, Analysis and Interpretation of Data INTERPERSONAL SKILLS OF ELEMENTARY EDUCATION STUDENTS OF PSU-URDANETA CAMPUS
Vhi Da Lyn
100% (2)
Machine Learning
Document136 pages
Machine Learning
Kenssy
100% (2)
Pandas: Import
Document13 pages
Pandas: Import
hello
100% (1)
1996 - Luigi Boscolo, Paolo Bertrando - Systemic Therapy With Individuals PDF
Document323 pages
1996 - Luigi Boscolo, Paolo Bertrando - Systemic Therapy With Individuals PDF
angolasearch
100% (2)
Simple Linear Regression With An Example Using NumPy - by Arun Ramji Shanmugam - Analytics Vidhya - Medium
Document10 pages
Simple Linear Regression With An Example Using NumPy - by Arun Ramji Shanmugam - Analytics Vidhya - Medium
vojkan73
No ratings yet
Survey Research Methods PDF by JR Floyd J Fow
Document2 pages
Survey Research Methods PDF by JR Floyd J Fow
bubuchokoy
0% (1)
Ass-2 Ds
Document29 pages
Ass-2 Ds
Vedant Andhale
No ratings yet
Data Wrangling and Preprocessing
Document41 pages
Data Wrangling and Preprocessing
Archana Balikram
No ratings yet
Data Analysis in Python-3
Document4 pages
Data Analysis in Python-3
mohan
No ratings yet
Implementation of Time Series Forecasting
Document12 pages
Implementation of Time Series Forecasting
Soba C
No ratings yet
Cse4020 ML Exp 1
Document6 pages
Cse4020 ML Exp 1
teafdf
No ratings yet
Scikit Learn
Document17 pages
Scikit Learn
RR
No ratings yet
ASSi2 DSBDA
Document4 pages
ASSi2 DSBDA
adagalepayale023
No ratings yet
2.3 Operations in Pandas
Document6 pages
2.3 Operations in Pandas
csbs249052
No ratings yet
03 A Polynomial Linear Regression
Document6 pages
03 A Polynomial Linear Regression
Gabriel Gheorghe
No ratings yet
DataFrame Statistics
Document41 pages
DataFrame Statistics
rohan jha
No ratings yet
Utf 8''week4
Document15 pages
Utf 8''week4
devendra416
No ratings yet
Data Analysis
Document8 pages
Data Analysis
Tayar Elie
No ratings yet
ML Practical File
Document43 pages
ML Practical File
Pankaj Singh
100% (1)
AD3411 - 1 To 5
Document11 pages
AD3411 - 1 To 5
Raj kamal
No ratings yet
Data Preprocessing in Python
Document12 pages
Data Preprocessing in Python
sredhar s
No ratings yet
Comparison of Various System Identification Methods For A MISO System
Document16 pages
Comparison of Various System Identification Methods For A MISO System
Sashank Varma Jampana
No ratings yet
House Price Prediction Using Machine Learning in Python
Document13 pages
House Price Prediction Using Machine Learning in Python
Mayank Vasisth Gandhi
No ratings yet
What Is Exploratory Data Analysis
Document13 pages
What Is Exploratory Data Analysis
Ramkrishna
No ratings yet
Ap Python
Document12 pages
Ap Python
mailadwaitharun
No ratings yet
Unit 5
Document19 pages
Unit 5
gomathinayagam755
No ratings yet
Practical 1 and 2-1
Document33 pages
Practical 1 and 2-1
SURAJ BISWAS
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
Document47 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
tanvi wadhwa
No ratings yet
ML Implementation
Document14 pages
ML Implementation
noussayer mighri
No ratings yet
Fitting Data - SciPy Cookbook Documentation PDF
Document10 pages
Fitting Data - SciPy Cookbook Documentation PDF
ninjai_thelittleninja
No ratings yet
Numpy and Pandas
Document11 pages
Numpy and Pandas
Suja Mary
No ratings yet
Machine Learnin
Document23 pages
Machine Learnin
Manoj Kumar 1183
100% (1)
Introduction To Python (Part III)
Document29 pages
Introduction To Python (Part III)
Subhradeep Pal
No ratings yet
Data Exploration and Regression in Python With HBAT Dataset
Document4 pages
Data Exploration and Regression in Python With HBAT Dataset
mani
No ratings yet
Virtual Lab
Document23 pages
Virtual Lab
Aniket Bahukhandi
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
Document9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
crazzy 8
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
Document9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
crazzy 8
No ratings yet
1.2 Data Cleaning
Document8 pages
1.2 Data Cleaning
mohamed
No ratings yet
Python For DScience & D Visualisation Updated
Document11 pages
Python For DScience & D Visualisation Updated
bitchingaround
No ratings yet
P05 The Regression Pipeline - Training and Testing Ans
Document13 pages
P05 The Regression Pipeline - Training and Testing Ans
YONG LONG KHAW
No ratings yet
Deep Learning
Document43 pages
Deep Learning
Nen Manchodni
No ratings yet
Pythonfile
Document36 pages
Pythonfile
collection58209
No ratings yet
MCP Lab-2023 ContentForPythonLibrariesTopic
Document9 pages
MCP Lab-2023 ContentForPythonLibrariesTopic
Nihad Ahmed
No ratings yet
Data Science - Unit II
Document173 pages
Data Science - Unit II
DHEEVIKA SURESH
100% (1)
7-Days-Analytics-Course-3feiz7-5
Document8 pages
7-Days-Analytics-Course-3feiz7-5
anupamakarupiah
No ratings yet
EE2211 CheatSheet
Document15 pages
EE2211 CheatSheet
Aditi
No ratings yet
Experiment9
Document5 pages
Experiment9
228r1a1266
No ratings yet
Assignment 2
Document1 page
Assignment 2
estebandgono
No ratings yet
Exp2 - Data Visualization and Cleaning and Feature Selection
Document13 pages
Exp2 - Data Visualization and Cleaning and Feature Selection
mnbatrawi
No ratings yet
Machine Learning Techniques Lesson 1
Document9 pages
Machine Learning Techniques Lesson 1
Igor Caetano Diniz
No ratings yet
Week 7 Laboratory Activity
Document12 pages
Week 7 Laboratory Activity
Gar Noob
No ratings yet
5 Missing Values - Jupyter Notebook
Document3 pages
5 Missing Values - Jupyter Notebook
venkatesh m
No ratings yet
Wa0002.
Document5 pages
Wa0002.
sir Bryan
No ratings yet
Architectures and Algorithms For DSP Systems (Crl702) : Centre For Applied Research in Electronics Iit Delhi
Document8 pages
Architectures and Algorithms For DSP Systems (Crl702) : Centre For Applied Research in Electronics Iit Delhi
Raj Aryan
No ratings yet
DBSCAN
Document1 page
DBSCAN
rampage4630
No ratings yet
N Umpy Notebook
Document17 pages
N Umpy Notebook
StocknEarn
No ratings yet
Working With Pandas Notes
Document27 pages
Working With Pandas Notes
AISHI SHARMA
No ratings yet
DWDM Lab Report
Document26 pages
DWDM Lab Report
Simran Shrestha
No ratings yet
Question 8
Document1 page
Question 8
luong.hng43
No ratings yet
MERN-Stack Developer Assessment
Document2 pages
MERN-Stack Developer Assessment
saurabh tiwari
No ratings yet
DA 11
Document3 pages
DA 11
saurabh tiwari
No ratings yet
avinash_tiwari_9
Document4 pages
avinash_tiwari_9
saurabh tiwari
No ratings yet
avinash_10
Document3 pages
avinash_10
saurabh tiwari
No ratings yet
JSP (Unit 5)
Document43 pages
JSP (Unit 5)
saurabh tiwari
No ratings yet
Unit 1 Updated
Document113 pages
Unit 1 Updated
saurabh tiwari
No ratings yet
M1
Document8 pages
M1
saurabh tiwari
No ratings yet
Name: Avinash Tiwari ROLL NO,.:2100290110041 DAA LAB: Dijkstra Algorithm
Document6 pages
Name: Avinash Tiwari ROLL NO,.:2100290110041 DAA LAB: Dijkstra Algorithm
saurabh tiwari
No ratings yet
Forecasting SK
Document20 pages
Forecasting SK
Nirmay Mufc Shah
No ratings yet
Motivation and Goals of Slow Tourism
Document15 pages
Motivation and Goals of Slow Tourism
Caroderiquelme
No ratings yet
De Thi Hoc Ki 1 Tieng Anh 6 Global Success de So 3 1667892380
Document10 pages
De Thi Hoc Ki 1 Tieng Anh 6 Global Success de So 3 1667892380
Đinh Nguyệt Hà
No ratings yet
Improving Doctor Patient Relationship: Dr. Ashraf A - Amir
Document26 pages
Improving Doctor Patient Relationship: Dr. Ashraf A - Amir
Dr Ashraf Amir
No ratings yet
Student Circular April 17
Document11 pages
Student Circular April 17
International School Manila
No ratings yet
Rcsed Mrcs Guide v6
Document12 pages
Rcsed Mrcs Guide v6
Min Maw
No ratings yet
Curriculum Vitae - Alexander Pérez
Document4 pages
Curriculum Vitae - Alexander Pérez
Jhon Rico Quintero
No ratings yet
Module 5 Teaching Prof
Document31 pages
Module 5 Teaching Prof
Jasmine Nicole Osalla
No ratings yet
Main Proposal
Document35 pages
Main Proposal
Munsaka Lutenta
No ratings yet
Detailed Lesson Plan in Mathematics V
Document12 pages
Detailed Lesson Plan in Mathematics V
Charina P. Bruno
No ratings yet
Guia 5 Ingles-Octavo Segundo Periodo PDF
Document4 pages
Guia 5 Ingles-Octavo Segundo Periodo PDF
Andres Javier Martinez Betancourt
No ratings yet
Are You Following Your Dreams?
Document5 pages
Are You Following Your Dreams?
paula.mrtyn
No ratings yet
Alex FNL
Document1 page
Alex FNL
joseph
No ratings yet
Saurashtra University, B. Sc. (Home - Science), English 2019
Document5 pages
Saurashtra University, B. Sc. (Home - Science), English 2019
Aditya Patel
No ratings yet
Preliminary Psychometric Properties of The Multidimensional Well-Being Inventory
Document1 page
Preliminary Psychometric Properties of The Multidimensional Well-Being Inventory
Shelly P. Harrell, Ph.D.
No ratings yet
Leadership Award
Document4 pages
Leadership Award
Jhonel Mogueis Dela Cruz
No ratings yet
SDG 4
Document2 pages
SDG 4
Anacel Manatad
No ratings yet
Capstone Solution
Document1 page
Capstone Solution
dave magcawas
No ratings yet
Coaches' Views On How To Develop Shared Leadership
Document22 pages
Coaches' Views On How To Develop Shared Leadership
Lejandra M
No ratings yet
A 118 Doctor Veterinary Medicine
Document2 pages
A 118 Doctor Veterinary Medicine
Enteng Niez
No ratings yet
Assessment Philosophy
Document2 pages
Assessment Philosophy
api-498082050
No ratings yet
Electrical Machines OBE, Clo Plo.
Document4 pages
Electrical Machines OBE, Clo Plo.
Awais Khan
No ratings yet
New Techniques in Chelonian Shell Repair
Document8 pages
New Techniques in Chelonian Shell Repair
Checko Latte
No ratings yet