Professional Documents
Culture Documents
:1
Write python programs to demonstrate Preprocessing.
Program:
import numpy as np
import pandas as pd
#Dropping Rows With all NAN values in the row and reflect ont the original matrix
X.dropna(axis=0,how='all',inplace=True)
X.describe()
X[['f1','f2','f3']].corr()
X1=X[['f1','f2']]
X1.plot.scatter(x='f1',y='f2')
#Renaming Columns
X.rename(columns={"f1":"F1","f2":"F2","f3":"F3"},inplace=True)
#Creating new column sum
X["sum"]=X["F1"]+X["F2"]+X["F3"]
Output:
Experiment No.:2
Write a python program to Preprocess WeatherAus Dataset..
Program:
import numpy as np
import pandas as pd
data=pd.read_csv("weatherAUS.csv")
data.head()
data.dropna(axis=0,how="all",inplace=True)
data.reset_index(inplace=True)
data.drop(["index"],axis=1,inplace=True)
data.fillna(data.mean(),inplace=True)
data.describe()
data[['MinTemp','MaxTemp']].corr()
x=data[['MinTemp','MaxTemp']]
x.plot.scatter(x='MinTemp',y='MaxTemp')
data["AveragePressure"]=(data["Pressure9am"]+data["Pressure3pm"])/2
data.head()
data.drop(["AveragePressure"],axis=1,inplace=True)
data.head()
Output: