Professional Documents
Culture Documents
Assignment Spreadsheet Sheila Thalia
Assignment Spreadsheet Sheila Thalia
Table of Contents
Remove Duplicate Remove Empty Value Split Column Transform Blank Remove Outliers
a. combine data (func. >>> Using filtering and From Units ( RM ) using Transform Blank to 0 Removing notably
B2&C2&D2&I2&J2&L2&N2& conditional formatting splits function for Room Bathroom different values(by
O2&Q2&Z2&AA2) Splits Property Type and Carpark using pivot or IQR)
b. Count Duplicated data using data >> splits text Transform x + x
(func. >>> to column format to x (rooms)
countif(AB:AB,AB2))
c. deleting duplicated data
(func. >>>>>
if(AB2=AB1,"duplicate","")
Outlier Cleaning (IQR Method)
Order Dataset
Customer Dataset
Payment Dataset
Cleaning Data
Data Interpretation :
After data above upper value is taken out (IQR), there is a change in the data interpretation
which described as follows:
mean > median > mode ,indicates the data is still skewed positive.
Coefficient of variation has change to be lower than 1. This indicates that the mean value can
used for analysis.
Standar deviation lower value than mean value. This indicates that the data has become
narrowly dispersed.
This data has lower variance value, indicates that the data points has become narrowly spread
out from the mean, and from one another.
Kurtosis value has become positive, indicates that the data has high peak
Skewness is still positive but has lower value than before, indicates that the data is still skewed
positive or still concentrated at the lower value.
Descriptive Statistic (phase 2) Actual Delivery Days
Data Interpretation :
After data above upper value is taken out, there is a change in the actual delivery
days data interpretation which described as follows:
mean > median > mode ,indicates the data is still skewed positive.
Coefficient of variation has change to be lower than 1. This indicates that the mean
value can used for analysis.
Standar deviation lower value than mean value. This indicates that the data has
become narrowly dispersed.
This data has lower variance value, indicates that the data points has become
narrowly spread out from the mean, and from one another.
Kurtosis value has relatively small negative value, indicates that the data has low
peak.but higher than before
Skewness is still positive but has lower value than before, indicates that the data is
still skewed positive or still concentrated at the lower value.
Exploratory Data Analysis
a. Number of orders per month
Insight :
Insight :