Professional Documents
Culture Documents
CSE520 Assignment2
CSE520 Assignment2
Also, we upload the given “Boston” dataset and read the dataset.
For Task#1. We will need to generate histogram for each attribute. And we will find the
characteristics of the data distribution (positively-skewed or negatively-skewed).
Now, we will find out the characteristics of data distribution based on skewness.
Task 2: Generate Box Plot for each attribute. Also, comment on the number of outliers for each
attribute.
From the boxplots we find crim, zn, chas, rm, dis, black, ptratio, lstat, and medv have outliers.
*3*. Generate a correlation heatmap among all attributes. Find the top 5 positively correlated
attribute pairs and top 5 negatively correlated attribute pairs.
Pairplot to show how to get positive and negative correlated attribute pairs.
5 positively correlated attribute pairs and 5 negatively correlated attribute pairs are in the
following....