Professional Documents
Culture Documents
1.A There are 1581 Rows are and 14 Columns are there in the dataset and Object type, float
and integer types are there.
1.B Here we iden fied the bad data in columns of Gender and Parter_salary. There were
spelling mistake was no ced in the Gender Column and there were null values. In the
Partner_salary column there was bad data (0.00) was no ced.
Here all spelling Mistakes were corrected
Gender
Male 1199
Female 327
Femal 1
Femle 1
Name: count, dtype: int64
Gender
Male 1199
Female 329
Name: count, dtype: int64
Here all bad treated with mode value of the Gender Column. we can no ce it in Gender Row
Here we treated bad data with the mean value of the column of Partner_Salary. So now our
data set is cleaned
1.
1.C Exploring all the features of the data separately by using Histogram and Barplots
to visualize all feature and drawn insights that can be utilized by the business.
Here I used histogram to visualize the Price column
Here I used histogram with kde to visualize the insight of Salary column
Here I used Histogram with KDE to visualize the insight of total_salary column
Here I used countplot to visualize the make column
1.E1 Ans. False. According to gragh “Women prefer SUV by a large margin, compared
to the Men”
1.E2 Ans is True. According to the graph a salaried person is more likely to buy a Sedan
F1. There are 1252 men and 329 women are working.
Gender
Male 1252
Female 329
Name: count, dtype: int64
Personal_loan
Yes 792
No 789
Name: count, dtype: int64
F2. There are number of Men are having personal loan compare women. Since there less
number of women they can appoint more woman candidate they can reduce men workers.
H.
Marital_status
Married 1443
Single 138
Name: count, dtype: int64
Gender
Male 1252
Female 329
Name: count, dtype: int64