Professional Documents
Culture Documents
APPROACH
1) Importing and Cleaning of 2) Formatting or Grouping 3) Performing Univariate & 4) Draw useful Insights.
the Data provided for an effective analysis. Bivariate analysis on
Categorial and Numerical
fields.
UNIVARIATE DISCRETE ANALYSIS FOR AGE GROUPS
Applicants are increasing with Age of the applicant until age 40 and after that we see decline in the no of
applications.
And from the 2nd plot , we see that Default rate is decreasing as the Age of the applicant increases.
UNIVARIATE DISCRETE ANALYSIS FOR FAMILY STATUS
• From the chart, It can be inferred that the most of applicants belongs to Married, Single & Civil Marriage
categories sequentially. Out of which, Single & Civil Marriage tend to default more, and Unknown
category never defaulted.
UNIVARIATE DISCRETE ANALYSIS FOR OCCUPATION
Most of the Applicants occupation is Missing. However Top applicants are from Laborers ,Sales Staff.
But most of the Default percentage is occurring from Low-Skill Laborers group
UNIVARIATE DISCRETE ANALYSIS FOR INCOME TYPE
Most of the applicants are From Working Category & Commercial Associate and Least are Businessman
and Student
However, Default percentage is more on Maternity Leave and Unemployed applicant group
UNIVARIATE CATEGORICAL ANALYSIS FOR ORGANIZATION
TYPE
Most of the Applicants are from Business Entity Type 3 , Missing and Self Employed.
However, Most default percentage is from Transport Type 3 , Industry Type 13 and industry Type 8 groups
ORDERED/ CONT., NUMERI CAL
VARIABLE ANALYSIS ON WORK
EXP
• AMT_GOOD_PRICE , AMT_CREDIT,
AMT_ANNUITY doesn’t seem to have any impact
on the default rate.
B I VA R I AT E A N A LY S I S -
E DU C AT I O N V S G E N D E R V S
INCOME
Top 10 correlations between variables are in the range of (0.33) to (0.99). and both datasets(defaulter
and non-defaulter) have almost similar correlation, except for the Region rating client vs Region
population relative.
INFERENCE