Welcome to Scribd!

Skip carousel

Surabhi FRA PartA Word

Uploaded by

Scribd SC

0% found this document useful (0 votes)

27 views13 pages

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

27 views13 pages

Surabhi FRA PartA Word

Uploaded by

Scribd SC

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 13

Search inside document

BUSINESS REPORT

Financial Risk Analysis Project

By Surabhi Chaughule
Milestone 1

FRA-Part A credit Risk

Problem Statement

Businesses or companies can fall prey to default if they are not able to keep up their debt
obligations. Defaults will lead to a lower credit rating for the company which in turn reduces its
chances of getting credit in the future and may have to pay higher interest on existing debts as
well as any new obligations. From an investor's point of view, he would want to invest in a
company if it is capable of handling its financial obligations, can grow quickly, and is able to
manage the growth scale.

A balance sheet is a financial statement of a company that provides a snapshot of what a

company owns, owes, and the amount invested by the shareholders. Thus, it is an important
tool that helps evaluate the performance of a business.

Data that is available includes information from the financial statement of the companies for the
previous year.

Dependent variable - No need to create any new variable, as the 'Default' variable is already
provided in the dataset, which can be considered as the dependent variable.

Test Train Split - Split the data into train and test datasets in the ratio of 67:33 and use a
random state of 42 (random_state=42). Model building is to be done on the train dataset and
model validation is to be done on the test dataset.

Dataset: Credit Risk Dataset

Data Dictionary: Data Dictionary

 Preliminary exploration of the Dataset:

All messy column names are fixed. All spaces, commas, brackets are replaced with either ‘_’
The first 5 rows of the dataset are as follows:

This dataset contains 2058 rows and 58 columns

 Check for Duplicate Records

No duplicate records exist

 Data type No of variables

 A description of the first few columns of the dataset is given below:
From the descriptive stats, we can see that most of the variables have outliers.
1.1 Outlier Treatment
Detecting Outliers
Conventionally, outliers are identified based on the inter-quartile distance as follows:
Q1 – 25th Percentile
Q3 – 75th Percentile
IQR = Q3 – Q1
Lower outlier = Value < 1.5 * IQR
Upper Outlier = Value > 1.5 * IQR
Based on this definition, the number of outliers in the dataset is as follows:
A view for each of the individual variables is as follows:
Observations
Many variables have outliers as defined by the above criteria.
- If all the records with outliers are dropped, only 112 records remain. Modeling
cannot be done with only these records, so, dropping records is not an option.
- The next option is to cap the outliers at both the ends.
- By increasing the boundaries to 10th and 90th Percentile, the number of outliers is as
follows:

1.2 Missing Value Treatment

The dataset with the treated outliers is used for further analysis. There is a
total of 220 missing values as seen below:
Accordingly, the missing values have been treated using the KNNImputer
After imputing the missing values, it is confirmed that there are no
more missing values

1.4 Univariate & Bivariate analysis with proper interpretation

Univariate Analysis:
We have done univariate analysis for Default customer

Bivariate Analysis:
We have shown bi-variate analysis of top 6 variables with Default variable
Multi- variate Analysis:

1.5 Train-Test Split

As stated in the problem, the dataset has to be split in the ratio of 67:33 using
random_state 42.

1.6 Build Logistic Regression Model (using statsmodel library) on most important
variables on Train Dataset and choose the optimum cutoff

Selecting top 14 features for model building

Training result:

Test result:
Since Precision is low for model, we will SMOTE. Using random_state=42 again running the
model
Training Set result:

Test set result:

Wiley CMAexcel Learning System Exam Review 2017: Part 1, Financial Reporting, Planning, Performance, and Control (1-year access)
From Everand
Wiley CMAexcel Learning System Exam Review 2017: Part 1, Financial Reporting, Planning, Performance, and Control (1-year access)
IMA
No ratings yet
Surabhi FRA PartA
Document13 pages
Surabhi FRA PartA
Scribd SC
No ratings yet
Ways to Achieve Quality
From Everand
Ways to Achieve Quality
chakrapani srinivasa
No ratings yet
LDA KNN Logistic
Document29 pages
LDA KNN Logistic
shruti gujar
100% (1)
Finance Risk Analytics - Priyanka Sharma - Business Report
Document49 pages
Finance Risk Analytics - Priyanka Sharma - Business Report
Priyanka Sharma
No ratings yet
Churn Analysis of Bank Customers
Document12 pages
Churn Analysis of Bank Customers
Vaibhav Agarwal
100% (1)
Finance & Risk Analytics: Project Report
Document16 pages
Finance & Risk Analytics: Project Report
sharwanidas
No ratings yet
Credit Risk Modelling and Analysis
Document14 pages
Credit Risk Modelling and Analysis
A d
100% (2)
PM Alternate Project
Document2 pages
PM Alternate Project
Anmol Singh
No ratings yet
Predict Sales and Survival with Regression Models
Document10 pages
Predict Sales and Survival with Regression Models
Anshul Dyundi
100% (1)
Data Mining Business Report Hansraj Yadav
Document34 pages
Data Mining Business Report Hansraj Yadav
P Venkata Krishna Rao
83% (12)
Customer Churn Prediction
Document19 pages
Customer Churn Prediction
surajkadam
100% (1)
FRA Business Report
Document21 pages
FRA Business Report
Surabhi Kulkarni
No ratings yet
Predictive Modeling Business Report Seetharaman Final Changes PDF
Document28 pages
Predictive Modeling Business Report Seetharaman Final Changes PDF
Ankita Mishra
100% (1)
Anshul Dyundi Predictive Modelling Alternate Project July 2022
Document11 pages
Anshul Dyundi Predictive Modelling Alternate Project July 2022
Anshul Dyundi
No ratings yet
Credit Card Fraud Analysis Ashutosh
Document3 pages
Credit Card Fraud Analysis Ashutosh
Hemang Khandelwal
No ratings yet
Analysis of German Credit Data
Document24 pages
Analysis of German Credit Data
savvy_as_98-1
100% (1)
Data Mining Project Anshul
Document48 pages
Data Mining Project Anshul
Anshul Mendhekar
100% (1)
Data Mining Project DSBA PCA Report Final
Document21 pages
Data Mining Project DSBA PCA Report Final
indraneel120
No ratings yet
Data Mininig Project
Document28 pages
Data Mininig Project
Karthikeyan Manimaran
67% (3)
Report - Project8 - FRA - Surabhi - Report
Document15 pages
Report - Project8 - FRA - Surabhi - Report
Surabhi Sood
100% (1)
Report - Project8 - FRA - Surabhi - Report
Document15 pages
Report - Project8 - FRA - Surabhi - Report
Surabhi Sood
0% (1)
Machin e Learnin G: Lab Record Implementation in R
Document30 pages
Machin e Learnin G: Lab Record Implementation in R
yuktha
No ratings yet
Domain Knowledge Features Machine Learning: Feature Engineering Is The Process of Using
Document3 pages
Domain Knowledge Features Machine Learning: Feature Engineering Is The Process of Using
AKARSH JAISWAL
No ratings yet
Telco Customer Churn
Document11 pages
Telco Customer Churn
Hamza Qazi
100% (2)
Data Mining Project - PCA - Hair Salon
Document8 pages
Data Mining Project - PCA - Hair Salon
Priyanka Sharma
No ratings yet
GSOE9810 Exam Notes
Document8 pages
GSOE9810 Exam Notes
Saika Dhaka
No ratings yet
Project Data Mining Tanaya Lokhande
Document58 pages
Project Data Mining Tanaya Lokhande
tanaya lokhande
No ratings yet
FRA Project
Document7 pages
FRA Project
Nitya Balakrishnan
No ratings yet
FRA PROJECT MILESTONE 1 LOGISTIC REGRESSION
Document23 pages
FRA PROJECT MILESTONE 1 LOGISTIC REGRESSION
Maminul Islam
100% (3)
Bank Loan Case Study Report
Document23 pages
Bank Loan Case Study Report
abrarmungi120402
No ratings yet
FAI Lecture - 23-10-2023 PDF
Document12 pages
FAI Lecture - 23-10-2023 PDF
Weixin07
No ratings yet
P-149 Final PPT
Document57 pages
P-149 Final PPT
Vijay rathod
No ratings yet
Project: Creditworthiness: Step 1: Business and Data Understanding
Document12 pages
Project: Creditworthiness: Step 1: Business and Data Understanding
yogesh patil
No ratings yet
DATT - Class 05 - Assignment - GR 9
Document9 pages
DATT - Class 05 - Assignment - GR 9
SAURABH SINGH
No ratings yet
A T M L T - : F R: Lgorithmic Rading Using Achine Earning ECH Niques Inal Eport
Document5 pages
A T M L T - : F R: Lgorithmic Rading Using Achine Earning ECH Niques Inal Eport
Cyou97
No ratings yet
Practical - Regression
Document114 pages
Practical - Regression
whitenegrogotchicks.619
No ratings yet
Data Mining Primer
Document5 pages
Data Mining Primer
JoJo Bristol
No ratings yet
Project – Finance And Risk Assessment Report
Document18 pages
Project – Finance And Risk Assessment Report
Abhishek
No ratings yet
Week 10 - PROG 8510 Week 10
Document16 pages
Week 10 - PROG 8510 Week 10
Vineel Kumar
No ratings yet
Amta Assignment
Document20 pages
Amta Assignment
Saransh Kansal
No ratings yet
Project 5 Surabhi Sood - Report
Document34 pages
Project 5 Surabhi Sood - Report
Surabhi Sood
No ratings yet
Estimating Parameter Values For Single Facilities
Document85 pages
Estimating Parameter Values For Single Facilities
Bahagian Kolaborasi Keusahawanan Jpkk
No ratings yet
Credit Risk Modeling in R
Document66 pages
Credit Risk Modeling in R
Arjun Khosla
100% (2)
Bank Loan Eligibility and Type Prediction
Document15 pages
Bank Loan Eligibility and Type Prediction
poornima
No ratings yet
Research Methods Assignment
Document4 pages
Research Methods Assignment
Bhavya Bharti
No ratings yet
Data Mining Project Report
Document29 pages
Data Mining Project Report
hema aarthi
No ratings yet
Decision Making: Clustering States Based on Health and Economic Conditions
Document20 pages
Decision Making: Clustering States Based on Health and Economic Conditions
Ankita Mishra
No ratings yet
Machine Learning Business Report
Document60 pages
Machine Learning Business Report
shorya
74% (53)
Predictive Model For E-Commerce
Document3 pages
Predictive Model For E-Commerce
Nipun Goyal
100% (1)
Feature Selection Methods Used in SAS
Document12 pages
Feature Selection Methods Used in SAS
Sumit Sidana
No ratings yet
Unit 2
Document28 pages
Unit 2
LOGESH WARAN P
No ratings yet
Effectiveness Assessment Between Sequential Minimal Optimization and Logistic Classifiers For Credit Risk Prediction
Document9 pages
Effectiveness Assessment Between Sequential Minimal Optimization and Logistic Classifiers For Credit Risk Prediction
International Journal of Application or Innovation in Engineering & Management
No ratings yet
Variable Measurement Systems - Part 3: Linearity - BPI Consulting
Document6 pages
Variable Measurement Systems - Part 3: Linearity - BPI Consulting
kikita2911*
No ratings yet
Predictive Model: Submitted by
Document27 pages
Predictive Model: Submitted by
Ankita Mishra
100% (1)
MKT Research HW 5
Document5 pages
MKT Research HW 5
Anmol Amin
No ratings yet
ML Project Shivani Pandey
Document49 pages
ML Project Shivani Pandey
Shubhangi Pandey
100% (1)
LogisticRegression Vs RandomForest 1601998364
Document14 pages
LogisticRegression Vs RandomForest 1601998364
begumkarabatak
No ratings yet
Big Data & Predictive Analytics: How To Submit
Document4 pages
Big Data & Predictive Analytics: How To Submit
Krishna Marwah
No ratings yet
Telecom Customer Churn RV PDF
Document29 pages
Telecom Customer Churn RV PDF
Ramachandran Venkataraman
No ratings yet
NTC 2008 Example 002 PDF
Document14 pages
NTC 2008 Example 002 PDF
Mohamed Abo-Zaid
No ratings yet
CH62 Lighting System
Document94 pages
CH62 Lighting System
YaQod
No ratings yet
Sprinkler Protection Guidance For Lithium-Ion Based Energy Storag e Systems
Document46 pages
Sprinkler Protection Guidance For Lithium-Ion Based Energy Storag e Systems
Alex
No ratings yet
CMT1
Document10 pages
CMT1
Dongneu Nguyen
100% (1)
Mathematical Logic
Document224 pages
Mathematical Logic
kauluzavanessa
No ratings yet
VFD module specification guide
Document17 pages
VFD module specification guide
Mikhail
No ratings yet
Basic Firefighting Lecture
Document88 pages
Basic Firefighting Lecture
Bfp Region Vii SanJose
No ratings yet
Cinchoo ETL - Parquet Writer - CodeProject
Document35 pages
Cinchoo ETL - Parquet Writer - CodeProject
gfgomes
No ratings yet
Sigma Mod Master Clock: Installation and Start-Up Instructions
Document62 pages
Sigma Mod Master Clock: Installation and Start-Up Instructions
Ficea
No ratings yet
IBP - Help For Standard Keyfigures PDF
Document311 pages
IBP - Help For Standard Keyfigures PDF
Saurabh Kulkarni
No ratings yet
Bess-Sm-3q90041-Qams-003 Method of Statement - Cable Termination PDF
Document5 pages
Bess-Sm-3q90041-Qams-003 Method of Statement - Cable Termination PDF
Christian Bulaong
No ratings yet
Maritime Standards
Document102 pages
Maritime Standards
MgZin
0% (1)
EagleBurgmann - PDGS Dry Gas Seal Upgrade For Australian LNG Project
Document2 pages
EagleBurgmann - PDGS Dry Gas Seal Upgrade For Australian LNG Project
sudhindra_tiwari
No ratings yet
Electric Machine
Document26 pages
Electric Machine
alex
No ratings yet
RULES and GUIDELINES Extempo and Poster Making Contest
Document4 pages
RULES and GUIDELINES Extempo and Poster Making Contest
Chrisa C. Tabiliran
No ratings yet
SUNSTAR - Latest Philippine Community News, Cebuano Stories, Bisaya News and Information - SUNSTAR
Document23 pages
SUNSTAR - Latest Philippine Community News, Cebuano Stories, Bisaya News and Information - SUNSTAR
JOHN LUKE VIANNEY
No ratings yet
Academics: Maximum Marks (Inclusive of All Subjects)
Document1 page
Academics: Maximum Marks (Inclusive of All Subjects)
Ranjan Anushka
No ratings yet
Cockpit and Passenger Compartment Separation Solutions 1588230875
Document14 pages
Cockpit and Passenger Compartment Separation Solutions 1588230875
rubenaris
No ratings yet
Basic Calculus Q4 Week 2 Module 10 Cath
Document12 pages
Basic Calculus Q4 Week 2 Module 10 Cath
Lee Marck Serios
No ratings yet
Users Manual 3 994639
Document140 pages
Users Manual 3 994639
Parul Mehta
No ratings yet
Soil Mechanics I Cee 305: 2.0 Consistency Limit (Atterberge Limit)
Document6 pages
Soil Mechanics I Cee 305: 2.0 Consistency Limit (Atterberge Limit)
ABUBAKAR SANI HABIBU
No ratings yet
USA IT Staffing - Roles of Bench Sale Recruiters
Document37 pages
USA IT Staffing - Roles of Bench Sale Recruiters
yaminika singereddy
No ratings yet
SG110CXPV Grid-Connected Inverter User Manual
Document101 pages
SG110CXPV Grid-Connected Inverter User Manual
Aizat Along
No ratings yet
Global 2 IDNS - PLC Commissioning
Document16 pages
Global 2 IDNS - PLC Commissioning
Cesar Chacon
No ratings yet
03-M-U-300000 List of Documents & Drawings For Cold Box-REV01
Document5 pages
03-M-U-300000 List of Documents & Drawings For Cold Box-REV01
mahmoud
No ratings yet
Video Games - Checklist, Research #1 and 2, Rubric, and Timeline
Document6 pages
Video Games - Checklist, Research #1 and 2, Rubric, and Timeline
fin vion
No ratings yet
Final Report For Research Paper
Document55 pages
Final Report For Research Paper
Anshu Pandey
No ratings yet
Java Basics - Key Elements of a Java Program
Document21 pages
Java Basics - Key Elements of a Java Program
Ash Lee
No ratings yet
Array in C++
Document19 pages
Array in C++
Shweta Shah
No ratings yet