MBA PA Sem1 23 Assignment2

Uploaded by

2022mb21048

0% found this document useful (1 vote)

66 views4 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (1 vote)

66 views4 pages

MBA PA Sem1 23 Assignment2

Uploaded by

2022mb21048

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Assignment 2

MCA Technology Solutions Private Limited was established in 2015 in

Bangalore with an objective to integrate analytics and technology with business.
MCA Technology Solutions helped its clients in areas such as customer
intelligence, forecasting, optimization, risk assessment, web analytics, and text
mining, and cloud solutions. Risk assessment vertical at MCA technology
solutions focuses on problems such as fraud detection and credit scoring. Sachin
Kumar, Director at MCA Technology Solutions, Bangalore was approached by
one of his clients, a commercial bank, to assist them in detecting earning
manipulators among the bank’s customers. The bank provided business loans to
small and medium enterprises and the value of loan ranged from INR 10 million
to 500 million ($1 = INR 66.82, August 16, 2016). The bank suspected that their
customers may be involved in earnings’ manipulations to increase their chance
of securing a loan.

Saurabh Rishi, the Chief Data Scientist at MCA Technologies, was assigned the
task of developing a use case for predicting earning manipulations. He was aware
of models such as Beneish model that was used for predicting earning
manipulations; however, he was not sure of its performance, especially in the
Indian context. Saurabh decided to develop his own model for predicting earning
manipulations using data downloaded from the Prowess database maintained by
the Centre of Monitoring Indian Economy (CMIE). Daniel received information
related to earning manipulators from Securities Exchange Board of India (SEBI)
and the Lexis Nexis database. Data on 220 companies was collected to develop
the model.

The data is available in the file “Earnings Manipulation 220.csv” in the

following link.

https://drive.google.com/file/d/1h-qteu-
obJ5y5nQhHn_ypUOzZSp8lEVn/view?usp=sharing

Description of the columns is given in the following Table. In this table

 netPPE is net Property, Plant, and Equipment. Property Plant and

Equipment is the value of all buildings, land, furniture, and other physical
capital that a business has purchased to run a business.
 Subscript (t) means value in the current year financial statement and
subscript (t – 1) means the value in the previous year financial statement.
Answer the following questions using the above dataset.

1. How many cases of manipulators versus non-manipulators are there in the

dataset? Draw a bar plot to depict.

2. Create a 80:20 partition, and find how many positives are present in the
test data.

3. The number of cases of manipulators are very less compared to non-

manipulators. Use upsampling technique to create a balance dataset.

4. Build the following models using balanced dataset. Comment on which

metric should be given preference for this dataset. Finalize the model for
each technique after Hyperparameter tuning using GridsearchCV based
on the metric selected. Compare the model performances with respect to
different evaluation metrices.

a. Naïve Bayes
b. KNN
c. SVM
d. Logistic regression
e. Random Forest
f. Adaboost
g. Gradientboost
h. XGBoost

5. Comment on which are the most important features for predicting the
manipulators.

6. The number of cases of manipulators are very less compared to non-

manipulators. Use downsampling technique to create a balance dataset.

7. Compare the results of using both upsampling and downsampling

techniques. Report the best model of all the models.

What will you have to submit?

You need to submit a single jupyter notebook (.ipynb) file with all the work
done including comments via the elearn(Taxila) portal. Naming convention of
the file should be <BITS-ID-number>.ipynb.
Deadline:
19th November, 2023
Note that this is a hard deadline and no extension will be granted further.
Copying efforts will be dealt very seriously, students involved, if caught, will be
given zero marks.

Data Mining Business Report Hansraj Yadav
Document34 pages
Data Mining Business Report Hansraj Yadav
P Venkata Krishna Rao
83% (12)
Format Sec 14 Sarfaesi
Document11 pages
Format Sec 14 Sarfaesi
Hari
100% (6)
ITNPBD6 Assignment 2018-2 PDF
Document2 pages
ITNPBD6 Assignment 2018-2 PDF
Vaibhav Jain
No ratings yet
Arbitrage Presentation
Document26 pages
Arbitrage Presentation
Veronica Tabirta
No ratings yet
Karnataka
Document50 pages
Karnataka
Adarsh Jain
75% (8)
Sample Constitution Bylaws
Document3 pages
Sample Constitution Bylaws
Mark Heddison Bacsain
100% (3)
A Summary of The Rituals of Hajj
Document7 pages
A Summary of The Rituals of Hajj
Tori Small
No ratings yet
Predictive Analytics
Document4 pages
Predictive Analytics
chandan
No ratings yet
Prediction of Company Bankruptcy: Amlan Nag
Document16 pages
Prediction of Company Bankruptcy: Amlan Nag
Express Business Services
100% (2)
MCQs - Big Data Analytics - Predictive Analytics
Document10 pages
MCQs - Big Data Analytics - Predictive Analytics
ZuniButt
No ratings yet
Name - Santoshi Devi Nayudubathula Reg No - 2021Pgp220
Document4 pages
Name - Santoshi Devi Nayudubathula Reg No - 2021Pgp220
Santoshi Satyanarayana
No ratings yet
ML Use Cases Ebook
Document53 pages
ML Use Cases Ebook
Sliptnock Martinez
100% (2)
Project Team Report 13
Document9 pages
Project Team Report 13
Mukilan Baskaran
No ratings yet
Using Deep Learning For Recommender Systems: by Lakshman Bulusu Consultant, Matlen Silver Inc. and
Document20 pages
Using Deep Learning For Recommender Systems: by Lakshman Bulusu Consultant, Matlen Silver Inc. and
Dalia Mahmoud
No ratings yet
Final Project Report - Kelompok 4
Document6 pages
Final Project Report - Kelompok 4
andonifikri
No ratings yet
2020 6 17 Exam Pa Project Statement PDF
Document6 pages
2020 6 17 Exam Pa Project Statement PDF
Hông Hoa
No ratings yet
FAQ's - Applied Statistics
Document3 pages
FAQ's - Applied Statistics
nitheeshbharathi.m
No ratings yet
Customer Churn Prediction in Banking Sector - A Hybrid Approach
Document6 pages
Customer Churn Prediction in Banking Sector - A Hybrid Approach
IJRASETPublications
No ratings yet
House Price Forecasting Using Machine Learning Methods: Uter and Mathematics Education 11 (2021), 3624-3632
Document9 pages
House Price Forecasting Using Machine Learning Methods: Uter and Mathematics Education 11 (2021), 3624-3632
Rahul Patil
No ratings yet
Tugas Analitika Data (Yasa Hapipudin)
Document4 pages
Tugas Analitika Data (Yasa Hapipudin)
yasa
No ratings yet
Predicting Personal Loan Approval Using Machine Learning Handbook
Document31 pages
Predicting Personal Loan Approval Using Machine Learning Handbook
Ezhilarasi
No ratings yet
Assignment 2 (Group) : Basic Data Report
Document2 pages
Assignment 2 (Group) : Basic Data Report
besillysilly
No ratings yet
Chandan Gautam Resume
Document4 pages
Chandan Gautam Resume
Chandan Gautam
No ratings yet
Resume Subbaraju
Document3 pages
Resume Subbaraju
silverbyte
No ratings yet
DS Curso
Document15 pages
DS Curso
punisherneogo
No ratings yet
Improving Bug Triage Based On Predictive Model
Document5 pages
Improving Bug Triage Based On Predictive Model
IJSTE
No ratings yet
Cluster Credit Risk R PDF
Document13 pages
Cluster Credit Risk R PDF
Fabian Chahin
No ratings yet
Prabhjyot Resume 20230723B
Document2 pages
Prabhjyot Resume 20230723B
Basheer Khan
No ratings yet
Data Science 1B
Document50 pages
Data Science 1B
kagome
No ratings yet
A2 Estimation (Individual) : MIS772 2020 T2
Document1 page
A2 Estimation (Individual) : MIS772 2020 T2
Queen Rania
No ratings yet
Databyte ML Task 1
Document6 pages
Databyte ML Task 1
Mohini Thakur
No ratings yet
19 Stock Price Trend Predictionusing Multiple Linear Regression
Document6 pages
19 Stock Price Trend Predictionusing Multiple Linear Regression
nhoxsock8642
No ratings yet
DWM Lab Manual
Document92 pages
DWM Lab Manual
Hemamalini
No ratings yet
IJCRT2111135
Document7 pages
IJCRT2111135
Divya G K
No ratings yet
Chapter 1: Introduction: Overview of Business Analytics
Document12 pages
Chapter 1: Introduction: Overview of Business Analytics
Shairamay Aludino
No ratings yet
ML Final
Document34 pages
ML Final
4023 Keerthana
No ratings yet
Experience Using The Ibm Supply Chain Simulator
Document8 pages
Experience Using The Ibm Supply Chain Simulator
Manjiree Ingole Joshi
No ratings yet
PM Alternate Project
Document2 pages
PM Alternate Project
Anmol Singh
No ratings yet
Capstone Project Proposal
Document2 pages
Capstone Project Proposal
chinudash
No ratings yet
Final DMT Report PDF
Document27 pages
Final DMT Report PDF
Anonymous Sqb2WO
No ratings yet
Predictive Analytics - Chapter 1 PDF
Document10 pages
Predictive Analytics - Chapter 1 PDF
Deniece C. Castillo
No ratings yet
Data Analytics PDF
Document6 pages
Data Analytics PDF
Prazavi Jain
0% (1)
Sales-Forecasting of Retail Stores Using Machine Learning Techniques
Document7 pages
Sales-Forecasting of Retail Stores Using Machine Learning Techniques
shabir Ahmad
No ratings yet
Sales Analysis and Forecasting in Shopping Mart: Amit Kumar, Kartik Sharma, Anup Singh, Dravid Kumar
Document4 pages
Sales Analysis and Forecasting in Shopping Mart: Amit Kumar, Kartik Sharma, Anup Singh, Dravid Kumar
Kartik Sharma
No ratings yet
CV 202403071730503
Document2 pages
CV 202403071730503
info4nirbhay
No ratings yet
DR - Subbiah RR 10.09.2020
Document5 pages
DR - Subbiah RR 10.09.2020
Subbiah S
No ratings yet
Comparing Methods Assignment
Document2 pages
Comparing Methods Assignment
Ali Nasar
No ratings yet
COMP1801 - Copy 1
Document18 pages
COMP1801 - Copy 1
SujiKrishnan
No ratings yet
File 482621234 482621234 - Assignment 2 - 7378831553794248
Document5 pages
File 482621234 482621234 - Assignment 2 - 7378831553794248
Bob Philip
No ratings yet
Optimizing Fraudulent Firm Prediction Using Ensemble Machine Learning A Case Study of An External Audit
Document12 pages
Optimizing Fraudulent Firm Prediction Using Ensemble Machine Learning A Case Study of An External Audit
Marllus Lustosa
No ratings yet
Seth Surabhi PDF
Document1 page
Seth Surabhi PDF
Vaibhav Kapil
No ratings yet
Data Science: Sales Forecasting For Marketing
Document52 pages
Data Science: Sales Forecasting For Marketing
Saikiran Mamidi
No ratings yet
Day13-K-Means Clustering
Document10 pages
Day13-K-Means Clustering
SBS Movies
No ratings yet
FinalPaper SalesPredictionModelforBigMart
Document14 pages
FinalPaper SalesPredictionModelforBigMart
Guru Kowshik
No ratings yet
CV 2024030809232876
Document2 pages
CV 2024030809232876
info4nirbhay
No ratings yet
Sanjay Ghodawat Group of Institutions: Synopsis
Document7 pages
Sanjay Ghodawat Group of Institutions: Synopsis
aditya joshi
No ratings yet
Five Ways Artificial Intelligence and Machine Learning Deliver Business Impacts - Gartner
Document8 pages
Five Ways Artificial Intelligence and Machine Learning Deliver Business Impacts - Gartner
Keren Mendes
No ratings yet
Arindam Banerjee Arindam Banerjee SecB 71310058 346
Document6 pages
Arindam Banerjee Arindam Banerjee SecB 71310058 346
sonalz
No ratings yet
Important Questions
Document4 pages
Important Questions
Adilrabia rsl
No ratings yet
Fintech Assignment5
Document4 pages
Fintech Assignment5
vaibhav modi
No ratings yet
Data Science
Document38 pages
Data Science
DINESH REDDY
No ratings yet
Internship Task: Data Science & Machine Learning
Document3 pages
Internship Task: Data Science & Machine Learning
Abhimanyu Kudla
No ratings yet
Task Details
Document3 pages
Task Details
anujmodi763
No ratings yet
IBM Cognos Business Intelligence
From Everand
IBM Cognos Business Intelligence
Dustin Adkison
No ratings yet
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
From Everand
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
Zemelak Goraga
No ratings yet
Socpro18 0298
Document9 pages
Socpro18 0298
Alejandra CF
No ratings yet
Business Administration Finances Burlington Removed (1) Removed
Document3 pages
Business Administration Finances Burlington Removed (1) Removed
Leticia Silva dos Santos
No ratings yet
Music History Unit 5 Timeline
Document17 pages
Music History Unit 5 Timeline
DMMsmart
No ratings yet
A New Dawn For Politics 1St Edition Alain Badiou Full Chapter
Document67 pages
A New Dawn For Politics 1St Edition Alain Badiou Full Chapter
tom.dimaggio515
100% (9)
English Books
Document4 pages
English Books
Sidra Rasool
No ratings yet
What Is Poetry
Document4 pages
What Is Poetry
Talha Basheer
No ratings yet
Region Memo 101 2017 Regional Schools Press Conference (RSPC)
Document17 pages
Region Memo 101 2017 Regional Schools Press Conference (RSPC)
paul christian singco
No ratings yet
Noriega V Sison
Document6 pages
Noriega V Sison
Sinon Doods Novo
No ratings yet
Indian Railway II
Document56 pages
Indian Railway II
rohityadavalld
No ratings yet
01 GenPact Overview
Document9 pages
01 GenPact Overview
amitbharadwaj7
No ratings yet
Law Relating To Domestic Violence, Vijendra Kumar A Gogia & Company Hyderabad 2007
Document4 pages
Law Relating To Domestic Violence, Vijendra Kumar A Gogia & Company Hyderabad 2007
ashoknalsar
No ratings yet
Jussie Smollett Appeal
Document102 pages
Jussie Smollett Appeal
BICR
No ratings yet
Equinox Precession and The Beginning of The Biblical Year
Document4 pages
Equinox Precession and The Beginning of The Biblical Year
putzer19669
No ratings yet
The Spatial Distribution of Services
Document15 pages
The Spatial Distribution of Services
Bedri Muktar
No ratings yet
Tru-Fit Parts Inc Case 3
Document5 pages
Tru-Fit Parts Inc Case 3
CHRISLYN
100% (2)
8 Virtual Market Space
Document18 pages
8 Virtual Market Space
John Edwinson Jara
No ratings yet
Credit Transactions Memory Aid
Document35 pages
Credit Transactions Memory Aid
Gloria Cruz Revita
No ratings yet
SCDL Project Guideline
Document8 pages
SCDL Project Guideline
Pranav Shree
100% (1)
HR Retention Management System Tech Mahindra
Document115 pages
HR Retention Management System Tech Mahindra
Nilabjo Kanti Paul
100% (2)
Judaism Rules
Document2 pages
Judaism Rules
Patrice108365
No ratings yet
All About Pak Affairs (DMG Officer) - Recommended - CSS Forums
Document5 pages
All About Pak Affairs (DMG Officer) - Recommended - CSS Forums
Azeem Chaudhary
No ratings yet
Lucas v. Tuaño, G.R. No. 178763, April 21, 2009
Document22 pages
Lucas v. Tuaño, G.R. No. 178763, April 21, 2009
ARCHIE AJIAS
No ratings yet
Mipham Sherab Raltri Verses PDF
Document18 pages
Mipham Sherab Raltri Verses PDF
JSALAMONE
No ratings yet
Sakthi Mineral Co
Document33 pages
Sakthi Mineral Co
jai kumar
No ratings yet
Golden Rules of Accounts: Prof. Pallavi Ingale 9850861405
Document32 pages
Golden Rules of Accounts: Prof. Pallavi Ingale 9850861405
Pallavi Ingale
100% (1)