Professional Documents
Culture Documents
There are many methods and strategies from business patterns and fraud patterns
that can be used in the fraud detection in retail financing companies. The use of
process. Basically, every fraud incident has Machine Learning for fraud detection in
certain characteristics, both environmental retail financing companies has not been
conditions and a person’s behavior. widely used, because it is quite difficult to
Therefore, fraud can be detected if there is find patterns of fraud.
a good understanding of the characteristics This study aims to classify fraud
of each fraud condition that may arise incidents based on historical data. This
in the company. Classification analysis research is conducted through case studies
can be used to detect fraud based on the on retail consumer financing transactions.
characteristics that arise. The results of this The research method is to find a Machine
classification can be used as a sample in Learning algorithm with the highest level
carrying out the process of observation or of accuracy. Machine Learning analysis
field inspection to reveal the presence or output will be used as a parameter to
absence of fraud in these conditions. determine field inspection samples
Given that there is a large number of in order to find field facts about the
transactions in the retail financing business, incident of fraud. This is quite important
which means that the availability of data is to do considering the large volume of
also abundant, the fraud detection process transactions.
can use this data to classify a transaction
whether it meets the conditions as fraud 2. LITERATURE REVIEW AND HYPO-
or not. It is very difficult to perform THESIS
large amounts of data analysis manually. Definition of fraud
Therefore, the data analysis process must Tuanakotta (2013), stated that fraud is an
be assisted by smart or intelligent machine illegal display described as misleading,
technology to improve human capabilities disguising or destroying trust.The purpose
(Nayak & Dutta, 2017). The intelligence of this activity is to obtain cash, abundance
demonstrated by machines is usually or administrations, to keep away from
referred to as artificial intelligence, which installment or administrative losses, or to
is the study of computer science. Artificial obtain benefits for private issues. ACFE
intelligence is broadly used to solve defines fraud as a deliberate act aimed at
different issues such as business problems, persuading another person to act that is
robotics, natural language, mathematics, detrimental to that person.
games, perception, medical diagnosis, Fraud is a part of operational risk. In
engineering, financial analysis, scientific the scientific categorization, operational
analysis, and reasoning (Rahardja et al., risk of fraud can be categorized into
2017; Russell & Norvig, 2016). Internal Fraud and External Fraud (BCBS,
One form of artificial intelligence is the 2002). Ramadan (2020), explained that
machine learning method. This method this distinction is based on the fraudster.
(machine learning) can be defined as Internal parties such as employees, which
computer applications and mathematical is fraud committed by personwithin the
algorithms that are adopted by means of organization who abuses his/her power
learning derived from data and producing or assets for individual benefit (ACFE,
predictions in the future (Goldberg & 2020). Meanwhile, external partiessuch as
Holland, 1988). Machine Learning can customers and business partners, and a
be applied in the fraud detection process combination of the two parties.
by classifying whether a transaction is Remote Audit
categorized as fraud or not. Remote audit is an innovation and business
Shirgave (2019), uses machine learning transformation that has been going on
to conduct fraud detection in the credit for several years. The development of
card business. This is of course different technology is one of the factors that can
Asia Pacific Fraud Journal, 6(2) July-December 2021: 213-221| 215
support the remote audit process. There Figure 1. Machine Learning Algorithms
are several latest technologies that can be
utilized in the remote audit process, such as
the use of live video streaming, drones, and
artificial intelligence. The implementation
of technological developments for
remote audit elements, such as planning,
document checking, field inspection,
remote interviews, and closed meetings
can be done using video teleconferences
and other available technology platforms Source: Processed data
such as Microsoft Teams, Skype, and Zoom
(Litzenberg& Ramirez, 2020).
Recent research has revealed that
Remote audit is a solution to new
machine learning methods are divided
challenges, especially when the inspection
into Supervised, Unsupervised, and
team has limited space due to the
Reinforcement Learning (Somvanshi&
pandemic. There are pros and cons from
Chavan, 2016). When historical data with
various parties regarding this remote
data classifications are available, direct
audit process. Several positive things were
machine learning output classifications
found when implementing the remote
can be determined. In this case, the output
audit process, namely reducing travel
classification is divided into Fraud and
costs, expanding the use of specialists,
Non Fraud categories. This method of
increasing the use of existing technology to
determining fraud can be solved using the
strengthen documentation and reporting,
Supervised Learning method.
mitigating audit burdens on operational
facilities, and improving organization and Supervised Learning
confirmation of required documentation The supervised learning method is based
(Litzenberg& Ramirez, 2020). Apart on a data set collection with label. This
from simplifying the audit process, collection of data set is used to summarize
remote auditing has limitations that also the characteristics of the distribution
need attention. Lack of direct personal of attitude dimensions in each type of
interaction opens up opportunities for application so as to form an attitude model
fraud. This can increase the chances of from the data (Amei et al, 2011). Moreover,
submitting manipulated documents and supervised learning will use regression
omitting relevant information. and classificationmethod. Brownlee (2016)
revealed that the regression problem is
Machine Learning
when the output variable is aexact value,
Machine learning described as computer
such as price or sales number. At the same
applications and mathematical algorithms
time,the classification problem is when the
that are adopted by means of learning
output variable is categorical, such as a
derived from historical data and producing
one or zero, yes or no..Supervised learning
predictions in the future (Goldberg &
has several popular algorithms such as
Holland, 1988). Learning process is an
Logistic Regression,Back-propagation,
attempt to acquire intelligence that is taken
Neural Network, Decision Tree,Random
through two stages: training and testing
Forest, Naive Bayesian, Rocchio Method,
(Huang, Zhu, & Siew, 2006). Here is the
Linear Regression,k-Nearest Neighbor,
Machine Learning Algorithm:
and Support Vector Machines (SVM).
216| Nadya I Mustika et al., Machine Learning Algorithms in Fraud Detection
Table 5. Accuracy, Precision, and Recall Amei, W., Huailin, D., Qingfeng, W., & Ling,
Values L. (2011). A survey of application-
Value level protocol identification based
on machine learning. Presented
Accuracy 0.722654351
at International Conference
Precision 0.730547911 on Information Management,
Recall 0.978552279 Innovation Management and
Source: Processed data Industrial Engineering.
As shown on the table 4.4, the accuracy Amrizal. (2004). Pencegahan dan Pen-
value generated at the validation stage deteksian Kecurangan oleh Internal
using new input data is 0.722654351, which Auditor. Jakarta.
means that predictions using the Random
A. Said., & V. Torra (eds.). (2019). Data
Forest Method are quite accurate for the
science in practice, studies in big data
data held.
46. Springer International Publishing
5. CONCLUSION AG, part of Springer Nature 2019.
Based on the research steps that have Bank Indonesia. (2011). Surat Edaran Bank
been carried out, the results show that the Indonesia No.13/28/DPNP tanggal
Random Forest algorithm has the highest 9 Desember 2011 perihalPenerapan
train score and test score, 0.994999 and Strategi Anti Fraudbagi Bank Umum.
0.745437, which means that the Random
Forest algorithm is a more precise algorithm Basel Committee on Banking Supervision
in detecting the level of fraud based on (BCBS). (2002). Operational Risk
the classification of characteristics that Data Collection Exercise.
arise from each transaction. In addition, Brownlee, J. (2016). Master Machine
in the validation stage, an accuracy value Learning Algorithms: discover how
of 0.722654351 is obtained which proves they work and implement them from
that the Random Forest algorithm has a scratch. Jason Brownlee.
fairly good accuracy in predicting data
Goldberg, D. E., & Holland, J. H. (1988).
regarding the characteristics and levels of
Genetic algorithms and machine
fraud in 2020 at retail financing company
learning. Machine Learning, 3(2),
X.
95–99.
It is suggested that further research
add more predictor variables that have Huang, G. B., Zhu, Q. Y., & Siew, C.
an influence on the classification of fraud K. (2006). “Extreme learning
incidents with the aim of increasing the machine: theory and applications.”
accuracy value of the applied Machine Neurocomputing, 70(1–3), 489–501.
Learning algorithm and apply this method
Litzenberg, Roy., & Ramirez, C.F. (2020).
to different financial institutions and
Proses audit jarak jauhselama dan
different industries.
setelah covid-19: implikasi jangka
REFERENCE pendek dan panjang. The Institute of
Internal Auditors.
Association of Certified Fraud Examiners
(ACFE). (2020) Report to the Nation.
Asia Pacific Fraud Journal, 6(2) July-December 2021: 213-221| 221
Nayak, A., & Dutta, K. (2017). Impacts Russell, S. J., &Norvig, P. (2016). Artificial
of Machine Learning and Intelligence: A Modern Approach.
Artificial Intelligence on Mankind. Malaysia; Pearson Education
Dipresentasikan dalam International Limited.
Conference on Intelligent Computing
Shirgave, Suresh., Awati, Chetan., More,
and Control (I2C2), 1–3.
Rashmi., Patil, Sonam. (2019). A
Prasetyo. (2014). Data Mining Mengolah Review On Credit Card Fraud
Data Menjadi Informasi Meng- Detection Using Machine Learning.
gunakan Matlab. Penerbit Andi. International Journal of Scientific &
Technology Research. 8. 1217-1220.
Peraturan Otoritas Jasa Keuangan
Nomor 35/POJK.05/2018 tentang Somvanshi, M., & Chavan, P. (2016).
Penyelenggaraan Usaha Perusahaan A review of machine learning
Pembiayaan. techniques using decision tree
and support vector machine.
Ramadhan, Dona. (2020). Root Cause
Dipresentasikan dalam International
Analysis Using Fraud Pentagon
Conference on Computing Com-
Theory Approach (A conceptual
munication Control and Automation
framework).” Asia Pacific Fraud
(ICCUBEA).
Journal. Vol 5 No 1 2020.
Tuanakotta, Theodorus M. (2013).
Sudaryono, & Rahardja, Untung & Roihan,
Mendeteksi Manipulasi Laporan
Ahmad. (2017). Design of Business
Keuangan. Jakarta: Salemba Empat.
Intelligence in Learning Systems
Using iLearning Media. Universal
Journal of Management. Vol 5.