Professional Documents
Culture Documents
PROJECT
“CREDIT CARD FRAUD DETECTION”
By Sunrise Team
Data Science Bootcamp Batch 30
MEET THE TEAM
2) Dataset Story
3) Key Features
4) Observations
5) Analysis Steps
6) Conclusion
BACKGROUND
Credit card fraud is one of the most common types of identity fraud.
pandemic alone.
This has been sustained since, with the National Fraud Hunter Prevention
Service revealing that UK credit card fraud reached a five-year high in the last
1 2 3
Data Preprocessing Ex p lo ra t o ry D a t a Mo de lling
A n a ly s is ( ED A)
1
Heatmap * Few features have high co-relation among different features.
* V17 and V18 are highly co-related.
* V16 and V17 are highly co-related.
* V14 has a negative correlation with V4.
* V12 is also negatively correlated with V11.
* V11 is ngetively co-related with V10 and positvely with V4.
* V3 is positevely co-related with V10 and V12.
* V9 and V10 are also positively co-related.
EDA V1
V2
V3
-0.08
-1.40
0.01
V4 -0.04
V5 1.51
V6 -0.20
V7 19.03
2
V8 0.30
V9 0.17
V10 0.74
V11 -0.02
V12 0.07
V13 0.01
V14 0.21
V15 0.01
V16 0.27
V17 0.37
3
The Distribution of
'amount feature'
MODELLING
1
Logistic
Regression
MODELLING
4
XGBoost
Conclusion
Kinerja model terbaik didapatkan pada metode XGBoost dengan akurasi 0,97 yang
artinya sebesar 97% model dapat mengklasifikasikan true positive dan true negative
dengan benar
Pengembangan model yang efektif untuk deteksi penipuan adalah penting. Model
harus memiliki kemampuan untuk mengenali pola-pola yang mencurigakan dalam
transaksi kartu kredit.
Karena pola penipuan dapat berubah seiring waktu, model mungkin perlu disesuaikan
secara berkala untuk tetap efektif dalam mendeteksi penipuan yang baru muncul.
Memahami fitur-fitur yang paling berpengaruh dalam deteksi penipuan adalah
penting. Beberapa fitur mungkin memiliki keterkaitan yang tinggi dengan
kemungkinan penipuan.
Pentingnya deteksi dini penipuan kartu kredit. Semakin cepat penipuan terdeteksi,
semakin kecil kerugian yang mungkin terjadi.
THANK YOU