Welcome to Scribd!

IEEE CIS Fraud Detection: Kaveri Biswas (DT2019003), Keerthana P Girijan (DT2019004), Shefali Bedarkar (DT2019008)

Uploaded by

0% found this document useful (0 votes)

24 views7 pages

The document summarizes a fraud detection project using credit card transaction data. It outlines the features in the transaction and identity data, including transaction amount, product code, payment card details, and engineered features. It also describes exploratory data analysis of the data, finding it is sparse with 3.52% fraudulent transactions. Missing values were observed in some features. New time-based features like hour, day, and month were created to analyze patterns in fraudulent transactions over time. Fraud was found to be higher between 4am-12pm and lowest from 2pm-4pm, with the highest from 7am-10am.

Original Description:

Original Title

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

24 views7 pages

IEEE CIS Fraud Detection: Kaveri Biswas (DT2019003), Keerthana P Girijan (DT2019004), Shefali Bedarkar (DT2019008)

Uploaded by

Kavya

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 7

Search inside document

IEEE CIS Fraud Detection

Team: Seekers
Kaveri Biswas (DT2019003), Keerthana P Girijan (DT2019004),
Shefali Bedarkar (DT2019008)
Agenda

• Project Description
• Features
• Exploratory Data Analysis
• Missingness and Imputation
• PCA and Feature Engineering
• Balancing Techniques
• Modeling and Score
• Future Work
Project Description

• The dataset of credit card transactions is provided by the Vesta Corporation, said to be world’s
leading payment service company
• The dataset divided into two files, transaction and identity for both train and test
• Train dataset: 354324 x 434; Test dataset: 236216 x 433
• ‘isFraud’ is the binary target variable
Features
• Transaction features:
• TransactionDT: timedelta from a given reference datatime (not an actual timestamp)
• TransactionAMT: transaction amount paid in USD
• ProductCD: product code, the product for each transaction
• card1 – card6: payment card information, such as card type, card category, issue bank, country, etc
• addr: address
• dist: distance
• P_ and R_ emaildomain: purchaser and recipient email domain
• C1-C14: counting, such as how many addresses are found to be associated with the payment card, etc. The actual
meaning is masked
• D1-D15: timedelta, such as days between previous transaction, etc.
• M1-M9: match, such as names on card and address, etc.
• Vxxx: Vesta engineered rich features, including ranking, counting, and other entity relations.
• Identity Features:
• Categorical Features: DeviceType, DeviceInfo, id_12 – id_38
Exploratory Data Analysis (EDA)
• While conducting EDA, we found that the data was sparse
• Only 3.52% of the total transactions were positively classified as ‘isFraud’
• V and id features in train data have more than 70% missing values
• Another observation we made was of ‘TransactionDT’. Both train and test data details had been
taken at the same time but train amount values more than test data
• We created new columns like hours, day, week, and month to take a closer look at the time and
target
• It seems that in the hours from 4am to 12pm the fraction of fraudulent transaction is significantly
higher than other hours. And from hour 2pm to 4pm, the fractions of fraud is the lowest. While
from 7am to 10am the fraction is the highest. So we can create another new feature, classifying
time periods into different levels of warning sign in terms of their fraud fraction.

Data Collection: Six Sigma Thinking, #1
From Everand
Data Collection: Six Sigma Thinking, #1
Sumeet Savant
No ratings yet
Insiders' Guide to Technology-Assisted Review (TAR)
From Everand
Insiders' Guide to Technology-Assisted Review (TAR)
Ernst & Young LLP
No ratings yet
DBMS
Document36 pages
DBMS
gag90
No ratings yet
Sub-Aspects of Analytical CRM: Relationship Data Management, Data Mining and Data Warehouse
Document10 pages
Sub-Aspects of Analytical CRM: Relationship Data Management, Data Mining and Data Warehouse
Bhaskar Saha
No ratings yet
DBMS
Document24 pages
DBMS
Tapaswini Satapathy
No ratings yet
Projectproposal
Document11 pages
Projectproposal
opticdashy10
No ratings yet
Ida A1 12736625
Document11 pages
Ida A1 12736625
opticdashy10
No ratings yet
ACC217 JAN2021 Seminar 5 (S) TDH
Document49 pages
ACC217 JAN2021 Seminar 5 (S) TDH
Chan Yi Lin
No ratings yet
CH 16 Data and Competitive Advantage
Document48 pages
CH 16 Data and Competitive Advantage
thomas alvarez
No ratings yet
Distributed Data Mining in Credit Card Fraud Detection
Document57 pages
Distributed Data Mining in Credit Card Fraud Detection
Pankaj Gorasiya
No ratings yet
Unique Identification System ABSTRACT
Document9 pages
Unique Identification System ABSTRACT
Abhinish Swaroop
No ratings yet
Fundamentals of Data Science
Document62 pages
Fundamentals of Data Science
Dr. C. Deepa HoD AI&DS
100% (1)
Week7 SCM IntegrationTechnologiesEDI WS Cloud
Document130 pages
Week7 SCM IntegrationTechnologiesEDI WS Cloud
Ayush K Saxena
No ratings yet
ECommerce Unit II
Document121 pages
ECommerce Unit II
VIDHYANSH JAIN
No ratings yet
Untitled
Document14 pages
Untitled
Lakhvir Kaur
No ratings yet
HSC Ipt Notes
Document45 pages
HSC Ipt Notes
uhhwot
No ratings yet
Digital Assignment-2: Software Engineering
Document10 pages
Digital Assignment-2: Software Engineering
StarK FTW
No ratings yet
Dice Profile Sireesha Kandula
Document6 pages
Dice Profile Sireesha Kandula
Dave Jones
No ratings yet
Trends in Information Technology Infrastructure: DR Ritu Yadav
Document16 pages
Trends in Information Technology Infrastructure: DR Ritu Yadav
sudhanshu
No ratings yet
Data Mining Techniques (DMT) by Kushal Anjaria Session-1 (Lecture Note)
Document2 pages
Data Mining Techniques (DMT) by Kushal Anjaria Session-1 (Lecture Note)
Mighty Singh
No ratings yet
Ais Week 4
Document2 pages
Ais Week 4
Cristel Ann Dotimas
No ratings yet
E - Commerce
Document34 pages
E - Commerce
Mohit Saini
No ratings yet
PrepaidPayment System
Document3 pages
PrepaidPayment System
saikumarreddy
No ratings yet
Churn Analysis
Document12 pages
Churn Analysis
Isha Handayani
100% (3)
CRC Cards-1
Document17 pages
CRC Cards-1
Naveen Kumar
No ratings yet
Srs Final
Document11 pages
Srs Final
QUDDUS LARIK
No ratings yet
Determining Entity Relationships in Combating Refund Fraud
Document20 pages
Determining Entity Relationships in Combating Refund Fraud
Anto Ili
No ratings yet
Ramu Nelapati
Document5 pages
Ramu Nelapati
Ramu Nelapati
No ratings yet
E-Ticketing On Airline Reservation System
Document34 pages
E-Ticketing On Airline Reservation System
Arun Kanti Manna
No ratings yet
AIS - Chapter 5 System Development (Slide) .
Document102 pages
AIS - Chapter 5 System Development (Slide) .
Ermias Guragaw
No ratings yet
Electronic Payment System: Presented by
Document43 pages
Electronic Payment System: Presented by
sheetal28sv
No ratings yet
Introduction and Objective of Project: Project Genesis: An E-Commerce Website
Document17 pages
Introduction and Objective of Project: Project Genesis: An E-Commerce Website
Vinayak SIngh
No ratings yet
Cad Data 2
Document1 page
Cad Data 2
jahremade jahremade
No ratings yet
Financial Status Analysis of Credit Score Rating Using HMM (Hidden Markov Model)
Document21 pages
Financial Status Analysis of Credit Score Rating Using HMM (Hidden Markov Model)
Anonymous pKxfg8N
No ratings yet
T1 - Data Collection, Reliability and Validity of Data
Document8 pages
T1 - Data Collection, Reliability and Validity of Data
Alba Quintero
No ratings yet
Introduction To Data Science
Document10 pages
Introduction To Data Science
Dewa Sunandar
No ratings yet
Information Technology Infra
Document54 pages
Information Technology Infra
Prem Bahadur Kc
No ratings yet
Ruben A. Parazo Department of Computer Studies
Document7 pages
Ruben A. Parazo Department of Computer Studies
Paula Rodalyn Mateo
No ratings yet
Analyzing Systems Using Data Dictionaries
Document60 pages
Analyzing Systems Using Data Dictionaries
sheshpal
No ratings yet
Management Information System in Telecom Sector
Document31 pages
Management Information System in Telecom Sector
Rahul Pant
83% (6)
Lecture 8 - Evolution, Skills, Challenges Data For Ba
Document5 pages
Lecture 8 - Evolution, Skills, Challenges Data For Ba
Gracy Singh
No ratings yet
Learning Objectives: Identify The Major Categories and Trends of E-Commerce Applications
Document67 pages
Learning Objectives: Identify The Major Categories and Trends of E-Commerce Applications
simbu50
No ratings yet
I-FaKTOR Testimony Bureau Full Document
Document25 pages
I-FaKTOR Testimony Bureau Full Document
myibmcareer
No ratings yet
Synopsis Gillu
Document12 pages
Synopsis Gillu
stifler joe
No ratings yet
Data-Driven Fraud Detection: Bwanika Najib
Document34 pages
Data-Driven Fraud Detection: Bwanika Najib
dhyna
No ratings yet
CSFinal Report
Document33 pages
CSFinal Report
J Nikle Reddy
No ratings yet
Unit 1
Document68 pages
Unit 1
202302090036
No ratings yet
Credit Card Fraud Detection
Document28 pages
Credit Card Fraud Detection
Darshan Jagdale
No ratings yet
Data Mining: Concepts & Techniques
Document29 pages
Data Mining: Concepts & Techniques
Deepika Aggarwal
100% (1)
Lecture 2 SSDM
Document58 pages
Lecture 2 SSDM
PatrickLimo
No ratings yet
Banking Management System-REPORT
Document26 pages
Banking Management System-REPORT
Afnan Bin Abbas
No ratings yet
Week 2
Document36 pages
Week 2
FARYAL FATIMA
No ratings yet
Electronic Procurement Management
Document24 pages
Electronic Procurement Management
RONALD
No ratings yet
Mastercard Identity Check Early Adopter Program Learnings
Document5 pages
Mastercard Identity Check Early Adopter Program Learnings
stan ned
No ratings yet
DM ITERA 2020 w1
Document35 pages
DM ITERA 2020 w1
Afdi Fauzul Bahar
No ratings yet
CS 2 3 4 Aml
Document70 pages
CS 2 3 4 Aml
shruti katare
No ratings yet
Infrastructure E-Business
Document19 pages
Infrastructure E-Business
m jagadish
No ratings yet
Tugas Sistem Informasi Manajemen DFD Rental Cuci Mobil: Disusun Oleh
Document8 pages
Tugas Sistem Informasi Manajemen DFD Rental Cuci Mobil: Disusun Oleh
Muhammad Abrar Raihan
No ratings yet
Principles of Information Systems FUNDSYS 2019
Document12 pages
Principles of Information Systems FUNDSYS 2019
Zdhe
No ratings yet
Digital Project Management: A Comprehensive Guide: cybersecurity and compute, #40
From Everand
Digital Project Management: A Comprehensive Guide: cybersecurity and compute, #40
Chase Roger
No ratings yet
Logistic Regression
Document10 pages
Logistic Regression
Nikhil Gandhi
No ratings yet
DS PGM Using CPP
Document18 pages
DS PGM Using CPP
anand5703
No ratings yet
Planets, Luminaries, Asteroids, and Points in Astrology
Document3 pages
Planets, Luminaries, Asteroids, and Points in Astrology
Sushant Chhotray
No ratings yet
The Postulates of Quantum Mechanics: Postulate 1
Document6 pages
The Postulates of Quantum Mechanics: Postulate 1
sgyblee
No ratings yet
Che Vol1
Document139 pages
Che Vol1
abiraman
No ratings yet
Chapter 7
Document23 pages
Chapter 7
enes_ersoy_3
No ratings yet
Vastu Shastra
Document36 pages
Vastu Shastra
Anjanaya Lamani
0% (1)
Tour & Travel Management System
Document59 pages
Tour & Travel Management System
shravan
95% (21)
The Common Java Cookbook
Document333 pages
The Common Java Cookbook
tmo9d
100% (20)
1 Cath
Document12 pages
1 Cath
Hashem Alsmadi
No ratings yet
Jay Bird
Document37 pages
Jay Bird
vlcmstne04
100% (1)
Light: Year 9 Science Semester Revision
Document4 pages
Light: Year 9 Science Semester Revision
api-32133818
No ratings yet
DTC Table: Caution: Be Sure To Perform Before Starting Diagnosis
Document3 pages
DTC Table: Caution: Be Sure To Perform Before Starting Diagnosis
Bumbu Permata
No ratings yet
k215-165b (15amp Trip Sel CB)
Document1 page
k215-165b (15amp Trip Sel CB)
Claudio Diaz
No ratings yet
Mani Kaul Answer
Document10 pages
Mani Kaul Answer
Keshab R
No ratings yet
Ib Lab - Lenz's Law (DCP Ce)
Document2 pages
Ib Lab - Lenz's Law (DCP Ce)
ringo_tiger
100% (1)
Big-M Two Phase Methods
Document51 pages
Big-M Two Phase Methods
bits_who_am_i
No ratings yet
IT MAth Questions
Document24 pages
IT MAth Questions
Technicianccna
100% (2)
IT Companies - in Mumbai
Document22 pages
IT Companies - in Mumbai
Ricky Ortiz
No ratings yet
Log TMBAG6NEXD0028904 210446km 130765mi
Document6 pages
Log TMBAG6NEXD0028904 210446km 130765mi
Sasa Mitrovic
No ratings yet
Worksheet: Circular Motion and Gravitation-Answers Part A: Multiple Choice
Document17 pages
Worksheet: Circular Motion and Gravitation-Answers Part A: Multiple Choice
elena
No ratings yet
Activate 1 Biology
Document120 pages
Activate 1 Biology
Marina Belloni
100% (1)
Elbi Vessel Data Sheet
Document20 pages
Elbi Vessel Data Sheet
MAZEN
No ratings yet
EN5254 8 10 20 - MobileValves
Document5 pages
EN5254 8 10 20 - MobileValves
Amit Gupta
No ratings yet
11.4.2.5 Packet Tracer - Backing Up Configuration F
Document2 pages
11.4.2.5 Packet Tracer - Backing Up Configuration F
RichardWhitley
100% (1)
Executive Summary by Dr. Eugene Brigham and Dr. Joel Houston
Document12 pages
Executive Summary by Dr. Eugene Brigham and Dr. Joel Houston
CharisseMaeM.Carreon
No ratings yet
SEAWEED
Document118 pages
SEAWEED
JeromeGenilan
No ratings yet
Unification of Euler and Werner Deconvolution in Three Dimensions Via The Generalized Hilbert Transform
Document6 pages
Unification of Euler and Werner Deconvolution in Three Dimensions Via The Generalized Hilbert Transform
Mithun
No ratings yet
HCIP Datacom Advanced RS H12 831 - V1.0 ENU
Document123 pages
HCIP Datacom Advanced RS H12 831 - V1.0 ENU
guido.martini
100% (2)
3 - Intermetallic Compounds of Ni and Ga As Catalysts For The Synthesis of Methanol
Document12 pages
3 - Intermetallic Compounds of Ni and Ga As Catalysts For The Synthesis of Methanol
tunganh1110
No ratings yet