Department of Computer Engineering: Phishing Website Detector Using ML

G.H.
RAISONI COLLEGE OF ENGINEERING

& MANAGEMENT, PUNE
Department of Computer Engineering
< Phishing Website Detector Using ML>
Guide: Co-Guide:
Prof.Nivedita Kadam Prof.Gayatree Bedre
Name of Projectees
BCOB81-Safin Tamboli BCOB82-Sairaj Jadhav

BCOB85-Sarthak Pawar BCOB90-Shoeb Shah
Contents
Introduction
Justifications for Selecting the Title
Problem Statement
Examples of Phishing Websites
Literature Survey
Block Diagram
Expected Result
Work plan
Future Scope
References
Introduction
Nowadays Phishing becomes a main area of concern for security researchers

because it is not difficult to create the fake website which looks so close to
legitimate website. Experts can identify fake websites but not all the users can
identify the fake website and such users become the victim of phishing attack.
Main aim of the attacker is to steal banks account credentials.
Phishing attacks are becoming successful because lack of user awareness. Since
phishing attack exploits the weaknesses found in users, it is very difficult to
mitigate them, but it is very important to enhance phishing detection techniques.
Justification For Selecting The Title
The main purpose of the project is to detect a website created by any phisher
for hacking data from a user and making aware the user of such threats once
detected. It proposes to prove much beneficial for users for safe browsing and
keeping their data untouched by any phisher who is trying to use the user’s
credentials in illegal means.
So the title mentioned clearly gives the ideology and goal of our project.
i.e. “Phishing Website Detector Using ML)”
Problem Statement
Machine learning technology consists of a many algorithms which requires past

data to make a decision or prediction on future data. Using this technique,
algorithm will analyze various blacklisted and legitimate URLs and their
features to accurately detect the phishing websites including zero- hour
phishing websites.
The following were the questions we will be proposing to solve through this
project-
1.How URL detectors identify the phishing URLs or websites?
2.How to apply ML methods to classify malicious and legitimate websites?
3.How to evaluate a URL detector performance?

Examples of Phishing Websites-
1. Phishing Website send via mail-
2. Phishing Website sent via SMS-

Literature Survey
Sr. No. Paper title & its author Methodology Advantages Future Scope
1. Title: Detecting phishing websites using The proposed framework employs RNN- The outcome of this study reveals The future direction of this study is to
machine learning technique LSTM to identify the properties Pm and that the proposed method presents develop an unsupervised deep learning
Pl in an order to declare an URL as superior results rather than the method to generate insight from a URL.
Author: Ashit Kumar Dutta malicious or legitimate. existing deep learning methods
2. Title: URLs of benign websites were collected We have implemented python In future hybrid technology will be
from www.alexa.com and The URLs of program to extract features from implemented to detect phishing
Phishing Website Detection using Machine phishing websites were collected from URL. Below are the features that we websites more accurately, for which
Learning Algorithms www.phishtank.com. have extracted for detection of random forest algorithm of machine
phishing URLs. learning technology and blacklist method
Authors: And Classifies using Decision Tree as
will be used.
Splitter.
Rishikesh Mahajan
3. Title :Phishing Website Detection Using To select features, we used the Weka In our approach, to find most This is important, as we hope with a
Machine Learning Classifiers Optimized by tool and its algorithms for feature valuable features we used multiple decrease in the number of features, we
selection. feature selection filters. The outputs decreased time needed to build a model.
Feature Selection.
of these filters are analyzed and
To perform phishing websites
features that are proposed as most
detection, in this work we applied K-
important.
Nearest Neighbor (KNN)
Authors: Dželila Mehanović* | Jasmin Kevrić
4. Title: AN APPROACH FOR DETECTING PHISHING It consists of the parallel Support vector machine gives an In the future, we can find a better way to
ATTACKS accuracy of 91.3% on test data set. find a phishing website by using
decision tree which take the input and
This helps in providing accuracy. advanced features of the URL.
USING MACHINE LEARNING TECHNIQUES. produce a specific class. Thus, n number
of trees produce different classes.
Author: K.Venkateshwara Rao

Block Diagram
Block diagram of various stages of project-
Detecting Phising Websites using ML

Phising Website Detector Using ML
Algorithms
Phistank
Feature
Malicious URLs Legitimate URLs Extraction
Crawler
Data
RNN & RNN

Emails/SMS/ Random
Forest Testing Training
Enterprises
Phase Phase
Evaluating The
Result
Future Scope-
1.Creating a safe user friendly environment which can detect illegitimate activities.

2.It is possible to report and block a hacker using phishing website URL and tracing the
location of such anonymous hackers.

3.Awareness can be created among users by displaying certain type of Phishing URLs
available or cause more harm to our system like zero hour phishing websites.

Expected Result
System Description: Detecting Websites/URLs
Input: URLs, Random websites, Transaction IDs, Suspicious Mails

Output: Safe for Browsing (Continue) / Unsafe For Browsing (Block Website)
Possible Success Conditions: Developing a cautious way of browsing on internet,

checking random URLs forwarded on our mails or social media.
Failure Conditions: New format of phishing website may go undetected.
Work – Plan
Months Activities AUG’22 SEP’22 OCT’22 NOV’22 DEC’23 JAN’23 FEB’23
Literature Reviews √ √
Component Identification & Selection √
Designing √ √ √
Experimental Analysis √ √
Fabrication
Testing and Debugging
Preparation of Project Report

References
1. Anti-Phishing Working Group (APWG), https://docs.apwg.org//reports/apwg_trends_report_q4_2019. pdf
2. Jain A.K., Gupta B.B. “PHISH-SAFE: URL Features-Based Phishing Detection System Using Machine Learning”, Cyber
Security. Advances in Intelligent Systems and Computing, vol. 729, 2018, https://doi. org/10.1007/978-981-10-
8536-9_44
3. Purbay M., Kumar D, “Split Behavior of Supervised Machine Learning Algorithms for Phishing URL Detection”,
Lecture Notes in Electrical Engineering, vol. 683, 2021, https://doi.org/10.1007/978-981- 15-6840-4_40
4. Gandotra E., Gupta D, “An Efficient Approach for Phishing Detection using Machine Learning”, Algorithms for
Intelligent Systems, Springer, Singapore, 2021, https://doi.org/10.1007/978-981-15-8711-5_ 12.
5. Hung Le, Quang Pham, Doyen Sahoo, and Steven C.H. Hoi, “URLNet: Learning a URL Representation with Deep
Learning for Malicious URL Detection”, Conference’17, Washington, DC, USA, arXiv:1802.03162, July 2017.
6. Hong J., Kim T., Liu J., Park N., Kim SW, “Phishing URL Detection with Lexical Features and Blacklisted Domains”,
Autonomous Secure Cyber Systems. Springer, https://doi.org/10.1007/978-3-030-33432- 1_12.
7. J. Kumar, A. Santhanavijayan, B. Janet, B. Rajendran and B. S. Bindhumadhava, “Phishing Website Classification

and Detection Using Machine Learning,” 2020 International Conference on Computer Communication and
Informatics (ICCCI), Coimbatore, India, 2020, pp. 1–6, 10.1109/ ICCCI48352.2020.9104161.
Thank you !

Department of Computer Engineering: Phishing Website Detector Using ML

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Department of Computer Engineering: Phishing Website Detector Using ML

Uploaded by

Copyright:

Available Formats

G.H.

RAISONI COLLEGE OF ENGINEERING

< Phishing Website Detector Using ML>

BCOB81-Safin Tamboli BCOB82-Sairaj Jadhav

Nowadays Phishing becomes a main area of concern for security researchers

Machine learning technology consists of a many algorithms which requires past

1.How URL detectors identify the phishing URLs or websites?

2.How to apply ML methods to classify malicious and legitimate websites?

3.How to evaluate a URL detector performance?

1. Phishing Website send via mail-

2. Phishing Website sent via SMS-

Author: K.Venkateshwara Rao

Detecting Phising Websites using ML

RNN & RNN

System Description: Detecting Websites/URLs

Input: URLs, Random websites, Transaction IDs, Suspicious Mails

Possible Success Conditions: Developing a cautious way of browsing on internet,

Testing and Debugging

Preparation of Project Report

1. Anti-Phishing Working Group (APWG), https://docs.apwg.org//reports/apwg_trends_report_q4_2019. pdf

7. J. Kumar, A. Santhanavijayan, B. Janet, B. Rajendran and B. S. Bindhumadhava, “Phishing Website Classification

You might also like