Fake URL Detection Using
Machine Learning
Presented by: [Your Name(s) & USN]
Department: Computer Science and
Engineering
Under the guidance of: [Faculty
Name, Designation]
ABSTRACT
• With the increasing use of the internet,
phishing and fake websites have become a
serious threat to users. Attackers create fake
URLs that look similar to legitimate websites
to steal sensitive information.
• This project aims to develop a Fake URL
Detection System using machine learning
algorithms that classify URLs as legitimate or
fake based on their features such as length,
INTRODUCTION
• The growth of e-commerce, online banking,
and digital communication has increased
cybercrime risks. Fake URLs are one of the
most common tools used by attackers for
phishing.
• By analyzing patterns in URLs, we can build
intelligent systems capable of predicting
whether a given website is genuine or not.
PROBLEM STATEMENT
• Identifying fake URLs manually is difficult for
users.
• Traditional security mechanisms like antivirus
or firewalls may not detect newly created
phishing sites.
• Hence, there is a need for an automated
machine learning-based system that
accurately distinguishes between fake and
LITERATURE SURVEY
• 1. 'Phishing Website Detection Using Machine
Learning' – Proposed Random Forest classifier
achieving 95% accuracy.
• 2. 'URL Feature Extraction for Malicious
Detection' – Focused on lexical and host-based
features for better performance.
• 3. 'Deep Learning Approach for Fake URL
Detection' – Applied neural networks to
METHODOLOGY
• Step 1: Data Collection – Dataset of real and
fake URLs.
• Step 2: Feature Extraction – Extract features
like URL length, dots count, domain age,
HTTPS usage, etc.
• Step 3: Model Training – Train using ML
algorithms such as Logistic Regression,
Random Forest, or Decision Tree.
• Step 4: Testing & Evaluation – Evaluate model
accuracy, precision, and recall.
HARDWARE AND SOFTWARE
IMPLEMENTATION
• Hardware Requirements:
• - Standard PC/Laptop (Intel i5, 4GB RAM or
higher)
• Software Requirements:
• - Python 3.x
• - Libraries: scikit-learn, pandas, numpy, flask
• - Jupyter Notebook / VS Code
• - Dataset (Phishing/Legitimate URLs dataset)
CONCLUSION
• The Fake URL Detection System effectively
detects phishing and fake websites with high
accuracy.
• It can be integrated into browsers or email
clients to protect users from online fraud.
• Future work includes using deep learning
models and real-time browser extensions.
THANK YOU!
•