Problem Statement - Phishing URL Detection

Uploaded by

23ume534

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views2 pages

Problem Statement - Phishing URL Detection

Uploaded by

23ume534

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Problem Statement: Phishing URL Detection

Objective: Develop a robust phishing URL detection model using machine learning techniques .

Dataset Description:

● Dataset Type: Tabular

● Associated Tasks: Classification
● Feature Types: Real, Categorical, Integer
● Number of Instances: 188690
● Number of Features: 54

Features Available:

● URLLength: Integer representing the length of the URL.

● Domain: Categorical feature indicating the domain of the URL.
● DomainLength: Integer representing the length of the domain name.
● IsDomainIP: Binary integer indicating if the domain is represented as an IP address.
● TLD: Categorical feature denoting the top-level domain of the URL.
● URLSimilarityIndex: Integer measuring similarity to known phishing or legitimate
URLs.
● CharContinuationRate: Integer indicating character continuation rate in the URL.
● TLDLegitimateProb: Continuous feature representing probability of the TLD being
legitimate.

Target Variable:

● Label: Binary variable where 1 denotes a legitimate URL and 0 denotes a phishing
URL.

Task:

Your task involves conducting thorough exploratory data analysis (EDA) to understand
feature distributions, handle missing values, and visualize relationships between
features and the target variable (Label). Subsequently, select pertinent features that
effectively differentiate between phishing and legitimate URLs, possibly engaging in
feature engineering to enhance model performance. Following feature selection, apply an
appropriate classification approach, split the dataset into training and testing sets, and
train the model on the training data. Evaluate the model's performance on the testing set
to ensure it correctly predicts whether a URL is phishing or legitimate based on the
selected features.
Instructions:

● Use Google Colab or Jupyter Notebook for coding. Extract and load the dataset provided
in a zip folder for analysis.
● Write original code; direct copy-pasting will result in disqualification. Ensure your code
is well-structured with clear comments explaining each step.
● Be ready to explain the mathematical concepts and basic workings of your code during
evaluation.
● Submit your finalized Notebook (.ipynb) file by July 27, 2024. Late submissions will not
be considered. Provide the link to your .ipynb file on the shared submission sheet.

Review 4
No ratings yet
Review 4
9 pages
Project Synopsis: Title: Phishing Detection System Using Machine Learning
No ratings yet
Project Synopsis: Title: Phishing Detection System Using Machine Learning
2 pages
Updated Phishing Url Detection
No ratings yet
Updated Phishing Url Detection
13 pages
Phishing PPT Final
No ratings yet
Phishing PPT Final
24 pages
Phishing URL Detection Presentation
No ratings yet
Phishing URL Detection Presentation
12 pages
Phishing Detection Capstone
No ratings yet
Phishing Detection Capstone
19 pages
Malicious URL Detection Using Random Forest
No ratings yet
Malicious URL Detection Using Random Forest
36 pages
Phishing Final
No ratings yet
Phishing Final
13 pages
Phishing Detection System Through Hybrid Machine Learning Based On URL
No ratings yet
Phishing Detection System Through Hybrid Machine Learning Based On URL
33 pages
Phishing Website Detection by Machine Learning Techniques Presentation
No ratings yet
Phishing Website Detection by Machine Learning Techniques Presentation
12 pages
Midterm Project Report
No ratings yet
Midterm Project Report
21 pages
Dattatrya Synopsis 1
No ratings yet
Dattatrya Synopsis 1
6 pages
Phishing Website Detection with ML
No ratings yet
Phishing Website Detection with ML
16 pages
Phishing Detection Using ML
No ratings yet
Phishing Detection Using ML
11 pages
Phishing
No ratings yet
Phishing
10 pages
Paper 2
No ratings yet
Paper 2
10 pages
20mis0106 VL2023240103172 Pe003
No ratings yet
20mis0106 VL2023240103172 Pe003
5 pages
URL Phishing
No ratings yet
URL Phishing
36 pages
Phishing Detection with Machine Learning
No ratings yet
Phishing Detection with Machine Learning
28 pages
22 04 CPE Presentation
No ratings yet
22 04 CPE Presentation
18 pages
Evasion Attacks and Defense Mechanisms For Machine Learning
No ratings yet
Evasion Attacks and Defense Mechanisms For Machine Learning
59 pages
128 Submission
No ratings yet
128 Submission
7 pages
Mandadi 2022
No ratings yet
Mandadi 2022
4 pages
Phishing-Detection Using ML
No ratings yet
Phishing-Detection Using ML
14 pages
Phishing Website Detection Using ML 2-1
No ratings yet
Phishing Website Detection Using ML 2-1
20 pages
Enhancing Phishing URL Detection Through Comprehen
No ratings yet
Enhancing Phishing URL Detection Through Comprehen
7 pages
Phisingppt
No ratings yet
Phisingppt
15 pages
Presentation Slides
No ratings yet
Presentation Slides
42 pages
Automated Phishing Detection Through URL Analysis and Machine Learning
No ratings yet
Automated Phishing Detection Through URL Analysis and Machine Learning
9 pages
NIS Microproject
No ratings yet
NIS Microproject
10 pages
Project Proposal (1)
No ratings yet
Project Proposal (1)
45 pages
Phishing Website Detection
No ratings yet
Phishing Website Detection
19 pages
A Machine Learning-Based Solution For Enhanced Online Security
No ratings yet
A Machine Learning-Based Solution For Enhanced Online Security
13 pages
Project 3 - Phishing Detector Using LR
No ratings yet
Project 3 - Phishing Detector Using LR
3 pages
Phishing Detection via ML Project
No ratings yet
Phishing Detection via ML Project
17 pages
CyberSec Review3 Team10
No ratings yet
CyberSec Review3 Team10
28 pages
Ai Phishing Report
No ratings yet
Ai Phishing Report
3 pages
Phishingdmreport
No ratings yet
Phishingdmreport
19 pages
Employing Machine Learning Algorithms To Detect Phishing URL Websites
No ratings yet
Employing Machine Learning Algorithms To Detect Phishing URL Websites
6 pages
Phishing Detection with ML Capstone
No ratings yet
Phishing Detection with ML Capstone
29 pages
Sniffing Dtetction IEEE Paper
No ratings yet
Sniffing Dtetction IEEE Paper
3 pages
Web-Based Machine Learning Framework For Phishing URL Detection and Analysis
No ratings yet
Web-Based Machine Learning Framework For Phishing URL Detection and Analysis
7 pages
PHISHNET Multi Algorithmic Safety Net For Advanced Phishing URL Detection
No ratings yet
PHISHNET Multi Algorithmic Safety Net For Advanced Phishing URL Detection
8 pages
Final Yr Project PhishingAttack
No ratings yet
Final Yr Project PhishingAttack
12 pages
Malicious URL Proposal
No ratings yet
Malicious URL Proposal
2 pages
CSE422 15 Lab GroupNo011 Report 23241101 21201596
No ratings yet
CSE422 15 Lab GroupNo011 Report 23241101 21201596
13 pages
Final PPT - Phishing Website
100% (2)
Final PPT - Phishing Website
23 pages
SE Report G7
No ratings yet
SE Report G7
21 pages
PBL-2 Report File
No ratings yet
PBL-2 Report File
11 pages
Phishing Detection Using Machine Learnin
No ratings yet
Phishing Detection Using Machine Learnin
5 pages
Ieee Paper
No ratings yet
Ieee Paper
3 pages
B5 - Project Synopsis
No ratings yet
B5 - Project Synopsis
5 pages
PhishShield (New Research Paper)
No ratings yet
PhishShield (New Research Paper)
6 pages
Phishing Detection with ML
No ratings yet
Phishing Detection with ML
25 pages
Phishing 094610
No ratings yet
Phishing 094610
26 pages
Phishing Url Detection Research PDF
No ratings yet
Phishing Url Detection Research PDF
9 pages
How Is This Model Different From Existing Phishing Detection Models
No ratings yet
How Is This Model Different From Existing Phishing Detection Models
4 pages

Problem Statement - Phishing URL Detection

Uploaded by

Problem Statement - Phishing URL Detection

Uploaded by

Problem Statement: Phishing URL Detection

● Dataset Type: Tabular

● URLLength: Integer representing the length of the URL.

You might also like