AI - Predicting Cyberbullying On Social Media

Nothing

Uploaded by

Kethavath Saritha76

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views4 pages

AI - Predicting Cyberbullying On Social Media

Nothing

Uploaded by

Kethavath Saritha76

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1.

Title:
Predic ng Cyberbullying on Social Media Using Machine Learning Techniques

2. Project Statement:

The project aims to address the escala ng concern of cyberbullying on social media pla orms (such
as Twi er (or X), Instagram, Facebook, etc.) by u lising machine learning and deep learning
algorithms, such as Support Vector Machine (SVM), Convolu onal Neural Networks (CNN), Random
Forest etc to predict and report them.
To improve the accuracy of the solu on, the project will leverage the Natural Language Toolkit for
data preprocessing and feature extrac on. Then, using the ML techniques, models will be built and
evaluated to e ec vely dis nguish cyberbullying on social media. This project will be helpful for
mely detec on of bullying episodes and providing assistance to vic ms.

3. Outcomes:

• Real Time Detec on of Cyberbullying Episodes: Crea on of a real- me system that

monitors social media data and alerts authori es or organisers of poten al harassment
or bullying on social media.
• Understanding of Types of Cyberbullying: Researchers can also gain insights on the kind
of cyberbullying such as harassment over Age, Religion, Ethnicity, Gender, etc. The
authori es can then take necessary steps based on the type of cyberbullying.
• Deployment and integra on: Researchers can focus on the deployment and integra on
of cyberbullying tweet predic on models into exis ng social media pla orms. This can
provide real- me feedback to users and contribute to a safer and more inclusive online
environment.
• Contribu on to Cyber Safety: The ul mate outcome of such a project would be to
contribute to cyber safety and security by providing a tool that can detect harassment on
social media. Cyberbullying is a grave issue with severe consequences and such ML models
can provide promising solu ons to combat it.

Modules to be Implemented:
1. Data Inges on
2. Exploratory Data Analysis (EDA)
3. Data Preprocessing using NLP techniques
4. Machine Learning Models (Random Forest, SVM, CNN, etc.)
5. Evalua on and Compara ve Analysis of Models
6. Project Presenta on & Documenta on
ti
tt
ti
ti
ti
ti
ti
ti
ti
ff
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
tf
ti
ti
tf
Week-wise Implementa on Plan of Modules:

Milestone 1: Week 1-2

Module 1: Data Collec on and Impor ng Relevant Libraries
• Understand the problem statement
• Gather Twitter (or X) data from relevant sources
• Import relevant libraries on Python
Module 2: Exploratory Data Analysis (EDA)
The goal is to perform EDA on the raw data and provide data visualisa ons in the form of
charts. Examples below:
• Plot the distribu on of tweets labelled on type of cyberbullying
• Plot distribution charts based on word lengths
• Plot word clouds for different label classifications
• Bar charts based on common words

Milestone 2: Week 3-4

Module 3: Data Preprocessing
The social media data (tweets in this case) consists of massive amounts of noise. Therefore, a
rigorous data preprocessing will be implemented to ensure the quality and reliability of the
dataset. This will involve:
• Cleaning and ltering the social media content to remove noise, irrelevant informa on, and
duplicate posts.
• NLP techniques will be used for text normalisa on, tokenisa on, stemming, and removal of
stop words to standardise the textual data.
fi
ti
ti
ti
ti
ti
ti
ti
ti
Milestone 3: Week 5-6
Module 4: Building Machine Learning Techniques
The goal is to build a suite of sophis cated ML models on the transformed textual data to
iden fy cyberbullying. These models are recursively evaluated and tuned to make them more
e ciently predic ve. Some of the proposed models are:
• Convolu onal Neural Networks: CNN models are designed to process data through
mul ple layers of arrays. Text-based CNNs work on word embeddings in the form of
matrices.
• Random Forest: RF combines several di erent classi ers to nd solu ons to complex tasks.
A random forest is essen ally an algorithm consis ng of mul ple decision trees, trained by
bagging or bootstrap aggrega ng. A random forest text classi ca on model predicts an
outcome by taking the decision trees' mean output.
• Naïve Bayes model: A probabilis c supervised learning approach that works with a
likelihood func on that illustrates the probability of witnessing a speci c value of a feature.
• Support Vector Machine: SVM is a supervised ML model that uses classi ca on techniques
to categorise new text a er being given labeled training data sets for each category.
Milestone 4: Week 7
Module 5: Evalua on and Compara ve Analysis of Models
The goal is to do a compara ve analysis of the results obtained from the implementa on of
various algorithms on selected datasets.
• U lise parameters such as accuracy, recall, precision and F1-score to carry out this analysis.
• To the best performing models, provide as series of texts to observe and record real- me
predic ons.

Milestone 5: Week 8
Module 6: Project Presenta on and Documenta on
• Prepare a presenta on and demo with following structure:
o Problem Statement and Objec ve
o Methodology (Brief overview of models used)
o Results & Insights (emphasise on key takeaways)
o Visualisa ons
o Q&A Session
• Clear visualisa ons and minimum overly technical text in presenta ons.
• Documenta on prepara on in below men oned format:
ti
ti
ffi
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
ft
ti
ti
ti
ti
ti
ti
ti
ff
ti
ti
ti
ti
fi
fi
ti
fi
ti
ti
ti
fi
fi
ti
ti
ti
o Project Overview: Problem statement, goals, expected outcomes
o Data Sources: Details on where data was acquired
o Data Preprocessing and Cleaning: Steps taken, techniques used, jus ca on
o Exploratory Data Analysis: Summary of ndings, key visualisa ons
o Model Development: Explana on of model choices, ra onale for parameter selec on
o Model Evalua on: Performance metrics used, comparison of di erent models
o Predic ve Results: Examples of predictions of cyberbullying
o Appendix: Code snippets (well-commented), addi onal visualisa ons, etc.

Evalua on Criteria:
Milestone 1 Evalua on (Week 1-2):
• Successful loading of the dataset into a suitable format (e.g., Pandas Data Frame inPython).
• Iden ca on of missing and duplicate values and handling strategy
• Approval of Ini al summary sta s cs to understand the data distribu on.
• Approval of thorough examina on of data distribu ons (histograms, box plots, etc.).
Milestone 2 Evalua on (Week 3-4):
• Approval of steps for data preprocessing techniques and its implementa on.
• Approval of outcomes of the data preprocessing through visualisa ons of input vs output data
for each data preprocessing step.

Milestone 3 Evalua on (Week 5-6):

• Approval of the Machine Learning models and architectures to be used on the processed
dataset.
• Approval of the hyperparameter tuning process and the range of parameters explored.
• Comple on and approval of performance metrics for all built models.

Milestone 4 Evalua on (Week 7-8):

• Approval of the nal model based on evalua on criteria
• Approval of the presenta on and project documenta on.
• Final code submission on GitHub.

Trigger Warning: The cyberbullying datasets may contain strong language on sensi ve topics, such as
violence, abuse, discrimina on, and/or mental health issues
ti
fi
ti
ti
ti
ti
ti
ti
fi
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
fi
ti
ti
ti
ti
ti
ti
ff
ti
ti
ti
fi
ti
ti
ti
ti
ti

Team 7
No ratings yet
Team 7
11 pages
Nasimuzzaman (M230105004 Project)
No ratings yet
Nasimuzzaman (M230105004 Project)
32 pages
Rahoof
No ratings yet
Rahoof
14 pages
CyberbullyingDetection - Documentation
No ratings yet
CyberbullyingDetection - Documentation
12 pages
Predicting Cyberbullying in Social Media Using Machine Learning
No ratings yet
Predicting Cyberbullying in Social Media Using Machine Learning
7 pages
Detection of Cyber Bullying On Social Media Using
No ratings yet
Detection of Cyber Bullying On Social Media Using
10 pages
Cyber-Bullying Detection via Fuzzy Logic
No ratings yet
Cyber-Bullying Detection via Fuzzy Logic
3 pages
Detection Oof Cyber Bullying in Social Media Using Machine Learningppt
No ratings yet
Detection Oof Cyber Bullying in Social Media Using Machine Learningppt
19 pages
CBDPPT
No ratings yet
CBDPPT
25 pages
Abs 1
No ratings yet
Abs 1
2 pages
Cyberbullying Detection Using Deep Learning
No ratings yet
Cyberbullying Detection Using Deep Learning
5 pages
Cyberbullying Detection Using Machine Learning and Deep Learning - PPTX 1
No ratings yet
Cyberbullying Detection Using Machine Learning and Deep Learning - PPTX 1
6 pages
Batch-9 Paper
No ratings yet
Batch-9 Paper
8 pages
Detection of Cyberbullying On Social Media Using Machine Learning
No ratings yet
Detection of Cyberbullying On Social Media Using Machine Learning
67 pages
Machine Learning for Cyber Bullying Detection
No ratings yet
Machine Learning for Cyber Bullying Detection
5 pages
Cyberbullying Detection with AI
No ratings yet
Cyberbullying Detection with AI
13 pages
Cyberbullying Detection Using NLP (r3) - 1
No ratings yet
Cyberbullying Detection Using NLP (r3) - 1
45 pages
Cyberbullying IPR
No ratings yet
Cyberbullying IPR
25 pages
Automated Detection of Cyber Bullying
No ratings yet
Automated Detection of Cyber Bullying
3 pages
CBDA Research Paper
No ratings yet
CBDA Research Paper
29 pages
Cyberbullying
No ratings yet
Cyberbullying
18 pages
Yshu
No ratings yet
Yshu
23 pages
Bullynet Unmasking Cyberbullying.2nd Review
No ratings yet
Bullynet Unmasking Cyberbullying.2nd Review
35 pages
Big Data Meets Social Media: Predicting Cyberbullying With Machine Learning Algorithms
No ratings yet
Big Data Meets Social Media: Predicting Cyberbullying With Machine Learning Algorithms
10 pages
Project Documentation
No ratings yet
Project Documentation
11 pages
Paper 4
No ratings yet
Paper 4
5 pages
Yogeesh
No ratings yet
Yogeesh
9 pages
CBDA Research Paper
No ratings yet
CBDA Research Paper
19 pages
1.social Media Cyber Bullying Detection Using Machine Learning.
No ratings yet
1.social Media Cyber Bullying Detection Using Machine Learning.
11 pages
Pre-Defence PPT - pptx201-15-14053
No ratings yet
Pre-Defence PPT - pptx201-15-14053
20 pages
INTRO Merged
No ratings yet
INTRO Merged
52 pages
Batch 17
No ratings yet
Batch 17
27 pages
Project A1
No ratings yet
Project A1
3 pages
Cyberbullying
No ratings yet
Cyberbullying
6 pages
Deep Learning Algorithms For Cyber-Bulling Detection in Social Media Platfo 20250424 135605 0000
No ratings yet
Deep Learning Algorithms For Cyber-Bulling Detection in Social Media Platfo 20250424 135605 0000
8 pages
A Comprehensive Review On Cyberbullying Prevention
No ratings yet
A Comprehensive Review On Cyberbullying Prevention
7 pages
Cyberdetection Report
No ratings yet
Cyberdetection Report
57 pages
Cyberbully Detection on Twitter Using NLP & CNN
No ratings yet
Cyberbully Detection on Twitter Using NLP & CNN
50 pages
Cyberbullying Paper
No ratings yet
Cyberbullying Paper
9 pages
Impact Factor: 8.165: Volume 10, Issue 3, March 2022
No ratings yet
Impact Factor: 8.165: Volume 10, Issue 3, March 2022
7 pages
Cyberbullying Detection
No ratings yet
Cyberbullying Detection
20 pages
Capstone Project Proposal: Suraksha
No ratings yet
Capstone Project Proposal: Suraksha
12 pages
Icest Journal Paper
No ratings yet
Icest Journal Paper
12 pages
Machine Learning for Cyberbullying Detection
No ratings yet
Machine Learning for Cyberbullying Detection
9 pages
Cyberbullying Detection Using Machine Learning
No ratings yet
Cyberbullying Detection Using Machine Learning
6 pages
Major Rohith Major
No ratings yet
Major Rohith Major
41 pages
Intro Merged
No ratings yet
Intro Merged
46 pages
Detection of Cyberbullying Using Machine Learning and Deep Learning
No ratings yet
Detection of Cyberbullying Using Machine Learning and Deep Learning
20 pages
Cyberbullying Detection Based On Semantic Enhanced Marginalised Denoising Autoencoder - Report
No ratings yet
Cyberbullying Detection Based On Semantic Enhanced Marginalised Denoising Autoencoder - Report
71 pages
21BCM503 Anushya Mijorproject
No ratings yet
21BCM503 Anushya Mijorproject
57 pages
Cyberbullying Detection Using ML
No ratings yet
Cyberbullying Detection Using ML
54 pages
Survey Paper
No ratings yet
Survey Paper
8 pages
Spring 2025 - CS619 - 10969
No ratings yet
Spring 2025 - CS619 - 10969
4 pages
Final Project Report
No ratings yet
Final Project Report
31 pages
Machine Learning-Based Strategies For Detecting Cyberbullying in Online Chats
No ratings yet
Machine Learning-Based Strategies For Detecting Cyberbullying in Online Chats
4 pages
Major Project Presentation Template For Review 1
No ratings yet
Major Project Presentation Template For Review 1
49 pages
Assessment 2 UEL CN 7000
No ratings yet
Assessment 2 UEL CN 7000
10 pages
ML Digit Classification Report
No ratings yet
ML Digit Classification Report
2 pages
160 300 2 PB
No ratings yet
160 300 2 PB
10 pages
A Comparison of Machine Learning Algorithms For Customer Churn Prediction
No ratings yet
A Comparison of Machine Learning Algorithms For Customer Churn Prediction
6 pages
Online Payment Fraud Detection ML
No ratings yet
Online Payment Fraud Detection ML
17 pages
Network Traffic Prediction in 4G - LTE Using Machine Learning Techniques
No ratings yet
Network Traffic Prediction in 4G - LTE Using Machine Learning Techniques
7 pages
Advanced Data Science & AI Course
No ratings yet
Advanced Data Science & AI Course
51 pages
Stock Market Prediction Using Big Data: February 2023
No ratings yet
Stock Market Prediction Using Big Data: February 2023
9 pages
1 s2.0 S2772442522000016 Main
No ratings yet
1 s2.0 S2772442522000016 Main
18 pages
Andres Limon Alcocer English
No ratings yet
Andres Limon Alcocer English
1 page
Final
No ratings yet
Final
21 pages
Scikit-learn ML Course Guide
100% (1)
Scikit-learn ML Course Guide
23 pages
1 s2.0 S2214509524001426 Main
No ratings yet
1 s2.0 S2214509524001426 Main
21 pages
Project 3
No ratings yet
Project 3
4 pages
Diabetes Prediction System
No ratings yet
Diabetes Prediction System
25 pages
Project Report
70% (10)
Project Report
47 pages
Credit Scoring Optimization for Banks
No ratings yet
Credit Scoring Optimization for Banks
26 pages
Medical Insurance Cost Prediction
100% (1)
Medical Insurance Cost Prediction
18 pages
Bagging Trees & Random Forests Guide
No ratings yet
Bagging Trees & Random Forests Guide
50 pages
Milk Cap - Project - Report
No ratings yet
Milk Cap - Project - Report
67 pages
Week 7 Solution
100% (1)
Week 7 Solution
4 pages
Machine Learning With Boosting
100% (1)
Machine Learning With Boosting
212 pages
Ajanah, Hakeema Ize Final Project
No ratings yet
Ajanah, Hakeema Ize Final Project
97 pages
Ritesh Tandon Machine Learning Project
100% (5)
Ritesh Tandon Machine Learning Project
23 pages
Discovering The Symptom Patterns of COVID-19 From Recovered and
No ratings yet
Discovering The Symptom Patterns of COVID-19 From Recovered and
9 pages
Used Car Price Prediction Model
No ratings yet
Used Car Price Prediction Model
10 pages
IEEE Format Paper
No ratings yet
IEEE Format Paper
20 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Multi-Disease Prediction System Proposal
No ratings yet
Multi-Disease Prediction System Proposal
4 pages