0% found this document useful (0 votes)

24 views5 pages

Sentiment Analysis Project Report

The sentiment analysis project utilized DistilBERT to classify tweets into positive, neutral, and negative categories. The methodology included text preprocessing, tokenization, and model training, resulting in an accuracy of 85%. Future enhancements and dataset augmentation are suggested for improved performance in real-world applications.

Uploaded by

AMR 66

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views5 pages

Sentiment Analysis Project Report

Uploaded by

AMR 66

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Sentiment Analysis Project Report

Amr Khaled 21100834 Amira Ali 21100789

1. Introduction:

analysis is the process of analyzing text data to determine the sentiment or

emotion it conveys, such as positive, negative, or neutral. This project uses
transformer-based architecture, specifically Distil BERT, to classify tweets
into three categories: positive, neutral, and negative.

2. Dataset Details:
Columns:

• id: Unique identifier for each tweet.

• label: Sentiment label (0 for neutral, 1 for positive, -1 for negative).

• tweet: The text of the tweet.

3. Methodology

3.1. Text Preprocessing

• Removed URLs, HTML tags, and special characters using regular expressions.

• Stripped extra whitespace.

• Ensured all text was converted to lowercase.

3.2. Tokenization

• Used the DistilBertTokenizer to tokenize the text data, ensuring padding and
truncation to a fixed length of 128 tokens.
3.3. Data Splitting

• Split the dataset into training (70%) and testing (30%) sets using train_test_split().

• Encoded labels into numeric values using LabelEncoder.

3.4. Model Selection

• Selected DistilBERT (distilbert-base-uncased) for its efficiency and accuracy in text

classification tasks.

• Added a classification head with three output neurons for multi-class classification.
3.5. Training

• Optimizer: AdamW with a learning rate of 2e-5.

• Batch Size: 16.

• Epochs: 3.

• Used the training dataset to fine-tune the pre-trained Distil BERT model.

‫ؤم‬
Conclusion :
This project successfully built a sentiment analysis model using DistilBERT,
achieving an accuracy of 85%. With further improvements and dataset
augmentation, the model’s performance can be enhanced for real-world
applications.

NLP Project Report
No ratings yet
NLP Project Report
17 pages
Twitter Sentiment Analysis Project Idea
No ratings yet
Twitter Sentiment Analysis Project Idea
3 pages
Fin Ijprems1714118825
No ratings yet
Fin Ijprems1714118825
6 pages
Twitter Sentiment Analysis Using Machine Learning Project Report
No ratings yet
Twitter Sentiment Analysis Using Machine Learning Project Report
3 pages
Introduction
No ratings yet
Introduction
27 pages
10 1109@icaccs48705 2020 9074208
No ratings yet
10 1109@icaccs48705 2020 9074208
3 pages
Twitter Sentiment Analysis Project
No ratings yet
Twitter Sentiment Analysis Project
18 pages
Social Media Sentiment Analysis
No ratings yet
Social Media Sentiment Analysis
9 pages
Se Write-Up
No ratings yet
Se Write-Up
2 pages
Sentiment Analysis of Twitter Data: Radhi D. Desai
No ratings yet
Sentiment Analysis of Twitter Data: Radhi D. Desai
4 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
IC-RTETM Final Sentiment Analysis
No ratings yet
IC-RTETM Final Sentiment Analysis
13 pages
Anjali Presentation
No ratings yet
Anjali Presentation
21 pages
Projec Niraj Nishad
No ratings yet
Projec Niraj Nishad
11 pages
Sentiment Analysis Final Documentation Report
50% (2)
Sentiment Analysis Final Documentation Report
21 pages
Projec Niraj Nishad
No ratings yet
Projec Niraj Nishad
11 pages
Twitter Sentiment Analysis For Product Review
No ratings yet
Twitter Sentiment Analysis For Product Review
19 pages
MP 1
No ratings yet
MP 1
14 pages
Comment Analyser Thesis
No ratings yet
Comment Analyser Thesis
63 pages
Group 10 Data Science Project Report (Sentiment Analysis)
No ratings yet
Group 10 Data Science Project Report (Sentiment Analysis)
23 pages
Project Abstract
No ratings yet
Project Abstract
2 pages
Twitter Sentiment Analysis Project
No ratings yet
Twitter Sentiment Analysis Project
7 pages
Dupesh
No ratings yet
Dupesh
9 pages
Social Media Se
No ratings yet
Social Media Se
3 pages
Praveen Phase 3
No ratings yet
Praveen Phase 3
6 pages
DS - Lab Report.
No ratings yet
DS - Lab Report.
25 pages
Social Media Sentiment
No ratings yet
Social Media Sentiment
8 pages
Sentiment Analysis Project Documentation
No ratings yet
Sentiment Analysis Project Documentation
2 pages
Twitter Sentiment Analysis Guide
No ratings yet
Twitter Sentiment Analysis Guide
7 pages
Efficient Sentiment Analysis Model
No ratings yet
Efficient Sentiment Analysis Model
3 pages
NM Project
No ratings yet
NM Project
18 pages
Sentimental Analysis of Web Scapping Data
No ratings yet
Sentimental Analysis of Web Scapping Data
9 pages
Sentiment Analysis Task On Twitter Data
No ratings yet
Sentiment Analysis Task On Twitter Data
6 pages
Sentiment Analysis On User-Generated Tweets
No ratings yet
Sentiment Analysis On User-Generated Tweets
15 pages
Twitter Sentiment Analysis Model
No ratings yet
Twitter Sentiment Analysis Model
2 pages
Sentiment Analysis Using Bert Model
No ratings yet
Sentiment Analysis Using Bert Model
8 pages
IR Case Study Final Presentation
No ratings yet
IR Case Study Final Presentation
12 pages
Sentiment of Tweets
No ratings yet
Sentiment of Tweets
7 pages
BERT for Social Media Sentiment Analysis
No ratings yet
BERT for Social Media Sentiment Analysis
34 pages
Twitter Sentiment Analysis Guide
No ratings yet
Twitter Sentiment Analysis Guide
3 pages
Twitter Sentiment Analysis Guide
No ratings yet
Twitter Sentiment Analysis Guide
3 pages
IMDB Reviews Sentiment Analysis Report
No ratings yet
IMDB Reviews Sentiment Analysis Report
17 pages
Python Sentiment Analysis Guide
No ratings yet
Python Sentiment Analysis Guide
3 pages
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
No ratings yet
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
6 pages
Dataset Description: Amazon Reviews of Unlocked Phone
No ratings yet
Dataset Description: Amazon Reviews of Unlocked Phone
4 pages
Aditya, Aditya and Abishek
No ratings yet
Aditya, Aditya and Abishek
15 pages
Major Project Presentationn (2) - 1
No ratings yet
Major Project Presentationn (2) - 1
51 pages
Vaibhav DSBDA Project
No ratings yet
Vaibhav DSBDA Project
16 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
5 pages
Exp6 Dav 68 Dnyaneshwar 1
No ratings yet
Exp6 Dav 68 Dnyaneshwar 1
6 pages
Sentiment Analysis For Twitter Comments Project Exp
No ratings yet
Sentiment Analysis For Twitter Comments Project Exp
5 pages
Detailed Report
No ratings yet
Detailed Report
6 pages
Capstone Project
No ratings yet
Capstone Project
15 pages
ISSS609 Project Proposal Group 7
No ratings yet
ISSS609 Project Proposal Group 7
8 pages
Sentiment Analysis of Online Reviews
No ratings yet
Sentiment Analysis of Online Reviews
11 pages
ProjectFinalReport 2copies
No ratings yet
ProjectFinalReport 2copies
26 pages
AIDI 1003 Presentation
No ratings yet
AIDI 1003 Presentation
9 pages
22102B2006 Exp 3
No ratings yet
22102B2006 Exp 3
4 pages

Sentiment Analysis Project Report

Uploaded by

Sentiment Analysis Project Report

Uploaded by

Sentiment Analysis Project Report

Amr Khaled 21100834 Amira Ali 21100789

analysis is the process of analyzing text data to determine the sentiment or

• id: Unique identifier for each tweet.

• label: Sentiment label (0 for neutral, 1 for positive, -1 for negative).

• tweet: The text of the tweet.

3.1. Text Preprocessing

• Stripped extra whitespace.

• Ensured all text was converted to lowercase.

• Encoded labels into numeric values using LabelEncoder.

3.4. Model Selection

• Selected DistilBERT (distilbert-base-uncased) for its efficiency and accuracy in text

• Optimizer: AdamW with a learning rate of 2e-5.

• Batch Size: 16.

You might also like