0% found this document useful (0 votes)

12 views16 pages

SML 1

Uploaded by

freefire1523143

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views16 pages

SML 1

Uploaded by

freefire1523143

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Sentiment Analysis

Project Based Learning (PBL) Report

for the course
Statistics for Machine Learning – 20MA32L01

BACHELOR OF TECHNOLOGY

COMPUTER SCIENCE AND ENGINEERING

By
23R15A0525- S.Akshara
23R15A0524- R.Navaneetha
23R15A0523-P.Envitha

Under the guidance of

Dr. A. Srinivasulu

Department of Computer Science and Engineering

Accredited by NBA

Geethanjali College of Engineering and Technology

(UGC Autonomous)
(Affiliated to J.N.T.U.H, Approved by AICTE, New Delhi)
Cheeryal (V), Keesara (M), Medchal.Dist.-501 301.
JUNE-2025
TABLE OF CONTENTS

S.No. Contents Page No

1 ACKNOWLEDGEMENT 1

2 ABSTRACT 2

3 INTRODUCTION 3

4 SYSTEM DESIGN 7

5 IMPLEMENTATION 8

6 SAMPLE CODE 9

7 OUTPUT SCREENS 11

8 CONCLUSION 13

9 REFERENCES 14
ACKNOWLEDGEMENT
We would like to acknowledge and give my warmest thanks to our faculty Dr. A. srinivasulu
sir who made this work possible. Their guidance and advice carried us through all the stages
of writing our project. We would also like to thank our classmates for letting our defence
be an enjoyable moment, and for your brilliant comments and suggestions, thanks to you.
We would also like to give special thanks to our families as a whole for their continuous
support and understanding when undertaking my research and writing my project and
providing the required equipment. The project would not have been successful without their
Cooperation and inputs.

1
ABSTRACT

This project presents a foundational approach to sentiment analysis using deep learning with
TensorFlow and Keras. Sentiment analysis is a key task in Natural Language Processing (NLP) that
involves determining the emotional tone behind text data. The goal of this project is to classify short
sentences into positive or negative sentiments. A small custom dataset is created with labeled
sentences expressing either positive or negative emotions. The preprocessing phase includes
tokenizing the text using Keras’ Tokenizer and padding the sequences to ensure uniform input
lengths for the neural network.

The model is a simple sequential neural network that processes the tokenized text and learns to
identify sentiment patterns. It is trained using binary classification techniques to predict whether a
given sentence has a positive (label 1) or negative (label 0) sentiment. Through this implementation,
the project demonstrates key steps such as data preparation, text vectorization, neural network
construction, and training for binary sentiment classification.

Although this is a basic implementation, it effectively introduces important concepts in text

classification and serves as a practical guide for beginners in machine learning and NLP. The project
can be further extended by incorporating a larger dataset, using word embeddings, or applying more
complex deep learning architectures like LSTM or GRU.

2
INTRODUCTION

About the project

In today's digital era, vast amounts of textual data are generated daily through social media, product
reviews, forums, and other online platforms. Analyzing the sentiment behind this text helps
businesses, researchers, and developers understand public opinion, improve customer experiences,
and make informed decisions. Sentiment analysis, a subfield of Natural Language Processing (NLP),
involves determining whether a piece of text expresses a positive, negative, or neutral sentiment.

This project aims to build a basic sentiment analysis model using Python and deep learning libraries
such as TensorFlow and Keras. The model classifies short sentences into binary categories: positive
or negative sentiment. A small, manually created dataset is used to train the model, with each
sentence labeled accordingly. The workflow includes text preprocessing through tokenization and
sequence padding, followed by the development and training of a neural network model.

The purpose of this project is to provide a practical and educational implementation of sentiment
analysis suitable for beginners. It introduces core concepts such as text-to-sequence conversion,
neural network design, and binary classification. While the model is simple, it lays the foundation
for more advanced techniques in NLP. The project can be enhanced by using larger datasets, pre-
trained word embeddings, or more sophisticated models like LSTMs or Transformers.

Project outcomes and objectives

1. Understand Sentiment Analysis Concepts

To gain a clear understanding of what sentiment analysis is and how it is used in real-world
applications.

3
2. Implement Text Preprocessing Techniques
To learn and apply preprocessing steps such as tokenization and padding, preparing raw
text for deep learning models.
3. Develop a Binary Sentiment Classification Model
To build a neural network using TensorFlow and Keras that can classify text into positive
or negative sentiments.
4. Train and Evaluate the Model
To train the model on a small labeled dataset and assess its performance using appropriate
metrics.
5. Provide a Simple Educational NLP Solution
To create a beginner-friendly implementation that demonstrates the basic steps in building
a text classification model using deep learning.

Project Outcomes

 A working sentiment analysis model capable of predicting positive or negative sentiment

from short text inputs.
 A clear understanding of how to preprocess textual data for machine learning.
 Practical experience in building and training neural networks using Keras.
 A foundation for more advanced NLP tasks such as multi-class sentiment analysis, emotion
detection, or model deployment.
 A Jupyter Notebook that serves as a learning tool for future NLP or machine learning
projects.

4
Key Features

1. Text Preprocessing Module

Function:
Prepares raw textual data for analysis by converting text into a numerical format usable by machine
learning models.

Key Features:

 Tokenization: Converts words into integer sequences using Keras Tokenizer.

 Padding: Ensures consistent input lengths using pad_sequences.
 Vocabulary Generation: Builds a word index to maintain consistent encoding.

2. Sentiment Classification Model Module

Function:
Trains a deep learning model to classify text inputs as positive or negative sentiments.

Key Features:

 Neural Network Structure: Uses Embedding, Flatten, and Dense layers for classification.
 Training with Labels: Learns sentiment patterns from labeled training data.
 Binary Output: Outputs a probability between 0 and 1 (positive vs. negative sentiment).

3. Inference and Prediction Module

Function:
Uses the trained model to predict sentiment for new, unseen text inputs.

5
Key Features:

 Dynamic Input Handling: Accepts new text, tokenizes, and pads based on the original
tokenizer.
 Probability Output: Returns the model’s confidence score for positive or negative
sentiment.
 Decision Thresholding: Applies a threshold (e.g., > 0.5 is positive) to decide final output.

4. Decision-Making Module

Function:
Interprets model outputs to make final sentiment decisions and guide responses.

Key Features:

 Threshold-Based Classification: Converts prediction probabilities into labels.

 Confidence Assessment: Optionally outputs prediction certainty to the user.
 Rule-Based Actions: Could trigger different responses or actions based on sentiment (e.g.,
alert on negative sentiment).

5. User Interface (Optional/Future Module)

Function:
Provides a user-friendly interface for inputting text and viewing sentiment results.

Key Features:

 Input Box for Text: Allows users to enter custom sentences.

 Prediction Display: Shows whether the sentiment is positive or negative along with
confidence.

6
SYSTEM DESIGN

Software Requirements

Operating System : Microsoft Windows

Software Name : Jupyter Notebook

Type : IDE

Developers : Fernando Pérez

Hardware Requirements

Device name : DESKTOP-0OGA6I1

Processor : AMD Ryzen 3 3250U with Radeon Graphics 2.60 GHz

Installed RAM : 8.00 GB (5.94 GB usable)

Device ID : 76DDCDEE-6C4D-43DB-99D9-4E23080623F7

System type : 64-bit operating system, x64-based processor

7
IMPLEMENTATION
Modules Implementation

Text Preprocessing Module

# Define input sentences and labels
sentences = [
"I love this product",
"This is the best thing ever",
"Absolutely fantastic experience",
"I hate this",
"This is the worst",
"Terrible and disappointing"
]
labels = [1, 1, 1, 0, 0, 0]
# Tokenization
from tensorflow.keras.preprocessing.text import Tokenizer
from tensorflow.keras.preprocessing.sequence import pad_sequences

tokenizer = Tokenizer()
tokenizer.fit_on_texts(sentences)
sequences = tokenizer.texts_to_sequences(sentences)
# Padding
padded_sequences = pad_sequences(sequences, padding='post')

Sentiment Classification Model Module

# Define features and labels
X = padded_sequences
y = labels
# Build model
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Embedding, Flatten, Dense

model = Sequential()
model.add(Embedding(input_dim=len(tokenizer.word_index)+1, output_dim=8,
input_length=X.shape[1]))
model.add(Flatten())
model.add(Dense(16, activation='relu'))
model.add(Dense(1, activation='sigmoid'))
# Compile and train model
model.compile(optimizer='adam', loss='binary_crossentropy',
metrics=['accuracy'])
model.fit(X, y, epochs=10)

8
Inference and Prediction Module
# Predict sentiment of new text
new_text = ["I really enjoy this"]
new_seq = tokenizer.texts_to_sequences(new_text)
new_pad = pad_sequences(new_seq, maxlen=X.shape[1], padding='post')
prediction = model.predict(new_pad)
print("Positive" if prediction[0][0] > 0.5 else "Negative")

Sample Code

import numpy as np

texts = ["I love programming",

"Python is awesome",
"I hate bugs",
"Debugging is fun",
"I love solving problems",
"I don't like errors"]
labels = [1, 1, 0, 1, 1, 0]

from tensorflow.keras.preprocessing.text import Tokenizer

from tensorflow.keras.preprocessing.sequence import pad_sequences

tokenizer = Tokenizer()
tokenizer.fit_on_texts(texts)

sequences = tokenizer.texts_to_sequences(texts)
sequences

texts

['I love programming',

'Python is awesome',
'I hate bugs',
'Debugging is fun',
'I love solving problems',
"I don't like errors"]

9
max_length = max([len(sequence) for sequence in sequences])
max_length

X = pad_sequences(sequences, maxlen=max_length, padding='post')

y = np.array(labels)
y

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import Embedding, Dense, Flatten

model = Sequential()
model.add(Embedding(input_dim=len(tokenizer.word_index) + 1,
output_dim=8,
input_length=max_length))

model.add(Flatten())
model.add(Dense(10, activation='relu'))

model.add(Dense(1, activation='sigmoid'))
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

model.fit(X, y, epochs=20, batch_size=2)

sample_text = "i love programming"

sample_sequence = tokenizer.texts_to_sequences([sample_text]) # Tokenize the sample text
sample_padded = pad_sequences(sample_sequence, maxlen=max_length, padding='post') # Pad
the sequence
prediction = model.predict(sample_padded)
if prediction > 0.5:
print('positive')
else:
print('negative')
print(prediction[0][0])

10
Sample Output

11
12
CONCLUSION

This sentiment analysis project successfully demonstrates how natural language processing (NLP)
and deep learning techniques can be applied to classify text data into positive or negative sentiments.
Through a structured and modular approach, the project walks through the essential stages of
preprocessing raw text using tokenization and padding, building and training a neural network model
with Keras, and making predictions on new text inputs.

Despite using a small custom dataset for simplicity, the project effectively highlights the workflow
of a typical machine learning pipeline—from data preparation to inference. The use of embedding
layers enables the model to understand word relationships, while the binary classification output
provides an interpretable result.

This beginner-friendly project serves as a strong foundation for more complex NLP tasks. It can be
extended further by incorporating larger and real-world datasets, more advanced model architectures
like LSTM or BERT, and deploying the model through a user interface using frameworks such as
Flask or Streamlit.

Overall, the project not only accomplishes its goal of performing basic sentiment analysis but also
offers valuable insights into the end-to-end development of an AI-based text classification system.

13
REFERENCES
https://www.eneuro.org https://www.geeksforgeeks.org/python-programming-
language/ https://stackoverflow.com/

https://ailocallist.com

Sentiment Analysis for Students
No ratings yet
Sentiment Analysis for Students
26 pages
NLP Final Mini Project
No ratings yet
NLP Final Mini Project
17 pages
Praveen Phase 3
No ratings yet
Praveen Phase 3
6 pages
Mini Project
No ratings yet
Mini Project
16 pages
AI Report Shivam
No ratings yet
AI Report Shivam
8 pages
Complete Report
No ratings yet
Complete Report
56 pages
Project Report - M13 Sentiment Analyzer
No ratings yet
Project Report - M13 Sentiment Analyzer
9 pages
Experiment 6
No ratings yet
Experiment 6
3 pages
Shivamani
No ratings yet
Shivamani
63 pages
Sentiment Analysis Report
No ratings yet
Sentiment Analysis Report
31 pages
Restaurant Review Production Analysis Using Python
No ratings yet
Restaurant Review Production Analysis Using Python
33 pages
Synopsis 6th Sem
No ratings yet
Synopsis 6th Sem
5 pages
Sentiment Analysis with LSTM
No ratings yet
Sentiment Analysis with LSTM
38 pages
RES Presentation
No ratings yet
RES Presentation
21 pages
NLP Project (Documentation)
No ratings yet
NLP Project (Documentation)
8 pages
ISSS609 Project Proposal Group 7
No ratings yet
ISSS609 Project Proposal Group 7
8 pages
Project Report 2023
No ratings yet
Project Report 2023
32 pages
Sentimental Analysis of Twitter Using Emoji: A Creative and Innovative Project Report
No ratings yet
Sentimental Analysis of Twitter Using Emoji: A Creative and Innovative Project Report
19 pages
Predicting The Reviews of The Restaurant Using Natural Language Processing Technique
No ratings yet
Predicting The Reviews of The Restaurant Using Natural Language Processing Technique
4 pages
NM Project Report-Sentiment Analysis-2
No ratings yet
NM Project Report-Sentiment Analysis-2
36 pages
Twitter Sentiment Analysis Project Report
No ratings yet
Twitter Sentiment Analysis Project Report
15 pages
Machine Learning Sentiment Analysis
No ratings yet
Machine Learning Sentiment Analysis
8 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
Software Engineering - Documentation 02023
No ratings yet
Software Engineering - Documentation 02023
9 pages
Report Sentiment Analysis Using NLP and Deep Learning
No ratings yet
Report Sentiment Analysis Using NLP and Deep Learning
65 pages
Review Analysis and Sentiment Learning Using NLP
No ratings yet
Review Analysis and Sentiment Learning Using NLP
15 pages
Comment Analyser Thesis
No ratings yet
Comment Analyser Thesis
63 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
9 pages
Project Review
No ratings yet
Project Review
17 pages
Sentiment Analysis Using NLP
No ratings yet
Sentiment Analysis Using NLP
42 pages
Sentiment Analysis
100% (1)
Sentiment Analysis
35 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
Internship Presentation
No ratings yet
Internship Presentation
16 pages
GR22
No ratings yet
GR22
8 pages
Theolaaaa4273 Merged
No ratings yet
Theolaaaa4273 Merged
76 pages
1-5 Cs PDH
No ratings yet
1-5 Cs PDH
5 pages
Project Report
No ratings yet
Project Report
42 pages
NLP Exp1
No ratings yet
NLP Exp1
5 pages
Harsh Internship
No ratings yet
Harsh Internship
18 pages
Sentiment Analysis of Tweets Project
No ratings yet
Sentiment Analysis of Tweets Project
15 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
21bce3701 Senti K9ar
No ratings yet
21bce3701 Senti K9ar
28 pages
Deep Learning Based Sentiment
No ratings yet
Deep Learning Based Sentiment
62 pages
Document Movie Review
No ratings yet
Document Movie Review
31 pages
Group 10 Data Science Project Report (Sentiment Analysis)
No ratings yet
Group 10 Data Science Project Report (Sentiment Analysis)
23 pages
Keywords::Sentimental Analysis, Naive Bayes, Support Vector Machine
No ratings yet
Keywords::Sentimental Analysis, Naive Bayes, Support Vector Machine
44 pages
Aditya, Aditya and Abishek
No ratings yet
Aditya, Aditya and Abishek
15 pages
Quantum Techniques in Sentiment Analysis
No ratings yet
Quantum Techniques in Sentiment Analysis
103 pages
Minor Project Presentation
No ratings yet
Minor Project Presentation
16 pages
Sentiment Analysis SRS
No ratings yet
Sentiment Analysis SRS
9 pages
BERT for Social Media Sentiment Analysis
No ratings yet
BERT for Social Media Sentiment Analysis
34 pages
Eeg Based Emotion Classification Using Deep Learning Models
No ratings yet
Eeg Based Emotion Classification Using Deep Learning Models
4 pages
Machine Learning Presentation
No ratings yet
Machine Learning Presentation
20 pages
NLP Project Report NLP Project Report
No ratings yet
NLP Project Report NLP Project Report
48 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
14 pages
Chapter One
No ratings yet
Chapter One
6 pages
Analyzing Sentiment Using IMDb Dataset
No ratings yet
Analyzing Sentiment Using IMDb Dataset
4 pages
Anna University: Chennai 600 025
No ratings yet
Anna University: Chennai 600 025
10 pages
Quantitative Analysis of The Impact of Social Media Usage On The Educational Quality of BSIT Students in CDSCDB
No ratings yet
Quantitative Analysis of The Impact of Social Media Usage On The Educational Quality of BSIT Students in CDSCDB
18 pages
Grade Level Section Filipino Class Reading Profile Summary Posttest
No ratings yet
Grade Level Section Filipino Class Reading Profile Summary Posttest
13 pages
Answers For Portfolio in PT
67% (6)
Answers For Portfolio in PT
230 pages
Rachel Mawuenyegah
No ratings yet
Rachel Mawuenyegah
2 pages
Margie's Mechanical Teacher Issues
No ratings yet
Margie's Mechanical Teacher Issues
2 pages
14 Learner Centered Principle
No ratings yet
14 Learner Centered Principle
2 pages
Vietnam War Protest Lesson Plan
No ratings yet
Vietnam War Protest Lesson Plan
4 pages
Ancient Greek Teaching Methods Explained
No ratings yet
Ancient Greek Teaching Methods Explained
4 pages
6th Grade Science: Understanding Gravity
No ratings yet
6th Grade Science: Understanding Gravity
5 pages
Experiment No: 8 TITTLE: Write A Program To Solve 8 Puzzle Problems. Exercise
No ratings yet
Experiment No: 8 TITTLE: Write A Program To Solve 8 Puzzle Problems. Exercise
3 pages
Teaching As A Profession
No ratings yet
Teaching As A Profession
10 pages
Fls BPP
No ratings yet
Fls BPP
3 pages
1.simple Interview Evaluation Form Template
No ratings yet
1.simple Interview Evaluation Form Template
2 pages
Grade 10 - English Curriculum Map
No ratings yet
Grade 10 - English Curriculum Map
4 pages
Elements of School Medicine
No ratings yet
Elements of School Medicine
20 pages
English 8 Quarter 3 Learning Competencies
No ratings yet
English 8 Quarter 3 Learning Competencies
2 pages
DLL Tle8nailcare q2 w1 w4
100% (1)
DLL Tle8nailcare q2 w1 w4
12 pages
Teaching Techniques for Educators
No ratings yet
Teaching Techniques for Educators
11 pages
SPED 405 Assignment A
No ratings yet
SPED 405 Assignment A
3 pages
Grade 10 Lesson Plan: National Parks
No ratings yet
Grade 10 Lesson Plan: National Parks
4 pages
EPP-ICT Accomplishment Report 2022-2023
No ratings yet
EPP-ICT Accomplishment Report 2022-2023
33 pages
Abu Shahid Salman: AI & Leadership Profile
No ratings yet
Abu Shahid Salman: AI & Leadership Profile
2 pages
Smarter Law Transforming Busy Lawyers Into Business Leaders Official Test Bank
No ratings yet
Smarter Law Transforming Busy Lawyers Into Business Leaders Official Test Bank
408 pages
Deconstructing The Educationindustrial Complex in The Digital Age Douglas Loveless Instant Download
100% (3)
Deconstructing The Educationindustrial Complex in The Digital Age Douglas Loveless Instant Download
80 pages
Introduction to Educational Research
No ratings yet
Introduction to Educational Research
2 pages
Richards & Rodgers CH 2 Nature-Of-Approaches-And-methods-In-language
100% (1)
Richards & Rodgers CH 2 Nature-Of-Approaches-And-methods-In-language
31 pages
Lesson Plan Wishes
100% (1)
Lesson Plan Wishes
4 pages
Callista Amanda, Widi Hadiyanti-Utilizing Memes For Vocabulary Learning
No ratings yet
Callista Amanda, Widi Hadiyanti-Utilizing Memes For Vocabulary Learning
16 pages
Audio Text Reading with SLiCK
100% (1)
Audio Text Reading with SLiCK
2 pages
SPG Training Proposal
No ratings yet
SPG Training Proposal
3 pages

SML 1

Uploaded by

SML 1

Uploaded by

Sentiment Analysis

Project Based Learning (PBL) Report

COMPUTER SCIENCE AND ENGINEERING

Under the guidance of

Department of Computer Science and Engineering

Geethanjali College of Engineering and Technology

S.No. Contents Page No

Although this is a basic implementation, it effectively introduces important concepts in text

About the project

Project outcomes and objectives

1. Understand Sentiment Analysis Concepts

 A working sentiment analysis model capable of predicting positive or negative sentiment

1. Text Preprocessing Module

 Tokenization: Converts words into integer sequences using Keras Tokenizer.

2. Sentiment Classification Model Module

3. Inference and Prediction Module

 Threshold-Based Classification: Converts prediction probabilities into labels.

5. User Interface (Optional/Future Module)

 Input Box for Text: Allows users to enter custom sentences.

Operating System : Microsoft Windows

Software Name : Jupyter Notebook

Developers : Fernando Pérez

Device name : DESKTOP-0OGA6I1

Processor : AMD Ryzen 3 3250U with Radeon Graphics 2.60 GHz

Installed RAM : 8.00 GB (5.94 GB usable)

System type : 64-bit operating system, x64-based processor

Text Preprocessing Module

Sentiment Classification Model Module

texts = ["I love programming",

from tensorflow.keras.preprocessing.text import Tokenizer

['I love programming',

X = pad_sequences(sequences, maxlen=max_length, padding='post')

from tensorflow.keras.models import Sequential

model.fit(X, y, epochs=20, batch_size=2)

sample_text = "i love programming"

You might also like