0% found this document useful (0 votes)
24 views15 pages

Sentiment Analysis of Tweets Project

Uploaded by

snehaguptavv
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views15 pages

Sentiment Analysis of Tweets Project

Uploaded by

snehaguptavv
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Shri Chhatrapati Shivaji Maharaj College of Engineering

DEPARTMENT OF COMPUTER ENGINEERING


ACADEMIC YEAR 2023-24

PROJECT REPORT
on

“Classify tweets into positive and negative tweets”


Submitted in partial fulfilment of the Third Year
Computer Engineering [2023-2024]

by
Aparna Gangishetty

UNDER THE GUIDANCE OF

Prof. N.B.Gade
( DEPARTMENT COMPUTER ENGINEERING)

Affiliated to:
Savitribai Phule Pune University

1
Department of Computer Engineering

CERTIFICATE

This is to certify that Miss. Aparna Gangishetty of class Third Year (2023-24) has
successfully completed Project on Classify tweets into positive and negative tweets under
the guidance of Prof.N.B.Gade in fulfilment of the requirement for the award of Degree in
Computer Engineering at Shri. Chhatrapati Shivaji Maharaj College of Engineering, Nepti,
Ahmednagar in the partial fulfilment of the Graduate Degree course in T.E. (Computer) at
the Department of Computer Engineering in the academic Year 2023-24 as prescribed by the
“Savitribai Phule Pune University”, Pune.

Prof.N.B.Gade. Prof.V.V.Jagtap. DR.Y.R.Kharde.

(Project Guide) (HOD) (Principal)

2
Abstract
This project addresses the problem of sentiment analysis in twitter; that is classifying
tweets according to the sentiment expressed in them: positive, negative or neutral.
Twitter is an online micro-blogging and social-networking platform which allows users
to write short status updates of maximum length 140 characters. It is a rapidly
expanding service with over 200 million registered users out of which 100 million are
active users and half of them log on twitter on a daily basis - generating nearly 250
million tweets per day. Due to this large amount of usage we hope to achieve a reflection
of public sentiment by analysing the sentiments expressed in the tweets. Analysing the
public sentiment is important for many applications such as firms trying to find out the
response of their products in the market, predicting political elections and predicting
socioeconomic phenomena like stock exchange. The aim of this project is to develop a
functional classifier for accurate and automatic sentiment classification of an unknown
tweet stream.

3
It is my privilege to acknowledge with deep sense of gratitude to my Project guide
Prof. N.B.Gade for her valuable suggestions and guidance throughout my course
of study and timely help given to me in completion of my project report.I express
my gratitude to Prof.V.V.Jagtap (HOD Computer Engineering Dept.) and
Dr.Y.R.Kharde(Principal) for their kind cooperation. I am highly obliged to the
entire staff of Computer Engineering Dept. for their kind cooperation and help. I
would also like to thank my parents who patiently helped me through my work.
Their support and faith are one of the reasons for the fulfilment of this Project..I’d
also like to extend special thanks to all my colleagues and friends for helping me
whenever needed. Also, a warm thanks to Library staff members for the
availability of the library facilities needed for preparing this report.Last but clearly
not the least I would thank the Almighty for giving me the strength to complete my
report on time.

Aparna Gangishetty

4
Contents

ABSTRACT

ACKNOWLEDGEMENT

1. INTRODUCTION ................................................................ 01

2. ARCHITECTURE ................................................................. 02
2.1 ER-Diagram .......................................................................... 02

3. PROPOSED SYSTEM ............................................................ 03


3.1 Problem statement. .................................................................. 03
3.2 Hardware Requirement. .......................................................... 03
3.3 Software Requirement. ............................................................ 03

3.4 Outcome……………………………………………………..

4. SNAPSHOTS. ............................................................................................. 04
5. CONCLUSION ........................................................................................... 09 6.
REFERENCES. .......................................................................................... 10

5
Chapter 1
Introduction

Domain Introduction

This project of analyzing sentiments of tweets comes under the domain of “Pattern
Classification” and “Data Mining”. Both of these terms are very closely related and
intertwined, and they can be formally defined as the process of discovering “useful”
patterns in large set of data, either automatically (unsupervised) or semi- automatically
(supervised). The project would heavily rely on techniques of “Natural Language
Processing” in extracting significant patterns and features from the large data set of tweets
and on “Machine Learning” techniques for accurately classifying individual unlabelled
data samples (tweets) according to whichever pattern model best describes them.The
features that can be used for modeling patterns and classification can be divided into two
main groups: formal language based and informal blogging based. Language based
features are those that deal with formal linguistics and include prior sentiment polarity of
individual words and phrases, and parts of speech tagging of the sentence. Prior sentiment
polarity means that some words and phrases have a natural innate tendency for expressing
particular and specific sentiments in general. For example the word “excellent” has a
strong positive connotation while the word “evil” possesses a strong negative connotation.
So whenever a word with positive connotation is used in a sentence, chances are that the
entire sentence would be expressing a positive sentiment. Parts of Speech tagging, on the
other hand, is a syntactical approach to the problem. It means to automatically identify
which part of speech each individual word of a sentence belongs.

6
Chapter 2
Architecture

Diagram:

7
Chapter 3

Proposed System

3.1 Problem Statement:

Use the following dataset and classify tweets into positive and negative tweets.
https://www.kaggle.com/ruchi798/data-science-tweets.

3.2 Hardware Requirement:


•A PC with Windows/Linux OS
• Processor with 1.7-2.4gHz speed
• Minimum of 8gb RAM 2gb Graphic card

3.3 Software Requirement:


• Text Editor (VS-code/WebStorm)
• Anaconda distribution package (PyCharm Editor)
• Python libraries Anaconda distribution:
Anaconda is a free and open-source distribution of the Python programming
languages for scientific computing (data science, machine learning applications,
large-scale data processing, predictive analytics, etc.), that aims to simplify package
management system and deployment. Package versions are managed by the package
management system conda. The anaconda distribution includes data- science
packages suitable for Windows, Linux and MacOS.3 Python libraries:

For the computation and analysis we need certain python libraries which are
used to perform analytics. Packages such as SKlearn, Numpy, pandas,
Matplotlib, Flask framework, etc are needed

8
Chapter 4
Snapshots

1.

9
2.

10
11
3.

4.

12
13
Chapter 5
Conclusion

The task of sentiment analysis, especially in the domain of micro-bloging, is still


in the developing stage and far from complete. So we propose a couple of ideas which we
feel are worth exploring in the future and may result in further improved performance.

Right now we have worked with only the very simplest unigram models; we can
improve those models by adding extra information like closeness of the word with a
negation word. We could specify a window prior to the word (a window could for example
be of 2 or 3 words) under consideration and the effect of negation may be incorporated
into the model if it lies within that window. The closer the negation word is to the unigram
word whose prior polarity is to be calculated, the more it should affect the polarity. For
example if the negation is right next to the word, it may simply reverse the polarity of that
word and farther the negation is from the word the more minimized ifs effect should be.

14
References

1. https://www.analyticsvidhya.com/blog/2021/06/twitter-sentiment-analysis-a-nlp-use-case- for-
beginners/#:~:text=Twitter%20sentiment%20analysis%20analyzes%20the%20sentiment%2
0or%20emotion,dataset%20related%20to%20a%20particular%20topic%20or%20event.
2. https://www.geeksforgeeks.org/twitter-sentiment-analysis-using-python/
3. https://www.bing.com/images/search?q=er+diagram+of+classification+of+tweets+into+posi
tive+negative&qpvt=er+diagram+of+classification+of+tweets+into+positive+negative&FO
RM=IGRE

15

You might also like