0% found this document useful (0 votes)

31 views4 pages

Sentiment Analysis with NLTK

Uploaded by

SE69Shweta Yenaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views4 pages

Sentiment Analysis with NLTK

Uploaded by

SE69Shweta Yenaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

3/28/24, 3:07 PM Sentiment_analysis

In [17]: import pandas as pd

from [Link] import stopwords
from [Link] import word_tokenize
from [Link] import WordNetLemmatizer
import re

In [18]: [Link]('punkt')
[Link]('stopwords')
[Link]('wordnet')

[nltk_data] Downloading package punkt to

[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package punkt is already up-to-date!
[nltk_data] Downloading package stopwords to
[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package stopwords is already up-to-date!
[nltk_data] Downloading package wordnet to
[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package wordnet is already up-to-date!

Out[18]: True

Method 1
In [19]: df = pd.read_csv('[Link]', usecols=['body'])
lemma = WordNetLemmatizer()
stop_words = [Link]('english')

In [20]: def text_prep(x):

corp = str(x).lower()
corp = [Link]('[^a-zA-Z]+',' ', corp).strip()
tokens = word_tokenize(corp)
words = [t for t in tokens if t not in stop_words]
lemmatize = [[Link](w) for w in words]

return lemmatize

In [22]: preprocess_tag = [text_prep(i) for i in df['body']]

df["preprocess_txt"] = preprocess_tag
df['total_len'] = df['preprocess_txt'].map(lambda x: len(x))

In [24]: file = open('[Link]', 'r')

neg_words = [Link]().split()
file = open('[Link]', 'r')
pos_words = [Link]().split()

localhost:8888/notebooks/Sentiment_analysis.ipynb 1/4
3/28/24, 3:07 PM Sentiment_analysis

In [27]: num_pos = df['preprocess_txt'].map(lambda x: len([i for i in x if i in pos_w

df['pos_count'] = num_pos
num_neg = df['preprocess_txt'].map(lambda x: len([i for i in x if i in neg_w
df['neg_count'] = num_neg
df['sentiment'] = round((df['pos_count'] - df['neg_count']) / df['total_len'
[Link]()

Out[27]:
body preprocess_txt total_len pos_count neg_count sentiment

I had the Samsung [samsung, awhile,

0 A600 for awhile which absolute, doo, doo, read, 162 18 18 0.00
is abs... re...

Due to a software
[due, software, issue,
1 issue between Nokia 67 8 3 0.07
nokia, sprint, phone, t...
and Spri...

This is a great,
[great, reliable, phone,
2 reliable phone. I also 68 10 4 0.09
also, purchased, phon...
purcha...

I love the phone and

[love, phone, really, need,
3 all, because I really 41 3 0 0.07
one, expect, price...
did...

The phone has been

[phone, great, every,
4 great for every 56 5 3 0.04
purpose, offer, except, ...
purpose it ...

Method 2
In [28]: df['sentiment'] = round(df['pos_count'] / (df['neg_count']+1), 2)
[Link]()

Out[28]:
body preprocess_txt total_len pos_count neg_count sentiment

I had the Samsung [samsung, awhile,

0 A600 for awhile which absolute, doo, doo, read, 162 18 18 0.95
is abs... re...

Due to a software
[due, software, issue,
1 issue between Nokia 67 8 3 2.00
nokia, sprint, phone, t...
and Spri...

This is a great,
[great, reliable, phone,
2 reliable phone. I also 68 10 4 2.00
also, purchased, phon...
purcha...

I love the phone and

[love, phone, really, need,
3 all, because I really 41 3 0 3.00
one, expect, price...
did...

The phone has been

[phone, great, every,
4 great for every 56 5 3 1.25
purpose, offer, except, ...
purpose it ...

In [30]: [Link]('vader_lexicon')

[nltk_data] Downloading package vader_lexicon to

[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...

Out[30]: True

localhost:8888/notebooks/Sentiment_analysis.ipynb 2/4
3/28/24, 3:07 PM Sentiment_analysis

Method 3
In [35]: from [Link] import SentimentIntensityAnalyzer

sent = SentimentIntensityAnalyzer()
df = pd.read_csv('[Link]', usecols=['body'])
df['body'].fillna('', inplace=True)
polarity = [round(sent.polarity_scores(str(i))['compound'], 2) for i in df['
df['sentiment_score'] = polarity
print([Link]())

body sentiment_score
0 I had the Samsung A600 for awhile which is abs... 0.86
1 Due to a software issue between Nokia and Spri... 0.89
2 This is a great, reliable phone. I also purcha... 0.80
3 I love the phone and all, because I really did... 0.96
4 The phone has been great for every purpose it ... 0.77

Exra
In [54]: # Create WordNetLemmatizer object
wnl = WordNetLemmatizer()

# single word lemmatization examples

list1 = ['kites', 'babies', 'dogs', 'flying', 'smiling',
'driving', 'tried', 'feet']
for words in list1:
print(words + " ---> " + [Link](words))

print('better' + " ---> " + [Link]('better',pos='a'))

kites ---> kite

babies ---> baby
dogs ---> dog
flying ---> flying
smiling ---> smiling
driving ---> driving
tried ---> tried
feet ---> foot
better ---> good

In [59]: sentence = 'I am good in cricket, but best in Football.'

# Tokenize the sentence
tokens = nltk.word_tokenize(sentence)

# Get English stopwords
english_stopwords = set([Link]('english'))

# Filter out stopwords
filtered_tokens = [word for word in tokens if [Link]() not in english_st

print(filtered_tokens)

['good', 'cricket', ',', 'best', 'Football', '.']

localhost:8888/notebooks/Sentiment_analysis.ipynb 3/4
3/28/24, 3:07 PM Sentiment_analysis

In [60]: import nltk

from [Link] import PorterStemmer

# Sentence to stem
sentence = 'I am good in cricket, but best in Football.'

# Tokenize the sentence
tokens = nltk.word_tokenize(sentence)

# Initialize PorterStemmer
stemmer = PorterStemmer()

# Perform stemming on each token
stemmed_tokens = [[Link](word) for word in tokens]
print(stemmed_tokens)

['I', 'am', 'good', 'in', 'cricket', ',', 'but', 'best', 'in', 'footbal',
'.']

In [ ]:

localhost:8888/notebooks/Sentiment_analysis.ipynb 4/4

R002 KrishAhuja BDA Lab9.Ipynb - Colab
No ratings yet
R002 KrishAhuja BDA Lab9.Ipynb - Colab
3 pages
British Airways Forage Report
No ratings yet
British Airways Forage Report
12 pages
Python NLP Techniques Guide
No ratings yet
Python NLP Techniques Guide
18 pages
Chapter 3
No ratings yet
Chapter 3
28 pages
Viva Questions For Opinion Mining Project by NASIR ABBAS - VUBWN
No ratings yet
Viva Questions For Opinion Mining Project by NASIR ABBAS - VUBWN
8 pages
Basenlp
No ratings yet
Basenlp
5 pages
Q 3
No ratings yet
Q 3
2 pages
NLP Sentimental Analysis 1736351356
No ratings yet
NLP Sentimental Analysis 1736351356
32 pages
Text Preprocessing and Sentiment Analysis
No ratings yet
Text Preprocessing and Sentiment Analysis
13 pages
Sentiment Analysis Basics
No ratings yet
Sentiment Analysis Basics
32 pages
17 Practicals
No ratings yet
17 Practicals
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
AIML IA3 Loki & SG
No ratings yet
AIML IA3 Loki & SG
31 pages
Sentiment Analysis Using Vectotizer
No ratings yet
Sentiment Analysis Using Vectotizer
37 pages
Detailed Report
No ratings yet
Detailed Report
6 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
Dataset Description: Amazon Reviews of Unlocked Phone
No ratings yet
Dataset Description: Amazon Reviews of Unlocked Phone
4 pages
Lab Manual
No ratings yet
Lab Manual
10 pages
Ment Analysis Text Classification
No ratings yet
Ment Analysis Text Classification
9 pages
Sentiment Analysis and Keyword Extraction
No ratings yet
Sentiment Analysis and Keyword Extraction
14 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
3 pages
Sentiment Analysis Blog Series Part
No ratings yet
Sentiment Analysis Blog Series Part
8 pages
Pandas PD Numpy NP NLTK NLTK - Sentiment.vader Re Wordcloud Seaborn Sns Matplotlib - Pyplot PLT
No ratings yet
Pandas PD Numpy NP NLTK NLTK - Sentiment.vader Re Wordcloud Seaborn Sns Matplotlib - Pyplot PLT
6 pages
Sentiment Analysis Using Bert Model
No ratings yet
Sentiment Analysis Using Bert Model
8 pages
Nokia Positive and Negative TM
No ratings yet
Nokia Positive and Negative TM
8 pages
Sentiment Analysis Using Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Machine Learning Algorithms
23 pages
NLPPR7
No ratings yet
NLPPR7
6 pages
Kindle Review Sentiment Analysis - Ipynb - Colab
No ratings yet
Kindle Review Sentiment Analysis - Ipynb - Colab
5 pages
Sentiment Analysis On User-Generated Tweets
No ratings yet
Sentiment Analysis On User-Generated Tweets
15 pages
DSBA+Master+Codebook+ +Text+Mining+&+TSF
No ratings yet
DSBA+Master+Codebook+ +Text+Mining+&+TSF
11 pages
DS - Lab Report.
No ratings yet
DS - Lab Report.
25 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
Amazon Assignment Ex
No ratings yet
Amazon Assignment Ex
11 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
Web and Social Media Analytics Lab
No ratings yet
Web and Social Media Analytics Lab
34 pages
Adithiyaa BR 23MBA0018 SMA DA Text Mining PDF
No ratings yet
Adithiyaa BR 23MBA0018 SMA DA Text Mining PDF
6 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Sentiment Analysis Part 1
No ratings yet
Sentiment Analysis Part 1
9 pages
Package Sentimentr': R Topics Documented
No ratings yet
Package Sentimentr': R Topics Documented
49 pages
Dav Exp7 56
No ratings yet
Dav Exp7 56
8 pages
Bert Sentiment
No ratings yet
Bert Sentiment
7 pages
Sentiment Analysis of Twitter Data: Radhi D. Desai
No ratings yet
Sentiment Analysis of Twitter Data: Radhi D. Desai
4 pages
Session 7
No ratings yet
Session 7
17 pages
Chapter 10 - Text Analytics
No ratings yet
Chapter 10 - Text Analytics
13 pages
Sentiment Analysis Using Text Mining PDF
100% (1)
Sentiment Analysis Using Text Mining PDF
12 pages
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
48 pages
Raj DV Exp5
No ratings yet
Raj DV Exp5
6 pages
Natural Language Processing-Section
No ratings yet
Natural Language Processing-Section
29 pages
1a NLTK
No ratings yet
1a NLTK
10 pages
Sentiment Analysis For PolishPoznan Studies in Contemporary Linguistics
No ratings yet
Sentiment Analysis For PolishPoznan Studies in Contemporary Linguistics
24 pages
Social Media Sentimental Analysis 1
No ratings yet
Social Media Sentimental Analysis 1
30 pages
Ai Project
No ratings yet
Ai Project
15 pages
Combine PDF
No ratings yet
Combine PDF
124 pages
IDTA For NLP
No ratings yet
IDTA For NLP
16 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
18 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
14 pages
DSBD 7 Ass
No ratings yet
DSBD 7 Ass
9 pages
BERT Driven Sentiment Classification With PyTorch
No ratings yet
BERT Driven Sentiment Classification With PyTorch
54 pages
Sentiment Analysis JW Marriot
No ratings yet
Sentiment Analysis JW Marriot
16 pages
Math, Faith, and Reality Debate
100% (1)
Math, Faith, and Reality Debate
13 pages
Mastering Relative Clauses
No ratings yet
Mastering Relative Clauses
13 pages
Iso Fdis 1452-4 2009 (E)
No ratings yet
Iso Fdis 1452-4 2009 (E)
18 pages
Lesson 5 Political Self: Group 3
No ratings yet
Lesson 5 Political Self: Group 3
15 pages
55.1.amelia Jones, The Artist Is Present
No ratings yet
55.1.amelia Jones, The Artist Is Present
31 pages
MIT6 050JS08 Chapter3
No ratings yet
MIT6 050JS08 Chapter3
16 pages
Peptic Ulcer Nursing Case Study
100% (2)
Peptic Ulcer Nursing Case Study
50 pages
Corporate Spin-Offs and Shareholders' Wealth
No ratings yet
Corporate Spin-Offs and Shareholders' Wealth
22 pages
Mental Health Disorders Nclex Nursing Resources
100% (1)
Mental Health Disorders Nclex Nursing Resources
79 pages
Chapter 4 - Magnetic Field, Transformer and Motor - v3
No ratings yet
Chapter 4 - Magnetic Field, Transformer and Motor - v3
36 pages
Overview of Wide Area Monitoring Systems
No ratings yet
Overview of Wide Area Monitoring Systems
19 pages
Test - 1 012
No ratings yet
Test - 1 012
4 pages
Oxford Legal Research Method Course
No ratings yet
Oxford Legal Research Method Course
23 pages
High/Low Effort Consumer Decision Making
No ratings yet
High/Low Effort Consumer Decision Making
29 pages
Extracapsular Cataract Extraction (Ecce)
No ratings yet
Extracapsular Cataract Extraction (Ecce)
9 pages
Samsung SM3 Maintenance Guide
No ratings yet
Samsung SM3 Maintenance Guide
14 pages
Marketing Strategies in Pharmaceuticals
No ratings yet
Marketing Strategies in Pharmaceuticals
2 pages
Catalog Amway
No ratings yet
Catalog Amway
35 pages
TEC - SPM - Supra 1050 - 8003015 - Mar2015
100% (2)
TEC - SPM - Supra 1050 - 8003015 - Mar2015
62 pages
Safety Management at Construction Projects: A Case Study of Infrastructure Projects
No ratings yet
Safety Management at Construction Projects: A Case Study of Infrastructure Projects
11 pages
Exploring Absolution in Fitzgerald's Story
No ratings yet
Exploring Absolution in Fitzgerald's Story
9 pages
The Science of Forensic Entomology 1st Edition David B Rivers Download
No ratings yet
The Science of Forensic Entomology 1st Edition David B Rivers Download
41 pages
Hosmillo, Nicole Raz - Ancient Chinese Education
No ratings yet
Hosmillo, Nicole Raz - Ancient Chinese Education
7 pages
Small Trees
No ratings yet
Small Trees
5 pages
Pascal Traveling Clamp Brochure
No ratings yet
Pascal Traveling Clamp Brochure
7 pages
Hall Effect Experiment in Semiconductors
No ratings yet
Hall Effect Experiment in Semiconductors
4 pages
Certificate
No ratings yet
Certificate
1 page
Ethical Approval Form For Dissertation
100% (2)
Ethical Approval Form For Dissertation
9 pages
Day 2-FDP
No ratings yet
Day 2-FDP
23 pages
Test Questions P.e.health Grade2 Q1
No ratings yet
Test Questions P.e.health Grade2 Q1
3 pages

Sentiment Analysis with NLTK

Uploaded by

Sentiment Analysis with NLTK

Uploaded by

3/28/24, 3:07 PM Sentiment_analysis

In [17]: import pandas as pd

[nltk_data] Downloading package punkt to

In [20]: def text_prep(x):

In [22]: preprocess_tag = [text_prep(i) for i in df['body']]

In [24]: file = open('[Link]', 'r')

In [27]: num_pos = df['preprocess_txt'].map(lambda x: len([i for i in x if i in pos_w

I had the Samsung [samsung, awhile,

I love the phone and

The phone has been

I had the Samsung [samsung, awhile,

I love the phone and

The phone has been

[nltk_data] Downloading package vader_lexicon to

# single word lemmatization examples

kites ---> kite

In [59]: sentence = 'I am good in cricket, but best in Football.'

['good', 'cricket', ',', 'best', 'Football', '.']

In [60]: import nltk

You might also like