Ai Phase - 1

The document outlines a sentiment analysis project aimed at understanding customer evaluations of competitor products using NLP techniques. It discusses the use of a dataset from Kaggle containing tweets about U.S. airlines, detailing steps for data preprocessing, feature extraction, and visualization of sentiment distribution. Additionally, it highlights insights that can be derived from the analysis to inform business decisions and improve customer satisfaction.

Uploaded by

Sakthi Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views21 pages

Ai Phase - 1

Uploaded by

Sakthi Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 21

NAAN MUDHALVAN

PROJECT TITLE: Sentiment Analysis

Problem statement:
This type of project can show you what working as an NLP
specialist is like. For this project, you want to find out how
customers evaluate competitor products, i.e., what they
likes and dislikes. It’s a great business case. Learning
what customers like about competing products can be a
great way to improve your own product, so this is
something that many companies are actively trying to do.
Employ different NLP methods to better understand
customer feedback and opinion.
Dataset Link:
https://www.kaggle.com/datasets/crowdflower/twitter-airline-sentiment
Description:

This dataset originally came

from Crowdflower's Data for Everyone
library.
As the original source says,
A sentiment Analysis job about the
problems of each major U.S. airline.
Twitter data was scraped from February
of 2015 and contributors were asked to
first classify positive, negative, and
neutral tweets, followed by categorizing
negative reasons (such as "late flight" or
"rude service").
Dataset Contents
• The dataset you've provided from
Kaggle, the "Twitter US Airline
Sentiment," is a suitable dataset for
sentiment analysis but primarily
focuses on customer reviews related
to airline experiences rather than
competitor products.
• It contains tweets related to different
U.S. airlines and the sentiment labels
classify tweets as positive, negative,
or neutral based on the sentiment
expressed.
• If you are specifically
interested in competitor
products and their customer
reviews, you may need to
explore other datasets or data
sources.

• Consider searching for

datasets on general e-
commerce websites, product
review platforms, or customer
sentiment analysis datasets
that cover a broader range of
products and services
• Using the Kaggle platform's
search and filtering options to
look for datasets that match
your specific criteria, or you
may need to explore other
sources and repositories for
datasets related to
competitor products and
customer sentiments.
Steps of approach

1. Import Libraries:
Start by Importing the necessary
libraries.

Code:
import numpy as np
import pandas as pd
import nltk
from nltk.sentiment.vader import SentimentIntensityAnalyser
import re
from textblob import TextBlob
from word cloud import wordcloud
import seaborn as sns
import matplotlib.pyplot as plt
import cufflinks as cf
inline %matplotlib
from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize
from nltk.stem import WordNetLemmatizer
from plotly.offline import init_notebook_mode,iplot
init_notebook_mode(connected = True)
cf.go_offline();
import plotly.graph_objs as go
from plotly.subplots import make_subplots

import warnings
warnings.flilterwarning(“ignore”)
warning.warn(“this will not show”)

pd.set_option(‘display.max_columns’,None)
2. Load the Dataset:
Load the dataset into a Pandas
DataFrame.
Code:
data = pd.read_csv("Tweets.csv")
3. Data cleaning:
Clean the text data by performing the
following steps:
•Remove special characters and URLs:
Code:
data['text']
=data['text'].apply(lambda
x:re.sub(r"http\S+|www\S+|<.*?>|[^
a-zA-Z0-9\s]", '', x))
• Convert text to lowercase:
Code:
data['text'] = data['text’].str.lower()
• Remove stopwords (common
words that don't provide
meaningful information):
Code:
stop_words =
set(stopwords.words('english'))
data['text'] = data['text'].apply(lambda
x: ' '.join(word for word in
word_tokenize(x) if word not in
stop_words))
• Lemmatize words to
reduce them to their base
form:
Code:
lemmatizer = WordNetLemmatizer()
data['text'] =
data['text'].apply(lambda x: '
'.join(lemmatizer.lemmatize(word) for
word in word_tokenize(x)))
4. Select Relevant Columns:
If you're only interested in the
cleaned text and sentiment labels,
select those columns:
Code:
data = data[['text', 'airline_sentiment']]
5. Save the Preprocessed Data:
Optionally, you can save the
preprocessed data to a new CSV
file.
Code:
data.to_csv("preprocessed_tweets.
csv", index=False)
Feature Extraction
1. Load and Preprocess the Data:
Begin by loading and preprocessing the
textual data as previously described in
the "Data Preprocessing" section.
Code:
import pandas as pd
# Load the preprocessed data
data =
pd.read_csv("preprocessed_tweets.
csv")
# Split the data into features (text)
and labels (sentiment)
X = data['text']
y = data['airline_sentiment']
2. Sentiment Analysis:
You can apply sentiment analysis
techniques to predict the sentiment labels.
In this example, we'll use the Multinomial
Naive Bayes classifier.
Code:
from sklearn.naive_bayes import MultinomialNB
from sklearn.metrics import accuracy_score,
classification_report
# Train a Multinomial Naive Bayes classifier
clf = MultinomialNB()
clf.fit(X_train_tfidf, y_train)
# Make predictions
y_pred = clf.predict(X_test_tfidf)
# Evaluate the model
accuracy = accuracy_score(y_test, y_pred)
print("Accuracy:", accuracy)
print(classification_report(y_test, y_pred))
Visualizations
Using Python libraries like Matplotlib, Seaborn, and
Plotly we can Visualizing sentiment distribution and
analyzing trends.
Sentiment Distribution:
To visualize the sentiment distribution, you can
create a bar chart or pie chart showing the count
of each sentiment category (positive, negative,
neutral).
Code:
import matplotlib.pyplot as plt
import seaborn as sns
sentiment_counts =
data['airline_sentiment'].value_counts()
plt.figure(figsize=(8, 6))
sns.countplot(data=data, x='airline_sentiment',
order=sentiment_counts.index)
plt.title('Sentiment Distribution')
plt.xlabel('Sentiment')
plt.ylabel('Count')
plt.show()
Word Clouds:
You can create word clouds to visualize the
most frequently occurring words in each
sentiment category.
Code:
from wordcloud import WordCloud
# Generate word clouds for each sentiment
sentiment_labels = data['airline_sentiment'].unique()
for sentiment in sentiment_labels:
text = " ".join(data[data['airline_sentiment'] ==
sentiment]['text'])
wordcloud = WordCloud(width=800, height=400,
background_color='white').generate(text)
plt.figure(figsize=(8, 6))
plt.imshow(wordcloud, interpolation='bilinear')
plt.title(f'Word Cloud for {sentiment} Sentiment')
plt.axis('off')
plt.show()
These visualizations will help you gain insights into
sentiment distribution, temporal trends, and the most
frequently used words in each sentiment category in
your dataset.
Insights Generation
Analyzing sentiment analysis results can provide
valuable insights that can guide business decisions.
Here are some meaningful insights can extract from the
given dataset to inform business decisions:
1. Identify Common Complaints and
Issues: Analyze the most frequent negative
sentiment expressions to identify common
complaints and issues raised by customers. This can
help airlines prioritize areas for improvement, such
as customer service, flight delays, or seat comfort.

1. Monitor Brand Sentiment Over Time:

Use sentiment trends over time to monitor how
the sentiment towards different airlines evolves.
Are there specific periods when sentiment
becomes more positive or negative? This can
inform marketing and operational strategies.
3. Assess the Impact of Customer Service
Responses:
If the dataset includes customer service
responses to negative tweets, analyze whether
these responses lead to sentiment changes.
Determine if timely and helpful responses result
in more positive sentiments, potentially
improving customer satisfaction.

4.Identify Positive Sentiment Drivers:

Explore the positive sentiment tweets to
identify what customers appreciate about the
airlines. It could be factors like excellent
service, friendly staff, or smooth check-in
processes. Leverage these insights in
marketing campaigns.
3. Customer Loyalty and Advocacy:
Identify loyal customers who consistently
express positive sentiments. Consider engaging
with them for testimonials, case studies, or
loyalty programs to strengthen brand advocacy.

4. Iterative Improvement
Use sentiment analysis results as part of a
continuous improvement process. Regularly
monitor sentiment and take action based on
feedback to enhance customer satisfaction and
loyalty.
5. Customer Demographics and
Sentiment: If available, analyze sentiment
based on customer demographics (e.g., age,
gender) to identify patterns in sentiment. Tailor
marketing and communication strategies to
specific customer segments.

6. Impact of Special Promotions or

Events: Examine sentiment changes around
special promotions, events, or incidents (e.g.,
weather-related disruptions). Did these events
have a significant impact on sentiment? This
can inform crisis management and marketing
strategies.
Remember that these insights should
guide data-driven decision-making
processes.
Combining sentiment
analysis with other data sources and
feedback channels, such
as customer surveys and
reviews on other platforms,
can provide a more
comprehensive view of customer
sentiment
and preferences.

Twitter Sentiment Analysis For Product Review
No ratings yet
Twitter Sentiment Analysis For Product Review
19 pages
Sentiment Analysis for Airlines
No ratings yet
Sentiment Analysis for Airlines
4 pages
Southnlp2024 Poster 34
No ratings yet
Southnlp2024 Poster 34
1 page
NLP for Airline Sentiment Analysis
No ratings yet
NLP for Airline Sentiment Analysis
29 pages
Product Rating Through Sentiment Analysis
No ratings yet
Product Rating Through Sentiment Analysis
23 pages
Sentiment Analysis of Customer Reviews From e
No ratings yet
Sentiment Analysis of Customer Reviews From e
6 pages
Sentimental Analysis of Customer Reviews Which Should Be Represent in Graph by Using Plot Scatter
No ratings yet
Sentimental Analysis of Customer Reviews Which Should Be Represent in Graph by Using Plot Scatter
12 pages
Software Engineering - Project Proposal
No ratings yet
Software Engineering - Project Proposal
13 pages
BERT NLP Sentiment Analysis of Airlines
No ratings yet
BERT NLP Sentiment Analysis of Airlines
33 pages
Social Media Sentiment Analysis Report
No ratings yet
Social Media Sentiment Analysis Report
7 pages
Significant Labels in Sentiment Analysis of Online Customer Reviews of Airlines
No ratings yet
Significant Labels in Sentiment Analysis of Online Customer Reviews of Airlines
18 pages
Business Sentiment Analysis Guide
No ratings yet
Business Sentiment Analysis Guide
6 pages
Airline Sentiment Analysis via Mutual Information
No ratings yet
Airline Sentiment Analysis via Mutual Information
6 pages
Proposalwriting
No ratings yet
Proposalwriting
16 pages
Social Media Sentiment Project
No ratings yet
Social Media Sentiment Project
2 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Customer Sentiment Analysis Guide
No ratings yet
Customer Sentiment Analysis Guide
3 pages
Social Media Se
No ratings yet
Social Media Se
3 pages
Sentiment Analysis For Twitter Comments Project Exp
No ratings yet
Sentiment Analysis For Twitter Comments Project Exp
5 pages
Python-Based Tweet Sentiment Analysis
No ratings yet
Python-Based Tweet Sentiment Analysis
4 pages
Fin Ijprems1714118825
No ratings yet
Fin Ijprems1714118825
6 pages
Sentiment Analyzer For E-Commerce
No ratings yet
Sentiment Analyzer For E-Commerce
16 pages
Arsalan's Project
No ratings yet
Arsalan's Project
4 pages
Sentiment Analysis with ML
No ratings yet
Sentiment Analysis with ML
10 pages
Arsalan's Project New
No ratings yet
Arsalan's Project New
4 pages
Social Media Sentiment Analysis
No ratings yet
Social Media Sentiment Analysis
9 pages
Sentiment Analysis of US Airlines
No ratings yet
Sentiment Analysis of US Airlines
25 pages
NM Project
No ratings yet
NM Project
18 pages
NLP Sentimental Analysis 1736351356
No ratings yet
NLP Sentimental Analysis 1736351356
32 pages
Anjali Presentation
No ratings yet
Anjali Presentation
21 pages
Twitter Sentiment Analysis Guide
No ratings yet
Twitter Sentiment Analysis Guide
27 pages
Twitter Sentiment Analysis with Hadoop
No ratings yet
Twitter Sentiment Analysis with Hadoop
27 pages
6 Project Report Sem6
No ratings yet
6 Project Report Sem6
13 pages
Dataset Description: Amazon Reviews of Unlocked Phone
No ratings yet
Dataset Description: Amazon Reviews of Unlocked Phone
4 pages
Textual Analysis Sentiment Analysis Presentation
No ratings yet
Textual Analysis Sentiment Analysis Presentation
15 pages
Sentiment Analysis of Twitter Data My
75% (4)
Sentiment Analysis of Twitter Data My
14 pages
Mukesh Joshiyara FInal
No ratings yet
Mukesh Joshiyara FInal
31 pages
Unnati Thesiss. Orignal1
No ratings yet
Unnati Thesiss. Orignal1
52 pages
Project Report
No ratings yet
Project Report
9 pages
Analyzing Customer Feedback Using NLP
No ratings yet
Analyzing Customer Feedback Using NLP
21 pages
Final Year Project PPT Template
No ratings yet
Final Year Project PPT Template
12 pages
Paper Main Findings Methodology Study Design Theoretical Framework Research Gaps Research Question Dataset
No ratings yet
Paper Main Findings Methodology Study Design Theoretical Framework Research Gaps Research Question Dataset
7 pages
Detailed Report
No ratings yet
Detailed Report
6 pages
Twittersentiment
No ratings yet
Twittersentiment
12 pages
NLP Project Report
No ratings yet
NLP Project Report
17 pages
Enhancing Customer Insights through Sentiment Analysis
No ratings yet
Enhancing Customer Insights through Sentiment Analysis
10 pages
VG Computer Science AI Recommender
No ratings yet
VG Computer Science AI Recommender
18 pages
Sentiment Analysis On Twitter Data Using Machine Learning Algorithms in Python
No ratings yet
Sentiment Analysis On Twitter Data Using Machine Learning Algorithms in Python
15 pages
Major Project Presentationn (2) - 1
No ratings yet
Major Project Presentationn (2) - 1
51 pages
Final Presentation
No ratings yet
Final Presentation
8 pages
Twitter Sentiment Analysis Project
No ratings yet
Twitter Sentiment Analysis Project
18 pages
TM Assignment
No ratings yet
TM Assignment
16 pages
Sentiment Analysis: A NLP And: 2. Detailed Approach
No ratings yet
Sentiment Analysis: A NLP And: 2. Detailed Approach
6 pages
Sentiment Analysis E-commerce System
No ratings yet
Sentiment Analysis E-commerce System
1 page
Sentiment Analysis
No ratings yet
Sentiment Analysis
7 pages
Twitter Sentiment Analysis Using Machine Learning Algorithms IJERTV12IS070128
No ratings yet
Twitter Sentiment Analysis Using Machine Learning Algorithms IJERTV12IS070128
3 pages
Capstone Project
No ratings yet
Capstone Project
15 pages
Sentiment of Tweets
No ratings yet
Sentiment of Tweets
7 pages
Vendor Pre Qualification Form
No ratings yet
Vendor Pre Qualification Form
6 pages
Fraud Detection Using Machine Learning
No ratings yet
Fraud Detection Using Machine Learning
36 pages
Letter From The President
No ratings yet
Letter From The President
2 pages
Concrete Standards for Durable Structures
No ratings yet
Concrete Standards for Durable Structures
13 pages
ТЕМА 3
No ratings yet
ТЕМА 3
4 pages
Time in Germany
No ratings yet
Time in Germany
1 page
Unit III. Learning Theories and Models
No ratings yet
Unit III. Learning Theories and Models
40 pages
Destiny Consultancy: "No Advice Only Solution"
No ratings yet
Destiny Consultancy: "No Advice Only Solution"
13 pages
Student Project Acknowledgments
No ratings yet
Student Project Acknowledgments
1 page
Trane Model CRHR-400 Physical and Electrical SpecificationsNEW
No ratings yet
Trane Model CRHR-400 Physical and Electrical SpecificationsNEW
1 page
Bulk SMS Service - Smsmenow
No ratings yet
Bulk SMS Service - Smsmenow
2 pages
ABB Low Voltage Coils & Kits Pricing
No ratings yet
ABB Low Voltage Coils & Kits Pricing
1 page
STS1500014-001 Rev 4 HAZID Report For OceanGuard BWMS Complete
No ratings yet
STS1500014-001 Rev 4 HAZID Report For OceanGuard BWMS Complete
71 pages
01a. Questionnaire Hf. Recurrent Rev. 01, Jan. 04, 2023-Lgtc-tt-Am-f004
No ratings yet
01a. Questionnaire Hf. Recurrent Rev. 01, Jan. 04, 2023-Lgtc-tt-Am-f004
4 pages
Diploma in Human Resources Brochure 2024
No ratings yet
Diploma in Human Resources Brochure 2024
13 pages
Getting Somewhere by Lilian A. Aujo-Group 1
100% (1)
Getting Somewhere by Lilian A. Aujo-Group 1
3 pages
Amithlon Kernel
No ratings yet
Amithlon Kernel
9 pages
LMCC Impact Study - Lindsay Johnson
No ratings yet
LMCC Impact Study - Lindsay Johnson
5 pages
TDD: Topics in Distributed Databases: Parallel Database Management Systems
No ratings yet
TDD: Topics in Distributed Databases: Parallel Database Management Systems
38 pages
Erebuni Yerevan - Concert Instruments
No ratings yet
Erebuni Yerevan - Concert Instruments
1 page
Sancoale Slab
No ratings yet
Sancoale Slab
17 pages
Automatic Congestion Handling Feature Parameter Description: Issue Date
No ratings yet
Automatic Congestion Handling Feature Parameter Description: Issue Date
61 pages
System Formwor
100% (1)
System Formwor
55 pages
Sanctions For Examination Misconducts
No ratings yet
Sanctions For Examination Misconducts
2 pages
Karanlık Üçlü Ölçeği: Türkçe Adaptasyon
No ratings yet
Karanlık Üçlü Ölçeği: Türkçe Adaptasyon
15 pages
SS2 Biology Lesson Note
No ratings yet
SS2 Biology Lesson Note
35 pages
UT1 Solution 1
No ratings yet
UT1 Solution 1
10 pages
Newcastle Disease Scientific - & Technico Booklet
No ratings yet
Newcastle Disease Scientific - & Technico Booklet
45 pages
Allowable Stress Values of Stainless Steel and Carbon Steel PDF
No ratings yet
Allowable Stress Values of Stainless Steel and Carbon Steel PDF
2 pages
Unit 9 Management Accounting Costing and Budgeting
0% (1)
Unit 9 Management Accounting Costing and Budgeting
7 pages

Ai Phase - 1

Uploaded by

Ai Phase - 1

Uploaded by

NAAN MUDHALVAN

PROJECT TITLE: Sentiment Analysis

This dataset originally came

• Consider searching for

1. Monitor Brand Sentiment Over Time:

4.Identify Positive Sentiment Drivers:

6. Impact of Special Promotions or

You might also like