0% found this document useful (0 votes)

16 views19 pages

Sentiment Analysis

Uploaded by

dikshayadav0728

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views19 pages

Sentiment Analysis

Uploaded by

dikshayadav0728

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

SENTIMENT ANALYSIS OF PRODUCT

REVIEWS
AGENDA

Introduction

Objective

Data cleaning & pre-

processing

Data analysis and

Visualisation

Conclusion
INTRODUCTION • This project is focused on analyzing customer
sentiment from Amazon product reviews.
• The goal is to extract meaningful insights from the
dataset by cleaning the data, performing
exploratory data analysis (EDA), and visualizing
key metrics—within Power BI and python.
• The insights drawn from this project will help in
understanding customer satisfaction levels,
identifying strengths and weaknesses across
product categories, and guiding data-driven
business decisions.
OBJECTIVE • Data Collection & Cleaning: Process Amazon
reviews, remove duplicates, and standardize text.
• Exploratory Data Analysis (EDA): Identify
sentiment trends, visualize common words, and
analyze rating distributions.
• Sentiment Analysis in Power BI: Use text-based
metrics to classify sentiments (positive, negative,
neutral).
• Visualization: Create interactive dashboards with bar
charts, pie charts, and word clouds.
• Insights & Recommendations: Identify customer
preferences, product strengths/weaknesses, and
improvement areas..
DATA CLEANING & PREPROCESSING
Step 1: load dataset

The dataset is loaded using pd.read_excel().

It is stored in a DataFrame (df) to facilitate structured data analysis.

Step 2: Handling duplicates and counting missing value

•Converting ASINs to uppercase and removing spaces using .str.strip().str.upper().

•Removing rows where ASIN does not start with an alphabet using df[df['asin'].str.match(r'^[A-Za-z]')].

•Count total rows before cleaning using df.shape[0].

•Identify duplicate rows using df.duplicated(subset=['user_id', 'asin']).

•Remove duplicates using df.drop_duplicates(subset=['user_id', 'asin'], keep='first').

•Count missing values across all columns using df.isnull().sum().sum().

Step 3: missing value

We check for missing values using df.isna().sum().

Rows with missing values in text, ASIN, parent_ASIN, user_ID, Product Category, Sub-Category, Actual Price,
Discount Percentage, and Discount Price are dropped using df.dropna().
Step 4:Text Cleaning

We apply Regular Expressions (RegEx) to remove non-alphabetic characters

.Only letters and spaces are retained in the text_cleaned column.

Step 5: saved cleaned data

The cleaned data is saved as an Excel file (removed_data.xlsx).

Step 6: Extracting Removed Data Summary

We compare the original dataset with the cleaned version.

Display the total number of removed duplicates and missing values.
Print a summary showing the total rows before and after cleaning.
MARKET
EXPANSION
SENTIMENT ANALYSES

Step 1: Importing Required Libraries

To perform sentiment analysis, we need the following
Python libraries:
•nltk: A powerful NLP library for text analysis.
•pandas: To handle and manipulate structured data.
•VADER SentimentIntensityAnalyzer: A pre-trained
sentiment model for analyzing text sentiment.
Step 2: Initializing the Sentiment Analyzer
•sia = SentimentIntensityAnalyzer()
•This loads the VADER Sentiment Analyzer, allowing us to process sentiment scores.
•def get_sentiment(text):
•score = sia.polarity_scores(str(text))
•return score['compound']
•The function get_sentiment() takes a cleaned review (text) as input.It converts the text to a string (if not
already) and calculates its sentiment score using sia.polarity_scores().
•The function returns the compound score, which ranges from -1 (negative) to +1 (positive)
•df['sentiment_score'] = df['text_Cleaned'].apply(get_sentiment)
•This applies the get_sentiment() function to the text_Cleaned column.
•The sentiment score is stored in a new column called sentiment_score.
•print(df[['text_Cleaned', 'sentiment_score']].head())This prints the first five cleaned reviews along with their
sentiment scores.
Step 3: Applying Sentiment Classification on Reviews
•Since we have a compound score for each review, we classify sentiment based on its value:
Positive → compound score > 0.05
Neutral → compound score between -0.05 and 0.05
Negative → compound score < -0.05
def classify_sentiment(score):
if score >= 0.05: return "Positive"
elif score <= -0.05: return "Negative"
else: return "Neutral
•The function classify_sentiment() is applied to each row in the "sentiment_score" column.
A new column "sentiment" is created with labels Positive, Neutral, or Negative.
['sentiment'] = df['sentiment_score'].apply(classify_sentiment)
•The "sentiment_score" column shows numerical values.
The "sentiment" column has corresponding labels.
print(df[['text_Cleaned', 'sentiment_score', 'sentiment']].head())
•We save the final dataset, which includes the text reviews, sentiment scores, and sentiment labels, for
further analysis.
sentiment_file = r"C:\Users\HP\Downloads\Sentiment_data.xlsx"df.to_excel(sentiment_file, index=False)
KPI Cards
DATA VISUALIZATION
• 158M (Actual Price): Represents the total sum of actual product prices
across all reviewed products.

• 35.47M (Discount Price): Reflects the total sum of discounts provided on

these products.

Pie Chart: Count of Sentiment by Sentiment:

• Positive: 78.66%

• Neutral: 8.84%

• Negative: 12.51%

Slicers

• Sentiment Slicer Column:

• Filters data based on sentiment (Positive, Neutral, or Negative).

• Main Category Slicer Column:

• List Function: Allows users to filter sentiment analysis for different
product categories (e.g., Electronics, Clothing, Accessories).

• Sub-Category Slicer Column:

• List Function: Provides a more detailed view by filtering sentiment
data for specific product types within a category (e.g., Men’s Shoes,
Lingerie, Watches).
Average rating by Sentiment (Donut Chart)

• Findings:
• Higher rating are associated with neutral and positive sentiments.
• Almost 1/3 rating (33.09%) of rating are linked to neutral reviews, indicating potential strategic
pricing.

Count of Sentiment by Product Category (Bar Chart)

• Findings:
• "Home & Living" receives the highest sentiment count, with a major share being positive.
• "Beauty" and "Automotive" categories have mixed sentiments with a noticeable portion of neutral
and negative reviews.

Count of Sentiment by Sub-Category (Bar Chart)

• Findings:
• "Fragrance" sub-category has the highest sentiment count, primarily positive.
• "Car Accessories," "Furniture," and "Cleaning Supplies" show a mix of positive and neutral
sentiments.
• Some sub-categories like "Wearable Tech" and "Phone Accessories" have a relatively small number of
sentiment counts.
THANK YOU

PRESENT ED BY: Bhavay, diksha yadav and dimple

2202390060, 2202390021,
2202390039

BBA(BIA) – 6TH SEM

GROUP – 3

Sentiment Analysis of Customer Reviews From e
No ratings yet
Sentiment Analysis of Customer Reviews From e
6 pages
Kayak Brand Sentiment Analysis Guide
No ratings yet
Kayak Brand Sentiment Analysis Guide
13 pages
VG Computer Science AI Recommender
No ratings yet
VG Computer Science AI Recommender
18 pages
Episode 3 - Transcription
No ratings yet
Episode 3 - Transcription
4 pages
Ai Phase1
No ratings yet
Ai Phase1
12 pages
Sentiment Analysis of Online Reviews
No ratings yet
Sentiment Analysis of Online Reviews
11 pages
Dataset Description: Amazon Reviews of Unlocked Phone
No ratings yet
Dataset Description: Amazon Reviews of Unlocked Phone
4 pages
Comsats University Islamabad Wah Campus (Project Report) : Submitted by
No ratings yet
Comsats University Islamabad Wah Campus (Project Report) : Submitted by
14 pages
Paper 8848
No ratings yet
Paper 8848
4 pages
Project Report
No ratings yet
Project Report
9 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
11 pages
Sentiment Analysis with ML
No ratings yet
Sentiment Analysis with ML
10 pages
Paper PDF Data
No ratings yet
Paper PDF Data
3 pages
Ai Phase - 1
No ratings yet
Ai Phase - 1
21 pages
NM Project
No ratings yet
NM Project
18 pages
Experiment
No ratings yet
Experiment
5 pages
SentimentScanner Report (1) .PDF 157
No ratings yet
SentimentScanner Report (1) .PDF 157
20 pages
Gokul
No ratings yet
Gokul
10 pages
Amazon Reviews Sentiment Analysis Insights
No ratings yet
Amazon Reviews Sentiment Analysis Insights
10 pages
Final Year Project PPT Template
No ratings yet
Final Year Project PPT Template
12 pages
Sentiment Analysis of E Commerce Product Reviews
No ratings yet
Sentiment Analysis of E Commerce Product Reviews
8 pages
SSRN 3886135
No ratings yet
SSRN 3886135
16 pages
Sentimental Analysis of Customer Reviews Which Should Be Represent in Graph by Using Plot Scatter
No ratings yet
Sentimental Analysis of Customer Reviews Which Should Be Represent in Graph by Using Plot Scatter
12 pages
Report On Sentiment Analysis For Customer Reviews
No ratings yet
Report On Sentiment Analysis For Customer Reviews
4 pages
Mukesh Joshiyara FInal
No ratings yet
Mukesh Joshiyara FInal
31 pages
Data Science: Amazon Review Analysis
No ratings yet
Data Science: Amazon Review Analysis
22 pages
Polarity Categorization On Product Reviews
No ratings yet
Polarity Categorization On Product Reviews
4 pages
Enhancing Customer Insights through Sentiment Analysis
No ratings yet
Enhancing Customer Insights through Sentiment Analysis
10 pages
Hotels Review Classification Final
No ratings yet
Hotels Review Classification Final
34 pages
Harsha Edunet
No ratings yet
Harsha Edunet
10 pages
Business Review Sentiment Insights
No ratings yet
Business Review Sentiment Insights
6 pages
Amna Bagh Ali
No ratings yet
Amna Bagh Ali
6 pages
Sentiment Analysis On E-Commerce Product Using Mac
No ratings yet
Sentiment Analysis On E-Commerce Product Using Mac
6 pages
Tejano - Final
No ratings yet
Tejano - Final
29 pages
212 Dishu
No ratings yet
212 Dishu
42 pages
Amazon Review Data Analysis
No ratings yet
Amazon Review Data Analysis
23 pages
AI-web Scraping
No ratings yet
AI-web Scraping
18 pages
Starbucks Sentiment Analysis Using VADER
No ratings yet
Starbucks Sentiment Analysis Using VADER
23 pages
Review of Products Using Sentiment Analysis (4-2 Project Report) - 3
No ratings yet
Review of Products Using Sentiment Analysis (4-2 Project Report) - 3
75 pages
Sentiment Analysis Insights
No ratings yet
Sentiment Analysis Insights
1 page
Product Rating Through Sentiment Analysis
No ratings yet
Product Rating Through Sentiment Analysis
23 pages
Sentiment Analysis of Product Review
No ratings yet
Sentiment Analysis of Product Review
6 pages
Amazon Reviews Dataset Analysis
100% (1)
Amazon Reviews Dataset Analysis
7 pages
Research Aml
No ratings yet
Research Aml
22 pages
Amazon Review Sentiment Analysis Techniques
No ratings yet
Amazon Review Sentiment Analysis Techniques
27 pages
Sentiment Analysis: Natural Language Processing (NLP) Customer Feedback
No ratings yet
Sentiment Analysis: Natural Language Processing (NLP) Customer Feedback
12 pages
Fusion Sentiment Analysis Presentation
No ratings yet
Fusion Sentiment Analysis Presentation
20 pages
Sentiment Analysis of A Product Based On User Reviews Using Random Forests Algorithm
No ratings yet
Sentiment Analysis of A Product Based On User Reviews Using Random Forests Algorithm
5 pages
E-Commerce Sentiment Analysis Study
No ratings yet
E-Commerce Sentiment Analysis Study
1 page
Sentimental Analysis Research Paper 1
No ratings yet
Sentimental Analysis Research Paper 1
3 pages
Sentiment Analysis On Amazon Reviews Using Machine Learning
No ratings yet
Sentiment Analysis On Amazon Reviews Using Machine Learning
77 pages
Restaurant Review Sentiment Analysis
No ratings yet
Restaurant Review Sentiment Analysis
18 pages
Opinion Mining Classification Using Naiv
No ratings yet
Opinion Mining Classification Using Naiv
4 pages
ICIEM23 Presentation Format
No ratings yet
ICIEM23 Presentation Format
11 pages
Detailed Report
No ratings yet
Detailed Report
6 pages
Product Review Sentiment Analysis
No ratings yet
Product Review Sentiment Analysis
4 pages
Final Set Paper-2
No ratings yet
Final Set Paper-2
4 pages
Sentiment Analysis Internship Report
No ratings yet
Sentiment Analysis Internship Report
27 pages
QR Code For SMT Catalogue
No ratings yet
QR Code For SMT Catalogue
1 page
ETE Brochure
No ratings yet
ETE Brochure
4 pages
Assignment 1 Diksha 2202390021
No ratings yet
Assignment 1 Diksha 2202390021
5 pages
Assignment 1 Diksha 2202390021
No ratings yet
Assignment 1 Diksha 2202390021
5 pages
Cost
No ratings yet
Cost
9 pages
Assignment 2 Diksha 2202390021
No ratings yet
Assignment 2 Diksha 2202390021
3 pages
Revision Questions For Cswip Exams
91% (11)
Revision Questions For Cswip Exams
65 pages
O&M February 2024 Manpower Schedule v1
No ratings yet
O&M February 2024 Manpower Schedule v1
2 pages
Journey Roadmap To Problem Solving
No ratings yet
Journey Roadmap To Problem Solving
4 pages
DCCN 1
No ratings yet
DCCN 1
3 pages
2024-12-20T135152.513
No ratings yet
2024-12-20T135152.513
6 pages
ADAM24Pxx ETC
No ratings yet
ADAM24Pxx ETC
37 pages
John Dave Fuentes - Research - Progress - Report - Template
No ratings yet
John Dave Fuentes - Research - Progress - Report - Template
2 pages
Practice Programs - Day 4 and 5
No ratings yet
Practice Programs - Day 4 and 5
4 pages
Windows 10 Remote Desktop Guide
No ratings yet
Windows 10 Remote Desktop Guide
11 pages
PD Flow I - Floorplan - Physical Design, STA & Synthesis, DFT, Automation & Flow Dev, Verification Services. Turnkey Projects
No ratings yet
PD Flow I - Floorplan - Physical Design, STA & Synthesis, DFT, Automation & Flow Dev, Verification Services. Turnkey Projects
21 pages
ICT Exam Marking Scheme
No ratings yet
ICT Exam Marking Scheme
3 pages
Paper - 3: Advanced Auditing and Professional Ethics: (5 Marks)
No ratings yet
Paper - 3: Advanced Auditing and Professional Ethics: (5 Marks)
15 pages
Manchester Enterprise Front Page Oct. 4, 2012
No ratings yet
Manchester Enterprise Front Page Oct. 4, 2012
1 page
MacotronicB2 Rev2 1
No ratings yet
MacotronicB2 Rev2 1
140 pages
WIFI7
No ratings yet
WIFI7
24 pages
2013 Maria Miranda Unsitely Aesthetics
No ratings yet
2013 Maria Miranda Unsitely Aesthetics
157 pages
Chapter 5 - The Effects of Using ICT
No ratings yet
Chapter 5 - The Effects of Using ICT
11 pages
E-03-Sectoral NIMP-Electrical Electronics Industry
No ratings yet
E-03-Sectoral NIMP-Electrical Electronics Industry
32 pages
Apple vs Samsung: B2B Marketing Analysis
No ratings yet
Apple vs Samsung: B2B Marketing Analysis
11 pages
For Authors Certificate
0% (1)
For Authors Certificate
20 pages
Fit of CADCAM Implant Frameworks A Comprehensive Review
No ratings yet
Fit of CADCAM Implant Frameworks A Comprehensive Review
9 pages
Solarpv&iv PDF
No ratings yet
Solarpv&iv PDF
8 pages
Unit 6
No ratings yet
Unit 6
12 pages
BSIT Fall 2024 Midterm Exam Schedule
No ratings yet
BSIT Fall 2024 Midterm Exam Schedule
4 pages
Max6922 Max6934-3471063
No ratings yet
Max6922 Max6934-3471063
15 pages
FOS Script
No ratings yet
FOS Script
2 pages
Flow Iron Plug Valves Ps
No ratings yet
Flow Iron Plug Valves Ps
1 page
2nd SCH 2013-14 PDF
No ratings yet
2nd SCH 2013-14 PDF
67 pages
Iphone 13 Pro Max Battery Watt - Google Search
No ratings yet
Iphone 13 Pro Max Battery Watt - Google Search
1 page
Kohler Engine Owner's Manual Guide
No ratings yet
Kohler Engine Owner's Manual Guide
8 pages

Sentiment Analysis

Uploaded by

Sentiment Analysis

Uploaded by

SENTIMENT ANALYSIS OF PRODUCT

Data cleaning & pre-

Data analysis and

The dataset is loaded using pd.read_excel().

Step 2: Handling duplicates and counting missing value

•Converting ASINs to uppercase and removing spaces using .str.strip().str.upper().

•Count total rows before cleaning using df.shape[0].

•Remove duplicates using df.drop_duplicates(subset=['user_id', 'asin'], keep='first').

•Count missing values across all columns using df.isnull().sum().sum().

Step 3: missing value

We check for missing values using df.isna().sum().

We apply Regular Expressions (RegEx) to remove non-alphabetic characters

Step 5: saved cleaned data

The cleaned data is saved as an Excel file (removed_data.xlsx).

Step 6: Extracting Removed Data Summary

We compare the original dataset with the cleaned version.

Step 1: Importing Required Libraries

• 35.47M (Discount Price): Reflects the total sum of discounts provided on

Pie Chart: Count of Sentiment by Sentiment:

• Sentiment Slicer Column:

• Main Category Slicer Column:

• Sub-Category Slicer Column:

Count of Sentiment by Product Category (Bar Chart)

Count of Sentiment by Sub-Category (Bar Chart)

PRESENT ED BY: Bhavay, diksha yadav and dimple

BBA(BIA) – 6TH SEM

You might also like