Sentiment Analysis

This project aims to perform sentiment analysis on Apple's reviews (6000+ samples, from 2011 to 2023) gathered from Trustpilot using various Python libraries such as Requests, BeautifulSoup, NLTK, and TextBlob. By extracting review content and other relevant information from the web page, this analysis provides insights into the sentiment expressed by customers towards Apple's products and services.

Roadmap

Web scrape Apple's reviews from Trustpilot
Perform data cleaning
Create additional features (word_count, char_count, average_word_length, stopword_count, stopword_rate)
Pre-process text data (to lowercase, remove punctuation, stop words, recursive words, extra custom stop words, and lemmatization)
Perform sentiment analysis
Evaluate and analyse polarity and subjectivity
Draw conclusions

Stack

Requests
BeautifulSoup
NLTK
TextBlob
Data Cleaning and Interpretation

Methodology

The methodology consists of several key steps, including web scraping, data cleaning, and sentiment analysis using NLTK and TextBlob.

Web Scraping

The first step involved web scraping the reviews from Apple's Trustpilot page. The BeautifulSoup library was used to extract the review content and other relevant information from the HTML structure of the webpage. The Requests library facilitated the retrieval of the web page's HTML content for further processing.

Data Cleaning and Feature Creation

Once the reviews were collected, a comprehensive data cleaning process was applied to ensure the quality and reliability of the dataset. Additional features were also created from the reviews to provide more context and enable a more in-depth data analysis (word_count, char_count, average_word_length, stopword_count, stopword_rate).

The NLTK library was utilised for tasks such as stop word removal to preprocess the text data. TextBlob, a natural language processing library, was employed to lemmatize the reviews and assign sentiment scores, indicating whether each review expressed a positive, negative, or neutral sentiment. The sentiment analysis results were used to gain insights into the overall sentiment of the reviews and to identify any sentiments associated with specific aspects of Apple's products or services.

Evaluation and Conclusion

To evaluate the results we used TextBlob's sentiment analysis capabilities to assess the polarity and subjectivity of Apple's reviews. The polarity score indicates the overall sentiment expressed in the reviews, ranging from -1 to 1, where -1 indicates negative sentiment, 0 indicates neutral sentiment, and 1 indicates positive sentiment. On the other hand, the subjectivity score measures the degree of subjectiveness in the reviews, ranging from 0 to 1, where 0 indicates an objective statement, and 1 indicates a subjective statement.

The analysis revealed that the average polarity score of Apple's reviews (6000+ samples, from 2011 to 2023) was 0.030779, indicating a slightly positive sentiment overall. This suggests that there were more positive or neutral opinions expressed compared to negative ones. However, it is important to note that the magnitude of the polarity score is quite small, suggesting a relatively balanced sentiment.

Furthermore, the average subjectivity score was found to be 0.476655, indicating a moderate level of subjectivity in the reviews. This suggests that while some reviews were objective and based on factual information, a significant portion of the reviews contained subjective opinions or personal experiences.

The findings from this project can be valuable for Apple to gain insights into customer sentiment, identify areas of improvement, and address customer concerns. By understanding the sentiment and subjectivity of customer reviews, Apple can make data-driven decisions to enhance its products and services, improve customer satisfaction, and maintain a positive brand reputation.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
imgs		imgs
License.txt		License.txt
README.md		README.md
apple-sentiment-analysis.ipynb		apple-sentiment-analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis - Apple's reviews

Roadmap

Stack

Methodology

Web Scraping

Data Cleaning and Feature Creation