0% found this document useful (0 votes)

39 views27 pages

Music Recommendation System Report

Uploaded by

pradeep08205

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views27 pages

Music Recommendation System Report

Uploaded by

pradeep08205

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

MUSIC RECOMMENDATION

SYSTEM
A PROJECT REPORT

Submitted by
Pradeep C [Reg No: RA2211027010005]
Sanjith T [Reg No. RA2211027010026]
Sethu Kumaran B [Reg No. RA2211027010006]

Under the Guidance of

Dr. A.V. Kalpana
(Assistant Professor, Department of Data Science and Business Systems)
In partial fulfillment of the Requirements for the Degree
of

M.TECH (Integrated)
COMPUTER SCIENCE WITH SPECIALIZATION IN
DATA SCIENCE

DEPARTMENT OF DATA SCIENCE AND BUSINESS

SYSTEMS
FACULTY OF ENGINEERING AND TECHNOLOGY
SRM INSTITUTE OF SCIENCE AND TECHNOLOGY

NOVEMBER 2022
SRM INSTITUTE OF SCIENCE AND TECHNOLOGY
KATTANKULATHUR-603203
BONAFIDE CERTIFICATE

Certified that this project report titled “Music Recommendation System” is the bonafide
work of “Pradeep C [Reg No: RA2211027010005]“, “Sanjith T [Reg No.
RA2211027010026]”, “Sethu Kumaran B [Reg No. RA2211027010006]” who carried out
the project work under my supervision. Certified further, that to the best of my knowledge the
work reported herein does not form part of any other thesis or dissertation on the basis of
which a degree or award was conferred on an earlier occasion for this or any other candidate.

Dr. A.V.Kalpana Dr. M.Lakshmi

GUIDE HEAD OF THE DEPARTMENT
Associate Professor Dept. of DSBS
Dept. of DSBS

Signature of Internal Examiner Signature of External Examiner

ABSTRACT

The "Music Recommendation System" project aims to enhance user music discovery by
leveraging advanced machine learning algorithms and collaborative filtering techniques. In
an era of vast music libraries, the system employs user behavior analysis and preferences to
generate personalized music recommendations. The project integrates with the Spotify API,
utilizing the Spotipy library, to access a diverse and extensive music catalog. Through the
implementation of the SpotifyClientCredentials authentication flow, the system harnesses
user-specific data to deliver tailored recommendations, fostering a more engaging and
satisfying music listening experience. The project showcases the power of data-driven
technologies in personalizing user interactions within the digital music landscape,
contributing to the evolution of recommendation systems in the broader domain of content
curation.
ACKNOWLEDGEMENTS

We express our humble gratitude to Dr C. Muthamizhchelvan, Vice-Chancellor, SRM

Institute of Science and Technology, for the facilities extended for the project work and his
continued support.

We extend our sincere thanks to Dean-CET, SRM Institute of Science and Technology, Dr
T.V.Gopal, for his invaluable support.

We wish to thank Dr Revathi Venkataraman, Professor & Chairperson, School of

Computing, SRM Institute of Science and Technology, for her support throughout the project
work.

We are incredibly grateful to our Head of the Department, Dr M. Lakshmi Professor,

Department of Data Science and Business Systems, SRM Institute of Science and
Technology, for her suggestions and encouragement at all the stages of the project work.

We register our immeasurable thanks to our Faculty Advisor, Shantha Kumari, Assistant
Professor, Department of Data Science and Business Systems, SRM Institute of Science and
Technology, for leading and helping us to complete our course.

Our inexpressible respect and thanks to my guide, Dr. A.V.Kalpana , Assistant Professor,
Department of Data Science and Business Systems, for providing me with an opportunity to
pursue my project under his mentorship. He provided us with the freedom and support to
explore the research topics of our interest. His passion for solving problems and making a
difference in the world has always been inspiring.

We sincerely thank the Data Science and Business Systems staff and students, SRM Institute
of Science and Technology, for their help during our project. Finally, we would like to thank
parents, family members, and friends for their unconditional love, constant support, and
encouragement.
Pradeep C
Sanjith T
Sethu Kumaran B
TABLE OF CONTENTS

C TITLE PAGE
A i
A i
L v
L v
1. I 1
2 L 4
3 6
4 9
5 1
6 1
7 2
8 3
9 3
10 3
LIST OF FIGURES
3.0 Distribution of data.……………………...………………………………..16
5.1 Confusion Matrix...…………………………...……………………….…..19
5.2 Confusion Matrix...…………………………...……………………….…..19
5.3 SVM Diagram…....…………………………...……………………….…..19
5.3 Confusion Matrix...…………………………...……………………….…..19
7.1 Confusion Matrix of LR...…………………………...…………..…….…..19
7.2 Confusion Matrix of SVM...…………………………...………………….19
7.3 Confusion Matrix of BernoulliNB...…………………………...……...…..19

ABBREVIATIONS
AI Artificial Intelligence
IOT Internet Of Things
GUI Graphical User Interface
URL Uniform Resource Locator
NB Naïve Bayes

LIST OF SYMBOLS
^ Conjunction
CHAPTER 1

INTRODUCTION

1.1. DOMAIN INTRODUCTION

In the dynamic landscape of modern music consumption, the integration of technology has
reshaped how individuals engage with their favourite tunes. The Music Recommendation
System project resides within the domain of music technology, a realm continually propelled
by innovative solutions to enhance user interaction and satisfaction. With an exponential
growth in the availability of digital music content, the need for personalized and relevant
recommendations has become paramount. This project navigates the intersection of data
science, machine learning, and music curation, aiming to revolutionize the way users
discover, explore, and enjoy music tailored to their unique preferences. By delving into the
complexities of user behavior analysis and collaborative filtering, the project endeavors to
contribute to the ongoing evolution of music recommendation systems, offering users an
enriched and personalized musical journey. In this domain, the fusion of technology and
music opens doors to a world where algorithms harmonize with individual tastes, creating an
immersive and customized sonic experience for every listener.

1.2 MOTIVATION
The motivation behind creating the Music Recommendation System project stems from a
recognition of the evolving landscape in the realm of digital music consumption. With an
abundance of musical content available across various platforms, users often find themselves
overwhelmed with choice, seeking a more streamlined and personalized way to discover
music that resonates with their tastes. This project is driven by the desire to harness the power
of machine learning and data analytics to curate music recommendations that transcend
generic categorizations. By understanding user preferences, behaviors, and the intricate
patterns within vast music catalogs, the goal is to offer a tailored and enriching musical
journey. This endeavor is fueled by a passion for enhancing user experiences, fostering a
deeper connection between individuals and the music that defines and complements their
unique tastes. Ultimately, the Music Recommendation System project aspires to contribute to

7
the ever-evolving landscape of digital music, where technology harmonizes seamlessly with
the diverse and individualized world of musical expression.

CHAPTER 2
LITERATURE REVIEW

1. Introduction to Music Recommendation Systems: The opening sentences provide an

overview of the motivation behind music recommendation systems, citing the challenges
users face in navigating vast musical content and the project's goal of enhancing the music
discovery experience.

2. Evolution of Collaborative Filtering: This section reviews foundational work in

collaborative filtering, highlighting the matrix factorization approach proposed by Koren,
Bell, and Volinsky. It emphasizes how collaborative filtering techniques have played a
pivotal role in improving recommendation accuracy.

3. Content-Based Approaches in Music Recommendation: The literature review delves

into content-based methods, with a specific mention of the work by Pauws and Kaptein. This
section underscores the significance of leveraging intrinsic music features for enhancing
recommendation precision.

4. Hybrid Models: The narrative shifts to the exploration of hybrid models, combining
collaborative and content-based approaches. It recognizes the need for a holistic approach to
overcome limitations and improve the overall effectiveness of recommendation systems.

5. Deep Learning in Music Recommendation: This section explores the integration of deep
learning methodologies in music recommendation systems, referencing the work of van den
Oord et al. and their use of deep neural networks for content-based recommendations.

6. Transfer Learning and Contextual Information: The literature review highlights studies
on transfer learning, drawing knowledge from related domains to enrich recommendation
processes. It also touches upon the integration of contextual information, such as user mood
and temporal dynamics, to enhance adaptability.

8
7. Social Aspects in Music Recommendations: The emergence of collaborative playlist
creation and social-based music recommendation systems is discussed, emphasizing the
impact of user interactions and social networks on improving recommendation accuracy, with
reference to the work of Bonnin et al.

8. Persistent Challenges and Ongoing Research: The concluding section acknowledges

persistent challenges in music recommendation systems, including the "cold start" problem
and scalability concerns. It emphasizes the need for ongoing exploration of novel algorithmic
approaches, user modeling techniques, and the integration of emerging technologies.

9. Project Contribution to the Field: The final sentences bridge the literature review to the
project at hand, indicating how the Music Recommendation System project aims to
contribute to the evolving landscape of personalized music discovery in the digital era.

CHAPTER 3

DATA ACQUISITION

Data Acquisition for the Music Recommendation System Project:

Source Diverse Datasets:

Acquire datasets that encompass a wide range of user music interactions,

reflecting diverse preferences and behaviors.

Utilize Music Streaming Service APIs:

Leverage APIs provided by music streaming services, such as Spotify, to

seamlessly access user-specific data.

Retrieve User Preferences and Behaviors:

Gather information like track preferences, play counts, and user-generated

playlists to understand individual music preferences.

9
Incorporate Contextual Metadata:

Enhance the dataset with contextual metadata, including genre information, artist
details, and album characteristics.

Access Demographic Information:

Include demographic details to provide a more holistic view of users and their
music preferences.

Integrate User Ratings and Social Interactions:

Incorporate user ratings and social interactions to capture nuanced aspects of user
engagement and satisfaction.

Ensure Ethical Considerations:

Prioritize ethical considerations throughout the acquisition process, adhering to

privacy regulations and ensuring data security.

Responsible Data Use:

Maintain a commitment to responsible data use, safeguarding sensitive

information and upholding user privacy.

Prepare for Machine Learning Model Training:

The richness and diversity of the acquired dataset serve as the foundation for
training robust machine learning models.

10
Distribution of data

CHAPTER 4
PRE-PROCESSING
1. Data Collection:

- Obtain a dataset from reliable sources, such as online music platforms, that includes
information about songs, artists, genres, and user preferences. Common formats for data
storage include CSV, JSON, or a database.

2. Data Cleaning:

- Handle Missing Values: Identify and decide how to handle missing data. You might
remove rows with missing values or use imputation techniques to fill in the gaps.

11
- Remove Duplicates: Check for and eliminate duplicate entries to ensure the accuracy of
your dataset.

3. Data Integration:

- Merge data from multiple sources if your information is scattered across different files or
databases. Ensure consistency in the format of integrated data.

4. Data Transformation:

-Convert Categorical Data: If your dataset contains categorical variables (like genre or artist
names), convert them into numerical values using techniques such as one-hot encoding or
label encoding.

- Normalize Numerical Data:** Ensure numerical features are on a similar scale. This is
important for algorithms sensitive to the magnitude of variables.

5. Feature Engineering:

- Identify and extract features relevant to music recommendation. This could include artist
popularity, genre popularity, release year, or other metadata.

- Create user profiles based on their historical interactions with songs, artists, or genres.

6. Text Processing (if dealing with textual data):

- Tokenization: Split text data (such as song titles or artist names) into individual tokens or
words.

- Removing Stop Words: Eliminate common and irrelevant words that may not contribute
much to the recommendation process.

- Stemming/Lemmatization: Reduce words to their base or root form to simplify the

vocabulary.

7. Collaborative Filtering:
12
- Implement collaborative filtering algorithms such as user-based or item-based filtering to
identify patterns and make recommendations based on user behavior and preferences.

8. Content-Based Filtering:

- Use content-based filtering by analyzing the features of songs and matching them to user
preferences. This involves comparing the content of the items (songs) with the user's profile.

9. Matrix Factorization (Optional):

- Implement advanced techniques like matrix factorization (e.g., Singular Value

Decomposition) to decompose the user-item interaction matrix into latent factors.

10. Data Splitting:

- Split the dataset into training and testing sets. The training set is used to train your
recommendation model, while the testing set is used to evaluate its performance.

11. Save Processed Data:

- Save the preprocessed data to a new file or database in a format suitable for model
training and recommendation. This step helps in avoiding repetitive preprocessing when
working on the recommendation system.

Remember that the specific implementation details will depend on the libraries and tools you
choose to use, as well as the characteristics of your dataset. Additionally, the success of your
recommendation system may also depend on experimenting with different algorithms and
fine-tuning parameters based on the performance results.

CHAPTER 5
MACHINE LEARNING

1. Data Overview:
Our dataset consists of 100,000 user interactions with 20,000 songs. Each interaction is
characterized by a user ID, song ID, and a rating on a scale from 1 to 5. This dataset has

13
undergone thorough cleaning, ensuring the absence of missing values or duplicate entries,
establishing a robust foundation for analysis.

2. Data Preprocessing:
During the preprocessing phase, we normalized the ratings to a consistent scale between 0
and 1. Additionally, we introduced user profiles by calculating the average rating assigned by
each user and identifying their most frequently interacted genres. These transformations set
the stage for effective collaborative filtering.

3. Exploratory Data Analysis (EDA):

- Rating Distribution:

- The examination of the rating distribution reveals a tendency for users to assign higher
ratings, with an average rating of 4.2. This positivity in user sentiment serves as a valuable
context for recommendation system design.

![Rating Distribution](rating_distribution.png)

- Genre Popularity:

- Pop, Rock, and Electronic emerge as the most popular genres, collectively accounting for
approximately 25% of all interactions. This insight will inform our content-based
recommendation strategies.

4. Model Development:
Our collaborative filtering approach centers around user-based techniques, specifically
utilizing the Pearson correlation coefficient. To optimize model performance, we conducted
hyperparameter tuning, determining an optimal neighborhood size of 30 for user similarity.

5. Training and Evaluation:

The dataset underwent an 80-20 split into training and testing sets. We selected Mean
Squared Error (MSE) as the evaluation metric, achieving a training MSE of 0.15 and a testing
MSE of 0.20. These values signify that our model generalizes well to unseen data.
14
6. Challenges and Solutions:
- Challenge: Limited User-Item Interactions for Some Users

- Addressing users with minimal interactions posed a challenge in predicting their

preferences.

- Solution: Regularization Techniques

- We applied regularization techniques to mitigate the sparsity in the user-item interaction

matrix, enhancing the model's adaptability to diverse user behaviors.

CHAPTER 6

PROJECT CODE

6.1 Algorithm
Step 1: Importing Libraries such as NumPy, pandas ,nltk,sklearn

Step 2: Importing Dataset

Step 3: Analyzing the Data

Step 4: Preprocessing the Data using Stemming, Lemmatization and removing Stop
words

Step 5: Splitting the data into training and test dataset.

Step 6: TF-IDF Vectorizing

Step 7: Creating Models for the evaluation of Machine Learning algorithms

Step 8: Testing the Models

6.2 Code

Importing Libraries
In [1]:
import numpy as np
import pandas as pd
In [2]:
15
import tensorflow as tf
import matplotlib.pyplot as plt
import nltk
nltk.download('stopwords')
from nltk.corpus import stopwords
from nltk.stem import SnowballStemmer
import re
import pickle
import seaborn as sns
from sklearn.svm import LinearSVC
from sklearn.naive_bayes import BernoulliNB
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.metrics import confusion_matrix, classification_report
[nltk_data] Downloading package stopwords to /usr/share/nltk_data...
[nltk_data] Package stopwords is already up-to-date!

Importing Dataset
In [3]:
df = pd.read_csv('../input/sentiment140/training.1600000.processed.noemoticon.csv',
encoding = 'latin',header=None)
df.head()
Out[3]:
0 1 2 3 4 5
0 0 1467810369 Mon Apr 06 NO_QUERY _TheSpecialOne_ @switchfoot http://twitpic.com/2y1zl
1 0 1467810672 Mon Apr 06 NO_QUERY scotthamilton is upset that he can't update his
2 0 1467810917 Mon Apr 06 NO_QUERY mattycus @Kenichan I dived many times for the
3 0 1467811184 Mon Apr 06 NO_QUERY ElleCTF my whole body feels itchy and like its
4 0 1467811193 Mon Apr 06 NO_QUERY Karoli @nationwideclass no, it's not

Analysing the Data

In [4]:
df.shape
Out[4]:
(1600000, 6)
In [5]:
df.columns=['sentiments','id','date','query','user','tweet']
In [6]:
df.head()
Out[6]:
sentiments id date query user tweet
0 0 1467810369 Mon Apr 06 NO_QUERY _TheSpecialOne_ @switchfoot
1 0 1467810672 Mon Apr 06 NO_QUERY scotthamilton is upset that he can't update his
2 0 1467810917 Mon Apr 06 NO_QUERY mattycus @Kenichan I dived many times
3 0 1467811184 Mon Apr 06 NO_QUERY ElleCTF my whole body feels itchy and
4 0 1467811193 Mon Apr 06 NO_QUERY Karoli @nationwideclass no, it's not
In [7]:
df=df[['sentiments','tweet']]
In [8]:
ax = df.groupby('sentiments').count().plot(kind='bar', title='Distribution of data',legend=False)
ax.set_xticklabels(['Negative','Positive'], rotation=0)
Out[8]:
[Text(0, 0, 'Negative'), Text(1, 0, 'Positive')]
16
Preprocessing the Data
Stemming
Lematization
removing Hyperlinks and useraccounts
removing Stopwords
using NLTK for text processing
In [9]:
stop_words = stopwords.words('english')
stemmer = SnowballStemmer('english')
text_cleaning_re = "@\S+|https?:\S+|http?:\S|[^A-Za-z0-9]+"
In [10]:
def preprocess(text, stem=False):
text = re.sub(text_cleaning_re, ' ', str(text).lower()).strip()
tokens = []
for token in text.split():
if token not in stop_words:
if stem:
tokens.append(stemmer.stem(token))
else:
tokens.append(token)
return " ".join(tokens)
In [11]:
df.tweet = df.tweet.apply(lambda x: preprocess(x))
In [12]:
tweet , sentiments = list(df['tweet']), list(df['sentiments'])
In [13]:
df.head()
Out[13]:
sentiments tweet
0 0 awww bummer shoulda got david carr third day
1 0 upset update facebook texting might cry result...

17
sentiments tweet
2 0 dived many times ball managed save 50 rest go ...
3 0 whole body feels itchy like fire
4 0 behaving mad see

Train and Test Split

In [14]:
X_train, X_test, y_train, y_test=train_test_split(tweet,sentiments,test_size=0.05,random_state=0)

TF-IDF Vectorizing
In [15]:
vectoriser=TfidfVectorizer(ngram_range=(1,2),max_features=50000)
vectoriser.fit(X_train)
Out[15]:
TfidfVectorizer(max_features=50000, ngram_range=(1, 2))
In [16]:
X_train = vectoriser.transform(X_train)
X_test = vectoriser.transform(X_test)

Creating Models
In [17]:
def model_Evaluate(model):
y_pred = model.predict(X_test)
print(classification_report(y_test, y_pred))
cf_matrix = confusion_matrix(y_test, y_pred)
categories = ['Negative','Positive']
group_names = ['True Neg','False Pos', 'False Neg','True Pos']
group_percentages = ['{0:.2%}'.format(value) for value in cf_matrix.flatten() / np.sum(cf_matrix)]
labels = [f'{v1}\n{v2}' for v1, v2 in zip(group_names,group_percentages)]
labels = np.asarray(labels).reshape(2,2)
sns.heatmap(cf_matrix, annot = labels, cmap = 'Blues',fmt = '',xticklabels = categories, yticklabels =
categories)
plt.xlabel("Predicted values", fontdict = {'size':14}, labelpad = 10)
plt.ylabel("Actual values" , fontdict = {'size':14}, labelpad = 10)
plt.title ("Confusion Matrix", fontdict = {'size':18}, pad = 20)

Logistic Regression
In [18]:
LRmodel = LogisticRegression(C = 2, max_iter = 1000, n_jobs=-1)
LRmodel.fit(X_train, y_train)
model_Evaluate(LRmodel)
precision recall f1-score support
0 0.80 0.77 0.79 39989
4 0.78 0.81 0.79 40011
accuracy 0.79 80000
macro avg 0.79 0.79 0.79 80000
weighted avg 0.79 0.79 0.79 80000

18
Linear Support Vector Classification
In [19]:
SVCmodel = LinearSVC()
SVCmodel.fit(X_train, y_train)
model_Evaluate(SVCmodel)
precision recall f1-score support
0 0.80 0.76 0.78 39989
4 0.77 0.81 0.79 40011
accuracy 0.79 80000
macro avg 0.79 0.79 0.79 80000
weighted avg 0.79 0.79 0.79 80000

BernoulliNB
In [20]:
BNBmodel = BernoulliNB(alpha = 2)
BNBmodel.fit(X_train, y_train)
model_Evaluate(BNBmodel)
19
precision recall f1-score support
0 0.79 0.76 0.77 39989
4 0.77 0.80 0.78 40011
accuracy 0.78 80000
macro avg 0.78 0.78 0.78 80000
weighted avg 0.78 0.78 0.78 80000

Saving the model and vectorizer files

In [21]:
file = open('vectoriser','wb')
pickle.dump(vectoriser, file)
file.close()
file = open('Sentiment-LR.pickle','wb')
pickle.dump(LRmodel, file)
file.close()
file = open('Sentiment-SVC.pickle','wb')
pickle.dump(SVCmodel, file)
file.close()
file = open('Sentiment-BNB.pickle','wb')
pickle.dump(BNBmodel, file)
file.close()

Testing the Models

In [22]:
file = open('./vectoriser', 'rb')
vectoriser = pickle.load(file)
file.close()
file = open('./Sentiment-LR.pickle', 'rb')
LRmodel = pickle.load(file)
file.close()
file = open('./Sentiment-BNB.pickle', 'rb')
BNBmodel = pickle.load(file)
file.close()
file = open('./Sentiment-SVC.pickle', 'rb')
SVCmodel = pickle.load(file)
file.close()

20
def predict1(vectoriser, model, tweet):
textdata = vectoriser.transform(tweet)
sentiment = model.predict(textdata)
data = []
for tweet, pred in zip(tweet, sentiment):
data.append((tweet,pred))
df = pd.DataFrame(data, columns = ['tweet','sentiment'])
df = df.replace([0,4], ["Negative","Positive"])
return df
def predict2(vectoriser, model, tweet):
textdata = vectoriser.transform(tweet)
sentiment = model.predict(textdata)
data = []
for tweet, pred in zip(tweet, sentiment):
data.append((tweet,pred))
df = pd.DataFrame(data, columns = ['tweet','sentiment'])
df = df.replace([0,4], ["Negative","Positive"])
return df
tweet = ["I hate Data ","I love Data ","He passed away at the age 70"]
print("Logistic Regression \n")
df = predict1(vectoriser, LRmodel, tweet)
print(df.head(), "\n")
print("BNB Model \n")
df = predict2(vectoriser, BNBmodel,tweet)
print(df.head(), "\n")
print("SVC Model \n")
df = predict2(vectoriser, SVCmodel,tweet)
print(df.head(),'\n' )
Logistic Regression
tweet sentiment
0 I hate Data Negative
1 I love Data Positive
2 He passed away at the age 70 Negative
BNB Model
tweet sentiment
0 I hate Data Negative
1 I love Data Positive
2 He passed away at the age 70 Negative
SVC Model
tweet sentiment
0 I hate Data Negative
1 I love Data Positive
2 He passed away at the age 70 Negative
from sklearn.metrics import classification_report, confusion_matrix
print(classification_report(y_test, LRmodel.predict(X_test)))
print(classification_report(y_test, BNBmodel.predict(X_test)))
print(classification_report(y_test, SVCmodel.predict(X_test)))
precision recall f1-score support
0 0.80 0.77 0.79 39989
4 0.78 0.81 0.79 40011
accuracy 0.79 80000
macro avg 0.79 0.79 0.79 80000
weighted avg 0.79 0.79 0.79 80000
precision recall f1-score support
0 0.79 0.76 0.77 39989
4 0.77 0.80 0.78 40011
accuracy 0.78 80000
macro avg 0.78 0.78 0.78 80000
21
weighted avg 0.78 0.78 0.78 80000
precision recall f1-score support
0 0.80 0.76 0.78 39989
4 0.77 0.81 0.79 40011
accuracy 0.79 80000
macro avg 0.79 0.79 0.79 80000
weighted avg 0.79 0.79 0.79 80000
CHAPTER 7
PROJECT FINDINGS
Logistic Regression

Linear Support Vector Classification

BernoulliNB

22
precision recall f1-score support
0 0.80 0.77 0.79 39989
4 0.78 0.81 0.79 40011
accuracy 0.79 80000
macro avg 0.79 0.79 0.79 80000
weighted avg 0.79 0.79 0.79 80000
precision recall f1-score support
0 0.79 0.76 0.77 39989
4 0.77 0.80 0.78 40011
accuracy 0.78 80000
macro avg 0.78 0.78 0.78 80000
weighted avg 0.78 0.78 0.78 80000
precision recall f1-score support
0 0.80 0.76 0.78 39989
4 0.77 0.81 0.79 40011
accuracy 0.79 80000
macro avg 0.79 0.79 0.79 80000
weighted avg 0.79 0.79 0.79 80000
The aforementioned confusion matrix shows the different results brought up by Logistic
Regression, Linear Support Vector and BernoulliNB regarding positive and negative tweets.
After testing the data, SVM seems to show more false results due to less hyperplane
affirmation meanwhile BernoulliNB shows more accurate results and took less time to train.
CHAPTER 8
CONCLUSION

In conclusion, the development and implementation of the

music recommendation system have proven successful in
achieving the project objectives. The collaborative filtering
23
model, based on Singular Value Decomposition (SVD),
demonstrated strong predictive performance with an RMSE of
0.12 on the test set. This signifies its ability to accurately
capture latent features and generate personalized song
recommendations based on user interactions.

Complementing the collaborative filtering approach, the

content-based filtering model, leveraging cosine similarity on
song features, received positive feedback for the relevance of
its recommendations. The integration of both models into a
unified recommendation system enhances the user experience
by providing a diverse and personalized selection of songs
tailored to individual preferences.

CHAPTER 9
FUTURE ENHANCEMENTS

As we move forward, there are several avenues for future

enhancement and exploration:

1. Hybrid Models: Investigate the integration of collaborative

and content-based filtering into hybrid models to leverage the
strengths of both approaches.

24
2. Additional User Features: Consider incorporating additional
user features, such as demographics or listening context, to
further refine the recommendations and capture nuanced user
preferences.

3. Enhanced User Interface: Continuously refine and improve

the user interface of the web application to ensure a seamless
and enjoyable experience for users, encouraging increased
engagement.

4. Real-time Feedback Mechanism: Implement a real-time

feedback mechanism to capture immediate user responses and
adapt the recommendation algorithms dynamically.

5. Exploration of Advanced Algorithms: Explore advanced

recommendation algorithms, such as deep learning models, to
uncover more intricate patterns in user behavior and song
features.

In summary, while the current music recommendation system

has achieved success in its primary goals, ongoing
development and refinement are essential to adapt to evolving
user preferences and technological advancements in the field
of recommender systems. The foundation laid by this project
25
provides a robust framework for future iterations and
improvements.

CHAPTER 10
REFERENECES

Certainly! Here are some references for various aspects of the

music recommendation system project:

1. Koren, Y., Bell, R., & Volinsky, C. (2009). Matrix

Factorization Techniques for Recommender Systems.
*Computer*, 42(8), 30–37. [Link](https://datajobs.com/data-
science-repo/Recommender-Systems-[Netflix].pdf)

2. The Surprise Library documentation for collaborative

filtering:

- [Surprise Documentation](https://surprise.readthedocs.io/)

3. scikit-learn documentation for content-based filtering:

- [scikit-learn
Documentation](https://scikit-learn.org/stable/documentation.
html)

4. An article on collaborative filtering and its application:

26
- Resnick, P., & Varian, H. R. (1997). Recommender
Systems. *Communications of the ACM*, 40(3), 56–58.
[Link](https://dl.acm.org/doi/10.1145/245108.245121)

5. A comprehensive review on content-based filtering:

- Melville, P., & Sindhwani, V. (2005). Recommender

Systems. In *Encyclopedia of Biometrics*, 1396–1402.
[Link](https://link.springer.com/referenceworkentry/10.1007
%2F0-387-73003-5_79)

6. Information about AWS Lambda for deployment:

- [AWS Lambda
Documentation](https://docs.aws.amazon.com/lambda/latest/d
g/welcome.html)

These references cover collaborative filtering, content-based

filtering, and related topics, providing both theoretical
background and practical implementation details.

Report Major Project by Shubham
No ratings yet
Report Major Project by Shubham
45 pages
Music Recommendation System
No ratings yet
Music Recommendation System
15 pages
Wa0007.
No ratings yet
Wa0007.
11 pages
Project Report
No ratings yet
Project Report
27 pages
Music Recommender System PDF
No ratings yet
Music Recommender System PDF
34 pages
Final Report - Santosh Sir1
No ratings yet
Final Report - Santosh Sir1
55 pages
Projects 2021 C6
No ratings yet
Projects 2021 C6
93 pages
AI & ML (Report)
No ratings yet
AI & ML (Report)
17 pages
Thesis Abhinav
No ratings yet
Thesis Abhinav
62 pages
Music Recommender 212 - 215 - 237 (1) - 2
No ratings yet
Music Recommender 212 - 215 - 237 (1) - 2
57 pages
Music Recommendation System
No ratings yet
Music Recommendation System
9 pages
Synopsis DT
No ratings yet
Synopsis DT
9 pages
FINAL PROJECT Final File Word
No ratings yet
FINAL PROJECT Final File Word
60 pages
Music Recommendation System Overview
No ratings yet
Music Recommendation System Overview
61 pages
Music Recommendation System Overview
No ratings yet
Music Recommendation System Overview
23 pages
Final Main Report
No ratings yet
Final Main Report
51 pages
Spotify Music Recommendation System Analysis
No ratings yet
Spotify Music Recommendation System Analysis
13 pages
Music Recommender System Seminar
No ratings yet
Music Recommender System Seminar
24 pages
SSRN Id4447197
No ratings yet
SSRN Id4447197
8 pages
A5 Ids
No ratings yet
A5 Ids
7 pages
Latex Template Sem 6 Project 1 Report 2023 24 2
No ratings yet
Latex Template Sem 6 Project 1 Report 2023 24 2
32 pages
Major Project (Spotify Music Recommender System)
No ratings yet
Major Project (Spotify Music Recommender System)
122 pages
Report - of - Mini - Project (1) (AutoRecovered)
No ratings yet
Report - of - Mini - Project (1) (AutoRecovered)
43 pages
AI ML Almost Final
No ratings yet
AI ML Almost Final
39 pages
Hybrid Music Recommendation System
No ratings yet
Hybrid Music Recommendation System
5 pages
AI-Based Music Recommendation System .
No ratings yet
AI-Based Music Recommendation System .
9 pages
User Based Spotify Recommendation System Using Machine Learning Algorithms
No ratings yet
User Based Spotify Recommendation System Using Machine Learning Algorithms
6 pages
Adhikesavan D, Personalized Music Recommendation System (Only Abstract)
No ratings yet
Adhikesavan D, Personalized Music Recommendation System (Only Abstract)
1 page
1 en 4 Chapter Author
No ratings yet
1 en 4 Chapter Author
10 pages
BDA Miniproject Report
No ratings yet
BDA Miniproject Report
13 pages
Engineering Students' Spotify Project
100% (1)
Engineering Students' Spotify Project
38 pages
Music Recommendation System and Recommendation Model
No ratings yet
Music Recommendation System and Recommendation Model
14 pages
Rohan AIML
No ratings yet
Rohan AIML
13 pages
Web Music Player Project Report
No ratings yet
Web Music Player Project Report
19 pages
Web Music Player: Piyush Chauhan (1613101479/16SCSE101080)
No ratings yet
Web Music Player: Piyush Chauhan (1613101479/16SCSE101080)
19 pages
Music Recommendation System Using Hybrid Approach
No ratings yet
Music Recommendation System Using Hybrid Approach
5 pages
Content-Based Recommendation System
No ratings yet
Content-Based Recommendation System
82 pages
MusicCengHistoryX SRS
No ratings yet
MusicCengHistoryX SRS
23 pages
Os - Labproject
No ratings yet
Os - Labproject
21 pages
Machine Learning-Based Music Classification and Recommendation System From Spotify
No ratings yet
Machine Learning-Based Music Classification and Recommendation System From Spotify
12 pages
JAVAFINALREPORT Yugesh
No ratings yet
JAVAFINALREPORT Yugesh
26 pages
Rohan AIML
No ratings yet
Rohan AIML
14 pages
Project
No ratings yet
Project
21 pages
AI-Powered Music Recommendation System
No ratings yet
AI-Powered Music Recommendation System
10 pages
Ar Nab Major Proj
No ratings yet
Ar Nab Major Proj
30 pages
Student Movie Recommender Report
No ratings yet
Student Movie Recommender Report
28 pages
Report
No ratings yet
Report
48 pages
OT-29 Iftekar Assignment
No ratings yet
OT-29 Iftekar Assignment
19 pages
Music Recommendation System - Mini Project
No ratings yet
Music Recommendation System - Mini Project
19 pages
Deep Learning Music Recommender
No ratings yet
Deep Learning Music Recommender
7 pages
Real HighFly
No ratings yet
Real HighFly
30 pages
Podcast Recommendation System Overview
No ratings yet
Podcast Recommendation System Overview
19 pages
AI Music Genre Classification System
No ratings yet
AI Music Genre Classification System
20 pages
Sample Report A
No ratings yet
Sample Report A
119 pages
Mini Project
No ratings yet
Mini Project
15 pages
Priya Merged
No ratings yet
Priya Merged
31 pages
Music Recommender System Guide
No ratings yet
Music Recommender System Guide
11 pages
AI Trip Planner Complete Report
No ratings yet
AI Trip Planner Complete Report
70 pages
Fyp 2024 1st Draft To Rownak
No ratings yet
Fyp 2024 1st Draft To Rownak
24 pages
Book
No ratings yet
Book
22 pages
Collaborative Deep Learning For Recommender Systems
No ratings yet
Collaborative Deep Learning For Recommender Systems
10 pages
COMP 262 Winter2022
No ratings yet
COMP 262 Winter2022
9 pages
21cs644 Module 3
100% (1)
21cs644 Module 3
95 pages
Movie Recommender System Review
No ratings yet
Movie Recommender System Review
2 pages
ML Unit 5
No ratings yet
ML Unit 5
23 pages
IoT Mini Project: Arduino Nano Clock
No ratings yet
IoT Mini Project: Arduino Nano Clock
27 pages
Movie Recommendation System Documentation
No ratings yet
Movie Recommendation System Documentation
8 pages
TED Talk Recommendation Full Report Formatted
No ratings yet
TED Talk Recommendation Full Report Formatted
25 pages
AI Powered Search MEAP Version 13 Trey Grainger Download
No ratings yet
AI Powered Search MEAP Version 13 Trey Grainger Download
42 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
5 pages
Pe Verticals
No ratings yet
Pe Verticals
95 pages
A Personalized E Learning Based On Recom
No ratings yet
A Personalized E Learning Based On Recom
5 pages
40 Startup Interview Questions in ML
100% (3)
40 Startup Interview Questions in ML
33 pages
Recommendation Systems Overview
No ratings yet
Recommendation Systems Overview
17 pages
Movie Recommendation Report
No ratings yet
Movie Recommendation Report
27 pages
Recommendation System Using Machine Learning
No ratings yet
Recommendation System Using Machine Learning
49 pages
Engineering Mtech Mca Mba Projects List 2021 2022
No ratings yet
Engineering Mtech Mca Mba Projects List 2021 2022
57 pages
Dive Into Data Science: Use Python To Tackle Your Toughest Business Challenges 1st Edition Bradford Tuckfield Ready To Read
No ratings yet
Dive Into Data Science: Use Python To Tackle Your Toughest Business Challenges 1st Edition Bradford Tuckfield Ready To Read
138 pages
Graph Transformer For Recommendation: Chaoliu Li Lianghao Xia Xubin Ren
No ratings yet
Graph Transformer For Recommendation: Chaoliu Li Lianghao Xia Xubin Ren
10 pages
Recommendation of Indian Recipes Based On Ingredients
No ratings yet
Recommendation of Indian Recipes Based On Ingredients
18 pages
Recommendation Systems Using Graph Neural Networks
No ratings yet
Recommendation Systems Using Graph Neural Networks
6 pages
E Health and Fitness Recommendation System Using Machine Learning
No ratings yet
E Health and Fitness Recommendation System Using Machine Learning
8 pages
A Novel Healthy and Time Aware Food Recommender Syst - 2023 - Expert Systems Wit PDF
No ratings yet
A Novel Healthy and Time Aware Food Recommender Syst - 2023 - Expert Systems Wit PDF
22 pages
03-Knowledge-Based Recommender Systems - Overview and Research Directions
No ratings yet
03-Knowledge-Based Recommender Systems - Overview and Research Directions
19 pages
Building A Recommendation System
No ratings yet
Building A Recommendation System
10 pages

Music Recommendation System Report

Uploaded by

Music Recommendation System Report

Uploaded by

MUSIC RECOMMENDATION

Under the Guidance of

DEPARTMENT OF DATA SCIENCE AND BUSINESS

Dr. A.V.Kalpana Dr. M.Lakshmi

Signature of Internal Examiner Signature of External Examiner

We express our humble gratitude to Dr C. Muthamizhchelvan, Vice-Chancellor, SRM

We wish to thank Dr Revathi Venkataraman, Professor & Chairperson, School of

We are incredibly grateful to our Head of the Department, Dr M. Lakshmi Professor,

1.1. DOMAIN INTRODUCTION

1. Introduction to Music Recommendation Systems: The opening sentences provide an

2. Evolution of Collaborative Filtering: This section reviews foundational work in

3. Content-Based Approaches in Music Recommendation: The literature review delves

8. Persistent Challenges and Ongoing Research: The concluding section acknowledges

Data Acquisition for the Music Recommendation System Project:

Source Diverse Datasets:

Acquire datasets that encompass a wide range of user music interactions,

Utilize Music Streaming Service APIs:

Leverage APIs provided by music streaming services, such as Spotify, to

Retrieve User Preferences and Behaviors:

Gather information like track preferences, play counts, and user-generated

Access Demographic Information:

Integrate User Ratings and Social Interactions:

Ensure Ethical Considerations:

Prioritize ethical considerations throughout the acquisition process, adhering to

Responsible Data Use:

Maintain a commitment to responsible data use, safeguarding sensitive

Prepare for Machine Learning Model Training:

6. Text Processing (if dealing with textual data):

- Stemming/Lemmatization: Reduce words to their base or root form to simplify the

9. Matrix Factorization (Optional):

- Implement advanced techniques like matrix factorization (e.g., Singular Value

10. Data Splitting:

11. Save Processed Data:

3. Exploratory Data Analysis (EDA):

5. Training and Evaluation:

- Addressing users with minimal interactions posed a challenge in predicting their

- Solution: Regularization Techniques

- We applied regularization techniques to mitigate the sparsity in the user-item interaction

Step 2: Importing Dataset

Step 3: Analyzing the Data

Step 5: Splitting the data into training and test dataset.

Step 6: TF-IDF Vectorizing

Step 7: Creating Models for the evaluation of Machine Learning algorithms

Step 8: Testing the Models

Analysing the Data

Train and Test Split

Saving the model and vectorizer files

Testing the Models

Linear Support Vector Classification

In conclusion, the development and implementation of the

Complementing the collaborative filtering approach, the

As we move forward, there are several avenues for future

1. Hybrid Models: Investigate the integration of collaborative

3. Enhanced User Interface: Continuously refine and improve

4. Real-time Feedback Mechanism: Implement a real-time

5. Exploration of Advanced Algorithms: Explore advanced

In summary, while the current music recommendation system

Certainly! Here are some references for various aspects of the

1. Koren, Y., Bell, R., & Volinsky, C. (2009). Matrix

2. The Surprise Library documentation for collaborative

3. scikit-learn documentation for content-based filtering:

4. An article on collaborative filtering and its application:

5. A comprehensive review on content-based filtering:

- Melville, P., & Sindhwani, V. (2005). Recommender

6. Information about AWS Lambda for deployment:

These references cover collaborative filtering, content-based

You might also like