0% found this document useful (0 votes)

15 views5 pages

Movie Recommendations with GNNs

Uploaded by

golgothgolgoth039

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views5 pages

Movie Recommendations with GNNs

Uploaded by

golgothgolgoth039

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Movie Recommendation using Graph Neural Networks

Case Study Walkthrough

Introduction:

OTT platforms have now become a major part of our lives. In this case study, we use Graph Neural
Networks (GNNs) to solve the problem of suggesting movies to the users of these OTT platforms,
based on the movies they have watched and how they have rated those movies. Users tend to prefer
such personalized recommendations in accordance with their taste, so let’s take a look at how we
solve this problem using GNNs.

Topics:

1) Dataset
2) Creating the Graph Network
3) Traversing through the Graph
4) Preparing the data for the Graph Neural Network
[Link]@[Link]
R8L0PN473F
5) Working of the Model

Dataset:

We have been given two datasets to work with here:

● First is the “movie” dataset, which includes the name of each movie, its unique ID, and the
genre it belongs to.

● The second one is the “ratings” dataset, which contains the ratings that each user has given
to the movie he/she has watched.

● First, we look at the "ratings" dataset and we decide on a threshold for the ratings given to
them by a certain user. We decide the threshold to be 5. So, every movie rated below 5 will
not be considered for our activity.

This file is meant for personal use by [Link]@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 1
Sharing or publishing the contents in part or full is liable for legal action.
Creating the Graph Network:

● We group the "ratings" dataset based on what movies have been watched by which user.
Suppose user 1 has seen movie_1, movie_3, movie_6, movie_47, and movie_50. And
suppose user 2 has watched movie_1, movie_2, and movie_3. So, the groups would be
(movie_1, movie_3, movie_6, movie_47, movie_50), (movie_1, movie_2, movie_3), and so on.
So, each group represents the movies watched by each user that they rated over 5.

● We define two dictionaries, item_frequency and pair_frequency, where the item_frequency

represents how many users have watched that movie, and the dictionary pair_frequency
describes how many times a certain pair of movies have appeared in the same group.

● We have used the "tqdm()" function with ‘for’ loops because we want to see the progress of
iterations.

● Then, we want to create a graph where the nodes would be the movies. To create edges,
we will look at every pair of movies and we are looking at the product of their PMI index and
their pairing frequency. If this product is greater than the minimum threshold of weight that we
have assigned, then we create an edge between those two movies. Now, why are we creating
this graph? We are trying to model what movies are frequently watched together based on all
the user data. To think of this more intuitively, the higher the weight of an edge between two
[Link]@[Link]
R8L0PN473F
movies A and B, the higher the probability of movie B being suggested after you have
watched movie A and vice versa.

Traversing through the Graph:

● Now that we have the graph, we build a function, called “next_step”, that does the simple
operation of traveling to the next node given you’re currently on a node, i.e. when you have
watched a movie, what are the next movies you could consider.

● Since each node in the graph is likely to have more than one neighbor (judging from the
average degree which was 57), we have to take a probabilistic approach. In other words,
since we have more than one option for the next step, we assign probabilities to each edge
arising from our current node/movie and then we make our choice based on those
probabilities.

● Now, we have two hyperparameters, p and q, through which we can modify the probabilities
a little. Our neighbors can have an edge with a movie we have already watched (i.e. previous).
In that case, we would want to scale down the probability of re-visiting that node. If a
neighbor has edges with the previous movie besides having an edge with the current movie,
we would strongly want to visit that node. If a neighbor has an edge with the current node but

This file is meant for personal use by [Link]@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 2
Sharing or publishing the contents in part or full is liable for legal action.
not with the previous node, we want to keep their probability midway
between that of the earlier two scenarios. So, by adjusting the value of
p and q, we can control the probabilities of these scenarios.

● Then, we have a function called “random_walk”. This function takes five arguments, namely
graph, num_walks, num_steps, p, and q. We have discussed what p and q do and we will
later touch on why we are calling these two hyperparameters in this function once again. The
graph argument accepts the graph of movies that we have created. Now, to explain what
purpose the other two arguments serve, we need to look at what the function does altogether.
This function extracts a certain number of random walks within the graph, starting from a
random node. Now the argument ‘num_steps’ defines the number of steps in each one of
those random walks and the number of such walks has been defined by the ‘num_walks’
argument. The “random_walk” function calls the “next_step” function which helps choose the
next step during the particular random walk. The p and q we have used as arguments in the
“random_walk” function are used as arguments in the “next_step” function.

● So, the output of this function would be an array of the nature:

[ [ movie_1, movie_2, movie_67,movie_400,……………. movie_90],

[movie_3, movie_1, movie_60, movie _1000, ………….. movie_2]

.
.
[Link]@[Link]
R8L0PN473F
.
]

Preparing the data for the Graph Neural Network:

● We have the “generate_exampes” function, in which we iterate through all our random walks
and apply the skipgram function from the [Link] module. Let’s throw a little
light on how the skipgram function works.

● Suppose there is a sentence “I love to play basketball”. Now, if we apply the skipgram
function on this, it would return us some samples in the format: { word_1 from the sentence,
word_2} -> label.

The word_1 is a random word from the sentence and the word_2 can be any random word
from the total vocabulary. Now, the value of the label becomes ‘1’, if the word_2 which has
been randomly chosen from the vocabulary, happens to be a part of the input sentence as
well. Otherwise, the value of the label becomes ‘0’. Below is an example that can help
understand it better.

Input Sentence: ‘I love to play cricket’.

This file is meant for personal use by [Link]@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 3
Sharing or publishing the contents in part or full is liable for legal action.
Examples of output samples:
{ I, play} -> 1 (since both words are in input sentence)
{I, basketball} -> 1
{love, play}-> 1
{love, football} -> 0 (since football doesn’t exist in the input sentence)

● So instead of sentences, we are using the walks that we generated using the “random_walks”
function. So, when we run the skipgram over these random walks, we get tuples of movies
and a corresponding label. If both of those movies have been a part of the random walk, they
are labeled as ‘1’, otherwise ‘0’. And we have a dictionary called example_weights, where we
count the number of occurrences of those particular two movies.

Working of the Model:

● The output of the "generate_examples" function becomes the frame of data on which we are
going to train our neural network model. The first movie in the tuple is now called the context
movie, the second movie is called the target movie and our corresponding outputs are the
labels. So, our neural network model takes two inputs: target and context. Then, it converts
them to their respective embeddings through its embedding layer.

● Embedding is
[Link]@[Link] process in which we try to convert a word into a vector of a limited number of
R8L0PN473F
dimensions. The intuition behind embedding is that similar words will have similar
embeddings. So, if we take the dot product of two similar word embeddings, it will give a
higher value as opposed to that for two less similar words.

● The neural network (as we can see in the create_model function) has two layers. One of them
is embedding, which takes the target and context words and converts them into target and
context embeddings. Then, we have a layer that takes the dot product of these two
embeddings and gives either '1' or '0' as output.

● After this model is trained on the data, we extract the embedding layer of the model which is
the only matrix we will be needing for our use case.

● Now that we have got the embedding layer after training the model, how do we use it? We
take a set of movies and call them query movies. Our objective now is to find out the top 5
recommendations for each movie. So, we convert each query to a query embedding and then
try to find out the top 5 similar movies from the embedding matrix that we extracted from the
neural network after the training. Those 5 similar movies can be suggested to the user.

This file is meant for personal use by [Link]@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 4
Sharing or publishing the contents in part or full is liable for legal action.
Additional Reading Material:

● For Word to Vector Embedding: [Link]

● A comprehensive research paper on node embedding with implementation code:

[Link]

[Link]@[Link]
R8L0PN473F

This file is meant for personal use by [Link]@[Link] only.

Satish Deep Learning Lab MAnual
No ratings yet
Satish Deep Learning Lab MAnual
85 pages
Assignment 2
No ratings yet
Assignment 2
8 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Practical File: Deep Learning
No ratings yet
Practical File: Deep Learning
33 pages
Plotting Loss History in TensorFlow
No ratings yet
Plotting Loss History in TensorFlow
65 pages
LSTM Lecture
No ratings yet
LSTM Lecture
163 pages
Recommending Movies - Retrieval - TensorFlow Recommenders
No ratings yet
Recommending Movies - Retrieval - TensorFlow Recommenders
11 pages
DLT Experiment 2
No ratings yet
DLT Experiment 2
7 pages
Deep Walk Algorithm
No ratings yet
Deep Walk Algorithm
9 pages
Graph Neural Networks Overview
No ratings yet
Graph Neural Networks Overview
107 pages
Lab Sheet Artificial Intelligence: 1. Introduction To Machine Learning: Linear Regression
No ratings yet
Lab Sheet Artificial Intelligence: 1. Introduction To Machine Learning: Linear Regression
8 pages
A Quick Recap: Artificial Intelligence LAB
No ratings yet
A Quick Recap: Artificial Intelligence LAB
29 pages
ADL Exp File
No ratings yet
ADL Exp File
56 pages
01 - Introduction To Deep Learning
No ratings yet
01 - Introduction To Deep Learning
56 pages
DL Record
No ratings yet
DL Record
11 pages
Build a Deep Neural Network Guide
No ratings yet
Build a Deep Neural Network Guide
27 pages
Graph Neural Networks
100% (1)
Graph Neural Networks
27 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Markov Chain Neural Network Model
No ratings yet
Markov Chain Neural Network Model
8 pages
Node2vec: Scalable Feature Learning For Networks: Aditya Grover Et Al. Presented By: Saim Mehmood Ahmadreza Jeddi
No ratings yet
Node2vec: Scalable Feature Learning For Networks: Aditya Grover Et Al. Presented By: Saim Mehmood Ahmadreza Jeddi
30 pages
Datascience
No ratings yet
Datascience
7 pages
DL Lab1
No ratings yet
DL Lab1
15 pages
Expt 5 Expt 6
No ratings yet
Expt 5 Expt 6
10 pages
Student's Guide to Neural Networks
No ratings yet
Student's Guide to Neural Networks
40 pages
Neural Network for Bike Rental Prediction
No ratings yet
Neural Network for Bike Rental Prediction
15 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
Case Study - Sentiment Analysis With RNNs
No ratings yet
Case Study - Sentiment Analysis With RNNs
8 pages
Google Aiml
No ratings yet
Google Aiml
50 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
Machine Learning and Pattern Recognition Week 8 Neural Net Architectures
No ratings yet
Machine Learning and Pattern Recognition Week 8 Neural Net Architectures
3 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
100% (1)
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
Report Sentiment Analysis Marcos Matheus
No ratings yet
Report Sentiment Analysis Marcos Matheus
12 pages
GVPCOEW - Neural Networks Deep Learning Material - 2024 - DONE
No ratings yet
GVPCOEW - Neural Networks Deep Learning Material - 2024 - DONE
110 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
No ratings yet
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
28 pages
Unit 6
No ratings yet
Unit 6
34 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
24 pages
Sequence Models
No ratings yet
Sequence Models
73 pages
Nguyen Duy
No ratings yet
Nguyen Duy
66 pages
DL Lab Manual
No ratings yet
DL Lab Manual
18 pages
AIML Unit-1
No ratings yet
AIML Unit-1
31 pages
UNIT II - PPT - Part 1
No ratings yet
UNIT II - PPT - Part 1
41 pages
L2 Neural Network Basics
No ratings yet
L2 Neural Network Basics
105 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
48 pages
Keras RNN Guide for Beginners
No ratings yet
Keras RNN Guide for Beginners
13 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
Unit 5 Machine Learning
No ratings yet
Unit 5 Machine Learning
12 pages
DL Unit5 RNN
No ratings yet
DL Unit5 RNN
107 pages
06 GNN Slides
No ratings yet
06 GNN Slides
66 pages
Final DL
No ratings yet
Final DL
26 pages
Neural Networks in Information Retrieval
No ratings yet
Neural Networks in Information Retrieval
290 pages
Lesson 7 - RNN
No ratings yet
Lesson 7 - RNN
89 pages
Recurrent Neural Network Using LSTM Model
No ratings yet
Recurrent Neural Network Using LSTM Model
15 pages
Self-Supervised Learning in GNNs
No ratings yet
Self-Supervised Learning in GNNs
107 pages
Cad and Dog 2
No ratings yet
Cad and Dog 2
5 pages
RAGE: Explainable RAG for LLMs
No ratings yet
RAGE: Explainable RAG for LLMs
4 pages
Programming With Python and GUI Development... 2024
No ratings yet
Programming With Python and GUI Development... 2024
145 pages
Notebook - Deep Neural Networks
No ratings yet
Notebook - Deep Neural Networks
28 pages
Notebook - Main Code
No ratings yet
Notebook - Main Code
4 pages
Stock Market Dashboard in Python
No ratings yet
Stock Market Dashboard in Python
4 pages
Tanh Activation Function in Python
No ratings yet
Tanh Activation Function in Python
9 pages
Notebook - Text Classification
No ratings yet
Notebook - Text Classification
7 pages
Music Recommendation System Guide
No ratings yet
Music Recommendation System Guide
22 pages
Agave Plant Maturation Model Analysis
No ratings yet
Agave Plant Maturation Model Analysis
7 pages
Time Series Analysis of Stock AA
No ratings yet
Time Series Analysis of Stock AA
5 pages
Boston Dataset
No ratings yet
Boston Dataset
6 pages
Data Pipeline in ML
No ratings yet
Data Pipeline in ML
3 pages
New System To Harness 40% of The Sun's Heat To Produce Clean Hydrogen Fuel
No ratings yet
New System To Harness 40% of The Sun's Heat To Produce Clean Hydrogen Fuel
6 pages
5 3-2 Spatial Environmental Data Model Selection Long-Range Dependencies
No ratings yet
5 3-2 Spatial Environmental Data Model Selection Long-Range Dependencies
3 pages
1 3 Multiple Hypothesis Testing
No ratings yet
1 3 Multiple Hypothesis Testing
14 pages
Notebook - Geospatial
No ratings yet
Notebook - Geospatial
11 pages
Measurement Noise in Gaussian Processes
No ratings yet
Measurement Noise in Gaussian Processes
4 pages
Glossary of Notations - Recommender Systems Part 3
No ratings yet
Glossary of Notations - Recommender Systems Part 3
4 pages
Covariance Kernel Role in Spatial Prediction
No ratings yet
Covariance Kernel Role in Spatial Prediction
3 pages
MLS 1 - Presentation
No ratings yet
MLS 1 - Presentation
11 pages
MLS 1 - Regression
No ratings yet
MLS 1 - Regression
20 pages
ML LVC 3 Post-Session Summary
No ratings yet
ML LVC 3 Post-Session Summary
16 pages
The CNN Architecture
No ratings yet
The CNN Architecture
15 pages
ML LVC 3 Glossary
No ratings yet
ML LVC 3 Glossary
1 page
ML LVC 2 Post-Session Summary
No ratings yet
ML LVC 2 Post-Session Summary
12 pages
G.C.E. (Advanced Level) Economics
No ratings yet
G.C.E. (Advanced Level) Economics
147 pages
01 Introduc
No ratings yet
01 Introduc
7 pages
LARSSMUN 2024 - Committees & Topics
No ratings yet
LARSSMUN 2024 - Committees & Topics
3 pages
Scratch Dance Project Guide
100% (1)
Scratch Dance Project Guide
21 pages
Cognitive Computing For Natural Language Processing NLP and Understanding Medical Imaging Narratives
No ratings yet
Cognitive Computing For Natural Language Processing NLP and Understanding Medical Imaging Narratives
5 pages
My Home Made Tools - Chips With Everything
No ratings yet
My Home Made Tools - Chips With Everything
3 pages
Ayanda Zondo: Sales Consultant Resume
No ratings yet
Ayanda Zondo: Sales Consultant Resume
1 page
Catalog
No ratings yet
Catalog
554 pages
M500 2019 10years Free Ebook
No ratings yet
M500 2019 10years Free Ebook
329 pages
Graphs
No ratings yet
Graphs
44 pages
2 8 TFT Touch Shield
No ratings yet
2 8 TFT Touch Shield
15 pages
Manual Mix Deck Express
No ratings yet
Manual Mix Deck Express
52 pages
Automobiles Presentation
No ratings yet
Automobiles Presentation
10 pages
Exon Translated Cleaned Draft-1
No ratings yet
Exon Translated Cleaned Draft-1
130 pages
Compendium On Behalf of Petitioner
No ratings yet
Compendium On Behalf of Petitioner
5 pages
MDF Plant Weekly Progress Report
No ratings yet
MDF Plant Weekly Progress Report
14 pages
E-Mandi: Project Guide: S.B.Patil Presented By: Kavita.M.H, Megha.B.H
No ratings yet
E-Mandi: Project Guide: S.B.Patil Presented By: Kavita.M.H, Megha.B.H
20 pages
E50P Normal Procedures
100% (2)
E50P Normal Procedures
27 pages
Altec Federal Government Product Catalog
100% (3)
Altec Federal Government Product Catalog
146 pages
Share Zimbabwejobs MONDAY.,, 6 1
No ratings yet
Share Zimbabwejobs MONDAY.,, 6 1
72 pages
HEV Fuel Efficiency in Urban vs. Highway
No ratings yet
HEV Fuel Efficiency in Urban vs. Highway
5 pages
Life Story Rights Agreement Template
No ratings yet
Life Story Rights Agreement Template
5 pages
Integrated Marketing for Hero Honda
0% (1)
Integrated Marketing for Hero Honda
30 pages
UAS Control Station Handover Guide
No ratings yet
UAS Control Station Handover Guide
3 pages
Compressor
No ratings yet
Compressor
9 pages
Lone Star Rig Price List 2023
No ratings yet
Lone Star Rig Price List 2023
9 pages
How to Create Client Subscriptions
No ratings yet
How to Create Client Subscriptions
9 pages
Miele H 6700 BM Parts Pricing List
No ratings yet
Miele H 6700 BM Parts Pricing List
6 pages
Microeconomics Assignment - Kobra Soltani
No ratings yet
Microeconomics Assignment - Kobra Soltani
8 pages
Sandeep SAP SD End User Resume Final
No ratings yet
Sandeep SAP SD End User Resume Final
3 pages

Movie Recommendations with GNNs

Uploaded by

Movie Recommendations with GNNs

Uploaded by

Movie Recommendation using Graph Neural Networks

Case Study Walkthrough

We have been given two datasets to work with here:

This file is meant for personal use by [Link]@[Link] only.

● We define two dictionaries, item_frequency and pair_frequency, where the item_frequency

Traversing through the Graph:

This file is meant for personal use by [Link]@[Link] only.

● So, the output of this function would be an array of the nature:

[movie_3, movie_1, movie_60, movie _1000, ………….. movie_2]

Preparing the data for the Graph Neural Network:

Input Sentence: ‘I love to play cricket’.

This file is meant for personal use by [Link]@[Link] only.

Working of the Model:

This file is meant for personal use by [Link]@[Link] only.

● For Word to Vector Embedding: [Link]

● A comprehensive research paper on node embedding with implementation code:

This file is meant for personal use by [Link]@[Link] only.

You might also like