0% found this document useful (0 votes)

24 views11 pages

Stochastic Blockmodel - Ipynb - Colab

Uploaded by

akhil.s18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views11 pages

Stochastic Blockmodel - Ipynb - Colab

Uploaded by

akhil.s18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

12/4/24, 8:08 AM Stochastic Blockmodel.

ipynb - Colab

keyboard_arrow_down The Stochastic Blockmodel

import networkx as nx
import numpy as np
import [Link] as plt
import sympy

keyboard_arrow_down The simplest community: a clique

G_clique = nx.from_edgelist([(i,j) for i in range(10) for j in range(10) if i!=j])
[Link](G_clique, pos=nx.circular_layout(G_clique))

# The adjacency matrix is (almost) all ones

A_clique = nx.adjacency_matrix(G_clique).todense()
[Link](A_clique)

# Visualize the adjacency matrix

[Link](A_clique)
[Link]()

[Link] 1/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
<[Link] at 0x167fb20bc20>

keyboard_arrow_down Two communities: the Caveman graph

G_caveman = nx.from_edgelist([(i,j) for i in range(20) for j in range(20) if i!=j and (i-10)*(j-10)>0])
A_caveman = nx.adjacency_matrix(G_caveman).todense()
[Link](A_caveman)

<[Link] at 0x167fca4fec0>

keyboard_arrow_down Suppose you didn't know who lived in which cave

In other words, the nodes were in some random order

[Link](42)
order = [Link](len(A_caveman))
A_caveman2 = A_caveman[order,:][:,order]
[Link](A_caveman2)

[Link] 2/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
<[Link] at 0x167fcad7c80>

keyboard_arrow_down How can we figure out which nodes are in the same cave?
Let's look at a few rows of the adjacency matrix.

fig, axes = [Link](nrows=3, ncols=2, figsize=(12,2))

for i, r in enumerate([0, 2, 4, 5, 15, 12]):
col = i % 2
row = int(i/2)
axes[row, col].matshow(A_caveman2[r:r+1])
axes[row, col].set_title(f"Row {r}")
axes[row, col].[Link].set_ticks([])
axes[row, col].[Link].set_ticks([])

Idea: Run K-Nearest Neighbors clustering with these rows as the feature vectors
It would group rows 0/4/15 into one cluster, and 2/5/12 into another
Clusters = Communities

(We'll improve upon this idea later)

keyboard_arrow_down How would we check if the communities were good?

SUPPOSE someone told us here are the communities.

Maybe by doing K-Nearest Neighbors.

How would we check?

We would reorder the nodes by grouping people from the same club together
Then, we would look at the new adjacency matrix

[Link](order>=10)[0]

[Link] 3/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
array([ 2, 5, 7, 8, 9, 12, 14, 16, 17], dtype=int64)

someone_says_community1 = [0, 1, 3, 4, 6, 10, 11, 13, 15, 18]

someone_says_community2 = [2, 5, 7, 8, 9, 12, 14, 16, 17]

# Reordering the nodes

ordering = [Link]([someone_says_community1, someone_says_community2])
ordering

array([ 0, 1, 3, 4, 6, 10, 11, 13, 15, 18, 2, 5, 7, 8, 9, 12, 14,

16, 17])

# Adjacency matrix with new ordering

A_caveman2_ordered = A_caveman2[ordering][:, ordering]

# What does the reordered adjacency matrix look like?

[Link](A_caveman2_ordered)

<[Link] at 0x167fcb2acf0>

SUPPOSE someone told us here are the communities. How would we check?
IF the memberships are correct, the reordered adjacency matrix is block-structured.

Note: Whether the big block is first or the small block doesn't matter.

keyboard_arrow_down Stochastic Blockmodel: Generalizing the caveman graph

First, we will restate what we did in the caveman graph using new terminology.

n = 10 # number of nodes

# Each node can belong to one of two clubs

clubs = [Link](2, size=n)
clubs

array([1, 1, 1, 0, 0, 1, 1, 1, 0, 1])

# interests = club memberships matrix

interests = [Link]((n, 2))
interests[[Link](n), clubs] = 1
interests

array([[0., 1.],
[0., 1.],
[0., 1.],
[1., 0.],
[1., 0.],
[0., 1.],
[Link] 4/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
[0., 1.],
[0., 1.],
[1., 0.],
[0., 1.]])

Each row represents one person

The first number is the person's interest in club #1.
The second number is the interest in club #2.

keyboard_arrow_down From interests to network

Fans of club #1 become friends
Fans of club #2 become friends

club1_fans = interests[:,0] # Everyone's interest in club #1

club1_fans

array([0., 0., 0., 1., 1., 0., 0., 0., 1., 0.])

# A1[i,j] = club1_fans[i] * club1_fans[j]

A1 = [Link](club1_fans, club1_fans)

[Link](A1)

<[Link] at 0x167fde95e50>

club2_fans = interests[:,1] # fans of club #2

club2_fans

array([1., 1., 1., 0., 0., 1., 1., 1., 0., 1.])

A2 = [Link](club2_fans, club2_fans)
[Link](A2)

[Link] 5/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
<[Link] at 0x167fdf3c410>

# All friendships together gives the adjacency matrix A

A = A1 + A2
[Link](A)

<[Link] at 0x167fca65460>

keyboard_arrow_down From interests to network (in one step)

# Same thing, without all the intermediate steps
A = interests @ interests.T # Matrix multiplication
[Link](A)

[Link] 6/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
<[Link] at 0x167fdf3f9b0>

keyboard_arrow_down From network to interests

You see the network. How can you figure out the club memberships?

⇒ To find the right memberships, we need to find the ordering that makes the matrix block-structured.

keyboard_arrow_down Method #1: Communities via modularity

G = nx.from_numpy_array(A)

# Communities via modularity

communities = [Link].louvain_communities(G)
communities

[{0, 1, 2, 5, 6, 7, 9}, {3, 4, 8}]

# Check if the communities give a block-structured matrix

ordering = [Link]([list(x) for x in communities])
[Link](A[ordering][:, ordering])

<[Link] at 0x167fe47d100>

[Link] 7/11
12/4/24, 8:08 AM Stochastic [Link] - Colab

keyboard_arrow_down Method #2: Communities via spectral decomposition

eigenvalues, eigenvectors = [Link](A)
eigenvalues

array([-4.88399708e-16, -4.17365745e-16, -4.44899761e-17, -5.41731251e-34,

3.81435172e-19, 3.71371796e-18, 2.61415001e-16, 4.46874861e-16,
3.00000000e+00, 7.00000000e+00])

The eigenvalues are returned in ascending order.

The two largest ones are 7 and 3; the rest are pretty much 0

Let's look at the last two eigenvectors

fig, ax = [Link](figsize=(10,5))
[Link](eigenvectors[:,-2:])

<[Link] at 0x167fca66300>

Each row corresponds to one node.

Clearly, the rows are of two types

$\Rightarrow$ K-Nearest Neighbors clustering of the rows of the eigenvector matrix

Previously for the caveman: we clustered the rows of the adjacency matrix

from [Link] import KMeans

model = KMeans(n_clusters=2)
[Link](eigenvectors[:,-2:]) # K-Means on the eigenvector rows
predicted_clubs = model.labels_ # Clusters found by K-Means
predicted_clubs

C:\Users\deepay\Miniconda\Lib\site-packages\sklearn\cluster\_kmeans.py:1446: UserWarning: KMeans is known to have a memory l

[Link](
array([0, 0, 0, 1, 1, 0, 0, 0, 1, 0])

predicted_club1_members = [Link](predicted_clubs==0)[0]
predicted_club2_members = [Link](predicted_clubs==1)[0]
print('Predicted clubs', predicted_club1_members, 'and', predicted_club2_members)

Predicted clubs [0 1 2 5 6 7 9] and [3 4 8]

ordering = [Link]([predicted_club1_members, predicted_club2_members])

[Link](A[ordering][:, ordering])

[Link] 8/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
<[Link] at 0x16780407fb0>

keyboard_arrow_down Generalization
Story so far:

all fans of the same club become friends

fans of different clubs do not become friends.

Generalization:

Fans of the same club become friends with probability 0.8 (say)
Fans of different clubs become friends with probability 0.10 (say)

# B = cluster-connection matrix
B = [Link]([[0.8, 0.1], [0.1, 0.8]])
[Link](B)

Previously: A = interests @ interests.T # Adjacency matrix

This gave us the caveman graph

Now: We create a probability matrix, from which we sample the adjacency matrix

P = interests @ B @ interests.T # Probability matrix depends on the cluster-connection matrix B

array([[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.1, 0.1, 0.1, 0.8, 0.8, 0.1, 0.1, 0.1, 0.8, 0.1],
[0.1, 0.1, 0.1, 0.8, 0.8, 0.1, 0.1, 0.1, 0.8, 0.1],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.1, 0.1, 0.1, 0.8, 0.8, 0.1, 0.1, 0.1, 0.8, 0.1],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8]])

A = [Link](1, P) # Friendships are random

array([[1, 0, 1, 0, 0, 1, 0, 1, 0, 1],
[1, 1, 0, 0, 0, 1, 1, 1, 0, 1],
[0, 1, 1, 0, 0, 1, 1, 1, 0, 0],
[0, 0, 1, 1, 1, 0, 0, 1, 0, 0],
[0, 0, 1, 1, 1, 0, 0, 0, 1, 0],
[1, 1, 1, 0, 0, 1, 0, 1, 1, 0],
[1, 1, 1, 0, 0, 1, 1, 1, 0, 0],

[Link] 9/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
[0, 1, 1, 0, 0, 1, 1, 1, 0, 0],
[0, 0, 0, 1, 1, 0, 0, 1, 1, 0],
[0, 1, 1, 0, 0, 1, 1, 0, 0, 1]])

[Link](A)

<[Link] at 0x167808be510>

keyboard_arrow_down Does the block-structure still apply?

ordering = [Link]([[Link](clubs==0)[0], [Link](clubs==1)[0]]) # Actual communities
[Link](A[ordering][:, ordering])

<[Link] at 0x167808e0950>

Still roughly block structured.

In the caveman graph, it was exactly block-structured.

keyboard_arrow_down Finding the Communities

Same ideas as before.

keyboard_arrow_down Method #1: Modularity

[Link] 10/11
12/4/24, 8:08 AM Stochastic [Link] - Colab
G = nx.from_numpy_array(A)
communities = [Link].louvain_communities(G)
communities

[{2}, {3, 4, 8}, {7}, {0, 1, 5, 6, 9}]

Got split into too many communities

# Play with the resolution to get the desired number of communities

communities = [Link].louvain_communities(G, resolution=0.5)
communities

[{3, 4, 8}, {0, 1, 2, 5, 6, 7, 9}]

ordering = [Link]([list(x) for x in communities])

[Link](A[ordering][:, ordering])

<[Link] at 0x167809c6510>

[Link] 11/11

HW 3
No ratings yet
HW 3
12 pages
Dictionary Graph
No ratings yet
Dictionary Graph
5 pages
PhysRevE 83 016107-Accepted
No ratings yet
PhysRevE 83 016107-Accepted
12 pages
SE KMeansClustering
No ratings yet
SE KMeansClustering
21 pages
Prac7 8 9 10
No ratings yet
Prac7 8 9 10
12 pages
Community Detection With Graph Neural Networks
No ratings yet
Community Detection With Graph Neural Networks
16 pages
23MCB0003 Sna 04
No ratings yet
23MCB0003 Sna 04
15 pages
Num Py
No ratings yet
Num Py
34 pages
Stochastic Blockmodels and Community Structure in Networks
No ratings yet
Stochastic Blockmodels and Community Structure in Networks
11 pages
AIML Lab
No ratings yet
AIML Lab
42 pages
Graph Analysis3 Code
No ratings yet
Graph Analysis3 Code
2 pages
From Import Import As Import As From Import From Import From Import From Import
No ratings yet
From Import Import As Import As From Import From Import From Import From Import
9 pages
Inbuilt Kmeans
No ratings yet
Inbuilt Kmeans
3 pages
Spectral Clustering
No ratings yet
Spectral Clustering
5 pages
Numpy Arrays: Creation and Operations
No ratings yet
Numpy Arrays: Creation and Operations
12 pages
Kmeans Clustering
No ratings yet
Kmeans Clustering
3 pages
Ucs813
No ratings yet
Ucs813
4 pages
LecN10 R
No ratings yet
LecN10 R
9 pages
Aiml Lab
No ratings yet
Aiml Lab
19 pages
Code BTL
No ratings yet
Code BTL
4 pages
AI and ML Lab Programs To Print
No ratings yet
AI and ML Lab Programs To Print
22 pages
Workgroup Ass 4
No ratings yet
Workgroup Ass 4
6 pages
Social Network Analysis Unit-3
No ratings yet
Social Network Analysis Unit-3
28 pages
Numpy Data Manipulation Tips 1
No ratings yet
Numpy Data Manipulation Tips 1
13 pages
Tutorial Exercises Clustering - K-Means, Nearest Neighbor and Hierarchical
No ratings yet
Tutorial Exercises Clustering - K-Means, Nearest Neighbor and Hierarchical
7 pages
AIML Lab Manual Final
No ratings yet
AIML Lab Manual Final
43 pages
Problem Set 5
No ratings yet
Problem Set 5
2 pages
Lab07 KMeans Assignment
No ratings yet
Lab07 KMeans Assignment
13 pages
KD Trees
No ratings yet
KD Trees
7 pages
DWM
No ratings yet
DWM
12 pages
AIML Manual V1-6-83
No ratings yet
AIML Manual V1-6-83
78 pages
Math Lab Code for Interpolation Methods
No ratings yet
Math Lab Code for Interpolation Methods
10 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
FDS Program - Colaboratory
No ratings yet
FDS Program - Colaboratory
4 pages
Exercises695Clus Solution - Doc Exercises695Clus Solution
No ratings yet
Exercises695Clus Solution - Doc Exercises695Clus Solution
7 pages
TensorFlow Tensors for Deep Trading
No ratings yet
TensorFlow Tensors for Deep Trading
9 pages
Unsupervised Learning & Clustering
No ratings yet
Unsupervised Learning & Clustering
102 pages
Downloading Graphs in Jupyter Notebook
No ratings yet
Downloading Graphs in Jupyter Notebook
15 pages
AI Print
No ratings yet
AI Print
14 pages
AI Lab File Main
No ratings yet
AI Lab File Main
18 pages
2nd Year
No ratings yet
2nd Year
83 pages
Untitled Document
No ratings yet
Untitled Document
7 pages
AI LAB Contents
No ratings yet
AI LAB Contents
19 pages
Oxford SC2 Transcribed Notes
No ratings yet
Oxford SC2 Transcribed Notes
42 pages
Graph Search Algorithms and Classifiers
No ratings yet
Graph Search Algorithms and Classifiers
20 pages
Session 13 Numpy Fundamentals
No ratings yet
Session 13 Numpy Fundamentals
14 pages
Pandas
No ratings yet
Pandas
9 pages
Intro To Numpy With Examples
No ratings yet
Intro To Numpy With Examples
60 pages
Artificial Intelligence Lab Manual
No ratings yet
Artificial Intelligence Lab Manual
35 pages
Intro To Data Science
No ratings yet
Intro To Data Science
47 pages
Uninformed & Informed Search Algorithms
No ratings yet
Uninformed & Informed Search Algorithms
17 pages
Second Midterm
No ratings yet
Second Midterm
7 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Unsupervised ML: Clustering Guide
No ratings yet
Unsupervised ML: Clustering Guide
10 pages
Ai Lab Report 04
No ratings yet
Ai Lab Report 04
7 pages
Mini-Project #6: Instructions
No ratings yet
Mini-Project #6: Instructions
3 pages
Understanding GroupBy in Pandas
No ratings yet
Understanding GroupBy in Pandas
32 pages
Dsintro RST
No ratings yet
Dsintro RST
15 pages
Introduction to Pandas Categorical Data
No ratings yet
Introduction to Pandas Categorical Data
22 pages
Boolean RST
No ratings yet
Boolean RST
2 pages
Style Ipynb
No ratings yet
Style Ipynb
42 pages
B.Tech. Second Year III Semester Syllabus
No ratings yet
B.Tech. Second Year III Semester Syllabus
17 pages
Matroid Theory and Optimization Techniques
No ratings yet
Matroid Theory and Optimization Techniques
6 pages
Graph Algorithms
No ratings yet
Graph Algorithms
45 pages
Tinh Closenness Centrality
No ratings yet
Tinh Closenness Centrality
5 pages
Parallel Dijkstra for Large Graphs
No ratings yet
Parallel Dijkstra for Large Graphs
31 pages
AI Agents: A Comprehensive Guide
No ratings yet
AI Agents: A Comprehensive Guide
63 pages
Graph Theory Concepts and Theorems
No ratings yet
Graph Theory Concepts and Theorems
18 pages
Graph Algorithms: Matrices & Shortest Paths
No ratings yet
Graph Algorithms: Matrices & Shortest Paths
4 pages
Relation Worksheet
No ratings yet
Relation Worksheet
4 pages
AI Problem Solving for Students
No ratings yet
AI Problem Solving for Students
117 pages
1615888543RME - Detail Syllabus PhD-2020
No ratings yet
1615888543RME - Detail Syllabus PhD-2020
28 pages
Math in The Modern World Reviewer
No ratings yet
Math in The Modern World Reviewer
13 pages
CS 180 Midterm Exam 2010
No ratings yet
CS 180 Midterm Exam 2010
6 pages
HYPERBOLA
No ratings yet
HYPERBOLA
18 pages
Unit 7
No ratings yet
Unit 7
9 pages
ITC TD3 Graph Thory
No ratings yet
ITC TD3 Graph Thory
2 pages
MST Final
No ratings yet
MST Final
51 pages
Artificial Intelligence Chapter 3: Solving Problems by Searching
No ratings yet
Artificial Intelligence Chapter 3: Solving Problems by Searching
63 pages
Graph Theory Proofs and Problems
No ratings yet
Graph Theory Proofs and Problems
4 pages
Ii Year - Daa - Lab - Manual
No ratings yet
Ii Year - Daa - Lab - Manual
70 pages
Quiz#03: Q1. What Is A and Its Concepts?
No ratings yet
Quiz#03: Q1. What Is A and Its Concepts?
4 pages
Vls I 04082014
No ratings yet
Vls I 04082014
51 pages
Spherical Fuzzy Graph Application in Traffic
No ratings yet
Spherical Fuzzy Graph Application in Traffic
7 pages
Exercise Sheet 9
No ratings yet
Exercise Sheet 9
2 pages
Adsa QB
No ratings yet
Adsa QB
5 pages
5 - Network Flow Problems
No ratings yet
5 - Network Flow Problems
30 pages
The University of Calgary
No ratings yet
The University of Calgary
220 pages
Wallace1994 Article SpaceVariantImageProcessing
No ratings yet
Wallace1994 Article SpaceVariantImageProcessing
20 pages
Daa M-4
No ratings yet
Daa M-4
28 pages
Merge Sort and Quick Sort With Sorting Operations With Complexity
No ratings yet
Merge Sort and Quick Sort With Sorting Operations With Complexity
3 pages

Stochastic Blockmodel - Ipynb - Colab

Uploaded by

Stochastic Blockmodel - Ipynb - Colab

Uploaded by

12/4/24, 8:08 AM Stochastic Blockmodel.

keyboard_arrow_down The Stochastic Blockmodel

keyboard_arrow_down The simplest community: a clique

# The adjacency matrix is (almost) all ones

# Visualize the adjacency matrix

keyboard_arrow_down Two communities: the Caveman graph

keyboard_arrow_down Suppose you didn't know who lived in which cave

fig, axes = [Link](nrows=3, ncols=2, figsize=(12,2))

(We'll improve upon this idea later)

keyboard_arrow_down How would we check if the communities were good?

Maybe by doing K-Nearest Neighbors.

someone_says_community1 = [0, 1, 3, 4, 6, 10, 11, 13, 15, 18]

# Reordering the nodes

array([ 0, 1, 3, 4, 6, 10, 11, 13, 15, 18, 2, 5, 7, 8, 9, 12, 14,

# Adjacency matrix with new ordering

# What does the reordered adjacency matrix look like?

keyboard_arrow_down Stochastic Blockmodel: Generalizing the caveman graph

# Each node can belong to one of two clubs

# interests = club memberships matrix

Each row represents one person

keyboard_arrow_down From interests to network

club1_fans = interests[:,0] # Everyone's interest in club #1

# A1[i,j] = club1_fans[i] * club1_fans[j]

club2_fans = interests[:,1] # fans of club #2

# All friendships together gives the adjacency matrix A

keyboard_arrow_down From interests to network (in one step)

keyboard_arrow_down From network to interests

keyboard_arrow_down Method #1: Communities via modularity

# Communities via modularity

[{0, 1, 2, 5, 6, 7, 9}, {3, 4, 8}]

# Check if the communities give a block-structured matrix

keyboard_arrow_down Method #2: Communities via spectral decomposition

array([-4.88399708e-16, -4.17365745e-16, -4.44899761e-17, -5.41731251e-34,

The eigenvalues are returned in ascending order.

Let's look at the last two eigenvectors

Each row corresponds to one node.

$\Rightarrow$ K-Nearest Neighbors clustering of the rows of the eigenvector matrix

from [Link] import KMeans

C:\Users\deepay\Miniconda\Lib\site-packages\sklearn\cluster\_kmeans.py:1446: UserWarning: KMeans is known to have a memory l

Predicted clubs [0 1 2 5 6 7 9] and [3 4 8]

ordering = [Link]([predicted_club1_members, predicted_club2_members])

all fans of the same club become friends

Previously: A = interests @ interests.T # Adjacency matrix

P = interests @ B @ interests.T # Probability matrix depends on the cluster-connection matrix B

A = [Link](1, P) # Friendships are random

keyboard_arrow_down Does the block-structure still apply?

Still roughly block structured.

keyboard_arrow_down Finding the Communities

keyboard_arrow_down Method #1: Modularity

[{2}, {3, 4, 8}, {7}, {0, 1, 5, 6, 9}]

# Play with the resolution to get the desired number of communities

[{3, 4, 8}, {0, 1, 2, 5, 6, 7, 9}]

ordering = [Link]([list(x) for x in communities])

You might also like