Unsupervised Learning

Unit 2 covers Unsupervised Learning, detailing key concepts like Clustering (K-means and Kernel K-means), Dimensionality Reduction (PCA and Kernel PCA), Matrix Factorization and Completion, and Generative Models (Mixture Models and Latent Factor Models). Each topic includes definitions, algorithms, applications, and examples, illustrating their use in areas such as customer segmentation, face recognition, and recommender systems. A summary table categorizes methods, types, algorithms, and applications for quick reference.

Uploaded by

Saurabh Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

119 views3 pages

Unsupervised Learning

Uploaded by

Saurabh Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Unit 2 Unsupervised learning

Below is a detailed elaboration of the topics under Unit 2: Unsupervised Learning ,

including definitions, algorithms/procedures, applications, and examples for each
concept.

🟩 1. Clustering: K-means / Kernel K-means

🔹 Definition:
 K-means Clustering : A popular iterative algorithm that partitions data into k
distinct clusters based on distances (typically Euclidean).
 Kernel K-means : An extension of K-means that uses kernel methods to handle
non-linearly separable data by mapping it into a higher-dimensional space.
🔹 Algorithm:
✅ K-means:
1. Choose number of clusters k .
2. Randomly initialize k centroids.
3. Assign each data point to the nearest centroid.
4. Recalculate centroids as the mean of points in each cluster.
5. Repeat steps 3–4 until convergence (centroids don’t change significantly).
✅ Kernel K-means:
1. Use a kernel function (e.g., RBF) to implicitly map data into a higher-dimensional
feature space.
2. Perform clustering in this new space using inner products defined by the kernel.
3. Update cluster assignments based on kernel-induced distance.
4. Repeat until convergence.
🔹 Applications & Examples:
 Customer segmentation
 Image compression (color quantization)
 Document clustering
 Anomaly detection
 Example: Grouping users by browsing behavior or grouping pixels in an image.

🟩 2. Dimensionality Reduction: PCA and Kernel PCA

🔹 Definition:
 PCA (Principal Component Analysis) : A linear technique that transforms data
into a lower-dimensional space while preserving maximum variance.
 Kernel PCA : Nonlinear version of PCA using kernel trick to capture complex
patterns.
🔹 Algorithm:
✅ PCA:
1. Standardize the data.
2. Compute covariance matrix.
3. Find eigenvectors and eigenvalues of the covariance matrix.
4. Select top k eigenvectors corresponding to largest eigenvalues.
5. Project original data onto these eigenvectors.

✅ Kernel PCA:

1
1. Apply a kernel function (e.g., Gaussian, polynomial) to compute similarity
between data points.
2. Construct Gram matrix (kernel matrix).
3. Center the kernel matrix.
4. Compute eigenvalues and eigenvectors of the centered kernel matrix.
5. Project data into the reduced space using top eigenvectors.
🔹 Applications & Examples:
 Face recognition (Eigenfaces)
 Data visualization (e.g., t-SNE alternative)
 Noise reduction
 Feature extraction
 Example: Reducing dimensions of gene expression data for analysis.

🟩 3. Matrix Factorization and Matrix Completion

🔹 Definition:
 Matrix Factorization : Decomposes a matrix into multiple matrices whose product
approximates the original. Often used for latent factor modeling.
 Matrix Completion : Fills missing entries of a partially observed matrix assuming
low-rank structure.
🔹 Algorithm:
✅ Matrix Factorization (e.g., SVD, Alternating Least Squares):
1. Let matrix R≈U⋅VT , where:
 U : User latent features
 V : Item latent features
2. Minimize reconstruction error using optimization techniques like gradient descent
or ALS.
✅ Matrix Completion:
1. Start with a sparse matrix (e.g., ratings matrix).
2. Assume the matrix has a low rank.
3. Optimize to find a low-rank matrix that fits observed entries.
4. Techniques: Nuclear norm minimization, alternating minimization.
🔹 Applications & Examples:
 Recommender systems (Netflix Prize)
 Topic modeling
 Collaborative filtering
 Example: Predicting movie ratings a user hasn't seen yet.

🟩 4. Generative Models: Mixture Models and Latent Factor Models

🔹 Definition:
 Mixture Models : Probabilistic models representing subpopulations within the
overall population (e.g., Gaussian Mixture Models).
 Latent Factor Models : Models that use hidden variables to explain observed data
(e.g., Probabilistic PCA, Factor Analysis).
🔹 Algorithm:
✅ Mixture Models (GMM + EM Algorithm):
1. Initialize parameters (means, covariances, mixing coefficients).
2. E-step : Estimate probability of each data point belonging to each component.
3. M-step : Update parameters using weighted maximum likelihood.
4. Iterate E-M steps until convergence.

2
✅ Latent Factor Models:
 Probabilistic PCA : Assumes observed data is generated from latent variables via
a linear transformation plus noise.
 Inference often done using Expectation-Maximization or variational Bayes.
🔹 Applications & Examples:
 Clustering (via GMMs)
 Density estimation
 Anomaly detection
 Topic modeling (e.g., LDA – Latent Dirichlet Allocation)
 Example: Segmenting customer groups or modeling topics in document
collections.

✅ Summary Table
TOPIC METHOD TYPE KEY APPLICATION
ALGORITHM
Clustering K-means / Partitioning Iterative Customer
Kernel K- reassignment segmentation, image
means compression
Dimensionality PCA / Kernel Linear/ Eigendecomposition Face recognition,
Reduction PCA Nonlinear visualization
Matrix Methods Matrix Latent space ALS, Gradient Recommender
Factorization / Descent systems,
Completion collaborative filtering
Generative GMM, LDA Probabilistic EM algorithm Density estimation,
Models topic modeling

GTU Quesion Answers
No ratings yet
GTU Quesion Answers
35 pages
Generative, Mixture & Latent Models
No ratings yet
Generative, Mixture & Latent Models
3 pages
Logistic Regression and Classifiers Overview
No ratings yet
Logistic Regression and Classifiers Overview
10 pages
Machine Unit4
No ratings yet
Machine Unit4
55 pages
Karpagam College of Engineering: Reg - No
No ratings yet
Karpagam College of Engineering: Reg - No
32 pages
Assignment4 - AnswerKey
No ratings yet
Assignment4 - AnswerKey
14 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
4 pages
Lecture 3 Ver2
No ratings yet
Lecture 3 Ver2
42 pages
Aiml Model
No ratings yet
Aiml Model
13 pages
Unit V
No ratings yet
Unit V
82 pages
Unit3 Datamining
No ratings yet
Unit3 Datamining
5 pages
Marketing Analytics Week-8 LAQ
No ratings yet
Marketing Analytics Week-8 LAQ
4 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
Detailed Clustering in Machine Learning Notes
No ratings yet
Detailed Clustering in Machine Learning Notes
4 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
8 pages
PR Mse2-1
No ratings yet
PR Mse2-1
11 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
8 pages
Unit Pattern
No ratings yet
Unit Pattern
6 pages
Nit ML Sugg
No ratings yet
Nit ML Sugg
5 pages
Machine Learning Theory Updated
No ratings yet
Machine Learning Theory Updated
8 pages
Machine Learning Study Notes - Quick Review Guide
No ratings yet
Machine Learning Study Notes - Quick Review Guide
12 pages
Unsupervised Learning & RL Guide
No ratings yet
Unsupervised Learning & RL Guide
6 pages
Unit 3
No ratings yet
Unit 3
21 pages
FML Solution 1
No ratings yet
FML Solution 1
19 pages
Reinforcement Learning and Statistical Methods
No ratings yet
Reinforcement Learning and Statistical Methods
79 pages
Full ml-2
No ratings yet
Full ml-2
1 page
AML Important Topics
No ratings yet
AML Important Topics
9 pages
ML Notes 2k25
No ratings yet
ML Notes 2k25
19 pages
Kernel Methods for Pattern Analysis
No ratings yet
Kernel Methods for Pattern Analysis
77 pages
Categorization of ML-DL Algorithms
No ratings yet
Categorization of ML-DL Algorithms
1 page
ML Notes
No ratings yet
ML Notes
8 pages
Pattern Recognition Techniques
No ratings yet
Pattern Recognition Techniques
10 pages
REVIEWER
No ratings yet
REVIEWER
9 pages
Unit 4 Introduction To Algorithm
No ratings yet
Unit 4 Introduction To Algorithm
10 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
7 pages
ML Unit4 QB Solutions
No ratings yet
ML Unit4 QB Solutions
8 pages
Notes Unit 2
No ratings yet
Notes Unit 2
3 pages
BDA Lecture Unit 3 With LAB
No ratings yet
BDA Lecture Unit 3 With LAB
20 pages
Comprehensive Notes Machine Learning
No ratings yet
Comprehensive Notes Machine Learning
4 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
ML - 2 - Mark - QA
No ratings yet
ML - 2 - Mark - QA
10 pages
PRCV Viva Notes
No ratings yet
PRCV Viva Notes
32 pages
Day School 03
No ratings yet
Day School 03
32 pages
ML Imp QB
No ratings yet
ML Imp QB
34 pages
Data Mining QP
No ratings yet
Data Mining QP
98 pages
Machine Learning Concepts and Techniques
No ratings yet
Machine Learning Concepts and Techniques
13 pages
Deep Learning 3
No ratings yet
Deep Learning 3
12 pages
Data Mining Question Bank 3,4,5
No ratings yet
Data Mining Question Bank 3,4,5
7 pages
Computer Vision-Lec 3
No ratings yet
Computer Vision-Lec 3
11 pages
ML Notes All
No ratings yet
ML Notes All
32 pages
ML Algorithms Comprehensive Study
No ratings yet
ML Algorithms Comprehensive Study
9 pages
Choosing Algorithms for House Price Prediction
No ratings yet
Choosing Algorithms for House Price Prediction
20 pages
Supervised vs. Unsupervised Learning
No ratings yet
Supervised vs. Unsupervised Learning
7 pages
Key Metrics and Techniques in Classification
No ratings yet
Key Metrics and Techniques in Classification
4 pages
UNIT2
No ratings yet
UNIT2
20 pages
Recent Incidents Involving The WhatsApp Accounts of S
No ratings yet
Recent Incidents Involving The WhatsApp Accounts of S
4 pages
Supervised Learning
No ratings yet
Supervised Learning
237 pages
B. Tech 7th Sem Project Rubric
No ratings yet
B. Tech 7th Sem Project Rubric
2 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Scalable Machine Learning
No ratings yet
Scalable Machine Learning
4 pages
Feed Forward Neural Network
No ratings yet
Feed Forward Neural Network
145 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
Deep Learning
No ratings yet
Deep Learning
8 pages
Training Neural Network
No ratings yet
Training Neural Network
114 pages
Math 3rd Sem Notes
No ratings yet
Math 3rd Sem Notes
3 pages
Electives
No ratings yet
Electives
3 pages
Multivariable Calculus Guide
100% (4)
Multivariable Calculus Guide
326 pages
Water Pollution and Control
No ratings yet
Water Pollution and Control
70 pages
Applications in Neural Network and Deep Learning
No ratings yet
Applications in Neural Network and Deep Learning
4 pages
Part of Speech for "Abdicate"
No ratings yet
Part of Speech for "Abdicate"
2 pages
BSCG Programme Guide 2019 Final PDF
0% (1)
BSCG Programme Guide 2019 Final PDF
137 pages
NEST 2020 Exam Results Analysis
No ratings yet
NEST 2020 Exam Results Analysis
52 pages
Bengali Cuisine PDF
67% (3)
Bengali Cuisine PDF
20 pages
Topic Modeling in Management Theory
No ratings yet
Topic Modeling in Management Theory
104 pages
1 s2.0 S0148296322007585 Main
No ratings yet
1 s2.0 S0148296322007585 Main
16 pages
Big Data Analytics Module Wise Important Questions and Answers Mumbai University
No ratings yet
Big Data Analytics Module Wise Important Questions and Answers Mumbai University
12 pages
Ta QB
No ratings yet
Ta QB
8 pages
Natural Language Processing in The Real World Text Processing Analytics and Classification 1st Edition Jyotika Singh Download
100% (1)
Natural Language Processing in The Real World Text Processing Analytics and Classification 1st Edition Jyotika Singh Download
76 pages
Social Media
No ratings yet
Social Media
7 pages
Multimodal Recommender Systems: Rakuten Institute of Technology
No ratings yet
Multimodal Recommender Systems: Rakuten Institute of Technology
44 pages
Samuel Bowles, Wendy Carlin (2020) - What Students Learn in Economics 101. Time For A Change
No ratings yet
Samuel Bowles, Wendy Carlin (2020) - What Students Learn in Economics 101. Time For A Change
39 pages
Natural Language Processing in The Real World Text Processing, Analytics, and Classification
100% (11)
Natural Language Processing in The Real World Text Processing, Analytics, and Classification
393 pages
1 s2.0 S0019850121001826 Main
No ratings yet
1 s2.0 S0019850121001826 Main
15 pages
Hang Li - Machine Learning Methods-Springer (2023)
No ratings yet
Hang Li - Machine Learning Methods-Springer (2023)
530 pages
76.sentiment Analysis in Emergency Calls For Exploring Natural Language Processing For Enhanced Police Dispatch Services
No ratings yet
76.sentiment Analysis in Emergency Calls For Exploring Natural Language Processing For Enhanced Police Dispatch Services
2 pages
Automatic Code Summarization: A Systematic Literature Review
No ratings yet
Automatic Code Summarization: A Systematic Literature Review
12 pages
Neurocomputing 1
No ratings yet
Neurocomputing 1
10 pages
CCS369 - TSS-Unit 2
No ratings yet
CCS369 - TSS-Unit 2
56 pages
A Dirichlet Multinomial Mixture Model-Based Approach For Short Text Clustering
No ratings yet
A Dirichlet Multinomial Mixture Model-Based Approach For Short Text Clustering
10 pages
Machine Learning Approaches To Facial and Text Analysis Discovering CEO Oral Communication Styles
No ratings yet
Machine Learning Approaches To Facial and Text Analysis Discovering CEO Oral Communication Styles
28 pages
Ai CH 4
No ratings yet
Ai CH 4
53 pages
Gender Disparity Across Topics in Top 5 Journals - Machine Learning Approach
No ratings yet
Gender Disparity Across Topics in Top 5 Journals - Machine Learning Approach
49 pages
Technical Documenetflix Technicalnt
No ratings yet
Technical Documenetflix Technicalnt
15 pages
Study of Emotion Concept Formation by Integrating Vision, Physiology, and Word Information Using Multilayered Multimodal Latent Dirichlet Allocation
No ratings yet
Study of Emotion Concept Formation by Integrating Vision, Physiology, and Word Information Using Multilayered Multimodal Latent Dirichlet Allocation
13 pages
UNIT4 (DWDM)
No ratings yet
UNIT4 (DWDM)
11 pages
Handle With Open Ended Questionnaire
No ratings yet
Handle With Open Ended Questionnaire
7 pages
Diachronic Language Model Degradation
No ratings yet
Diachronic Language Model Degradation
6 pages
A Review of Social Media-Based Public Opinion Analyses Challenges and Recommendations
No ratings yet
A Review of Social Media-Based Public Opinion Analyses Challenges and Recommendations
14 pages
Overview of Topic Modeling Techniques
No ratings yet
Overview of Topic Modeling Techniques
27 pages
Natural Language Processing in Data Science
No ratings yet
Natural Language Processing in Data Science
7 pages
Haque Et Al. - 2020 - Challenges in Docker Development A Large-Scale Study Using Stack Overflow
No ratings yet
Haque Et Al. - 2020 - Challenges in Docker Development A Large-Scale Study Using Stack Overflow
11 pages
Contextual-Aware and Expert Data Resources For Bra
No ratings yet
Contextual-Aware and Expert Data Resources For Bra
22 pages
Digitalising The Systematic Literature Review Process
No ratings yet
Digitalising The Systematic Literature Review Process
28 pages

Unsupervised Learning

Uploaded by

Unsupervised Learning

Uploaded by

Unit 2 Unsupervised learning

Below is a detailed elaboration of the topics under Unit 2: Unsupervised Learning ,

🟩 1. Clustering: K-means / Kernel K-means

🟩 2. Dimensionality Reduction: PCA and Kernel PCA

🟩 3. Matrix Factorization and Matrix Completion

🟩 4. Generative Models: Mixture Models and Latent Factor Models

You might also like