0% found this document useful (0 votes)

11 views10 pages

Unsupervised Machine Learning

The document provides an overview of various unsupervised machine learning models categorized into clustering algorithms, dimensionality reduction algorithms, association rule learning, anomaly detection algorithms, generative models, graph-based models, and neural network-based approaches. Each category includes specific algorithms along with their strengths and weaknesses. The information serves as a comprehensive guide for understanding the capabilities and limitations of different unsupervised learning techniques.

Uploaded by

karltasi150

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views10 pages

Unsupervised Machine Learning

Uploaded by

karltasi150

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Unsupervised machine learning models are algorithms designed to find patterns or structure in data

without predefined labels. Here’s a categorized list of commonly used unsupervised machine learning
models:
1. Clustering Algorithms
These algorithms group data points into clusters based on their similarities.
 K-Means Clustering
 Hierarchical Clustering (e.g., Agglomerative and Divisive)
 DBSCAN (Density-Based Spatial Clustering of Applications with Noise)
 OPTICS (Ordering Points to Identify Clustering Structure)
 Mean Shift
 Gaussian Mixture Models (GMM)
 Spectral Clustering
 Affinity Propagation
 Self-Organizing Maps (SOMs)
2. Dimensionality Reduction Algorithms
These models reduce the number of features in the dataset while preserving its structure.
 Principal Component Analysis (PCA)
 Kernel PCA
 t-SNE (t-Distributed Stochastic Neighbor Embedding)
 UMAP (Uniform Manifold Approximation and Projection)
 Factor Analysis
 Independent Component Analysis (ICA)
 Non-Negative Matrix Factorization (NMF)
 Latent Dirichlet Allocation (LDA) (for topic modeling)
3. Association Rule Learning
Used to discover relationships between variables in large datasets.
 Apriori Algorithm
 Eclat
 FP-Growth (Frequent Pattern Growth)
4. Anomaly Detection Algorithms
These are used to identify data points that deviate significantly from the majority.
 Isolation Forest
 One-Class SVM
 Autoencoders (Unsupervised variants)
 Local Outlier Factor (LOF)
 Elliptic Envelope
5. Matrix Factorization
Used in recommendation systems and collaborative filtering.
 Singular Value Decomposition (SVD)
 Non-Negative Matrix Factorization (NMF)
 Alternating Least Squares (ALS)
6. Generative Models
Used to generate data similar to the input dataset.
 Generative Adversarial Networks (GANs)
 Variational Autoencoders (VAEs)
 Boltzmann Machines (e.g., Restricted Boltzmann Machines)
7. Graph-Based Models
Used to analyze data represented in graph structures.
 Graph Clustering (e.g., Louvain algorithm for community detection)
 DeepWalk
 Node2Vec
 Spectral Graph Algorithms
8. Neural Network-Based Approaches
Unsupervised learning techniques using neural networks.
 Autoencoders
o Variants: Denoising Autoencoders, Sparse Autoencoders, Contractive Autoencoders

 Self-Organizing Maps (SOM)

 Contrastive Predictive Coding
 Deep Belief Networks (DBNs)
9. Density Estimation
Used to estimate the probability density function of data.
 Kernel Density Estimation (KDE)
 Gaussian Mixture Models (GMMs)

STRENGTHS AND WEAKNESS OF THE MODELS

1. Clustering Algorithms
K-Means Clustering
 Strengths:
o Simple and efficient for large datasets.

o Easy to interpret.

o Works well when clusters are spherical and equally sized.

 Weaknesses:
o Sensitive to the initial centroids.

o Struggles with non-spherical clusters and varying densities.

o Requires specifying the number of clusters (k) beforehand.

Hierarchical Clustering
 Strengths:
o No need to pre-specify the number of clusters.

o Produces a dendrogram for better data visualization.

 Weaknesses:
o Computationally expensive for large datasets (not scalable).

o Sensitive to noise and outliers.

DBSCAN (Density-Based Spatial Clustering of Applications with Noise)

 Strengths:
o Handles clusters of arbitrary shapes.

o Can detect outliers as noise points.

o No need to specify the number of clusters.

 Weaknesses:
o Struggles with varying density clusters.
o Sensitive to parameters (eps and min_samples).

OPTICS
 Strengths:
o Extension of DBSCAN for varying density clusters.

o Better cluster hierarchy detection.

 Weaknesses:
o Computationally more expensive than DBSCAN.

o Complex to fine-tune parameters.

Mean Shift
 Strengths:
o No need to specify the number of clusters.

o Detects clusters of arbitrary shapes.

 Weaknesses:
o Computationally intensive for large datasets.

o Bandwidth parameter selection is challenging.

Gaussian Mixture Models (GMM)

 Strengths:
o Handles overlapping clusters well.

o Provides probabilistic cluster assignments.

 Weaknesses:
o Assumes data follows a Gaussian distribution.

o Requires specifying the number of components.

Spectral Clustering
 Strengths:
o Effective for non-convex clusters.

o Works well with similarity graphs.

 Weaknesses:
o Not scalable to large datasets.

o Requires specifying the number of clusters.

Affinity Propagation
 Strengths:
o No need to predefine the number of clusters.

o Works well with sparse data.

 Weaknesses:
o Computationally expensive.

o Tends to converge to suboptimal solutions for large datasets.

Self-Organizing Maps (SOMs)

 Strengths:
o Useful for visualizing high-dimensional data.

o Can learn complex relationships in data.

 Weaknesses:
o Convergence can be slow.

o Results depend on initialization and hyperparameters.

2. Dimensionality Reduction Algorithms

Principal Component Analysis (PCA)
 Strengths:
o Computationally efficient.

o Works well for linearly correlated features.

 Weaknesses:
o Assumes linear relationships.

o Loses interpretability as dimensions are reduced.

Kernel PCA
 Strengths:
o Extends PCA to capture non-linear relationships.

o Effective with kernel tricks.

 Weaknesses:
o Computationally expensive.

o Choice of kernel parameters affects performance.

t-SNE
 Strengths:
o Excellent for visualizing high-dimensional data.

o Preserves local structure.

 Weaknesses:
o Computationally expensive.

o Does not preserve global structure.

o Results vary with perplexity parameter.

UMAP
 Strengths:
o Faster and better global structure preservation than t-SNE.

o Works well with large datasets.

 Weaknesses:
o Sensitive to hyperparameters.

o May not capture fine details in local relationships.

Factor Analysis
 Strengths:
o Reduces data redundancy.

o Handles linear dependencies.

 Weaknesses:
o Assumes Gaussian distribution of data.

o Limited to linear relationships.

Independent Component Analysis (ICA)

 Strengths:
o Finds independent components in data.

o Useful in blind signal separation.

 Weaknesses:
o Sensitive to noise and outliers.

o Assumes statistical independence.

Non-Negative Matrix Factorization (NMF)
 Strengths:
o Produces interpretable, non-negative features.

o Good for text and image data.

 Weaknesses:
o Sensitive to initialization.

o Struggles with non-linear relationships.

3. Association Rule Learning

Apriori Algorithm
 Strengths:
o Easy to implement.

o Effective for small datasets.

 Weaknesses:
o Computationally expensive for large datasets.

o Requires careful threshold selection.

Eclat
 Strengths:
o More efficient than Apriori for large datasets.

o Uses a vertical data format.

 Weaknesses:
o Limited scalability for high-dimensional data.

o Parameter tuning can be complex.

FP-Growth
 Strengths:
o Efficient and scalable.

o Avoids candidate generation.

 Weaknesses:
o Memory-intensive for very large datasets.

o Complex implementation.
4. Anomaly Detection Algorithms
Isolation Forest
 Strengths:
o Efficient for high-dimensional data.

o Handles outliers well.

 Weaknesses:
o Assumes anomalies are less frequent and different.

One-Class SVM
 Strengths:
o Effective in high-dimensional spaces.

o Works well for complex boundaries.

 Weaknesses:
o Sensitive to kernel selection.

o Computationally expensive.

Autoencoders
 Strengths:
o Capable of learning complex representations.

o Useful for high-dimensional data.

 Weaknesses:
o Requires large datasets for training.

o Sensitive to architecture and hyperparameters.

Local Outlier Factor (LOF)

 Strengths:
o Detects local anomalies effectively.

o Works well with varying density.

 Weaknesses:
o Computationally expensive.

o Sensitive to the number of neighbors.

5. Generative Models
Generative Adversarial Networks (GANs)
 Strengths:
o Generates realistic synthetic data.

o Handles complex data distributions.

 Weaknesses:
o Training is unstable and sensitive to hyperparameters.

o Prone to mode collapse.

Variational Autoencoders (VAEs)

 Strengths:
o Produces interpretable latent space.

o Effective for generative tasks.

 Weaknesses:
o Reconstructions may lack sharpness.

o Requires careful balance between reconstruction loss and regularization.

Boltzmann Machines
 Strengths:
o Capable of learning joint probability distributions.

o Useful for feature extraction.

 Weaknesses:
o Computationally expensive to train.

o Limited scalability.

6. Graph-Based Models
Spectral Graph Algorithms
 Strengths:
o Effective for graph-structured data.

o Captures relationships in community detection.

 Weaknesses:
o Not scalable to very large graphs.

o Requires careful selection of eigenvectors.

Models
No ratings yet
Models
46 pages
Models
No ratings yet
Models
20 pages
Unit 4 Introduction To Algorithm
No ratings yet
Unit 4 Introduction To Algorithm
10 pages
Marketing Analytics Week-8 LAQ
No ratings yet
Marketing Analytics Week-8 LAQ
4 pages
ML Algo Revision (Detailed)
No ratings yet
ML Algo Revision (Detailed)
8 pages
Machine Learning Theory Updated
No ratings yet
Machine Learning Theory Updated
8 pages
Detailed Clustering in Machine Learning Notes
No ratings yet
Detailed Clustering in Machine Learning Notes
4 pages
Data Enggineering
No ratings yet
Data Enggineering
16 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
7 pages
What Are The Common Algorithms in Machine Learning
No ratings yet
What Are The Common Algorithms in Machine Learning
3 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
9 pages
M.L. 3,5,6 Unit 3
No ratings yet
M.L. 3,5,6 Unit 3
6 pages
Classification in Data Mining
No ratings yet
Classification in Data Mining
60 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
6 pages
ML Overview
No ratings yet
ML Overview
11 pages
Unit 2
No ratings yet
Unit 2
11 pages
Project Des
No ratings yet
Project Des
52 pages
Asynchronous Task Cluster Analysis
No ratings yet
Asynchronous Task Cluster Analysis
2 pages
Classification
No ratings yet
Classification
4 pages
Nit ML Sugg
No ratings yet
Nit ML Sugg
5 pages
Machine Unit4
No ratings yet
Machine Unit4
55 pages
Aiml Model
No ratings yet
Aiml Model
13 pages
Logistic Regression and Classifiers Overview
No ratings yet
Logistic Regression and Classifiers Overview
10 pages
Unssupervised Learning
No ratings yet
Unssupervised Learning
34 pages
Full ml-2
No ratings yet
Full ml-2
1 page
Unit3 Datamining
No ratings yet
Unit3 Datamining
5 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
Supervised vs. Unsupervised Learning
No ratings yet
Supervised vs. Unsupervised Learning
7 pages
ML Algorithms Comprehensive Study
No ratings yet
ML Algorithms Comprehensive Study
9 pages
CatBoost and XGBoost Overview
No ratings yet
CatBoost and XGBoost Overview
11 pages
ML Assigment 3
No ratings yet
ML Assigment 3
4 pages
Zzplagiarism
No ratings yet
Zzplagiarism
23 pages
Key Metrics and Techniques in Classification
No ratings yet
Key Metrics and Techniques in Classification
4 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
30 pages
Chapter 2,3,4
No ratings yet
Chapter 2,3,4
8 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
13 pages
Unsupervised Learning: Clustering & Dimensionality Reduction
No ratings yet
Unsupervised Learning: Clustering & Dimensionality Reduction
2 pages
AI As Subset
No ratings yet
AI As Subset
16 pages
Kavin
No ratings yet
Kavin
15 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
8 pages
Classification
No ratings yet
Classification
34 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
4 pages
Machine Learning Questions and Answers: Decision Tree
No ratings yet
Machine Learning Questions and Answers: Decision Tree
3 pages
ML - ML in Nutshell
No ratings yet
ML - ML in Nutshell
7 pages
Featuer Extraction
No ratings yet
Featuer Extraction
3 pages
Lecture 3 Ver2
No ratings yet
Lecture 3 Ver2
42 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
PRCV Viva Notes
No ratings yet
PRCV Viva Notes
32 pages
Fifth Chapter Classification Clustering
No ratings yet
Fifth Chapter Classification Clustering
16 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
3 pages
CZ4032 Data Analytics & Mining Notes
No ratings yet
CZ4032 Data Analytics & Mining Notes
16 pages
ML Notes
No ratings yet
ML Notes
12 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
Big Data Techniques of 2025
No ratings yet
Big Data Techniques of 2025
31 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
Clustering
No ratings yet
Clustering
11 pages
Recovery of Exact Sparse Representations in The Presence of Bounded Noise
No ratings yet
Recovery of Exact Sparse Representations in The Presence of Bounded Noise
8 pages
Algorithms and Data Structures Exercises
No ratings yet
Algorithms and Data Structures Exercises
3 pages
Collective Mind in AI: Perceptron Cluster
No ratings yet
Collective Mind in AI: Perceptron Cluster
2 pages
Digital Communication System Components
No ratings yet
Digital Communication System Components
72 pages
Solution Optimization Problems
No ratings yet
Solution Optimization Problems
14 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
Yamaha Rivage PM10 Review
No ratings yet
Yamaha Rivage PM10 Review
18 pages
Machine Learning - Notes - 321
No ratings yet
Machine Learning - Notes - 321
3 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
32 pages
Christianity Krishn Niti Hai PDF
No ratings yet
Christianity Krishn Niti Hai PDF
332 pages
Euler's Method Simpson's Rule: X X DX
No ratings yet
Euler's Method Simpson's Rule: X X DX
1 page
BST 1 PDF
No ratings yet
BST 1 PDF
71 pages
Lec3-Problem Solving Agents
No ratings yet
Lec3-Problem Solving Agents
19 pages
Maharaja Institute of Technology Mysore: Syllabus Subject: Operations Research Subject Code: 15ME81
No ratings yet
Maharaja Institute of Technology Mysore: Syllabus Subject: Operations Research Subject Code: 15ME81
19 pages
Treasure Lair: Problem D
No ratings yet
Treasure Lair: Problem D
2 pages
Linear Programming for Students
No ratings yet
Linear Programming for Students
21 pages
Introduction to Model Predictive Control
100% (1)
Introduction to Model Predictive Control
24 pages
QMF
100% (1)
QMF
63 pages
CSE445 T5a Decision Trees
No ratings yet
CSE445 T5a Decision Trees
54 pages
@vtudeveloper - in CV Mod 2
No ratings yet
@vtudeveloper - in CV Mod 2
25 pages
Fourier Transform Assignment
No ratings yet
Fourier Transform Assignment
3 pages
Graphing Polynomial Functions
No ratings yet
Graphing Polynomial Functions
24 pages
Mws Mec Ode TXT Runge4th Examples PDF
No ratings yet
Mws Mec Ode TXT Runge4th Examples PDF
6 pages
Daa Lab 2024
No ratings yet
Daa Lab 2024
9 pages
Problem Solving (IPO Chart2)
No ratings yet
Problem Solving (IPO Chart2)
12 pages
A Framework For Digital Filter Design
No ratings yet
A Framework For Digital Filter Design
12 pages
ECE 306 Homework: Signals & Systems
No ratings yet
ECE 306 Homework: Signals & Systems
1 page
It340 Machine Learning (End - sp23)
No ratings yet
It340 Machine Learning (End - sp23)
1 page
CS Exam Prep: Tree & Sort Problems
No ratings yet
CS Exam Prep: Tree & Sort Problems
5 pages
DS Tricks
No ratings yet
DS Tricks
2 pages

Unsupervised Machine Learning

Uploaded by

Unsupervised Machine Learning

Uploaded by

Unsupervised machine learning models are algorithms designed to find patterns or structure in data

 Self-Organizing Maps (SOM)

STRENGTHS AND WEAKNESS OF THE MODELS

o Works well when clusters are spherical and equally sized.

o Struggles with non-spherical clusters and varying densities.

o Requires specifying the number of clusters (k) beforehand.

o Produces a dendrogram for better data visualization.

o Sensitive to noise and outliers.

DBSCAN (Density-Based Spatial Clustering of Applications with Noise)

o Can detect outliers as noise points.

o No need to specify the number of clusters.

o Better cluster hierarchy detection.

o Complex to fine-tune parameters.

o Detects clusters of arbitrary shapes.

o Bandwidth parameter selection is challenging.

Gaussian Mixture Models (GMM)

o Provides probabilistic cluster assignments.

o Requires specifying the number of components.

o Works well with similarity graphs.

o Requires specifying the number of clusters.

o Works well with sparse data.

o Tends to converge to suboptimal solutions for large datasets.

Self-Organizing Maps (SOMs)

o Can learn complex relationships in data.

o Results depend on initialization and hyperparameters.

2. Dimensionality Reduction Algorithms

o Works well for linearly correlated features.

o Loses interpretability as dimensions are reduced.

o Effective with kernel tricks.

o Choice of kernel parameters affects performance.

o Preserves local structure.

o Does not preserve global structure.

o Results vary with perplexity parameter.

o Works well with large datasets.

o May not capture fine details in local relationships.

o Handles linear dependencies.

o Limited to linear relationships.

Independent Component Analysis (ICA)

o Useful in blind signal separation.

o Assumes statistical independence.

o Good for text and image data.

o Struggles with non-linear relationships.

3. Association Rule Learning

o Effective for small datasets.

o Requires careful threshold selection.

o Uses a vertical data format.

o Parameter tuning can be complex.

o Avoids candidate generation.

o Handles outliers well.

o Works well for complex boundaries.

o Useful for high-dimensional data.

o Sensitive to architecture and hyperparameters.

Local Outlier Factor (LOF)

o Works well with varying density.

o Sensitive to the number of neighbors.

o Handles complex data distributions.

o Prone to mode collapse.

Variational Autoencoders (VAEs)

o Effective for generative tasks.

o Requires careful balance between reconstruction loss and regularization.

o Useful for feature extraction.

o Captures relationships in community detection.

o Requires careful selection of eigenvectors.

You might also like