0% found this document useful (0 votes)

99 views32 pages

Iterative Methods for Mixture Segmentation

This document summarizes a lecture on iterative methods for mixture-model segmentation and subspace clustering. It reviews K-means clustering and the Expectation-Maximization (EM) algorithm for central clustering. It then discusses modeling data with a mixture of subspaces and formulations of the K-subspaces and EM algorithms for subspace segmentation. The EM algorithm estimates subspace membership probabilities and model parameters iteratively through E and M steps. Key differences between K-subspaces and EM are noted, and homework is assigned on exercises from the lecture handout.

Uploaded by

navneeth91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views32 pages

Iterative Methods for Mixture Segmentation

Uploaded by

navneeth91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

4/7/2015

MA5232 Modeling and Numerical

Simulations
Lecture 2
Iterative Methods for Mixture-Model
Segmentation
8 Apr 2015
National University of Singapore

Last time
PCA reduces dimensionality of a data set while
retaining as much as possible the data variation.
Statistical view: The leading PCs are given by the
leading eigenvectors of the covariance.
Geometric view: Fitting a d-dim subspace model via
SVD

Extensions of PCA
Probabilistic PCA via MLE
Kernel PCA via kernel functions and kernel matrices
National University of Singapore

4/7/2015

This lecture
Review basic iterative algorithms for central
clustering
Formulation of the subspace segmentation
problem

National University of Singapore

Segmentation by Clustering

From: Object Recognition as Machine Translation, Duygulu, Barnard, de Freitas, Forsyth, ECCV02

4/7/2015

Example 4.1
Euclidean distance-based clustering is not
invariant to linear transformation

Distance metric needs to be adjusted after

linear transformation

National University of Singapore

Central Clustering
Assume data sampled from a mixture of
Gaussian

Classical distance metric between a sample

and the mean of the jth cluster is the
Mahanalobis distance

National University of Singapore

4/7/2015

Central Clustering: K-Means

Assume a map function provide each ith
sample a label
An optimal clustering minimizes the withincluster scatter:

i.e., the average distance of all samples to

their respective cluster means
National University of Singapore

Central Clustering: K-Means

However, as K is user defined,
when
each point becomes a cluster itself: K=n.
In this chapter, would assume true K is known.

National University of Singapore

4/7/2015

Algorithm
A chicken-and-egg view

National University of Singapore

Two-Step Iteration

National University of Singapore

4/7/2015

Example
http://util.io/k-means

National University of Singapore

Feature Space

Source: K. Grauman

4/7/2015

Results of K-Means Clustering:

Image

Clusters on intensity

Clusters on color

K-means clustering using intensity alone and color alone

* From Marc Pollefeys COMP 256 2003

4/7/2015

A bad local optimum

National University of Singapore

Characteristics of K-Means
It is a greedy algorithm, does not guarantee to
converge to the global optimum.
Given fixed initial clusters/ Gaussian models, the
iterative process is deterministic.
Result may be improved by running k-means
multiple times with different starting conditions.
The segmentation-estimation process can be
treated as a generalized expectationmaximization algorithm
National University of Singapore

4/7/2015

EM Algorithm [Dempster-Laird-Rubin 1977]

Expectation Maximization (EM) estimates the
model parameters and the segmentation in a
ML sense.
Assume samples are independently drawn
from a mixed probabilistic distribution,
indicated by a hidden discrete variable z
Cond. dist. can be Gaussian
National University of Singapore

The Maximum-Likelihood Estimation

The unknown parameters are
The likelihood function:

The optimal solution maximizes the loglikelihood

National University of Singapore

4/7/2015

The Maximum-Likelihood Estimation

Directly maximize the log-likelihood function
is a high-dimensional nonlinear optimization
problem

National University of Singapore

Define a new function:

The first term is called expected complete loglikelihood function;

The second term is the conditional entropy.
National University of Singapore

4/7/2015

Observation:

National University of Singapore

The Maximum-Likelihood Estimation

Regard the (incomplete) log-likelihood as a
function of two variables:
Maximize g iteratively (E step, followed by M
step)

National University of Singapore

4/7/2015

Iteration converges to a stationary

point

National University of Singapore

Prop 4.2: Update

National University of Singapore

4/7/2015

Update
Recall

Assume
is fixed, then maximize the
expected complete log-likelihood

National University of Singapore

To maximize the expected log-likelihood, as an

example, assume each cluster is isotropic
normal distribution:

Eliminate the constant term in the objective

National University of Singapore

4/7/2015

Exer 4.2

Compared to k-means, EM assigns the

samples softly to each cluster according to a
set of probabilities.

National University of Singapore

EM Algorithm

National University of Singapore

4/7/2015

Exam 4.3: Global max may not exist

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

Visual example of EM

4/7/2015

Potential Problems
Incorrect number of Mixture Components

Singularities

Incorrect Number of Gaussians

4/7/2015

Incorrect Number of Gaussians

Singularities
A minority of the data can have a
disproportionate effect on the model
likelihood.
For example

4/7/2015

GMM example

Singularities
When a mixture component collapses on a
given point, the mean becomes the point, and
the variance goes to zero.
Consider the likelihood function as the
covariance goes to zero.
The likelihood approaches infinity.

4/7/2015

K-means VS EM

k-means clustering and EM clustering on an artificial dataset ("mouse"). The

tendency of k-means to produce equi-sized clusters leads to bad results, while
EM benefits from the Gaussian distribution present in the data set

National University of Singapore

So far
K-means
Expectation Maximization

National University of Singapore

4/7/2015

Next up
Multiple-Subspace Segmentation
K-subspaces
EM for Subspaces

National University of Singapore

Multiple-Subspace Segmentation

National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces
With noise, we minimize

Unfortunately, unlike PCA, there is no constructive

solution to the above minimization problem. The main
difficulty is that the foregoing objective is hybrid it is
a combination of minimization on the continuous
variables {Uj} and the discrete variable j.
National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces

Exactly the same as

in PCA

National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces
In the M step

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

Relationship between K-subspaces and

EM
At each iteration,
K-subspaces algorithm gives a definite
assignment of every data point into one of the
subspaces;
EM algorithm views the membership as a
random variable and uses its expected value
to give a probabilistic assignment of the
data point.
National University of Singapore

Homework
Read the handout Chapter 4 Iterative
Methods for Multiple-Subspace
Segmentation.
Complete exercise 4.2 (page 111) of the
handout

National University of Singapore

MLSlides5 - Selected - Shared
No ratings yet
MLSlides5 - Selected - Shared
30 pages
K-means and EM Clustering Overview
No ratings yet
K-means and EM Clustering Overview
23 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Mixture Models and K-Means Clustering
No ratings yet
Mixture Models and K-Means Clustering
8 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
30 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Gaussian Mixture Models Overview
No ratings yet
Gaussian Mixture Models Overview
17 pages
5 Clustering
No ratings yet
5 Clustering
38 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
Machine Learning: CSCE883
No ratings yet
Machine Learning: CSCE883
22 pages
TD10 - TD - GMM - 2025
No ratings yet
TD10 - TD - GMM - 2025
1 page
EM and Kmeans Relations
No ratings yet
EM and Kmeans Relations
70 pages
Week 5 v1.1 - Unsupervised Learning
No ratings yet
Week 5 v1.1 - Unsupervised Learning
40 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
33 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
DSA5102 Lecture10
No ratings yet
DSA5102 Lecture10
40 pages
Lec11 Ann
No ratings yet
Lec11 Ann
61 pages
CB PDF
No ratings yet
CB PDF
69 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
cz4041 10 Clustering
No ratings yet
cz4041 10 Clustering
67 pages
ML.5-Clustering Techniques (Week 9)
No ratings yet
ML.5-Clustering Techniques (Week 9)
71 pages
Lec 11
No ratings yet
Lec 11
57 pages
Unit 3
No ratings yet
Unit 3
43 pages
Image Segmentation Techniques
No ratings yet
Image Segmentation Techniques
47 pages
Expectation-Maximization For The Gaussian Mixture Model
No ratings yet
Expectation-Maximization For The Gaussian Mixture Model
8 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
Expectation-Maximization Clustring V2
No ratings yet
Expectation-Maximization Clustring V2
9 pages
MLLecture 1
No ratings yet
MLLecture 1
56 pages
Lecture 08 Slides
No ratings yet
Lecture 08 Slides
43 pages
Machine Learning Syllabus
No ratings yet
Machine Learning Syllabus
73 pages
CVPR Unit 5,6
No ratings yet
CVPR Unit 5,6
25 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Clustering
No ratings yet
Clustering
65 pages
MLT Lab 08
No ratings yet
MLT Lab 08
5 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
ML09 Clustering
No ratings yet
ML09 Clustering
65 pages
Unit IV Clustering
No ratings yet
Unit IV Clustering
60 pages
Unit Iii
No ratings yet
Unit Iii
70 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
GMM and EM Algorithm Overview
No ratings yet
GMM and EM Algorithm Overview
33 pages
Lecture Expectation Maximization
No ratings yet
Lecture Expectation Maximization
58 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
6 pages
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
No ratings yet
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
34 pages
K-Means Clustering Method For The Analysis of Log Data
No ratings yet
K-Means Clustering Method For The Analysis of Log Data
3 pages
Lecture08b Kmeans
No ratings yet
Lecture08b Kmeans
10 pages
Unit 4
No ratings yet
Unit 4
43 pages
I2ml3e Chap7
No ratings yet
I2ml3e Chap7
22 pages
Symmetrical Based Projects
No ratings yet
Symmetrical Based Projects
105 pages
Tema5 Teoria-2830
No ratings yet
Tema5 Teoria-2830
57 pages
Generalized Carr-Pelts Volatility Surface
No ratings yet
Generalized Carr-Pelts Volatility Surface
15 pages
Setup Squid Proxy on Fedora Core
No ratings yet
Setup Squid Proxy on Fedora Core
11 pages
Core Java Lab Manual
No ratings yet
Core Java Lab Manual
22 pages
IFR Flight Rules and Requirements Guide
No ratings yet
IFR Flight Rules and Requirements Guide
2 pages
Prealgebra 7th Edition Test Bank
No ratings yet
Prealgebra 7th Edition Test Bank
34 pages
Hydrologic Models For Urban Floodplain Mapping and Damage Reduction in Brownsville, TX
No ratings yet
Hydrologic Models For Urban Floodplain Mapping and Damage Reduction in Brownsville, TX
45 pages
WCED Maths Lit Revision Booklet - Grade 11 Term 1 2024 QP
No ratings yet
WCED Maths Lit Revision Booklet - Grade 11 Term 1 2024 QP
17 pages
Prelim Exam in MST 4
No ratings yet
Prelim Exam in MST 4
2 pages
Artimesia
No ratings yet
Artimesia
21 pages
Effect of Light Quality and Intensity On Emergence, Growth and Reproduction in Chromolaena Odorata
No ratings yet
Effect of Light Quality and Intensity On Emergence, Growth and Reproduction in Chromolaena Odorata
14 pages
Business Statistics Mcqs (Set-1) : 1. The Sum of Deviations of Observations From Their Arithmetic Mean
No ratings yet
Business Statistics Mcqs (Set-1) : 1. The Sum of Deviations of Observations From Their Arithmetic Mean
6 pages
3-Parallel Software
No ratings yet
3-Parallel Software
35 pages
Grade 7 Math: Understanding Sets
No ratings yet
Grade 7 Math: Understanding Sets
1 page
Supplemental Problems
71% (7)
Supplemental Problems
64 pages
System Stability: Chapter Objective: - System Stability - Routh-Hurwitz's Stability Criterion
No ratings yet
System Stability: Chapter Objective: - System Stability - Routh-Hurwitz's Stability Criterion
23 pages
RCM Harmony
100% (5)
RCM Harmony
39 pages
Bits F446 1816 20230809111214
No ratings yet
Bits F446 1816 20230809111214
2 pages
Business Statistics
No ratings yet
Business Statistics
2 pages
Partial Differential Equations Course
No ratings yet
Partial Differential Equations Course
2 pages
Interaction Between Bioactive Glasses and Human Dentin.
No ratings yet
Interaction Between Bioactive Glasses and Human Dentin.
9 pages
C.0.2 Preface of Prefaces-2024
No ratings yet
C.0.2 Preface of Prefaces-2024
7 pages
Central Tendency and Dispersion Theory Notes
No ratings yet
Central Tendency and Dispersion Theory Notes
3 pages
Wearable Antennas
No ratings yet
Wearable Antennas
21 pages
Mechanical Vibration Course Overview
No ratings yet
Mechanical Vibration Course Overview
2 pages
Hydraulics in Missile Launch Systems
No ratings yet
Hydraulics in Missile Launch Systems
244 pages
Work Permit System in SDMS 13.03.2023
No ratings yet
Work Permit System in SDMS 13.03.2023
36 pages
Accuri C6 Plus System Quick Reference Guide
No ratings yet
Accuri C6 Plus System Quick Reference Guide
6 pages
Deep Learning in Edge Computing
No ratings yet
Deep Learning in Edge Computing
14 pages
Assignment Number Two & Open Ended Design Problem (Fall 2025)
No ratings yet
Assignment Number Two & Open Ended Design Problem (Fall 2025)
6 pages
digiten-DTC 101
No ratings yet
digiten-DTC 101
11 pages