0% found this document useful (0 votes)

90 views17 pages

Machine Learning

The document discusses the Gaussian distribution, which is commonly used to model real-valued continuous variables. It describes the univariate Gaussian distribution using mean and variance parameters, and how to estimate these parameters with maximum likelihood. It then introduces the multivariate Gaussian distribution using a mean vector and covariance matrix, and how these generalize the univariate case. It concludes with an example of how Gaussian distributions can be used for class conditional classification problems.

Uploaded by

paivensolidsnake

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views17 pages

Machine Learning

Uploaded by

paivensolidsnake

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

The Gaussian Distribution

Machine Learning and Pattern Recognition

Chris Williams
School of Informatics, University of Edinburgh

August 2014

(All of the slides in this course have been adapted from

previous versions by Charles Sutton, Amos Storkey, David Barber.)

1 / 17

Outline

A useful model for real-valued quantities

Univariate Gaussian

Multivariate Gaussian

Maximum likelihood estimation

Class conditional classification

Reading: Murphy 4.1.2, 4.1.3 (without proof), 4.2 up to end

of 4.2.1; or Barber 8.4 up to start of 8.4.1 and 8.8 up to start
of 8.8.2.

2 / 17

The Gaussian Distribution

The Gaussian distribution is one of the most common

distributions over continuous variables.

The one dimensional Gaussian distribution is given by

P (x|, 2 ) = N (x; , 2 ) =

1
2 2

exp

(x )2
2 2

x N (, 2 ) (x is distributed as...).

is the mean of the Gaussian and 2 is the variance.

If = 0 and 2 = 1 then N (x; , 2 ) is called a standard

Gaussian.

3 / 17

Plot
0.4

0.3

0.2

0.1

This is a standard one dimensional Gaussian distribution.

All Gaussians have the same shape subject to scaling and

displacement.

If x is distributed N (, 2 ), then y = (x )/ is distributed

N (0, 1).

4 / 17

Normalization

Remember all distributions must integrate to one. The 2 2

is called a normalization constant - it ensures this is the case.

Hence tighter Gaussians have higher peaks:

0.4

0.35

0.3

0.25

0.2

0.15

0.1

0.05

0
8

5 / 17

Maximum Likelihood Estimation

Maximum likelihood: Set = 1/ 2 Take derivatives

N
N
1X
(xn )2
log(2) +
log
2
2
2
X
log P (X|, )
=
(xn )

n
1X n
N
log P (X|, )
=
(x )2 +

2 n
2
log P (X|, ) =

Hence equating
to zero:
= (1/N )
P derivatives

2 = (1/N ) n (xn
)2 .

and

6 / 17

Multivariate Gaussian
I

The vector x is multivariate Gaussian if for mean and

covariance matrix , it is distributed according to

1
1
T 1
exp (x ) (x )
P (x|, ) =
2
|(2)|1/2

The univariate Gaussian is a special case of this.

Shorthand: x N (, )

is called a covariance matrix, i.e., each element says

ij = Cov(Xi , Xj ), where
Cov(Xi , Xj ) = E[(Xi i )(Xj j )]

must be symmetric and positive definite

7 / 17

Multivariate Gaussian: Picture

0.16
0.14
0.12
0.1
0.08
0.06
0.04
0.02
0
3
2

2
0

1
0

2
3

8 / 17

Mahalanobis Distance

d2 (xi , xj ) = (xi xj )T 1 (xi xj )

d2 (xi , xj ) is called the Mahalanobis distance between xi and xj

If is diagonal, the contours of d2 are axis-aligned ellipsoids

If is not diagonal, the contours of d2 are rotated ellipsoids

= U U T
where is diagonal and U is a rotation matrix (eigendecomposition
of )

is positive definite entries in are positive

9 / 17

Multivariate Gaussian: Maximum Likelihood

I
I
I

The Maximum Likelihood estimate can be found in the same

way.
P
n
= (1/N ) N
n=1 x
PN
= (1/N ) n=1 (xn )(xn )T
Sometimes the Gaussian is parameterized in terms of the
precision matrix = 1 .

10 / 17

Example
I

The data.
6

6
6

11 / 17

Example
I

The data. The maximum likelihood fit.

6
6

12 / 17

Class conditional classification

Example
Suppose you have variables position and class where the
position is a location in D-dimensional space. Suppose you have
data D consisting of examples of position and class. If we
assume that all the points with a particular class label are
Gaussian, describe how, using the data, you could predict the
class for a previously unseen position (and give the accuracy of
the prediction).

13 / 17

Class conditional classification

Learning: Fit Gaussian to data in each class (class conditional

fitting). Gives p(position|class)

Find estimate for probability of each class (see last lecture)

p(class)

Inference: Given a new position, we can ask What is the

probability of this point being generated by each of the
Gaussians?

Better still give probability using Bayes rule

P (class|position) P (position|class)P (class)
Then can get ratio
P (class = 1|position)/P (class = 0|position).

Decision boundary for two classes is where this ratio is one.

14 / 17

Key Facts About Gaussians

Sums of Gaussian RVs are Gaussian

Linear Gaussian models are jointly Gaussian. In general, let

p(x) = N (x|x , x )
p(y|x) = N (y|Ax + b, n )

Then p(x, y) is Gaussian, and so is p(x|y). See Murphy 4.3.

If p(x, y) a multivariate Gaussian, both the marginals

p(x), p(y) and the conditionals p(x|y), p(y|x) are Gaussian.

15 / 17

Inference in Gaussian models

Partition variables into two groups, X1 and X2

1
=
2

=

11
21

12
22

c1|2 = 1 + 12 1
22 (x2 2 )
c1|2 = 11 12 1
22 21
I

For proof see e.g. 4.3.4 of Murphy (2012) (not examinable)

16 / 17

Summary

A useful model for real-valued quantities

Univariate Gaussian

Multivariate Gaussian

Maximum likelihood estimation

Class conditional classification

17 / 17

Tut 07
No ratings yet
Tut 07
19 pages
Univariate and Multivariate Gaussians
No ratings yet
Univariate and Multivariate Gaussians
15 pages
Decision Theory - II Part 9mar23
No ratings yet
Decision Theory - II Part 9mar23
37 pages
Lec9 MultivariateGaussian
No ratings yet
Lec9 MultivariateGaussian
60 pages
Machine Learning and Pattern Recognition Gaussian Processes
No ratings yet
Machine Learning and Pattern Recognition Gaussian Processes
6 pages
Multivariate Gaussian Explained
No ratings yet
Multivariate Gaussian Explained
10 pages
An Intuitive Tutorial To Gaussian Processes Regression: Jie Wang Ingenuity Labs Research Institute
No ratings yet
An Intuitive Tutorial To Gaussian Processes Regression: Jie Wang Ingenuity Labs Research Institute
19 pages
More On Gaussians
No ratings yet
More On Gaussians
11 pages
CPSC 540: Machine Learning: Gaussians
No ratings yet
CPSC 540: Machine Learning: Gaussians
30 pages
Gaussian Process Intuitive
No ratings yet
Gaussian Process Intuitive
17 pages
Bayesian Models for AI Experts
No ratings yet
Bayesian Models for AI Experts
130 pages
More On Gaussians
No ratings yet
More On Gaussians
11 pages
Bayesian Kernel Methods
No ratings yet
Bayesian Kernel Methods
40 pages
Multivariate Normal Distribution Overview
No ratings yet
Multivariate Normal Distribution Overview
6 pages
Multidimensional Gaussian Classification
No ratings yet
Multidimensional Gaussian Classification
99 pages
Applied Statistics - Lecture 1: Mario Beraha
No ratings yet
Applied Statistics - Lecture 1: Mario Beraha
52 pages
Maximum Likelihood Estimation Basics
No ratings yet
Maximum Likelihood Estimation Basics
520 pages
Gaussian Processes Tutorial
No ratings yet
Gaussian Processes Tutorial
31 pages
Gaussian Processes Regression Tutorial
No ratings yet
Gaussian Processes Regression Tutorial
30 pages
Lecture Notes On The Gaussian Distribution
No ratings yet
Lecture Notes On The Gaussian Distribution
6 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Gaussian Processes for Regression
No ratings yet
Gaussian Processes for Regression
9 pages
Class Gaussian Process 2024
No ratings yet
Class Gaussian Process 2024
170 pages
Gaussian Process Tutorial by Andrew NG
No ratings yet
Gaussian Process Tutorial by Andrew NG
13 pages
STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
Bayesian Learning
No ratings yet
Bayesian Learning
21 pages
2 Mle
No ratings yet
2 Mle
28 pages
L05 NaiveBayes
No ratings yet
L05 NaiveBayes
21 pages
Gaussian PDF Properties & Errors
No ratings yet
Gaussian PDF Properties & Errors
30 pages
Gaussian Probability Density Functions: Properties and Error Characterization
No ratings yet
Gaussian Probability Density Functions: Properties and Error Characterization
30 pages
Discriminant Functions in Classification
No ratings yet
Discriminant Functions in Classification
28 pages
Gaussian Probability Distributions
No ratings yet
Gaussian Probability Distributions
33 pages
I2ml3e Chap5
No ratings yet
I2ml3e Chap5
26 pages
4gaussian Discriminant
No ratings yet
4gaussian Discriminant
50 pages
Gaussian Random Variables and Vectors
No ratings yet
Gaussian Random Variables and Vectors
2 pages
Multivariate Gaussian Distribution: Leon Gu
No ratings yet
Multivariate Gaussian Distribution: Leon Gu
5 pages
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
No ratings yet
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
30 pages
2 Probability
No ratings yet
2 Probability
30 pages
Gaussian Mixture Model
No ratings yet
Gaussian Mixture Model
10 pages
Intro to Pattern Recognition
No ratings yet
Intro to Pattern Recognition
9 pages
CS229: Factor Analysis Explained
No ratings yet
CS229: Factor Analysis Explained
9 pages
Tungban Probabilistic ML 2021 - Lecture09
No ratings yet
Tungban Probabilistic ML 2021 - Lecture09
46 pages
Machine Learning Estimation Guide
No ratings yet
Machine Learning Estimation Guide
6 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Generative Learning Explained
No ratings yet
Generative Learning Explained
14 pages
Tutorial
No ratings yet
Tutorial
11 pages
W2e Multivariate Gaussian
No ratings yet
W2e Multivariate Gaussian
6 pages
Joint Probability & Gaussian PDF
No ratings yet
Joint Probability & Gaussian PDF
14 pages
3 2lecture13-Gaussians - PPTX - Lecture13-Gaussians
No ratings yet
3 2lecture13-Gaussians - PPTX - Lecture13-Gaussians
19 pages
CL 202 Multivariate Gaussian Handout
No ratings yet
CL 202 Multivariate Gaussian Handout
9 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
Materi 5 - 2
No ratings yet
Materi 5 - 2
25 pages
Beginning Vibration Analysis PDF
No ratings yet
Beginning Vibration Analysis PDF
96 pages
Beam Deflection Formulas and Examples
No ratings yet
Beam Deflection Formulas and Examples
23 pages
Beam Deflection and Slope Calculations
No ratings yet
Beam Deflection and Slope Calculations
12 pages
Chapter 9 Deflections of Beams: X Axis and The Tangent To The Deflection
No ratings yet
Chapter 9 Deflections of Beams: X Axis and The Tangent To The Deflection
5 pages
FFT and Windowing Explained
No ratings yet
FFT and Windowing Explained
11 pages
CB-18 Simple Structs EI, Euler
No ratings yet
CB-18 Simple Structs EI, Euler
32 pages
Damped Vibrations
No ratings yet
Damped Vibrations
13 pages
Chapter 3 Waves in An Elastic Whole Space Equation of Motion of A Solid
No ratings yet
Chapter 3 Waves in An Elastic Whole Space Equation of Motion of A Solid
16 pages
Yeild Criterion
No ratings yet
Yeild Criterion
16 pages
Horrocks Et Al. - Managing Ageing Plant
No ratings yet
Horrocks Et Al. - Managing Ageing Plant
53 pages
The Fourier Transform and Its Applications
100% (3)
The Fourier Transform and Its Applications
428 pages
Mass Matrix Diagonalization Guide
No ratings yet
Mass Matrix Diagonalization Guide
186 pages
Abaqus Adaptive Meshing
No ratings yet
Abaqus Adaptive Meshing
13 pages
Linear Algebra Foundations
No ratings yet
Linear Algebra Foundations
17 pages
Gauss Elimination in Numerical Method
No ratings yet
Gauss Elimination in Numerical Method
11 pages
Chapter 7 - FEM
100% (1)
Chapter 7 - FEM
34 pages
FFT and Windowing Explained
No ratings yet
FFT and Windowing Explained
11 pages
Measurements in Rotating Machinery
No ratings yet
Measurements in Rotating Machinery
48 pages
Fourier Series Workbook Guide
100% (2)
Fourier Series Workbook Guide
8 pages
57-Piping Vibration Analysis - Jcw&Kea
No ratings yet
57-Piping Vibration Analysis - Jcw&Kea
17 pages
Ultrasonic Guided Waves Overview
0% (1)
Ultrasonic Guided Waves Overview
18 pages
A - Pipe Coating
100% (2)
A - Pipe Coating
25 pages
200 More Puzzling Problems in Physics
No ratings yet
200 More Puzzling Problems in Physics
16 pages
Circle Intersection and Tangent Problems
No ratings yet
Circle Intersection and Tangent Problems
4 pages
Model Question Paper - 5: Ii P.U.C Mathematics
No ratings yet
Model Question Paper - 5: Ii P.U.C Mathematics
3 pages
Advanced Math Methods: Laplace & Fourier Transforms
100% (1)
Advanced Math Methods: Laplace & Fourier Transforms
18 pages
Geometrical Properties of Circles
100% (2)
Geometrical Properties of Circles
17 pages
Expected Utility and Decision Making
No ratings yet
Expected Utility and Decision Making
24 pages
MULTIPLE CHOICE. Choose The One Alternative That Best Completes The Statement or Answers The Question. Multiply
No ratings yet
MULTIPLE CHOICE. Choose The One Alternative That Best Completes The Statement or Answers The Question. Multiply
2 pages
Analyzing Algorithms: CS-EE-310 Algorithms Analysis
No ratings yet
Analyzing Algorithms: CS-EE-310 Algorithms Analysis
41 pages
Unit 1 Pure Mathematics MCQ (2008 - 2015) Answers PDF
No ratings yet
Unit 1 Pure Mathematics MCQ (2008 - 2015) Answers PDF
1 page
Ut1 Class 12 Portions & Blue Print
No ratings yet
Ut1 Class 12 Portions & Blue Print
6 pages
Discrete Mathematics Lesson 1
No ratings yet
Discrete Mathematics Lesson 1
5 pages
On Stochastic Dynamical Systems Governed by Generalized Fractional Order Stochastic Anti-Periodic Boundary Value Problem With Poisson-Jump
No ratings yet
On Stochastic Dynamical Systems Governed by Generalized Fractional Order Stochastic Anti-Periodic Boundary Value Problem With Poisson-Jump
25 pages
Functions SL Review
No ratings yet
Functions SL Review
65 pages
Tutorial On Axiomatic Set Theory - Movellan
No ratings yet
Tutorial On Axiomatic Set Theory - Movellan
8 pages
Content Discussion: Mathematics in The Modern World
No ratings yet
Content Discussion: Mathematics in The Modern World
3 pages
Optim 1 Final Exam June 2016
No ratings yet
Optim 1 Final Exam June 2016
5 pages
LLN Assessment
No ratings yet
LLN Assessment
43 pages
Classical Mechanics Homework IV
No ratings yet
Classical Mechanics Homework IV
2 pages
AQA SET 2 Foundation Paper 2 Mark Scheme
No ratings yet
AQA SET 2 Foundation Paper 2 Mark Scheme
11 pages
Math 5 Q3 M6
No ratings yet
Math 5 Q3 M6
16 pages
Equation of A Line Pdf1
No ratings yet
Equation of A Line Pdf1
5 pages
38 MPAH 4A 07 Mini Quiz
No ratings yet
38 MPAH 4A 07 Mini Quiz
22 pages
BCSL 032 Solved Assignments 2016
No ratings yet
BCSL 032 Solved Assignments 2016
7 pages
Module 6 Homogeneous Differential Equation With Constants Coefficients
No ratings yet
Module 6 Homogeneous Differential Equation With Constants Coefficients
3 pages
Basarab Nicolescu, TRANSDISCIPLINARITY - PAST, PRESENT AND FUTURE
100% (4)
Basarab Nicolescu, TRANSDISCIPLINARITY - PAST, PRESENT AND FUTURE
36 pages
Sample Paper (IX)
No ratings yet
Sample Paper (IX)
4 pages
Af GF
No ratings yet
Af GF
7 pages
Maths PB Solutions
No ratings yet
Maths PB Solutions
9 pages
Straight - Lines (Questions)
No ratings yet
Straight - Lines (Questions)
4 pages
All The Math You Need To Know in Artificial Intelligence
No ratings yet
All The Math You Need To Know in Artificial Intelligence
7 pages

Machine Learning

Uploaded by

Machine Learning

Uploaded by

The Gaussian Distribution

Machine Learning and Pattern Recognition

(All of the slides in this course have been adapted from

A useful model for real-valued quantities

Maximum likelihood estimation

Class conditional classification

Reading: Murphy 4.1.2, 4.1.3 (without proof), 4.2 up to end

The Gaussian Distribution

The Gaussian distribution is one of the most common

The one dimensional Gaussian distribution is given by

is the mean of the Gaussian and 2 is the variance.

If = 0 and 2 = 1 then N (x; , 2 ) is called a standard

This is a standard one dimensional Gaussian distribution.

All Gaussians have the same shape subject to scaling and

If x is distributed N (, 2 ), then y = (x )/ is distributed

Remember all distributions must integrate to one. The 2 2

Hence tighter Gaussians have higher peaks:

Maximum Likelihood Estimation

Maximum likelihood: Set = 1/ 2 Take derivatives

The vector x is multivariate Gaussian if for mean and

The univariate Gaussian is a special case of this.

is called a covariance matrix, i.e., each element says

must be symmetric and positive definite

Multivariate Gaussian: Picture

d2 (xi , xj ) = (xi xj )T 1 (xi xj )

d2 (xi , xj ) is called the Mahalanobis distance between xi and xj

If is diagonal, the contours of d2 are axis-aligned ellipsoids

If is not diagonal, the contours of d2 are rotated ellipsoids

is positive definite entries in are positive

Multivariate Gaussian: Maximum Likelihood

The Maximum Likelihood estimate can be found in the same

The data. The maximum likelihood fit.

Class conditional classification

Class conditional classification

Learning: Fit Gaussian to data in each class (class conditional

Find estimate for probability of each class (see last lecture)

Inference: Given a new position, we can ask What is the

Better still give probability using Bayes rule

Decision boundary for two classes is where this ratio is one.

Key Facts About Gaussians

Sums of Gaussian RVs are Gaussian

Linear Gaussian models are jointly Gaussian. In general, let

Then p(x, y) is Gaussian, and so is p(x|y). See Murphy 4.3.

If p(x, y) a multivariate Gaussian, both the marginals

Inference in Gaussian models

Partition variables into two groups, X1 and X2

For proof see e.g. 4.3.4 of Murphy (2012) (not examinable)

A useful model for real-valued quantities

Maximum likelihood estimation

Class conditional classification

You might also like