0% found this document useful (0 votes)

5 views39 pages

ML Lecture16

Uploaded by

Aniket Dwivedi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views39 pages

ML Lecture16

Uploaded by

Aniket Dwivedi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Machine Learning

CSE343/CSE543/ECE363/ECE563
Lecture 16 | Take your own notes during lectures
Vinayak Abrol <abrol@iiitd.ac.in>
Gaussian Mixture Model (GMM)
GMM is a parametric probability density
function represented as a weighted sum
of Gaussian component densities.

Soft assignment: each data point is assigned

to each cluster with a probability.
Gaussian Mixture Model (GMM)
GMM is a parametric probability density
function represented as a weighted sum
of Gaussian component densities.

Soft assignment: each data point is assigned

to each cluster with a probability.
GMM: Mathematical Model

Assignment: wk is the probability that i’s

cluster is k. It has discrete probability
distribution.
wk is called Mixture Weights
GMM: Mathematical Model

Assignment: wk is the probability that i’s

cluster is k. It has discrete probability
distribution.
wk is called as Mixture Weights

Generation: given the cluster assignment

generate each example from its distribution
GMM: Mathematical Model

Assignment: wk is the probability that i’s

cluster is k. It has discrete probability
distribution.
wk is called as Mixture Weights

Generation: given the cluster assignment

generate each example from its distribution
GMM: The Likelihood Function
GMM: The Likelihood Function

Likelihood and Joint Probability distribution

Recall Lecture on Linear Regression!

We already know the mathematical trick

GMM: The Likelihood Function

Likelihood and Joint Probability distribution

Recall Lecture on Linear Regression!

We already know the mathematical trick

GMM: The Likelihood Function

Likelihood and Joint Probability distribution

Recall Lecture on MLE/MAP!

We already know the mathematical trick

GMM: The Likelihood Function
GMM: The Likelihood Function

In this case we cannot pass the log through the sum

This problem occurs in many ML formulations when we have latent (hidden
variables) e.g., cluster assignment here.
GMM: The Likelihood Function

In this case we cannot pass the log through the sum

This problem occurs in many ML formulations when we have latent (hidden
variables) e.g., cluster assignment here.

Note, if you know the zi’s (cluster assignment), there is no sum (inner red one)
and the issue with the sum and the log goes away!
GMM: The Likelihood Function

In this case we cannot pass the log through the sum

This problem occurs in many ML formulations when we have latent (hidden
variables) e.g., cluster assignment here.

The mathematical tool to solve this problem is called Expectation-Maximization (EM)

EM is an iterative procedure where we update the zi’s and then update µ, Σ, and w.

E-step: compute cluster assignments (which are probabilistic)

M-step: update θ (which are the cluster’s properties)
EM Algorithm

The main idea in EM is to ﬁnd a lower bound on

likelihood.
Maximizing the lower bound always leads to
higher values of likelihood.
EM Algorithm

The main idea in EM is to ﬁnd a lower bound on

likelihood.
Maximizing the lower bound always leads to
higher values of likelihood.

Procedure:
Starting at θt with iteration t in orange, we construct the
surrogate lower bound A(θ, θt).

Maximizing A(θ, θt) increases our likelihood and the

maximum occurs at θt+1

We again construct a surrogate lower bound A(θ, θt+1),

and maximize it to get to the next iteration,which occurs
at point θt+2 and so on.
Back to GMM

Bayes Rule
Back to GMM

Bayes Rule

E Step: Assignment
Back to GMM

Bayes Rule

E Step: Assignment

This is similar to k-means where we assign each point to a cluster probabilistically

at iteration t.
Back to GMM

M Step: Model Update

sum-log-sum
Back to GMM

M Step: Model Update

sum-log-sum sum-log-E
(convert sum to average)
Back to GMM

M Step: Model Update

sum-log-sum sum-log-E sum-E-log sum-sum-log

(convert sum to average) (Apply Jensen’s Inequality) (convert average to sum)
Back to GMM

M Step: Model Update

sum-log-sum sum-log-E sum-E-log sum-sum-log

(convert sum to average) (Apply Jensen’s Inequality) (convert average to sum)
Back to GMM

M Step: Model Update

sum-log-sum sum-log-E sum-E-log sum-sum-log

(convert sum to average) (Apply Jensen’s Inequality) (convert average to sum)

Ignoring trivial substitutions and calculations, we get

Back to GMM

M Step: Model Update

The update for w is tricker because of the constraint

Back to GMM

M Step: Model Update

The update for w is tricker because of the constraint

And we are back to the Lecture on SVMs. Can you guess What can we do?
Back to GMM

M Step: Model Update

The update for w is tricker because of the constraint

And we are back to the Lecture on SVMs. Can you guess What can we do?

Method of Lagrange Multiplier

Back to GMM

M Step: Model Update

The update for w is tricker because of the constraint

And we are back to the Lecture on SVMs. Can you guess What can we do?

Method of Lagrange Multiplier

Here we are using index k′ so as not to be confused with the sum over k
Back to GMM

M Step: Model Update

The update for w is tricker because of the constraint

And we are back to the Lecture on SVMs. Can you guess What can we do?

Method of Lagrange Multiplier

Here we are using index k′ so as not to be confused with the sum over k
GMM Likelihood in Detail
Visualization of EM procedure in GMM
Covariance Matrix: same vs different | full vs diagonal
Covariance Matrix: same vs different | full vs diagonal
Covariance Matrix: same vs different | full vs diagonal
Covariance Matrix: same vs different | full vs diagonal
Covariance Matrix: same vs different | full vs diagonal
Covariance Matrix: same vs different | full vs diagonal
Thanks

CS411 Final Term MCQs Merged by Masters
No ratings yet
CS411 Final Term MCQs Merged by Masters
357 pages
Unsupervised Learning of Distribution
No ratings yet
Unsupervised Learning of Distribution
34 pages
Lec 13
No ratings yet
Lec 13
27 pages
14 Gaussian Mixture Models
No ratings yet
14 Gaussian Mixture Models
60 pages
GMM
No ratings yet
GMM
25 pages
Tutorial em
No ratings yet
Tutorial em
57 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
20 Gaussian Mixture Model
No ratings yet
20 Gaussian Mixture Model
55 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
Lecture-04 GMM EMalg
No ratings yet
Lecture-04 GMM EMalg
34 pages
Idioms For 12th Class
0% (1)
Idioms For 12th Class
21 pages
Generalized Majorization-Minimization
No ratings yet
Generalized Majorization-Minimization
10 pages
Unit 5 - ML
No ratings yet
Unit 5 - ML
10 pages
INAIO Stage 2 Sample Problems MLTheory
No ratings yet
INAIO Stage 2 Sample Problems MLTheory
6 pages
Lecture 19 and 20
No ratings yet
Lecture 19 and 20
27 pages
ML Unit 3 MID1
No ratings yet
ML Unit 3 MID1
83 pages
2B Naive Bayes
No ratings yet
2B Naive Bayes
90 pages
16) ISM-Session 16 - 30th and 31st March 2024
No ratings yet
16) ISM-Session 16 - 30th and 31st March 2024
36 pages
L11.2 Prob Models em
No ratings yet
L11.2 Prob Models em
20 pages
Lecture 19 and 20
No ratings yet
Lecture 19 and 20
27 pages
A Tutorial On MM Algorithms
No ratings yet
A Tutorial On MM Algorithms
28 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
A Brief Overview of Bayesian Model Averaging: Chris Sroka, Juhee Lee, Prasenjit Kapat, Xiuyun Zhang
No ratings yet
A Brief Overview of Bayesian Model Averaging: Chris Sroka, Juhee Lee, Prasenjit Kapat, Xiuyun Zhang
70 pages
FL LectureNotes
No ratings yet
FL LectureNotes
92 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
PROBABILISTIC Learning Jb-New
No ratings yet
PROBABILISTIC Learning Jb-New
13 pages
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
No ratings yet
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
12 pages
ds11 2
No ratings yet
ds11 2
19 pages
Oral Texte
No ratings yet
Oral Texte
12 pages
Lec 12
No ratings yet
Lec 12
15 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
No ratings yet
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
32 pages
ML Unit Iii
No ratings yet
ML Unit Iii
12 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
Gtag Auditing Network and Comms MGMT 2nd Ed Rev
No ratings yet
Gtag Auditing Network and Comms MGMT 2nd Ed Rev
46 pages
HMM Tutorial
No ratings yet
HMM Tutorial
15 pages
CB PDF
No ratings yet
CB PDF
69 pages
MCMC Brief
100% (1)
MCMC Brief
69 pages
Unit 2
No ratings yet
Unit 2
7 pages
Gaussian Mixture Modelling GMM
No ratings yet
Gaussian Mixture Modelling GMM
11 pages
Generalized Method of Moments (GMM) Estimation in Stata 11: David M. Drukker
No ratings yet
Generalized Method of Moments (GMM) Estimation in Stata 11: David M. Drukker
27 pages
Gaussian Distribution
No ratings yet
Gaussian Distribution
5 pages
Dasar Statistika Dan Matematika
No ratings yet
Dasar Statistika Dan Matematika
30 pages
Andrew Rosenberg - Lecture 18: Gaussian Mixture Models and Expectation Maximization
No ratings yet
Andrew Rosenberg - Lecture 18: Gaussian Mixture Models and Expectation Maximization
34 pages
Lecture3 EM
No ratings yet
Lecture3 EM
36 pages
Message-6 2
No ratings yet
Message-6 2
226 pages
Learning With Hidden Variables - EM Algorithm
No ratings yet
Learning With Hidden Variables - EM Algorithm
31 pages
Class 02
No ratings yet
Class 02
42 pages
GMM and MINZ Program Libraries For Matlab
No ratings yet
GMM and MINZ Program Libraries For Matlab
38 pages
Likelihood EM HMM Kalman
No ratings yet
Likelihood EM HMM Kalman
46 pages
Summary SC Microeconometrics
No ratings yet
Summary SC Microeconometrics
20 pages
Curs 1 SSL - Introduction
No ratings yet
Curs 1 SSL - Introduction
57 pages
Open Source Intelligence Techniques Resources For Searching and Analyzing Online Information 6th Edition Michael Bazzell Download
No ratings yet
Open Source Intelligence Techniques Resources For Searching and Analyzing Online Information 6th Edition Michael Bazzell Download
86 pages
Maximum Likelihood Learning of Gaussians For Data Mining
No ratings yet
Maximum Likelihood Learning of Gaussians For Data Mining
25 pages
Documentation For GPML Matlab Code
No ratings yet
Documentation For GPML Matlab Code
10 pages
Week Wise Syllabus - DCAA
No ratings yet
Week Wise Syllabus - DCAA
6 pages
03 01 24 - 19 02 59 - DebugLog
No ratings yet
03 01 24 - 19 02 59 - DebugLog
81 pages
AOS Practical File1
No ratings yet
AOS Practical File1
74 pages
MM Algorithm
No ratings yet
MM Algorithm
28 pages
Ribbit Instagram Clone em PHP
No ratings yet
Ribbit Instagram Clone em PHP
33 pages
Capgemini Interview Questions
No ratings yet
Capgemini Interview Questions
6 pages
Protocols and Switching
No ratings yet
Protocols and Switching
48 pages
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
No ratings yet
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
3 pages
GMM Stata
No ratings yet
GMM Stata
27 pages
Delomatic 4 DM-4 Land/DM-4 Marine: Technical Specifications Part 2, Chapter 29
No ratings yet
Delomatic 4 DM-4 Land/DM-4 Marine: Technical Specifications Part 2, Chapter 29
22 pages
Consumer Intentions To Adopt Electronic Commerce - Incorporating Trust and Risk in The Technology Acceptance Model
No ratings yet
Consumer Intentions To Adopt Electronic Commerce - Incorporating Trust and Risk in The Technology Acceptance Model
30 pages
Amortized
No ratings yet
Amortized
31 pages
Kaizen RuleBook
No ratings yet
Kaizen RuleBook
4 pages
An Intuitive Geometric Approach To The Gauss Markov Theorem
No ratings yet
An Intuitive Geometric Approach To The Gauss Markov Theorem
15 pages
Database Security
No ratings yet
Database Security
22 pages
Benchmarking For Comparative Evaluation of RP Systems and Processes
No ratings yet
Benchmarking For Comparative Evaluation of RP Systems and Processes
13 pages
ML Lecture15
No ratings yet
ML Lecture15
13 pages
User Manual: AN5506-04-B GPON Optical Network Unit
No ratings yet
User Manual: AN5506-04-B GPON Optical Network Unit
44 pages
Unit 2 - Machine Learning - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 2 - Machine Learning - WWW - Rgpvnotes.in PDF
10 pages
Acer Aspire 3810t - INVENTEC BAP31 - 1310A2264501 - REV AX1Sec
No ratings yet
Acer Aspire 3810t - INVENTEC BAP31 - 1310A2264501 - REV AX1Sec
36 pages
20 May
No ratings yet
20 May
1 page
Java Theory (9th Class)
No ratings yet
Java Theory (9th Class)
13 pages
LAS 2-Programming-Q1
No ratings yet
LAS 2-Programming-Q1
13 pages
Block Diagram and Layout Plans
No ratings yet
Block Diagram and Layout Plans
10 pages
Kunal's Yaml Tutorial Notes
No ratings yet
Kunal's Yaml Tutorial Notes
12 pages
Vivobarefoot Upgrades Technology Infrastructure
100% (1)
Vivobarefoot Upgrades Technology Infrastructure
5 pages
Download: Solutions Intermediate Progress Tests Unit 1answer
No ratings yet
Download: Solutions Intermediate Progress Tests Unit 1answer
2 pages
How To Read Research Papers
No ratings yet
How To Read Research Papers
2 pages
Edms 2
No ratings yet
Edms 2
10 pages
Practice Questions
No ratings yet
Practice Questions
3 pages
8-Queen Problem
No ratings yet
8-Queen Problem
2 pages
Manual - Excel Masterclass 1 - DS7
No ratings yet
Manual - Excel Masterclass 1 - DS7
4 pages
LPD8 Editor User Guide: To Download and Install The Editor Software
No ratings yet
LPD8 Editor User Guide: To Download and Install The Editor Software
2 pages
Online Resources: Where To From Here
No ratings yet
Online Resources: Where To From Here
4 pages
GHR M: A Case Study ON Hutchinson Essar India Acquisition BY Vodafone
No ratings yet
GHR M: A Case Study ON Hutchinson Essar India Acquisition BY Vodafone
10 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ML Lecture16

Uploaded by

ML Lecture16

Uploaded by

Machine Learning

Soft assignment: each data point is assigned

Soft assignment: each data point is assigned

Assignment: wk is the probability that i’s

Assignment: wk is the probability that i’s

Generation: given the cluster assignment

Assignment: wk is the probability that i’s

Generation: given the cluster assignment

Likelihood and Joint Probability distribution

Recall Lecture on Linear Regression!

We already know the mathematical trick

Likelihood and Joint Probability distribution

Recall Lecture on Linear Regression!

We already know the mathematical trick

Likelihood and Joint Probability distribution

Recall Lecture on MLE/MAP!

We already know the mathematical trick

In this case we cannot pass the log through the sum

In this case we cannot pass the log through the sum

In this case we cannot pass the log through the sum

The mathematical tool to solve this problem is called Expectation-Maximization (EM)

E-step: compute cluster assignments (which are probabilistic)

The main idea in EM is to ﬁnd a lower bound on

The main idea in EM is to ﬁnd a lower bound on

Maximizing A(θ, θt) increases our likelihood and the

We again construct a surrogate lower bound A(θ, θt+1),

This is similar to k-means where we assign each point to a cluster probabilistically

M Step: Model Update

M Step: Model Update

M Step: Model Update

sum-log-sum sum-log-E sum-E-log sum-sum-log

M Step: Model Update

sum-log-sum sum-log-E sum-E-log sum-sum-log

M Step: Model Update

sum-log-sum sum-log-E sum-E-log sum-sum-log

Ignoring trivial substitutions and calculations, we get

M Step: Model Update

The update for w is tricker because of the constraint

M Step: Model Update

The update for w is tricker because of the constraint

M Step: Model Update

The update for w is tricker because of the constraint

Method of Lagrange Multiplier

M Step: Model Update

The update for w is tricker because of the constraint

Method of Lagrange Multiplier

M Step: Model Update

The update for w is tricker because of the constraint

Method of Lagrange Multiplier

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.