0% found this document useful (0 votes)

24 views4 pages

Examples1 2up

Uploaded by

Amir Sharifi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views4 pages

Examples1 2up

Uploaded by

Amir Sharifi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Engineering Tripos Part IIB FOURTH YEAR

Module 4F10: STATISTICAL PATTERN RECOGNITION

Examples Paper 1
Straightforward questions are marked †
Tripos standard (but not necessarily Tripos length) questions are marked ∗

Bayes Risk
1. In many pattern classification problems, one has the option either to assign the
pattern to one of the c classes, or to reject it as being unrecognizable. If the cost to
reject is not too high, rejection may be a desirable action. Let the cost of classification
be defined as
0 ωi = ωj (i.e. (Correct classification)
λ(ωi |ωj ) = λr ωi = ω0 (i.e. Rejection)
λs Otherwise (i.e. Substitution Error)
Show that for minimum risk classification, the decision rule should associate a test
vector x with class ωi , if P (ωi |x) ≥ P (ωj |x) for all j and P (ωi |x) ≥ 1 − λr /λs , and
reject otherwise.
EM and Mixture Models
2. † For d-dimensional data compare the computational cost of calculating the log-
likelihood with a diagonal covariance matrix Gaussian distribution, a full covariance
matrix Gaussian distribution and an M -component diagonal covariance matrix Gaus-
sian mixture models. Clearly state any assumptions made.
3. A 1-dimensional 2-component mixture distribution has a common fixed known vari-
ance = 1 and initial mean values µ1 = 0 µ2 = 2 and mixture weights c1 = c2 = 0.5.
There is a data set of 9 training data points provided

−1.5, −0.5, 0.1, 0.3, 0.9, 1.3, 1.9, 2.3, 3.0

(a) Calculate the log likelihood of the training data for the mixture distribution with
the initial parameters.
(b) Calculate updated values for the mean and mixture weights for 1 iteration of the
E-M algorithm.
4. Consider an M component mixture model of d-dimensional binary data x of the form
∑
M
p(x) = P (ωm )p(x|ωm )
m=1

1
where the j th component PDF has parameters λj1 , . . . , λjd and
∏
d
p(x|ωj ) = λxjii (1 − λji )1−xi
i=1

A set of training samples x1 , . . . , xn are used to train the mixture model. Using
the standard form of EM with mixture models show that the maximum likelihood
estimate for the “new” parameters, λ̂ji , is given by
∑n
k=1 P (ωj |xk )xki
λ̂ji = ∑
k=1 P (ωj |xk )
n

where P (ωj |xk ) is obtained using the “old” model parameters.

5. ∗ A series of n independent, noisy, measurements are taken, x1 , . . . , xn . The noise
is known to be Gaussian distributed with zero mean and unit variance. The “true”
data is also known to be Gaussian distributed.
(a) Find the maximum likelihood estimates of the mean, µ, and variance, σ 2 , of the
“true” data by equating the gradient to zero.
(b) A latent variable zi is introduced. It is the value of the noise for observation xi .
Show that the posterior probability of zi given the current model parameters is
( )
(xi − µ) σ2
p(zi |xi , θ) = N zi ; ,
(1 + σ 2 ) (1 + σ 2 )
Using the expectation-maximisation algorithm derive re-estimation formulae for
the mean, µ, and variance, σ 2 . Show that the iterative estimation scheme for
the mean converges to the correct answer, you may assume that the variance of
the true data is known and fixed at σ 2 .
Discuss the merits of the two optimisation schemes for this task and for optimisation
tasks in general.
Product of Experts
6. ∗ For parts of this question it is useful to use matlab/octave A product of experts
system is to be used for speech synthesis. The data is known to be generated from
two classes ω1 and ω2 . Four Gaussian experts are to be used. These experts are:
p(xt |ω1 ) = N (xt ; 1, 1) Expert 1
p(xt − xt−1 |ω1 ) = N (xt − xt−1 ; 1, 1) Expert 2
p(xt |ω2 ) = N (xt ; 2, 1) Expert 3
p(xt − xt−1 |ω2 ) = N (xt − xt−1 ; −1, 1) Expert 4
A sequence of 3 samples are to be generated. The first two are known to come from
class ω1 , the final sample from class ω2 . The data is known to start in silence, which
has a value of 0.

2
(a) Show that the overall sequence of observarions can be written in the following
form
 
x1
 
   x1 − 0 
x1  
   x2 
Ax = A  x2  = 



 x2 − x1 
x3  
 x3 
x3 − x2

(b) The transformed data, Ax is Gaussian distributed, so

1
p(x|θ) = p(Ax|θ)
Z
1
= N (Ax; µ, Σ)
Z
where Z is the appropriate normalisation term to ensure a valid PDF. Find
expressions for µ and Σ.
(c) By using the following expression (or otherwise)
( ) ( )
1 1
exp − (Ax − µ)′ (Ax − µ) = exp − (x′ A′ Ax − 2µ′ Ax + µ′ µ)
2 2
find the mean of the distribution of x. How can this approach be used for speech
synthesis? What does x look like if experts 2 and 4 are not used, set A = I (an
identity matrix)

Restricted Boltzmann Machine

7. A restricted Boltzmann machine is to be built where the observations, x, are con-

tinuous variables and the hidden units, h, are binary. The energy function has the
following form:
∑
d
(xi − ai )2 ∑
J ∑ xi
G(x, h|θ) = − bj hj − hj wij
i=1 2σi2 j=1 i,j σi

Show that the posterior probabilty of the hidden and observed variables can be
expressed as
1
P (hj = 1|x, θ) = ∑
1 + exp(−bj − di=1 x1
w )
σi ij
∑
p(xi |h, θ) = N (xi ; ai + σi hj wij , σi2 )
j

Why is this form of expression important when training Restricted Boltzmann ma-
chines?

3
Single Layer Perceptrons

8. The standard single layer perceptron is used to discriminate between two classes.
There are two simple techniques for generalising this to a K class problem. The
first is to build a set of pairwise classifiers i.e. ωi versus ωj , j ̸= i. The second
is to build a set of classifiers of each class versus all other classes i.e. ωi versus
{ω1 , . . . , ωi−1 , ωi+1 , ωK }. Compare the two forms of classifier in terms of training and
testing computational cost. By drawing a specific example with K = 3 show that
both forms of classifier can result in an “ambiguous” region i.e. no decision can be
made. Describe how multiple binaries classifiers may be trained so that no ambigu-
ous regions exist.

Answers

3. (a) total log-likelihood of data (natural log (ln)) -15.302 (likelihood 2.262e-07); (b)
µ̂1 = −0.0426 ; µ̂2 = 1.878 ; ĉ1 = 0.5266; ĉ2 = 0.4734

M.J.F. Gales
P.C. Woodland
Oct 2003 - Jan 2007

ML1 2023 Fe
No ratings yet
ML1 2023 Fe
25 pages
05 Vae
No ratings yet
05 Vae
76 pages
Post-Exam 2 Practice Questions - Solutions 18.05, Spring 2014 Confidence Intervals
No ratings yet
Post-Exam 2 Practice Questions - Solutions 18.05, Spring 2014 Confidence Intervals
7 pages
Unsupervised Learning Clustering Math
No ratings yet
Unsupervised Learning Clustering Math
28 pages
Assignment 2 PDF
No ratings yet
Assignment 2 PDF
13 pages
Problem 1: Otherwise, 0 X 0 1), 0 ( ) (
No ratings yet
Problem 1: Otherwise, 0 X 0 1), 0 ( ) (
4 pages
CpE646 6v3 PDF
No ratings yet
CpE646 6v3 PDF
44 pages
Final F01soln
No ratings yet
Final F01soln
13 pages
Signal Processing MCQs
No ratings yet
Signal Processing MCQs
16 pages
Assignment 10 Solution
No ratings yet
Assignment 10 Solution
8 pages
Problem Sheet 1
No ratings yet
Problem Sheet 1
3 pages
Ch4 Sol
No ratings yet
Ch4 Sol
21 pages
Assn11 Sol
No ratings yet
Assn11 Sol
7 pages
X 400004 - Statistics Resit Exam: 13 February 2024
No ratings yet
X 400004 - Statistics Resit Exam: 13 February 2024
8 pages
Final F02soln
No ratings yet
Final F02soln
11 pages
3 Garch PDF
No ratings yet
3 Garch PDF
13 pages
Machine Learning (CSEN3203) 1-14
No ratings yet
Machine Learning (CSEN3203) 1-14
15 pages
Exam With Solutions
No ratings yet
Exam With Solutions
7 pages
Midterm - EE511 - Part B: K K K K
No ratings yet
Midterm - EE511 - Part B: K K K K
8 pages
E9 205 - Machine Learning For Signal Processing: Practice Midterm Exam
No ratings yet
E9 205 - Machine Learning For Signal Processing: Practice Midterm Exam
4 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
Exercise Solution 05 Linear Classification
No ratings yet
Exercise Solution 05 Linear Classification
9 pages
Final 2012 Wsolutions
No ratings yet
Final 2012 Wsolutions
14 pages
Mid Sem Solution 2019
No ratings yet
Mid Sem Solution 2019
9 pages
Lecture 2
No ratings yet
Lecture 2
8 pages
WI 19 Final (In-Person)
No ratings yet
WI 19 Final (In-Person)
14 pages
Final 2012 W
No ratings yet
Final 2012 W
8 pages
AI60201 Module3 4 Problems
No ratings yet
AI60201 Module3 4 Problems
4 pages
hw7 Sol
No ratings yet
hw7 Sol
12 pages
Cs 419 Endsemsols
No ratings yet
Cs 419 Endsemsols
6 pages
Midterm Sp16 Solutions
100% (1)
Midterm Sp16 Solutions
17 pages
Probabilistic Modelling and Reasoning
No ratings yet
Probabilistic Modelling and Reasoning
13 pages
Econometrics - Exercise Set 2 (Solution)
No ratings yet
Econometrics - Exercise Set 2 (Solution)
12 pages
Final f02
No ratings yet
Final f02
12 pages
ML 20240315
No ratings yet
ML 20240315
8 pages
Machine Learning: E0270 2015 Assignment 4: Due March 24 Before Class
No ratings yet
Machine Learning: E0270 2015 Assignment 4: Due March 24 Before Class
3 pages
HW 4
No ratings yet
HW 4
5 pages
PBM Notes
No ratings yet
PBM Notes
130 pages
X400004 20220215 Solutions
No ratings yet
X400004 20220215 Solutions
8 pages
Solutions Manual: A Note To Instructors
No ratings yet
Solutions Manual: A Note To Instructors
18 pages
12f-601-Midterm Machine Learning
No ratings yet
12f-601-Midterm Machine Learning
21 pages
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
No ratings yet
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
7 pages
E9 205 - Machine Learning For Signal Processing: Practice For Midterm Exam # 1
No ratings yet
E9 205 - Machine Learning For Signal Processing: Practice For Midterm Exam # 1
8 pages
Weatherwax Theodoridis Solutions
No ratings yet
Weatherwax Theodoridis Solutions
212 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
Regression On A Cylinder: A Project Submitted To The Faculty of The Graduate School of The University of Minnesota BY
No ratings yet
Regression On A Cylinder: A Project Submitted To The Faculty of The Graduate School of The University of Minnesota BY
42 pages
Solutions To The Exercises On The Bias-Variance Dilemma
No ratings yet
Solutions To The Exercises On The Bias-Variance Dilemma
8 pages
11 Hidden Markov Models (HMMS) Model and Problem Description
No ratings yet
11 Hidden Markov Models (HMMS) Model and Problem Description
15 pages
hw3 Solution
No ratings yet
hw3 Solution
7 pages
(Mathematics Study Resources, 1) Ludger Rüschendorf - Stochastic Processes and Financial Mathematics-Springer (2023)
100% (1)
(Mathematics Study Resources, 1) Ludger Rüschendorf - Stochastic Processes and Financial Mathematics-Springer (2023)
310 pages
Isi Mtech Qror 08
No ratings yet
Isi Mtech Qror 08
36 pages
(FREE PDF Sample) Stochastic Methods in Scientific Computing 1st Edition Massimo D'Elia Ebooks
100% (5)
(FREE PDF Sample) Stochastic Methods in Scientific Computing 1st Edition Massimo D'Elia Ebooks
84 pages
Prob Best-2 PDF
No ratings yet
Prob Best-2 PDF
181 pages
CH-14 Probability
No ratings yet
CH-14 Probability
6 pages
Endsem ML Regular AK
No ratings yet
Endsem ML Regular AK
7 pages
CSC 216 - Introduction To Simulation and Modelling Test Questions PDF
No ratings yet
CSC 216 - Introduction To Simulation and Modelling Test Questions PDF
34 pages
Probability: Prof. Neha Taneja
No ratings yet
Probability: Prof. Neha Taneja
35 pages
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
No ratings yet
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
12 pages
Ps 2
No ratings yet
Ps 2
7 pages
MM 27 Confidence Intervals Sample Proportion Distribution HYQs
No ratings yet
MM 27 Confidence Intervals Sample Proportion Distribution HYQs
18 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
Calculate Karl Pearson
100% (1)
Calculate Karl Pearson
2 pages
Solution 4 Problem 1: A A ( 1, +1) : Iid Data
No ratings yet
Solution 4 Problem 1: A A ( 1, +1) : Iid Data
18 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
2024 Decision Trees
No ratings yet
2024 Decision Trees
28 pages
Random Variables
No ratings yet
Random Variables
14 pages
T11 Solution
No ratings yet
T11 Solution
17 pages
Lesson 1.3: San Juan Institute of Technology
No ratings yet
Lesson 1.3: San Juan Institute of Technology
22 pages
Kolmogorov and Probability Theory
No ratings yet
Kolmogorov and Probability Theory
13 pages
Percolation and Random Walks On Graphs - Perla Sousi
No ratings yet
Percolation and Random Walks On Graphs - Perla Sousi
49 pages
MAT 211 CourseGuide - Lecture Notes - Summer 2015
No ratings yet
MAT 211 CourseGuide - Lecture Notes - Summer 2015
79 pages
Normal Curve Standard Normal Curve
No ratings yet
Normal Curve Standard Normal Curve
33 pages
Hypergeometric Distribution
No ratings yet
Hypergeometric Distribution
9 pages
Lecture 1
No ratings yet
Lecture 1
12 pages
Trial Exam 2021 With Solutions
No ratings yet
Trial Exam 2021 With Solutions
10 pages
Assignment 3 - Solutions
No ratings yet
Assignment 3 - Solutions
7 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Econometrics 1 Slide2b
No ratings yet
Econometrics 1 Slide2b
14 pages
Business Decision Making II Interval Estimation: Dr. Nguyen Ngoc Phan
No ratings yet
Business Decision Making II Interval Estimation: Dr. Nguyen Ngoc Phan
16 pages
Use of Gamma in Hydrological
No ratings yet
Use of Gamma in Hydrological
10 pages
Statistics: N Valid Missing Mean Median Mode Std. Deviation Minimum Maximum
No ratings yet
Statistics: N Valid Missing Mean Median Mode Std. Deviation Minimum Maximum
10 pages
Tut4 Questions
No ratings yet
Tut4 Questions
2 pages
Tut1 Questions
No ratings yet
Tut1 Questions
2 pages
Name: in The Name of Almighty Statistical Pattern Recognition Homework 1
No ratings yet
Name: in The Name of Almighty Statistical Pattern Recognition Homework 1
2 pages
Tut7 Questions
No ratings yet
Tut7 Questions
2 pages
Chapter 4 PDF
No ratings yet
Chapter 4 PDF
11 pages
Nonparametric Hypotheses and Rank Statistics For Unbalanced Factorial Designs
No ratings yet
Nonparametric Hypotheses and Rank Statistics For Unbalanced Factorial Designs
10 pages
Noel T.P. Hutahaean - 2205171038 - Tes1
No ratings yet
Noel T.P. Hutahaean - 2205171038 - Tes1
3 pages
Xii - Statics Target Paper 2024 by (Bagad Billa Group)
No ratings yet
Xii - Statics Target Paper 2024 by (Bagad Billa Group)
7 pages
Isi Mtech Qror 05
No ratings yet
Isi Mtech Qror 05
20 pages
Point Estimation: Definition of Estimators
No ratings yet
Point Estimation: Definition of Estimators
8 pages
Probability Distribution Question
No ratings yet
Probability Distribution Question
3 pages
PQT Model Exam
No ratings yet
PQT Model Exam
2 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Examples1 2up

Uploaded by

Examples1 2up

Uploaded by

Engineering Tripos Part IIB FOURTH YEAR

Module 4F10: STATISTICAL PATTERN RECOGNITION

−1.5, −0.5, 0.1, 0.3, 0.9, 1.3, 1.9, 2.3, 3.0

where P (ωj |xk ) is obtained using the “old” model parameters.

(b) The transformed data, Ax is Gaussian distributed, so

Restricted Boltzmann Machine

7. A restricted Boltzmann machine is to be built where the observations, x, are con-

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.