0% found this document useful (0 votes)

23 views6 pages

AE - Tema 3 - The Multivariate Gaussian Distribution

Uploaded by

Ramón García

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views6 pages

AE - Tema 3 - The Multivariate Gaussian Distribution

Uploaded by

Ramón García

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

In this topic we are going to study the multivariate Normal distribution that

among its applications lies the detection of anomalies. Some examples are the
detection of defective products, the detection of anomalous behavior in computers
or the detection of fraud. Before applying this important distribution, we will
briefly review its univariate version in order to better understand the coordinates
of the multivariate Gaussian distribution.

Univariate Gaussian Distribution

We say that a random variable X follows a univariate Normal or Gaussian
distribution if its density function is given by:

1 (X − µ)2
1 −
f (X) = √ e 2 σ2 ,
2nσ
where µ represents the mean and σ the standard deviation. The following
figure shows four random variables with different parameters.

Figura 1: Plots of different univariate normal distributions

N(0,1) N(0,4)

−2 0 2 −10 −5 0 5 10

N(2,1) N(2,4)

−2.5 0.0 2.5 5.0 7.5 −5 0 5 10

Let S = {X1 , · · · , Xn } observations independent and identically distributed

from a random variable X, that follows a Normal distribution, its mean and
variance can be obtained through the maximum likelihood estimator by using:

1
n
1X
X̄ = Xi
n i=1
n
1X
σˆ2 = (Xi − X̄)2
n i=1

When the sample is small, quasi-variance is usually used as an estimate of

variance because it is an unbiased estimator (E(sˆ2 ) = σ 2 ), quasi-variance is given
by:
n
ˆ 1 X
2
s = (Xi − X̄)2
n − 1 i=1
However, we will assume that the sample is large enough so that both formulas
can be assumed to be identical.
Example. As an application of the one-dimensional Gaussian, let’s see an
example of anomaly detection. Imagine a professional gambler who wants to know
which are the matches in which there is a more advantageous (or disadvantageous)
betting house so that he can approach betting (or avoid the disadvantage).

Multivariate Gaussian Distribution

Definition. Let Z1 , · · · , Zp random variables independently distributed according
to a normal with mean zero and variance one, N(0,1). Then, we say that
Z = [Z1 , · · · , Zp ]T follows a standard multivariate normal distribution with mean
0p and dispersion matrix identity Ip , that is
   
E(Z1 ) 0
 ..   .. 
E(Z) =  .  =  .  = 0p ,
E(Zp ) 0
 
var(Z1 ) . . . cov(Z1 , Zp )
D(Z) =  .. .. ..
 = Ip .
 
. . .
cov(Z1 , Zp ) . . . var(Zp )
Definition. We say that the random p-dimensional variable X follows a normal
distribution with parameters µ and Σ if X has the same distribution as µ + AZ,
where A satisfies that AAT = Σ and Z ∈ Np (0, I). It is denoted as X ∈ Np (µ, Σ),
that is
E(X) = E(µ + AZ) = µ + AE(Z) = µ,
D(X) = D(µ + AZ) = AD(Z)AT = Σ.
Remark. As Σ is a symmetric positive semidefinite matrix then it’s orthogonally
diagonalizable Σ = P ΛP T , where P is the matrix whose columns are the
eigenvectors of Σ and Λ the diagonal matrix formed by the eigenvalues of Σ. Then
Σ = P Λ1/2 Λ1/2 P T = P Λ1/2 (P Λ1/2 )T
with which A = P Λ1/2 .

2
Let S = {X1 , · · · , Xn } observations independent and identically distributed
from a random p-dimensional variable X ∈ Np (µ, Σ), n > p, its mean and
covariance matrix can be obtained through the maximum likelihood estimator
by using:
n
1X
µ̂ = Xi ,
n i=1
n
1X
Σ̂ = (Xi − µ̂)(Xi − µ̂)T .
n i=1

Note that now Xi and µ are vectors. If we denote X = [X1 · · · Xn ] a matrix

containing the data in the columns and Y = X − µ the centered data, then
 
Y1T
1 1   1
Σ̂ = Y Y T = [Y1 · · · Yn ]  ...  = Y1 Y1T + · · · + Yn YnT .
n n n
YnT
That is the formula we know from the first class except for the order of transposes.
This happens because in real data matrices you have observations as rows, not
columns.
Example. Let’s imagine now that we want to make an application that
detects when our computer is performing an anomalous behavior based on the
computational load and ram memory used. This graph shows a point diagram
of 435 samples taken.

Figura 2: Detection of outliers

5
X2

−5

−5 0 5
X1

Type outlier regular

3
One option would be to make a confidence interval for each variable. However,
as can be seen in the Figure 2 none of these intervals detect the anomaly. A
possible solution is to use the normal multivariate distribution.

Theorem. Let X ∈ Np (µ, Σp×p ) and rg(Σ)=p. Then X has density function
1 T −1
1 − (X − µ) Σ (X − µ)
f (X) = √ p e 2
( 2π)p det(Σ)
Proof. Let Z ∈ Np (0, I),then since Z has independent components it has density

p 1 (Zi − 0)2 p 1 2 1 T
Y 1 − Y 1 − Zi 1 − Z Z
f (Z) = √ e 2 1 = √ e 2 = √ e 2
2π 2π ( 2π)p
i=1 i=1

Consider the transformation from Rp → Rp given by

X = µ + AZ,
where AAT = Σ. From class 1 we saw that Σ has full range implies that A is
invertible, so
Z = h(X) = A−1 (X − µ).
Then, by the change of variable theorem we have that
f (X) = f (h(X))det(h0 (X)).
Giving
T
ZT Z = (X − µ)T A−1 A−1 (X − µ),
= (X − µ)T Σ−1 (X − µ).
Furthermore
1
det(Σ) = det(AAT ) = det(A)2 ⇒ det(A−1 ) = p ,
det(Σ)
then
1 T −1
1 − (X − µ) Σ (X − µ)
f (X) = √ p e 2
( 2π)p det(Σ)

Theorem. Let X ∈ N (µ, Σ), then M X + b ∈ N (M µ + b, M T ΣM )

Proof. Direct from the above
Theorem. Let X formed by two random multivariate Gaussian vectors
!
X1 µ1 Σ Σ12
X= ∈N , 11
X2 µ2 Σ21 Σ22

then X1 and X2 are independent if and only if Σ12 = ΣT21 = 0.

4
Proof. Suppose that X1 and X2 are independent then

Σ12 = cov(X1 , X2 ) = E[(X1 − µ1 )(X2 − µ2 )T ]

= E[(X1 X2 T − X1 µT2 − µ1 X2 T + µ1 µT2 ]
= E(X1 X2 T ) − µ1 µT2 − µ1 µT2 + µ1 µT2
independence
= µ1 µT2 − µ1 µT2 = 0

For the case of Σ21 note that Σ12 = ΣT21 because otherwise the covariance matrix
would not be symmetric.

On the other hand, let’s suppose now that Σ12 = ΣT21 = 0, then
1 T −1
1 − (X − µ) Σ (X − µ)
f (X) = √ p e 2
( 2π)p det(Σ)

Let’s work with

T −1
T −1 X1 − µ 1 Σ11 0 X1 − µ 1
(X − µ) Σ (X − µ) =
X2 − µ 2 0 Σ22 X2 − µ 2
−1
Σ 0 X 1 − µ 1
= (X1 − µ1 )T (X2 − µ2 )T 11
0 Σ−1 X2 − µ 2
22
T −1 T −1
X1 − µ 1
= (X1 − µ1 ) Σ11 (X2 − µ2 ) Σ22
X2 − µ 2
= (X1 − µ1 )T Σ−1 T −1
11 (X1 − µ1 ) + (X2 − µ2 ) Σ22 (X2 − µ2 )

Furthermore
1 1
√ p = √ p
( 2π)p det(Σ) ( 2π)p1 +p2 det(Σ11 )det(Σ22 )
1 1
= √ p √ p
( 2π)p1 det(Σ11 ) ( 2π)p2 det(Σ22 )

where p1 and p2 are the dimension of X1 and X2 , respectively.

Finally, by replacing the previous expressions in f (X) we have

f (X) = f (X1 )f (X2 )

Theorem. Let A be positive definite. Then the set of solutions for the equation
XT AX = c, c > 0, is an ellipsoid with principle axes in the directions of the
eigenvectors.

Proof. Let P = [p1 · · · pn ] the matrix whose columns are the coordinates of
orthonormed eigenvectors of A, that is A = P ΛP T , P T P = I. Assuming

5
Y = P T X the following holds

XT AX = XT P ΛP T X
= (P T X)T ΛP T X
= YT ΛY
= λ1 Y12 + · · · + λn Yn2
Y2 Y2
= √1 2 + · · · + √n 2 .
1/( λ1 ) 1/( λn )

Corollary. Let X ∈ Np (µ, Σ), then contour lines of the joint density function
f (X) = c are ellipsoids.

Proof.
1
− (X − µ)T Σ−1 (X − µ)
f (X) = k1 · e 2 = k ⇐⇒ (X − µ)T Σ−1 (X − µ) = c > 0.

Since Σ−1 , is positive definite the contour lines

Ec = {X : f (X) = k}

will be ellipsoids.

Theorem. Let X ∈ Np (µ, Σ), if Σ have full rank (rg(Σ) = p), then

(X − µ)T Σ−1 (X − µ) ∈ χ2 (p).

Proof. From the first part of the class we know that X ∈ Np (µ, Σ) if an only if
there exist µ ∈ Rp and A ∈ Rp×n such that X = AZ + µ for Zi ∈ N (0, 1) i.i.d.
Then using A = P Λ1/2 we have that Z = A−1 (X − µ). Let’s see how the previous
expression looks like

(X − µ)T Σ−1 (X − µ) = (X − µ)T (P ΛP T )−1 (X − µ)

= (X − µ)T P Λ−1 P T (X − µ)
= (X − µ)T P Λ−1/2 Λ−1/2 P T (X − µ)
= (X − µ)T (A−1 )T A−1 (X − µ)
p
X
T
=Z Z= Zi2
i=1

with Zi ∈ N (0, 1), and the chi-square distribution with p degrees of freedom is
the distribution of a sum of the squares of p independent standard normal random
variables.

STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
01-DensityEstimation 2
No ratings yet
01-DensityEstimation 2
26 pages
QRM 06
No ratings yet
QRM 06
59 pages
Ma40189 2016 2017 Problem Sheet 3 Solutions合并版
No ratings yet
Ma40189 2016 2017 Problem Sheet 3 Solutions合并版
67 pages
Multi Variate Analysis
No ratings yet
Multi Variate Analysis
133 pages
4gaussian Discriminant
No ratings yet
4gaussian Discriminant
50 pages
Applied Statistics in Business and Economics 5th Edition Doane Solutions Manual 1
100% (75)
Applied Statistics in Business and Economics 5th Edition Doane Solutions Manual 1
25 pages
Advanced Machine Learning: CS 281
100% (1)
Advanced Machine Learning: CS 281
88 pages
Gaussian Random Vectors
No ratings yet
Gaussian Random Vectors
6 pages
Tut 07
No ratings yet
Tut 07
19 pages
Cap 2 Applied Multivariate Statistical JOHNSON PP 149-163
No ratings yet
Cap 2 Applied Multivariate Statistical JOHNSON PP 149-163
15 pages
Chapter 5 Decision Theory
No ratings yet
Chapter 5 Decision Theory
85 pages
Multivariate Normal Distribution: 3.1 Basic Properties
No ratings yet
Multivariate Normal Distribution: 3.1 Basic Properties
13 pages
Murphysolns
No ratings yet
Murphysolns
45 pages
Chapter1 MV
No ratings yet
Chapter1 MV
72 pages
1-Multivariate Normal Distributions-18-07-2024
No ratings yet
1-Multivariate Normal Distributions-18-07-2024
36 pages
My Notes For Discrete and Continuous Distributions 987654
No ratings yet
My Notes For Discrete and Continuous Distributions 987654
28 pages
Multi Normal
No ratings yet
Multi Normal
6 pages
Stat520 Ch.3
No ratings yet
Stat520 Ch.3
5 pages
Research Methodology Part 1
No ratings yet
Research Methodology Part 1
25 pages
Lec9 MultivariateGaussian
No ratings yet
Lec9 MultivariateGaussian
60 pages
22-23 323 Week5Notes
No ratings yet
22-23 323 Week5Notes
8 pages
Multivariate Normal
No ratings yet
Multivariate Normal
24 pages
MASI MAH 23S SM - Sample 11 23 22
No ratings yet
MASI MAH 23S SM - Sample 11 23 22
411 pages
Slides 4
No ratings yet
Slides 4
51 pages
Topic 3 Multivariate Models I (Week 2)
No ratings yet
Topic 3 Multivariate Models I (Week 2)
27 pages
Bayesian Kernel Methods
No ratings yet
Bayesian Kernel Methods
40 pages
The Multivariate Normal Distribution: Exactly Central Limit
No ratings yet
The Multivariate Normal Distribution: Exactly Central Limit
59 pages
More On Gaussians
No ratings yet
More On Gaussians
11 pages
Multivariate Normal Distribution - Wikipedia, The Free Encyclopedia
No ratings yet
Multivariate Normal Distribution - Wikipedia, The Free Encyclopedia
12 pages
Cours MND
No ratings yet
Cours MND
19 pages
CH 4
No ratings yet
CH 4
3 pages
PBM Notes
No ratings yet
PBM Notes
130 pages
Multivariate Normal Distribution
100% (1)
Multivariate Normal Distribution
8 pages
Eda Summ3
No ratings yet
Eda Summ3
8 pages
Unit 19
No ratings yet
Unit 19
16 pages
Lecture 11 HHJJ
No ratings yet
Lecture 11 HHJJ
6 pages
Statistics Review
No ratings yet
Statistics Review
9 pages
Final Examination in Statistics and Probability
100% (15)
Final Examination in Statistics and Probability
2 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
19 pages
B.SC Statistics
No ratings yet
B.SC Statistics
16 pages
Multivariate Normal Distribution: 3.1 Basic Properties
No ratings yet
Multivariate Normal Distribution: 3.1 Basic Properties
13 pages
Normaldistribution
No ratings yet
Normaldistribution
10 pages
6.1 The Multivariate Normal Random Vector
No ratings yet
6.1 The Multivariate Normal Random Vector
9 pages
Multi Varia Da 1
No ratings yet
Multi Varia Da 1
59 pages
Multivariate Methods Assignment Help
No ratings yet
Multivariate Methods Assignment Help
17 pages
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
No ratings yet
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
20 pages
Practical Data Analysis With JMP
No ratings yet
Practical Data Analysis With JMP
8 pages
Solusi Soal Bab 4
No ratings yet
Solusi Soal Bab 4
9 pages
Multivariate Statistical Analysis: The Multivariate Normal Distribution
No ratings yet
Multivariate Statistical Analysis: The Multivariate Normal Distribution
13 pages
2.5.2 Multivariate Density
No ratings yet
2.5.2 Multivariate Density
12 pages
Random Vectors and Multivariate Normal Distribution
No ratings yet
Random Vectors and Multivariate Normal Distribution
6 pages
The Multivariate Gaussian Distribution: 1 Relationship To Univariate Gaussians
No ratings yet
The Multivariate Gaussian Distribution: 1 Relationship To Univariate Gaussians
10 pages
1 Notes On Brownian Motion: 1.1 Normal Distribution
No ratings yet
1 Notes On Brownian Motion: 1.1 Normal Distribution
15 pages
Murphy Gaussians
No ratings yet
Murphy Gaussians
15 pages
More On Gaussians
No ratings yet
More On Gaussians
11 pages
STAT3006: Tutorial 1: Sample Solutions
No ratings yet
STAT3006: Tutorial 1: Sample Solutions
10 pages
(Q3) MODULE 2 - Mean and Variance of A Discrete Random Variable
No ratings yet
(Q3) MODULE 2 - Mean and Variance of A Discrete Random Variable
20 pages
Solution To Exercises On MVN: 1 Question 1 (I)
No ratings yet
Solution To Exercises On MVN: 1 Question 1 (I)
3 pages
Pattern Recognition
No ratings yet
Pattern Recognition
9 pages
Multivariate Statistical Distributions
No ratings yet
Multivariate Statistical Distributions
12 pages
Covariance Matrix (W Krzanowski)
No ratings yet
Covariance Matrix (W Krzanowski)
5 pages
Beta Distribution
No ratings yet
Beta Distribution
8 pages
Lesson 4: Mean and Variance of Discrete Random Variable: Grade 11 - Statistics & Probability
No ratings yet
Lesson 4: Mean and Variance of Discrete Random Variable: Grade 11 - Statistics & Probability
26 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
G.J.O. Jameson - Finding Carmichael Numbers
No ratings yet
G.J.O. Jameson - Finding Carmichael Numbers
12 pages
Homework Emilio
No ratings yet
Homework Emilio
2 pages
Lab 0 - Hypertext - Web Protocols Lab
No ratings yet
Lab 0 - Hypertext - Web Protocols Lab
24 pages
Chapter III Random Variables
No ratings yet
Chapter III Random Variables
99 pages
Correlation
No ratings yet
Correlation
8 pages
STA256 Fall2022 Quiz3 Solutions
No ratings yet
STA256 Fall2022 Quiz3 Solutions
2 pages
Properties of The Normal and Multivariate Normal Distributions
No ratings yet
Properties of The Normal and Multivariate Normal Distributions
2 pages
Final Project
No ratings yet
Final Project
6 pages
Experiments Rlab Upto Cat - 1: Lab - 1 Introduction To R - Lab
No ratings yet
Experiments Rlab Upto Cat - 1: Lab - 1 Introduction To R - Lab
31 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
9 pages
Probability
No ratings yet
Probability
6 pages
Mixture of Gaussians and The EM Algorithm
No ratings yet
Mixture of Gaussians and The EM Algorithm
1 page
Non-Parametric Method: Advantages
No ratings yet
Non-Parametric Method: Advantages
22 pages
Math 1280 Learning Journal Unit 3
No ratings yet
Math 1280 Learning Journal Unit 3
3 pages
Econ 325 - Problem Set 3: Instructions
No ratings yet
Econ 325 - Problem Set 3: Instructions
2 pages
Reliability Estimation Using Minitab
No ratings yet
Reliability Estimation Using Minitab
16 pages
SPRING2024 - Chapter 3 - Assignment 02
No ratings yet
SPRING2024 - Chapter 3 - Assignment 02
13 pages
Valliammai Engeineering College: (S.R.M.NAGAR, KATTANKULATHUR-603 203)
No ratings yet
Valliammai Engeineering College: (S.R.M.NAGAR, KATTANKULATHUR-603 203)
21 pages
Math Biostatistics Boot Camp 1
100% (1)
Math Biostatistics Boot Camp 1
3 pages
Assign 6 2021 Autumn
No ratings yet
Assign 6 2021 Autumn
3 pages
Stochastic User Equilibrium
No ratings yet
Stochastic User Equilibrium
18 pages
18.445 Introduction To Stochastic Processes: Lecture 3: Markov Chains: Time-Reversal
No ratings yet
18.445 Introduction To Stochastic Processes: Lecture 3: Markov Chains: Time-Reversal
12 pages
Binomial Probability Distribution & Poisson Probability Distribution
No ratings yet
Binomial Probability Distribution & Poisson Probability Distribution
21 pages
10 HWsol
No ratings yet
10 HWsol
2 pages
Group 4 Gragasin Assessment 14
No ratings yet
Group 4 Gragasin Assessment 14
2 pages
Standard Diviation of Discrete Random Variable
No ratings yet
Standard Diviation of Discrete Random Variable
2 pages
Math119 Test3 Study Guide Part1
No ratings yet
Math119 Test3 Study Guide Part1
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

AE - Tema 3 - The Multivariate Gaussian Distribution

Uploaded by

AE - Tema 3 - The Multivariate Gaussian Distribution

Uploaded by

In this topic we are going to study the multivariate Normal distribution that

Univariate Gaussian Distribution

Figura 1: Plots of different univariate normal distributions

−2.5 0.0 2.5 5.0 7.5 −5 0 5 10

Let S = {X1 , · · · , Xn } observations independent and identically distributed

When the sample is small, quasi-variance is usually used as an estimate of

Multivariate Gaussian Distribution

Note that now Xi and µ are vectors. If we denote X = [X1 · · · Xn ] a matrix

Figura 2: Detection of outliers

Type outlier regular

Consider the transformation from Rp → Rp given by

Theorem. Let X ∈ N (µ, Σ), then M X + b ∈ N (M µ + b, M T ΣM )

then X1 and X2 are independent if and only if Σ12 = ΣT21 = 0.

Σ12 = cov(X1 , X2 ) = E[(X1 − µ1 )(X2 − µ2 )T ]

Let’s work with

where p1 and p2 are the dimension of X1 and X2 , respectively.

f (X) = f (X1 )f (X2 )

Since Σ−1 , is positive definite the contour lines

(X − µ)T Σ−1 (X − µ) ∈ χ2 (p).

(X − µ)T Σ−1 (X − µ) = (X − µ)T (P ΛP T )−1 (X − µ)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.