0% found this document useful (0 votes)

134 views16 pages

(Last) Extension of Several Random Variables

This document discusses the extension of concepts related to random variables from two variables to n variables. It defines an n-dimensional random vector and joint cumulative distribution function. It discusses marginal and conditional probability density functions for n random variables. It also generalizes the definition of independence to mutual independence of n random variables.

Uploaded by

Prehatin Fitrian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views16 pages

(Last) Extension of Several Random Variables

Uploaded by

Prehatin Fitrian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

EXTENSION TO SEVERAL RANDOM VARIABLES

PAPER
In order to fulfill the assignment for
Mathematical Statistics I
which is lectured by Mr. Susiswo and Mrs. Jamaliatul Badriyah

Written by:

1. Namira (140311606344)
2. Nur Rofidah Diyanah (140311604344)
3. Sri Prihatin (140311600162)
4. Trio Habibatur Rahma Utami (140311602695)

DEPARTMENT OF MATHEMATICS
THE FACULTY OF MATHEMATICS AND NATURAL SCIENCES
STATE UNIVERSITY OF MALANG
October, 2016
EXTENSION TO SEVERAL RANDOM VARIABLES
The notions about two random variables can be extended immediately to 𝑛 random
variables. We make the following definition of the space of 𝑛 random variables.

DEFINITION 2.6.1.
Consider a random experiment with the sample space 𝒞. Let the random variable 𝑋𝑖 assign to
each element 𝑐𝜖𝒞 one and only one real number 𝑋𝑖 (𝑐) = 𝑥𝑖 , 𝑖 = 1,2, … , 𝑛. We say that
(𝑋1 , 𝑋2 , … 𝑋𝑛 ) is an 𝑛-dimensional random vector. The space of this random vector is the
set of ordered 𝑛-tuples 𝒟 = {(𝑥1 , 𝑥2 , … , 𝑥𝑛 ): 𝑥1 = 𝑋1 (𝑐), … , 𝑥𝑛 = 𝑋𝑛 (𝑐), 𝑐 ∈ 𝒞}.
Furthermore, let 𝐴 be a subset of the space 𝒟. Then 𝑃[(𝑋1 , … 𝑋𝑛 ) ∈ 𝐴] = 𝑃(𝐶),where 𝐶 =
{𝑐: 𝑐 ∈ 𝒞 𝑎𝑛𝑑 (𝑋1 (𝑐), 𝑋2 (𝑐), … , 𝑋𝑛 (𝑐)) ∈ 𝐴}

In this section, we will often use vector notation. For example, we denote
(𝑋1 , … , 𝑋𝑛 )′ by the 𝑛 dimensional column vector 𝑿 and the observed values (𝑥1 , … , 𝑥𝑛 )′ of
the random variables by 𝒙. the joint cdf is defined to be
𝐹𝑿 (𝒙) = 𝑃[𝑋1 ≤ 𝑥1 , … , 𝑋𝑛 ≤ 𝑥𝑛 ].
We say that the 𝑛 random variables 𝑋1 , 𝑋2 , … 𝑋𝑛 are of the discrete type or of the continues
type and have a distribution of that type accordingly as the joint cdf can be expressed as

𝐹𝑿 (𝒙) = ∑ ∑ 𝑝(𝑤1 , … , 𝑤𝑛 ),
𝑤1 <𝑥1 ,…,𝑤𝑛 <𝑥𝑛

or as

𝐹𝑿 (𝒙) = ∫ ∫ 𝑓(𝑤1 , … , 𝑤𝑛 ) 𝑑𝑤1 … 𝑑𝑤𝑛 ,

𝑤1 <𝑥1 ,…,𝑤𝑛 <𝑥𝑛

For the continues case,

𝜕𝑛
𝐹 (𝒙) = 𝑓(𝑥)
𝜕𝑥1 … 𝜕𝑥𝑛 𝑋

In accordance with the convention of extending the definition of a joint pdf, is seen
that a point function 𝑓 essentially satisfies the condition of being a pdf if
(a) 𝑓 s defined and is nonnegative for all real values of its argument(s)
(b) its integral over all real values of its argument(s) is 1.
Likewise, a point function 𝑝 essentially satisfies of being a joint pmf if
(a) 𝑝 is defined and is nonnegative for all real values of its argument(s)
(b) its sum over all real values and its argument(s) is 1.
As in previous sections, it is sometimes convenient to speak the support side of a random
vector. For the discrete case, this would be all point in 𝒟 which have the positive mass, while
for the continous case these would be all point in 𝒟 can be embedded is an open set of
positive probability. We will use 𝒮 to denote support sets

EXAMPLE 2.6.1.
Let
−(𝑥+𝑦+𝑧)
𝑓(𝑥, 𝑦, 𝑧) = { 𝑒 0 < 𝑥, 𝑦, 𝑥 < ∞
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
Be the pdf of the random variables 𝑋, 𝑌, and 𝑍. Then the distribution function of 𝑋, 𝑌, and 𝑍
is given by
𝐹(𝑥, 𝑦, 𝑧) = 𝑃(𝑋 ≤ 𝑥, 𝑌 ≤ 𝑦, 𝑍 ≤ 𝑧)
𝑧 𝑦 𝑥
= ∫ ∫ ∫ 𝑒 −𝑢−𝑣−𝑤 𝑑𝑢 𝑑𝑣 𝑑𝑤
0 0 0

= (1 − 𝑒 −𝑥 )(1 − 𝑒 −𝑦 )(1 − 𝑒 −𝑧 ), 0 ≤ 𝑥, 𝑦, 𝑧 < ∞

and is equal to 0 elsewhere.
Let ( 𝑋1 , 𝑋2 , … , 𝑋𝑛 ) be a random vector and let 𝑌 = 𝑢(𝑋1 , 𝑋2 , … , 𝑋𝑛 ) for some
function 𝑢. As in the bivariate case, the expected value of the random variable exists if the
𝑛 −fold integral.
∞ ∞
∫ … ∫ |𝑢(𝑥1 , 𝑥2 , … , 𝑥𝑛 )|𝑓 (𝑥1 , 𝑥2 , … , 𝑥𝑛 ) 𝑑𝑥1 𝑑𝑥2 … 𝑑𝑥𝑛
−∞ −∞

exist when the random variables as of the continuous type, or if the 𝑛-fold sum

∑ … ∑|𝑢(𝑥1 , 𝑥2 , … , 𝑥𝑛 )|𝑓 (𝑥1 , 𝑥2 , … , 𝑥𝑛 )

𝑥𝑛 𝑥1

exist when the random variables are of the discrete type. If the expected value 𝑌 exist then it
its expectation is given by
∞ ∞
𝐸(𝑌) = ∫ … ∫ 𝑢(𝑥1 , 𝑥2 , … 𝑥𝑛 )𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) 𝑑𝑥1 𝑑𝑥2 … 𝑑𝑥𝑛
−∞ −∞

For the continuous case, and by

𝐸(𝑌) = ∑ … ∑ 𝑢(𝑥1 , 𝑥2 , … 𝑥𝑛 )𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )

𝑥𝑛 𝑥1

for the discrete case. In particular, 𝐸 is a linear operator. That is, if 𝑌𝑗 = 𝑢𝑗 (𝑋1 , … , 𝑋𝑛 ) for 𝑗 =
1,2, … , 𝑚 and each 𝐸(𝑌𝑖 ) exists then
𝑚
𝑚
𝐸 [∑ 𝑘𝑗 𝑌𝑗 ] = ∑ 𝑘𝑗 𝐸[𝑌𝑗 ],
𝑗=1
𝑗=1

where 𝑘1 , … , 𝑘𝑚 are constant.

We shall now discuss the notions of marginal and conditional probability density
function from the point of view of 𝑛 random variables. All of the preceding definition can be
directly generalized to the case of 𝑛 variables in the following manner. Let the random
variables 𝑋1 , 𝑋2 , … , 𝑋𝑛 be the continous type with the joint pdf 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ). By an
argument similar to the two-variable case, we have for every 𝑏,
∞
𝐹𝑋1 (𝑏) = 𝑃(𝑋1 < 𝑏) = ∫ 𝑓1 (𝑥1 )𝑑𝑥1 ,
−∞

where 𝑓1 (𝑥1 ) is defined by the (𝑛 − 1)-fold integral

∞ ∞
𝑓1 (𝑥1 ) = ∫ … ∫ 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )𝑑𝑥2 , … , 𝑑𝑥𝑛 .
−∞ −∞

Therefore, 𝑓1 (𝑥1 ) is the pdf of the random variable 𝑋1 and 𝑓1 (𝑥1 ) is called the marginal pdf
of 𝑋1. The marginal probability density functions 𝑓2 (𝑥2 ), … , 𝑓𝑛 (𝑥𝑛 ) of
𝑋2 , … , 𝑋𝑛 , respectively, are similar (𝑛 − 1)-fold integrals.
Up to this point, its marginal pdf has been a pdf of one random variable. It is
convenient to extend this terminology to joint probability density functions, which we shall
do now. Let 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) be the joint pdf of the 𝑛 random variables 𝑋1 , 𝑋2 , … , 𝑋𝑛 just as
before. Now, however, let us take any group of 𝑘 < 𝑛 of these random variables and let us
find the joint pdf of them. This joint pdf is called the marginal pdf of this particular group of
𝑘 variables. To fix the ideas, take = 6 , 𝑘 = 3, and let us select the group 𝑋2 , 𝑋4 , 𝑋5 . Then the
marginal pdf of 𝑋2 , 𝑋4 , 𝑋5 is the joint pdf of this particular group of three variables, namely,
∞ ∞ ∞
∫ ∫ ∫ 𝑓(𝑥1 , 𝑥2 , 𝑥3 , 𝑥4 , 𝑥5 , 𝑥6 )𝑑𝑥1 𝑑𝑥3 𝑑𝑥6 ,
−∞ −∞ −∞

if the random variables of the continuous type.

Next we extend the definition of a conditional pdf. Suppose 𝑓1 (𝑥1 ) > 0. Then we define the
symbol𝑓2,…,𝑛|1 (𝑥2 , … , 𝑥𝑛 |𝑥1 ) by the relation
𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
𝑓2,…,𝑛|1 (𝑥2 , … , 𝑥𝑛 |𝑥1 ) = ,
𝑓1 (𝑥1 )
and 𝑓2,…,𝑛|1 (𝑥2 , … , 𝑥𝑛 |𝑥1 ) is called the joint conditional pdf of 𝑋2 , … , 𝑋𝑛 , given 𝑋1 = 𝑥1. The
joint pdf of any 𝑛 − 1 random variables, say 𝑋1, … , 𝑋𝑖−1 , 𝑋𝑖+1 , … , 𝑋𝑛 , given 𝑋𝑖 = 𝑥𝑖 , is
defined as a joint pdf of 𝑋1 , … , 𝑋𝑛 divided by the marginal pdf 𝑓𝑖 (𝑥𝑖 ), provided that 𝑓𝑖 (𝑥𝑖 ) >
0. More generally, the joint conditional pdf of 𝑛 − 𝑘 of the random variables, for given
values of the remaining 𝑘 variables, is defined as the joint pdf of the 𝑛 variables divided by
the marginal pdf of the particular group of 𝑘 variables, provided that the letter pdf is positive.
Because a conditional pdf is a pdf of a certain number of random variables, the
expectation of a function of these random variables has been defined. To emphasize the fact
that a conditional pdf is under consideration, such expectation are called conditional
expectations. For instance, the conditional expectation of 𝑢(𝑋2 , … , 𝑋𝑛 ) given 𝑋1 = 𝑥1 , is, for
random variables of the continuous type, given by
∞ ∞
𝐸[𝑢(𝑋2 , … , 𝑋𝑛 )|𝑥1 ] = ∫ … ∫ 𝑢(𝑥2 , … , 𝑥𝑛 )𝑓2,…,𝑛|1 (𝑥2 , … , 𝑥𝑛 |𝑥1 )𝑑𝑥2 … 𝑑𝑥𝑛
−∞ −∞

Provided 𝑓1 (𝑥1 ) > 0 and the integral converges (absolutely) a usefull random variable is
given by ℎ(𝑋1 ) = 𝐸[𝑢(𝑋2 , … , 𝑋𝑛 )|𝑋1 )].
The above discussion of marginal and conditional distribution generalizes two random
variables of the discrete type by using pmfs and summation instead of integral. Let the
random variables 𝑋1 , 𝑋2 , … , 𝑋𝑛 have the joint pdf 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) and the marginal
probability density functions 𝑓1 (𝑥1 ), 𝑓2 (𝑥2 ), … , 𝑓𝑛 (𝑥𝑛 ), respectively the definition of the
independence of 𝑋1 and 𝑋2 is generalize to the mutual independence of 𝑋1 , 𝑋2 , … , 𝑋𝑛 as
follows: The random variables 𝑋1 , 𝑋2 , … , 𝑋𝑛 are said to be mutually independent if and only
if
𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) ≡ 𝑓1 (𝑥1 )𝑓2 (𝑥2 ) … 𝑓𝑛 (𝑥𝑛 ),
for the continuous case. In the discrete case 𝑋1 , 𝑋2 , … , 𝑋𝑛 are said to be mutually
independent if and only if
𝑝(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) ≡ 𝑝1 (𝑥1 )𝑝(𝑥2 ) … 𝑝𝑛 (𝑥𝑛 ),
Suppose 𝑋1 , 𝑋2 , … , 𝑋𝑛 are mutually independent. Then
𝑃(𝑎1 < 𝑋1 < 𝑏1 , 𝑎2 < 𝑋2 < 𝑏2 , … , 𝑎𝑛 < 𝑋𝑛 < 𝑏𝑛
= 𝑃(𝑎1 < 𝑋1 < 𝑏1 )𝑃(𝑎2 < 𝑋2 < 𝑏2 ) … 𝑃(𝑎𝑛 < 𝑋𝑛 < 𝑏𝑛 )
𝑛

= ∏ 𝑃(𝑎𝑖 < 𝑋𝑖 < 𝑏𝑖 ),

𝑖=1

where the symbol ∏𝑛𝑖=1 𝜑(𝑖) is defined to be

𝑛

∏ 𝜑(𝑖) = 𝜑(1)𝜑(2) … 𝜑(𝑛)

𝑖=1

The theorem that

𝐸[𝑢(𝑋1 )𝑣(𝑋2 )] = 𝐸[𝑢(𝑋1 )]𝐸[𝑣(𝑋2 )]
For independent random variables 𝑋1 and 𝑋2 becomes, for mutually independent random
variables 𝑋1 , 𝑋2 , … , 𝑋𝑛
𝐸[𝑢1 (𝑋1 )𝑢2 (𝑋2 ) … 𝑢𝑛 (𝑋𝑛 )] = 𝐸[𝑢1 (𝑋1 )]𝐸[𝑢2 (𝑋2)] … 𝐸[𝑢𝑛 (𝑋𝑛 )]
or
𝑛 𝑛

𝐸 [∏ 𝑢𝑖 (𝑋𝑖 )] = ∏ 𝐸[𝑢𝑖 (𝑋𝑖 )]

𝑖=1 𝑖=1

The moment-generating function (mgf) of the joint distribution of 𝑛 random variables

𝑋1 , 𝑋2 , … , 𝑋𝑛 is defined as follows. Let
𝐸[exp(𝑡1 𝑋1 + 𝑡2 𝑋2 + ⋯ + 𝑡𝑛 𝑋𝑛 )]
exists for −ℎ𝑖 < 𝑡𝑖 < ℎ𝑖 , 𝑖 = 1,2, … , 𝑛, where each ℎ𝑖 is positive. This expectation is denoted
by 𝑀(𝑡1 , 𝑡2 , … , 𝑡𝑛 ) and it’s called the mgf of the joint distribution of 𝑋1 , … , 𝑋𝑛 (or simply the
mgf of 𝑋1 , … , 𝑋𝑛 ). As in the cases of one and two variables, this mgf is unique and uniquely
determines the joint distributions of the n variables (and hence all marginal distributions). For
example the mgf of marginal distribution of 𝑋𝑖 is 𝑀(0, … , 0, 𝑡𝑖 , 0 , … , 0), 𝑖 = 1,2, … , 𝑛 ; that
of the marginal distributions of 𝑋𝑖 𝑎𝑛𝑑 𝑋𝑗 is 𝑀(0, … ,0, 𝑡𝑖 , 0, … , 0, 𝑡𝑗 , 0, … 0) ; and so on.
Theorem 2.5.5 of this chapter can be generalized, and the factorization
𝑛

𝑀(𝑡1 , 𝑡2 , … , 𝑡𝑛 ) = ∏ 𝑀(0, … ,0, 𝑡𝑖 , 0, … ,0) (2.6.6)

𝑖=1

Is a necessary and sufficient condition for the mutual independence of 𝑋1 , 𝑋2 , … , 𝑋𝑛 . note that
we can write the joint mgf in vector notation as
𝑀(𝑡) = 𝐸[exp(𝒕′ 𝑿)], 𝑓𝑜𝑟 𝒕 ∈ 𝐵 ⊂ 𝑅 𝑛 ,
Where 𝐵 = {𝒕 ∶ −ℎ𝑖 < 𝑡𝑖 < ℎ𝑖 , 𝑖 = 1, … , 𝑛}

EXAMPLE 2.6.2
Let 𝑋1 , 𝑋2 , and 𝑋3 be three mutually independent random variables and let each has the pdf
2𝑥 0 < 𝑥 < 1 (2.6.7)
𝑓(𝑥) = {
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
The joint pdf of 𝑋1 , 𝑋2 , 𝑋3 is 𝑓(𝑥1 )𝑓(𝑥1 )𝑓(𝑥3 ) = 8𝑥1 𝑥2 𝑥3 , 0 < 𝑥𝑖 < 1, 𝑖 = 1,2,3, zero
elsewhere then, for illustration, the expected value of 5𝑋1 𝑋23 + 3𝑋2 𝑋34 is
1 1 1

∫ ∫ ∫(5𝑥1 𝑥23 + 3𝑥2 𝑥34 )8𝑥1 𝑥2 𝑥3 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3 = 2

0 0 0

Let 𝑌 be the maximum of 𝑋1 , 𝑋2 , and 𝑋3. Then, for instance, we have

1 1 1 1
𝑃 (𝑌 ≤ ) = 𝑃 (𝑋1 ≤ , 𝑋2 ≤ , 𝑋3 ≤ )
2 2 2 2
1 1 1
2 2 2

= ∫ ∫ ∫ 8𝑥1 𝑥2 𝑥3 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3

0 0 0

1 6 1
=( ) =
2 64
In similar manner, we find that the cdf of 𝑌 is
0 𝑦<0
𝐺(𝑦) = 𝑃(𝑌 ≤ 𝑦) = {𝑦 6 0≤𝑦<1
1 1≤𝑦
Accordingly, the pdf of 𝑌 is
5
𝑔(𝑦) = {6𝑦 0<𝑦<1
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
REMARK 2.6.1
If 𝑋1 , 𝑋2 , and 𝑋3 are mutually independent, they are pairwise independent (that is, 𝑋𝑖 and 𝑋𝑗 ,
𝑖 ≠ 𝑗, where 𝑖, 𝑗 = 1,2,3 are independent). However, the following example, attributed to S.
Bernstein, shows that pairwise independence doesn’t necessarily imply mutual independence.
Let 𝑋1 , 𝑋2 and 𝑋3 have the joint pmf
1
𝑓(𝑥1 , 𝑥2 , 𝑥3 ) = {4 (𝑥1 , 𝑥2 , 𝑥3 ) ∈ {(1,0,0), (0,1,0), (0,0,1), (1,1,1)}
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
The joint pmf of 𝑋𝑖 𝑎𝑛𝑑 𝑋𝑗 , 𝑖 ≠ 𝑗 , is
1
𝑓𝑖𝑗 (𝑥𝑖 , 𝑥𝑗 ) = {4 (𝑥𝑖 , 𝑥𝑗 ) ∈ {(0,0), (1,0), (0,1), (1,1)}
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
Whereas the marginal pmf of 𝑋𝑖 is
1
𝑓𝑖 (𝑥𝑖 ) = {2 𝑥𝑖 = 0, 1
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
Obviously, if 𝑖 ≠ 𝑗 we have
𝑓𝑖𝑗 (𝑥𝑖 , 𝑥𝑗 ) ≡ 𝑓𝑖 (𝑥𝑖 )𝑓𝑗 (𝑥𝑗 )
And thus 𝑋𝑖 and 𝑋𝑗 are independent. However
𝑓(𝑥1 , 𝑥2 , 𝑥3 ) ≢ 𝑓1 (𝑥1 )𝑓2 (𝑥2 )𝑓3 (𝑥3 )
Thus 𝑋1 , 𝑋2, and 𝑋3 are not mutually independent.
Unless there is a possible misunderstanding between mutual and pairwise
independence, we usually drop the modifier mutual. Accordingly, using this practice in
example 2.6.2, we say that 𝑋1 , 𝑋2 , and 𝑋3 are independent random variables, meaning that
they are mutually independent. Occasionally, for emphasis, we use mutually independent so
that the reader is reminded that this is different from pairwise independence.
In addition if several random variables are mutually independent and have the same
distribution, we say that they are independent and identically distributed, which we
abbreviate as iid. So the random variables in example 2.6.2 are iid with the common pdf
given in expression (2.6.7)

2.6.1 VARIANCE-COVARIANCE
Let 𝑿 = (𝑋1 , … , 𝑋𝑛 )′ be an 𝑛 −dimensional random vector. Recall that we defined 𝐸(𝑿) =
(𝐸(𝑋1 ), … 𝐸(𝑋𝑛 ))′ that is the expectation of a random vector is just the vector of the
expectations of its components. Now suppose 𝑾 is an 𝑚 × 𝑛 matrix of random variables, say
𝑾 = [𝑊𝑖𝑗 ] for the random variable 𝑊𝑖𝑗 , 1 ≤ 𝑖 ≤ 𝑚 and 𝑖 ≤ 𝑗 ≤ 𝑛. Note that we can always
string out the matrix into an 𝑚𝑛 × 1 random vector. Hence, we define the expectation of a
random matrix
𝐸[𝑾] = [𝐸(𝑊𝑖𝑗 )] (2.6.8)

THEOREM 2.6.1
Let 𝑾𝟏 and 𝑾𝟐 be 𝑚 × 𝑛 matrices of random variables, and let 𝑨𝟏 and 𝑨𝟐 be 𝑘 × 𝑚
matrices of constants and let 𝑩 be a 𝑛 × 𝑙 matrix of constants. Then
𝐸[𝑨𝟏 𝑾𝟏 + 𝑨𝟐 𝑾2 ] = 𝑨𝟏 𝐸[𝑾𝟏 ] + 𝑨𝟐 𝐸[𝑾2 ] (2.6.9)

𝐸[𝑨𝟏 𝑾𝟏 𝑩] = 𝑨𝟏 𝐸[𝑾𝟏 ]𝐵 (2.6.10)

PROOF
Because of linearity of the operator 𝐸 of the operator 𝐸 on random variables, we have for the
(𝑖, 𝑗)th components of expression (2.6.9) that
𝑚 𝑚 𝑚 𝑚

𝐸 [∑ 𝑎1𝑖𝑠 𝑊1𝑠𝑗 + ∑ 𝑎2𝑖𝑠 𝑊2𝑠𝑗 ] = ∑ 𝑎1𝑖𝑠 𝐸[𝑊1𝑠𝑗 ] + ∑ 𝑎2𝑖𝑠 𝐸[𝑊2𝑠𝑗 ]

𝑠=1 𝑠=1 𝑠=1 𝑠=1

Hence by (2.6.8) expression (2.6.9) is true.

Because of linearity of the operator 𝐸 of the operator 𝐸 on random variables, we have for the
(𝑖, 𝑗)th components of expression (2.6.10) that
𝑛 𝑚 𝑛 𝑚 𝑛 𝑚

𝐸 [∑ [∑ 𝑎1𝑖𝑠 𝑊1𝑠𝑗 ] 𝑏𝑡𝑘 ] = ∑ 𝐸 [∑ 𝑎1𝑖𝑠 𝑊1𝑠𝑗 ] 𝑏𝑡𝑘 = ∑ [∑ 𝑎1𝑖𝑠 𝐸[𝑊1𝑠𝑗 ]] 𝑏𝑡𝑘

𝑡=1 𝑠=1 𝑡=1 𝑠=1 𝑡=1 𝑠=1
Let 𝑿 = (𝑋1 , … , 𝑋𝑛 )′ be an 𝑛-dimensional random vector, such that 𝜎𝑖2 = 𝑉𝑎𝑟(𝑋𝑖 ) < ∞. The
mean of X is 𝝁 = 𝐸[𝑿] and we define its variance-covariance matrix to be,
𝐶𝑜𝑣 (𝑿) = 𝐸[(𝑿 − 𝝁)(𝑿 − 𝝁)′ ] = [𝜎𝑖𝑗 ], (2.6.11)
Where 𝜎𝑖𝑖 denotes 𝜎𝑖2 .
EXAMPLE 2.6.3 (EXAMPLE 2.4.4, CONTINUED)
In example 2.4.4, we considered the joint pdf
𝑒 −𝑦 0<𝑥<𝑦<∞
𝑓(𝑥, 𝑦) = {
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
And showed that the first two moments are
𝜇1 = 1, 𝜇2 = 2
𝜎1 2 = 1, 𝜎2 2 = 2 (2.6.12)
𝐸[(𝑋 − 𝜇1 )(𝑌 − 𝜇2 )] = 1
Let 𝒁 = (𝑋, 𝑌)′. Then using the present notation, we have
1 1 1
𝐸[𝒁] = [ ] 𝑎𝑛𝑑 𝑐𝑜𝑣(𝒁) = [ ]
2 1 2

THEOREM 2.6.2
Let 𝑿 = (𝑋1 , . . . , 𝑋𝑛 )′ be an 𝑛 −dimensional random vector, such that 𝜎𝑖2 = 𝜎𝑖𝑖 = 𝑉𝑎𝑟(𝑿𝒊 ) <
∞. Let 𝑨 be an 𝑚 × 𝑛 matrix of constants. Then
𝐶𝑜𝑣(𝑿) = 𝐸[𝑿𝑿′ ] = 𝝁𝝁′ (2.6.13)
𝐶𝑜𝑣(𝑨𝑿) = 𝑨Cov(𝐗)𝐀′ (2.6.14)
Use theorem 2.6.1 to derive (2.6.13); i.e:
𝐶𝑜𝑣 (𝑿) = 𝐸[(𝑿 − 𝝁)(𝑿 − 𝝁)′ ]
= 𝐸[𝑿𝑿′ − 𝝁𝑿′ − 𝑿𝝁′ + 𝝁𝝁′ ]
= 𝐸[𝑿𝑿′ ] − 𝝁𝐸[𝑿′ ] − 𝐸[𝑿]𝝁′ + 𝝁𝝁′
Which is the desired result. For (2.6.14), we have
𝐶𝑜𝑣(𝑨𝑿) = 𝐸[𝑨(𝑿 − 𝝁)(𝑨(𝑿 − 𝝁))′ ]
= 𝐸[𝑨(𝑿 − 𝝁)(𝑿 − 𝝁)′ 𝑨′ ]
= 𝐸[(𝑨𝑿 − 𝑨𝝁)(𝑿′ 𝑨′ − 𝝁′ 𝑨′ )]
= 𝐸[𝑨𝑿𝑿′ 𝑨′ − 𝑨𝑿𝝁′ 𝑨′ − 𝑨𝝁𝑿′ 𝑨′ + 𝑨𝝁𝝁′ 𝑨′ ]
= 𝑨𝐸(𝑿𝑿′ )𝑨′ − 𝑨𝐸(𝑿)𝝁′ 𝑨′ − 𝑨𝝁𝐸(𝑿′ )𝑨′ + 𝑨𝝁𝝁′ 𝑨′
= 𝑨[𝐸[𝑿𝑿′ ] − 𝝁𝐸[𝑿′ ] − 𝐸[𝑿]𝝁′ + 𝝁𝝁′]𝑨′
All variance-covariance matrices are positive-semi definite (psd) matrices; that is,
𝑎′ 𝐶𝑜𝑣(𝑿)𝑎 ≥ 0, for all vectors 𝑎 ∈ 𝑅 𝑛 . to see this let 𝑿 be a random vector and let 𝒂 𝑛 × 1

(2.6.14)
vector of constants. Then 𝑌 = 𝒂′𝑿 is a random variable and, hence, has nonnegative
variance; i.e,
0 ≤ 𝑉𝑎𝑟(𝑌) = 𝑉𝑎𝑟(𝒂′ 𝑿) = 𝒂′ 𝐶𝑜𝑣(𝑿)𝒂;
Hence, 𝐶𝑜𝑣(𝑿) is psd.
EXERCISE
2(𝑥+𝑦+𝑧)
1. Let 𝑋, 𝑌, 𝑍 have the joint pdf 𝑓(𝑥, 𝑦, 𝑧) = ; 0 < 𝑥 < 1; 0 < 𝑦 < 1; 0 < 𝑧 <
3

1 and zero elsewhere.

a. Find the marginal probability density function of 𝑋, 𝑌, 𝑍
1 1 1 1
b. Compute 𝑃(0 < 𝑋 < 2 , 0 < 𝑌 < 2 , 0 < 𝑍 < 2) and 𝑃 (0 < 𝑋 < 2) =
1 1
𝑃 (0 < 𝑌 < 2) = 𝑃(0 < 𝑍 < 2)

c. Are 𝑋, 𝑌, 𝑎𝑛𝑑 𝑍 independent?

d. Calculate 𝐸(𝑋 2 𝑌𝑍 + 3𝑋𝑌 4 𝑍 2 )
e. Determine the cdf of 𝑋, 𝑌, 𝑎𝑛𝑑 𝑍
f. Find the conditional distribution of 𝑋 and 𝑌, given 𝑍 = 𝑧, and evaluate 𝐸(𝑋 +
𝑌|𝑧)
Solution
exp[−(𝑥1 + 𝑥2 + 𝑥3 )], 0 < 𝑥1 < ∞, 0 < 𝑥2 < ∞, 0 < 𝑥3 < ∞
𝑓(𝑥, 𝑦, 𝑧) = {
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
a. Marginal pdfs are
1 1 1 1
2
𝑓𝑋 (𝑥) = ∫ ∫ 𝑓(𝑥, 𝑦, 𝑧)𝑑𝑦 𝑑𝑧 = ∫ ∫(𝑥 + 𝑦 + 𝑧)𝑑𝑦 𝑑𝑧
3
0 0 0 0
1 1
2 1 1 2 1
= ∫ (𝑥𝑦 + 𝑦 2 + 𝑦𝑧] ) 𝑑𝑧 = ∫ ( + 𝑥 + 𝑧) 𝑑𝑧
3 2 0 3 2
0 0
2 1 1 1 2 2 2
= ( 𝑧 + 𝑥𝑧 + 𝑧 2 ] ) = (𝑥 + 1) = 𝑥 +
3 2 2 0 3 3 3

1 1 1 1
2
𝑓𝑌 (𝑦) = ∫ ∫ 𝑓(𝑥, 𝑦, 𝑧)𝑑𝑥 𝑑𝑧 = ∫ ∫(𝑥 + 𝑦 + 𝑧)𝑑𝑥 𝑑𝑧
3
0 0 0 0
1 1
2 1 1 2 1
= ∫ ( 𝑥 2 + 𝑥𝑦 + 𝑥𝑧] ) 𝑑𝑧 = ∫ ( + 𝑦 + 𝑧) 𝑑𝑧
3 2 0 3 2
0 0

2 1 1 1 2 2 2
= ( 𝑧 + 𝑦𝑧 + 𝑧 2 ] ) = (𝑦 + 1) = 𝑦 +
3 2 2 0 3 3 3
1 1 1 1
2
𝑓𝑍 (𝑧) = ∫ ∫ 𝑓(𝑥, 𝑦, 𝑧)𝑑𝑥 𝑑𝑦 = ∫ ∫(𝑥 + 𝑦 + 𝑧)𝑑𝑥 𝑑𝑦
3
0 0 0 0
1 1
2 1 1 2 1
= ∫ ( 𝑥 2 + 𝑥𝑦 + 𝑥𝑧] ) 𝑑𝑦 = ∫ ( + 𝑦 + 𝑧) 𝑑𝑦
3 2 0 3 2
0 0
2 1 1 1 2 2 2
= ( 𝑦 + 𝑦 2 + 𝑦𝑧] ) = (𝑧 + 1) = 𝑧 +
3 2 2 0 3 3 3
b.
1 1 1
2 2 2
1 1 1
𝑃 (0 < 𝑋 < , 0 < 𝑌 < , 0 < 𝑍 < ) = ∫ ∫ ∫ 𝑓(𝑥, 𝑦, 𝑧)𝑑𝑥 𝑑𝑦 𝑑𝑧
2 2 2
0 0 0
1 1 1 1 1
2 2 2 2 2 1
2 2 1 2
= ∫ ∫ ∫(𝑥 + 𝑦 + 𝑧)𝑑𝑥 𝑑𝑦 𝑑𝑧 = ∫ ∫( 𝑥 + 𝑥𝑦 + 𝑥𝑧] 2)𝑑𝑦 𝑑𝑧
3 3 2 0
0 0 0 0 0
1 1 1
2 2
3
2 1
2 1 1 1 2 1 3 1 2 1
= ∫ ∫ (( ) + 𝑦 + 𝑧 ) 𝑑𝑦 𝑑𝑧 = ∫ (( ) 𝑦 + 𝑦 + 𝑦𝑧 ] 2) 𝑑𝑧
3 2 2 2 3 2 4 2 0
0 0 0
1
2 1
2 1 4 1 4 1 2 1 4 1 4 1 2 2
= ∫ (( ) + ( ) + 𝑧 ) 𝑑𝑧 = (( ) 𝑧 + ( ) 𝑧 + 𝑧 ] )
3 2 2 4 3 2 2 8 0
0

2 1 5 1 5 1 5 1
= (( ) + ( ) + ( ) ) =
3 2 2 2 16
c. The multiplication of each marginal pdf gives
2 2 2 2 2 2
𝑓𝑋 (𝑥)𝑓𝑌 (𝑦)𝑓𝑍 (𝑧) = ( 𝑥 + ) ( 𝑦 + ) ( 𝑧 + )
3 3 3 3 3 3
Since 𝑓(𝑥, 𝑦, 𝑧) ≢ 𝑓𝑋 (𝑥)𝑓𝑌 (𝑦)𝑓𝑍 (𝑧) hence 𝑋, 𝑌, 𝑍 are not independent.
d.
1 1 1
2
𝐸(𝑋 2 𝑌𝑍 + 3𝑋𝑌 4 𝑍 2 ) = ∫ ∫ ∫(𝑥 2 𝑦𝑧 + 3𝑥𝑦 4 𝑧 2 )(𝑥 + 𝑦 + 𝑧)𝑑𝑥 𝑑𝑦 𝑑𝑧
3
0 0 0
1 1 1
2
= ∫ ∫ ∫(𝑥 3 𝑦𝑧 + 3𝑥 2 𝑦 4 𝑧 2 + 𝑥 2 𝑦 2 𝑧 + 3𝑥𝑦 5 𝑧 2 + 𝑥 2 𝑦𝑧 2 + 3𝑥𝑦 4 𝑧 3 )𝑑𝑥 𝑑𝑦 𝑑𝑧
3
0 0 0
1 1
2 1 1 3 1 3 1
= ∫ ∫ ( 𝑥 4 𝑦𝑧 + 𝑥 3 𝑦 4 𝑧 2 + 𝑥 3 𝑦 2 𝑧 + 𝑥 2 𝑦 5 𝑧 2 + 𝑥 3 𝑦𝑧 2 + 𝑥 2 𝑦 4 𝑧 3 ] ) 𝑑𝑦 𝑑𝑧
3 4 3 2 3 2 0
0 0
1 1
2 1 1 3 1 3
= ∫ ∫ ( 𝑦𝑧 + 𝑦 4 𝑧 2 + 𝑦 2 𝑧 + 𝑦 5 𝑧 2 + 𝑦𝑧 2 + 𝑦 4 𝑧 3 ) 𝑑𝑦 𝑑𝑧
3 4 3 2 3 2
0 0
1
2 1 1 1 1 1 3 1
= ∫ ( 𝑦 2 𝑧 + 𝑦 5 𝑧 2 + 𝑦 3 𝑧 + 𝑦 6 𝑧 2 + 𝑦 2 𝑧 2 + 𝑦 5 𝑧 3 ] ) 𝑑𝑧
3 8 5 9 2 6 10 0
0
1 1
2 1 1 1 1 1 3 2 1 16 3
= ∫( 𝑧 + 𝑧 2 + 𝑧 + 𝑧 2 + 𝑧 2 + 𝑧 3 ) 𝑑𝑧 = ∫ ( 𝑧 + 𝑧 2 + 𝑧 3 ) 𝑑𝑧
3 8 5 9 2 6 10 3 72 30 10
0 0

2 1 2 16 3 3 4 1 2 1 16 3
= ( 𝑧 + 𝑧 + 𝑧 ] )= ( + + )
3 72.3 30.3 4.10 0 3 72.3 30.3 4.10
e. The cdf of 𝑋, 𝑌, 𝑎𝑛𝑑 𝑍 is
𝐹(𝑥, 𝑦, 𝑧) = 𝑃(𝑋 ≤ 𝑥, 𝑌 ≤ 𝑦, 𝑍 ≤ 𝑧)
𝑧 𝑦 𝑥 𝑧 𝑦
2 2 1 𝑥
= ∫ ∫ ∫ (𝑢 + 𝑣 + 𝑤)𝑑𝑢 𝑑𝑣 𝑑𝑤 = ∫ ∫ ( 𝑢2 + 𝑢𝑣 + 𝑢𝑤] ) 𝑑𝑣 𝑑𝑤
3 3 2 0
0 0 0 0 0
𝑧 𝑦 𝑧
2 1 2 1 1 𝑦
= ∫ ∫ ( 𝑥 2 + 𝑥𝑣 + 𝑥𝑤) 𝑑𝑣 𝑑𝑤 = ∫ ( 𝑥 2 𝑣 + 𝑥𝑣 2 + 𝑥𝑣𝑤] ) 𝑑𝑤
3 2 3 2 2 0
0 0 0
𝑧
2 1 1 2 1 1 1 𝑧
= ∫ ( 𝑥 2 𝑦 + 𝑥𝑦 2 + 𝑥𝑦𝑤) 𝑑𝑤 = ( 𝑥 2 𝑦𝑤 + 𝑥𝑦 2 𝑤 + 𝑥𝑦𝑤 2 ] )
3 2 2 3 2 2 2 0
0
2 1 2 1 1 1 1 1
= ( 𝑥 𝑦𝑧 + 𝑥𝑦 2 𝑧 + 𝑥𝑦𝑧 2 ) = 𝑥 2 𝑦𝑧 + 𝑥𝑦 2 𝑧 + 𝑥𝑦𝑧 2
3 2 2 2 3 3 3
f. The conditional distribution of 𝑋 and 𝑌 , given 𝑍 = 𝑧 are
2
𝑓(𝑥, 𝑦, 𝑧) 3 (𝑥 + 𝑦 + 𝑧) 𝑥 + 𝑦 + 𝑧
𝑓𝑋|𝑍 (𝑋|𝑧) = = =
𝑓𝑍 (𝑧) 2 𝑧+1
(𝑧 + 1)
3
2
𝑓(𝑥, 𝑦, 𝑧) 3 (𝑥 + 𝑦 + 𝑧) 𝑥 + 𝑦 + 𝑧
𝑓𝑌|𝑍 (𝑌|𝑧) = = =
𝑓𝑍 (𝑧) 2 𝑧+1
(𝑧 + 1)
3
1 1
𝑥(𝑥 + 𝑦 + 𝑧) 𝑦(𝑥 + 𝑦 + 𝑧)
𝐸(𝑋 + 𝑌|𝑧) = 𝐸(𝑋|𝑧) + 𝐸(𝑌|𝑧) = ∫ 𝑑𝑥 + ∫ 𝑑𝑦
𝑧+1 𝑧+1
0 0
1 1
𝑥 2 + 𝑥𝑦 + 𝑥𝑧 𝑥𝑦 + 𝑦 2 + 𝑦𝑧
=∫ 𝑑𝑥 + ∫ 𝑑𝑦
𝑧+1 𝑧+1
0 0
1 1
1
= (∫(𝑥 2 + 𝑥𝑦 + 𝑥𝑧)𝑑𝑥 + ∫(𝑥𝑦 + 𝑦 2 + 𝑦𝑧)𝑑𝑦)
𝑧+1
0 0
1 1 1 1 1 1 1 1 1
= ( 𝑥 3 + 𝑥 2 𝑦 + 𝑥 2 𝑧] + 𝑥𝑦 2 + 𝑦 3 + 𝑦 2 𝑧] )
𝑧+1 3 2 2 0 2 3 2 0
1 1 1 1 1 1 1 1 2 1 1
= ( + 𝑦 + 𝑧 + 𝑥 + + 𝑧) = ( + 𝑥 + 𝑦 + 𝑧)
𝑧+1 3 2 2 2 3 2 𝑧+1 3 2 2
g. The conditional distribution of 𝑋 given 𝑌 = 𝑦 and 𝑍 = 𝑧 is
1
2 2 1 1 2 1
𝑓𝑋|𝑌,𝑍 (𝑋|𝑦, 𝑧) = ∫ (𝑥 + 𝑦 + 𝑧)𝑑𝑥 = ( 𝑥 2 + 𝑥𝑦 + 𝑥𝑧] ) = ( + 𝑦 + 𝑧)
3 3 2 0 3 2
0

Then the expected value is

1 1
𝑥(𝑥 + 𝑦 + 𝑧) 1
𝐸(𝑋|𝑦, 𝑧) = ∫ 𝑑𝑥 = ∫(𝑥 2 + 𝑥𝑦 + 𝑥𝑧)𝑑𝑥
1 1
0 2+𝑦+𝑧 2+𝑦+𝑧0
2 1 1 1 1 2 1 1 1
= ( 𝑥 3 + 𝑥 2 𝑦 + 𝑥 2 𝑧] ) = ( + 𝑦 + 𝑧)
1 + 2𝑦 + 2𝑧 3 2 2 0 1 + 2𝑦 + 2𝑧 3 2 2
2 4 + 6𝑦 + 6𝑧 4(2 + 3𝑦 + 3𝑧) 2 + 3𝑦 + 3𝑧
= . = =
1 + 2𝑦 + 2𝑧 12 12(1 + 2𝑦 + 2𝑧) 3 + 6𝑦 + 6𝑧

2. Let 𝑓(𝑥1 , 𝑥2 , 𝑥3 ) = exp[−(𝑥1 + 𝑥2 + 𝑥3 )], 0 < 𝑥1 < ∞, 0 < 𝑥2 < ∞, 0 < 𝑥3 < ∞,
zero elsewhere, be the joint pdf of 𝑋1 , 𝑋2 , 𝑋3.
a. Compute 𝑃(𝑋1 < 𝑋2 < 𝑋3 ) and 𝑃(𝑋1 = 𝑋2 < 𝑋3 )
Solution
−(𝑥1 +𝑥2 +𝑥3 )
𝑓(𝑥1 , 𝑥2 , 𝑥3 ) = {𝑒 0 < 𝑥1 < ∞; 0 < 𝑥2 < ∞; 0 < 𝑥3 < ∞
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
a. Computing 𝑃(𝑋1 < 𝑋2 < 𝑋3 ) gives
∞ 𝑥3 𝑥2

𝑃(𝑋1 < 𝑋2 < 𝑋3 ) = ∫ ∫ ∫ 𝑒 −(𝑥1 +𝑥2 +𝑥3 ) 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3

0 0 0
∞ 𝑥3 ∞ 𝑥3
𝑥2
= ∫ ∫ (− 𝑒 −(𝑥1 +𝑥2 +𝑥3 ) ] ) 𝑑𝑥2 𝑑𝑥3 = ∫ ∫ (−𝑒 −(2𝑥2 +𝑥3) )𝑑𝑥2 𝑑𝑥3
0
0 0 0 0
∞ ∞
1 𝑥3 1 1 1 ∞ 1
= ∫ (𝑒 −(2𝑥2 +𝑥3 ) ] ) 𝑑𝑥3 = ∫ 𝑒 −3𝑥3 𝑑𝑥3 = (− 𝑒 −3𝑥3 ] ) =
2 0 2 2 3 0 6
0 0

While computing 𝑃(𝑋1 = 𝑋2 < 𝑋3 ) gives

∞ 𝑥2 𝑥 1

𝑃(𝑋1 = 𝑋2 < 𝑋3 ) = ∫ ∫ ∫ 𝑒 −(𝑥1 +𝑥2+𝑥3) 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3

0 𝑥2 𝑥1
∞ 𝑥2 ∞ 𝑥2
𝑥1
= ∫ ∫ (−𝑒 −(𝑥1+𝑥2+𝑥3) ] )𝑑𝑥2 𝑑𝑥3 = ∫ ∫ 0 𝑑𝑥2 𝑑𝑥3 = 0
𝑥1
0 𝑥2 0 𝑥2

3. Let 𝑀(𝑡1 , 𝑡2 , 𝑡3 ) be the mgf of the random variables 𝑋1 , 𝑋2 , and 𝑋3 of Bernstein’s

example, describe in the remark 2.6.1. Show that
𝑀(𝑡1 , 𝑡2 , 0) = 𝑀(𝑡1 , 0 , 0)𝑀(0, 𝑡2 , 0) ; 𝑀(𝑡1 , 0, 𝑡3 ) = 𝑀(𝑡1 , 0 , 0)𝑀(0 , 0 , 𝑡3 ) and
𝑀(0, 𝑡2 , 𝑡3 ) = 𝑀(0, 𝑡2 , 0)𝑀(0 , 0 , 𝑡3 ) are true, but that
𝑀(𝑡1 , 𝑡2 , 𝑡3 ) ≠ 𝑀(𝑡1 , 0 , 0)𝑀(0, 𝑡2 , 0)𝑀(0, 0, 𝑡3 )
thus 𝑋1 , 𝑋2 , 𝑋3 are pairwise independent but not mutually independent.
Solution
According to the remark 2.6.1 that have been discussed before, we know that

1
𝑀(𝑡1 , 𝑡2 , 0) = ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2 𝑋2 𝑝1,2 (𝑥1 , 𝑥2 ) = ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2 𝑋2 ( )
4
𝑥1 𝑥2 𝑥1 𝑥2

1 1 1 1
= ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2 𝑋2 ( ) ( ) = [∑ 𝑒 𝑡1 𝑋1 ( )] [∑ 𝑒 𝑡2 𝑋2 ( )]
2 2 2 2
𝑥1 𝑥2 𝑥1 𝑥2

= [∑ 𝑒 𝑡1 𝑋1 𝑝1 (𝑥1 )] [∑ 𝑒 𝑡2 𝑋2 𝑝2 (𝑥2 )] = 𝑀(𝑡1 , 0, 0)𝑀(0, 𝑡2 , 0)

𝑥1 𝑥2

while

1
𝑀(𝑡1 , 0, 𝑡3 ) = ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡3 𝑋3 𝑝1,3 (𝑥1 , 𝑥3 ) = ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡3 𝑋3 ( )
4
𝑥1 𝑥3 𝑥1 𝑥3

1 1 1 1
= ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡3 𝑋3 ( ) ( ) = [∑ 𝑒 𝑡1 𝑋1 ( )] [∑ 𝑒 𝑡3 𝑋3 ( )]
2 2 2 2
𝑥1 𝑥3 𝑥1 𝑥3

= [∑ 𝑒 𝑡1 𝑋1 𝑝1 (𝑥1 )] [∑ 𝑒 𝑡3 𝑋3 𝑝3 (𝑥3 )] = 𝑀(𝑡1 , 0, 0)𝑀(0, 0, 𝑡3 )

𝑥1 𝑥3

while

1
𝑀(0, 𝑡2 , 𝑡3 ) = ∑ ∑ 𝑒 𝑡2 𝑋2 +𝑡3 𝑋3 𝑝2,3 (𝑥2 , 𝑥3 ) = ∑ ∑ 𝑒 𝑡2 𝑋2 +𝑡3 𝑋3 ( )
4
𝑥2 𝑥3 𝑥2 𝑥3

1 1 1 1
= ∑ ∑ 𝑒 𝑡2 𝑋2 +𝑡3 𝑋3 ( ) ( ) = [∑ 𝑒 𝑡2 𝑋2 ( )] [∑ 𝑒 𝑡3 𝑋3 ( )]
2 2 2 2
𝑥2 𝑥3 𝑥2 𝑥3
= [∑ 𝑒 𝑡2 𝑋2 𝑝2 (𝑥2 )] [∑ 𝑒 𝑡3 𝑋3 𝑝3 (𝑥3 )] = 𝑀(0, 𝑡2 , 0)𝑀(0, 0, 𝑡3 )
𝑥2 𝑥3

Thus it’s proved that 𝑀(𝑡1 , 𝑡2 , 0) = 𝑀(𝑡1 , 0 , 0)𝑀(0, 𝑡2 , 0) ; 𝑀(𝑡1 , 0, 𝑡3 ) =

𝑀(𝑡1 , 0 , 0)𝑀(0 , 0 , 𝑡3 ) and 𝑀(0, 𝑡2 , 𝑡3 ) = 𝑀(0, 𝑡2 , 0)𝑀(0 , 0 , 𝑡3 )

The mgf of 𝑋1 , 𝑋2 and 𝑋3 is

𝑀(𝑡1 , 𝑡2 , 𝑡3 ) = ∑ ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2 𝑋2+𝑡3 𝑋3 𝑝(𝑥1 , 𝑥2 , 𝑥3 )

𝑥1 𝑥2 𝑥3

1
= ∑ ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2 𝑋2+𝑡3 𝑋3 ( )
4
𝑥1 𝑥2 𝑥3

whereas the product of the marginal mgfs of 𝑋1 , 𝑋2 , 𝑋3 is

𝑀(𝑡1 , 0 , 0)𝑀(0, 𝑡2 , 0)𝑀(0, 0, 𝑡3 ) = ∑ ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2𝑋2 +𝑡3 𝑋3 𝑝1 (𝑥1 )𝑝2 (𝑥2 )𝑝3 (𝑥3 )
𝑥1 𝑥2 𝑥3

1 1 1 1
= [∑ 𝑒 𝑡1 𝑋1 ( )] [∑ 𝑒 𝑡2 𝑋2 ( )] [∑ 𝑒 𝑡3 𝑋3 ( )] = ∑ ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2 𝑋2 +𝑡3 𝑋3 ( )
2 2 2 8
𝑥1 𝑥2 𝑥3 𝑥1 𝑥2 𝑥3

1 1
Since we have that 𝑝(𝑥1 , 𝑥2 , 𝑥3 ) = 4 and 𝑝1 (𝑥1 )𝑝2 (𝑥2 )𝑝3 (𝑥3 ) = 8 so we know that

𝑝(𝑥1 , 𝑥2 , 𝑥3 ) ≠ 𝑝1 (𝑥1 )𝑝2 (𝑥2 )𝑝3 (𝑥3 )

Thus we can conclude that 𝑋1 , 𝑋2 , 𝑋3 are pairwise independent, but not mutually
independent.
4. Let 𝑿 = (𝑋1 , … , 𝑋𝑛 )′ be an 𝑛-dimensional random vector, with variance-covariance
matrix (2.6.11). Show that the 𝑖th diagonal entry of 𝐶𝑜𝑣(𝑿) is 𝜎𝑖2 = 𝑉𝑎𝑟(𝑋𝑖 ) and that
the (𝑖, 𝑗)𝑡ℎ off diagonal entry is 𝑐𝑜𝑣(𝑋𝑖 , 𝑋𝑗 ).
Solution
𝑋1 𝜇1
𝑋 𝜇 2
Let 𝑿 = [ 2 ] and 𝝁 = [ ⋮ ] since we have that
⋮
𝑋𝑛 𝜇𝑛
𝐶𝑜𝑣 (𝑿) = 𝐸((𝑿 − 𝝁)(𝑿 − 𝝁)′ )
So we know that
𝑋1 − 𝜇1
𝑋 − 𝜇2
𝐸((𝑿 − 𝝁)(𝑿 − 𝝁)′ ) = 𝐸 ([ 2 ] [𝑋1 − 𝜇1 𝑋2 − 𝜇2 … 𝑋𝑛 − 𝜇𝑛 ])
⋮
𝑋𝑛 − 𝜇𝑛
𝑋12 − 2 𝑋1 𝜇1 + 𝜇12 ⋯ 𝑋1 𝑋𝑛 − 𝑋1 𝜇𝑛 − 𝑋𝑛 𝜇1 + 𝜇1 𝜇𝑛
= 𝐸 ([ ⋮ ⋱ ⋮ ])
2 2
𝑋1 𝑋𝑛 − 𝑋1 𝜇𝑛 − 𝑋𝑛 𝜇1 + 𝜇1 𝜇𝑛 ⋯ 𝑋𝑛 − 2 𝑋𝑛 𝜇𝑛 + 𝜇𝑛
Since the 𝑖th diagonal entry on the expected value above is 𝐸(𝑋𝑖2 − 2𝑋𝑖 𝜇𝑖 + 𝜇𝑖2 )
which is equal to
𝐸(𝑋𝑖2 − 2𝑋𝑖 𝜇𝑖 + 𝜇𝑖2 ) = 𝐸((𝑋𝑖 − 𝜇𝑖 )2 ) = 𝑉𝑎𝑟(𝑋𝑖 )
Thus it is proved that the entry on the 𝑖th diagonal of the 𝐶𝑜𝑣(𝑿) above is 𝜎 2 =
𝑉𝑎𝑟(𝑋𝑖 ).

Now we verify that each of the entry off the diagonal is the 𝑐𝑜𝑣(𝑋𝑖 , 𝑋𝑗 ). We know
that all of the entry off the diagonal has form
𝐸(𝑋𝑖 𝑋𝑗 − 𝑋𝑖 𝜇𝑗 − 𝑋𝑗 𝜇𝑖 + 𝜇𝑖 𝜇𝑗 )
According to the previous materials we have that

𝐸(𝑋𝑖 𝑋𝑗 − 𝑋𝑖 𝜇𝑗 − 𝑋𝑗 𝜇𝑖 + 𝜇𝑖 𝜇𝑗 ) = 𝐸 ((𝑋𝑖 − 𝜇𝑖 )(𝑋𝑗 − 𝜇𝑗 )) = 𝑐𝑜𝑣 (𝑋𝑖 , 𝑋𝑗 )

Thus it is proved that all the entry off of diagonal are c𝑜𝑣(𝑋𝑖 , 𝑋𝑗 )

Stats 116 SU
No ratings yet
Stats 116 SU
128 pages
Theories Joint Distribution PDF
No ratings yet
Theories Joint Distribution PDF
25 pages
Theories Joint Distribution
No ratings yet
Theories Joint Distribution
25 pages
Supportive Notes & QB-Distribution Theory-PS-Unit2
No ratings yet
Supportive Notes & QB-Distribution Theory-PS-Unit2
11 pages
Joint Probability Distribution Reference 1
No ratings yet
Joint Probability Distribution Reference 1
13 pages
Lect6 PDF
No ratings yet
Lect6 PDF
11 pages
Joint Random Variables 1
No ratings yet
Joint Random Variables 1
11 pages
Chapter 4: Multiple Random Variables
No ratings yet
Chapter 4: Multiple Random Variables
34 pages
1.10 Two-Dimensional Random Variables: Chapter 1. Elements of Probability Distribution Theory
No ratings yet
1.10 Two-Dimensional Random Variables: Chapter 1. Elements of Probability Distribution Theory
13 pages
1.10 Two-Dimensional Random Variables: Chapter 1. Elements of Probability Distribution Theory
No ratings yet
1.10 Two-Dimensional Random Variables: Chapter 1. Elements of Probability Distribution Theory
13 pages
Joint Probability Functions
No ratings yet
Joint Probability Functions
7 pages
Joint
No ratings yet
Joint
5 pages
Probability Mammadli Ilgar
No ratings yet
Probability Mammadli Ilgar
27 pages
Advanced Probability & Statistics __ 23CST-286
No ratings yet
Advanced Probability & Statistics __ 23CST-286
28 pages
chap 3.1
No ratings yet
chap 3.1
25 pages
Chapter 6 - Joint Distributions
No ratings yet
Chapter 6 - Joint Distributions
20 pages
Joint Distribution
No ratings yet
Joint Distribution
37 pages
Review of Basic Probability: 1.1 Random Variables and Distributions
No ratings yet
Review of Basic Probability: 1.1 Random Variables and Distributions
8 pages
Joint Probability Distribution
No ratings yet
Joint Probability Distribution
14 pages
LectSlides#3
No ratings yet
LectSlides#3
80 pages
Econ-2042- Unit 4-HO
No ratings yet
Econ-2042- Unit 4-HO
13 pages
Brief Notes #4 Random Vectors: and X
No ratings yet
Brief Notes #4 Random Vectors: and X
4 pages
CHAPTER 03-Random Variable
No ratings yet
CHAPTER 03-Random Variable
68 pages
Random Variables - 2D
No ratings yet
Random Variables - 2D
17 pages
Lecure-4 Probability
No ratings yet
Lecure-4 Probability
51 pages
Recitation 5 solution
No ratings yet
Recitation 5 solution
17 pages
PDF&Rendition=1
No ratings yet
PDF&Rendition=1
33 pages
Week 11
No ratings yet
Week 11
24 pages
S201,Lec 2
No ratings yet
S201,Lec 2
48 pages
Joint Dist
No ratings yet
Joint Dist
30 pages
Stochastic Hydrology: Indian Institute of Science
No ratings yet
Stochastic Hydrology: Indian Institute of Science
56 pages
MTH451_Study_Notes
No ratings yet
MTH451_Study_Notes
29 pages
Lecture 7 - Fall 2023
No ratings yet
Lecture 7 - Fall 2023
29 pages
Slide 5 01
No ratings yet
Slide 5 01
26 pages
Lec 03
No ratings yet
Lec 03
45 pages
Joint Distribution and Later
No ratings yet
Joint Distribution and Later
61 pages
Probability
No ratings yet
Probability
26 pages
Chap 3: Two Random Variables: X X X X X
No ratings yet
Chap 3: Two Random Variables: X X X X X
63 pages
Chap 3: Two Random Variables: X X X X X
No ratings yet
Chap 3: Two Random Variables: X X X X X
63 pages
Bivariate Distributions
No ratings yet
Bivariate Distributions
11 pages
Multivariate Discrete Distributions
No ratings yet
Multivariate Discrete Distributions
27 pages
Unit II - Sjit
No ratings yet
Unit II - Sjit
32 pages
Continuous Couples
No ratings yet
Continuous Couples
9 pages
CH-3b (Cond CDF and PDF)
No ratings yet
CH-3b (Cond CDF and PDF)
58 pages
NPT19 jointRV2
No ratings yet
NPT19 jointRV2
6 pages
Continuous Random Variables: Joint PDFS, Conditioning, Expectation and Independence
No ratings yet
Continuous Random Variables: Joint PDFS, Conditioning, Expectation and Independence
30 pages
Chap 5 PME
No ratings yet
Chap 5 PME
48 pages
Notes (Week 1 and 2)
No ratings yet
Notes (Week 1 and 2)
7 pages
09-Multivariate Distributions
No ratings yet
09-Multivariate Distributions
49 pages
Statistics I - Unit 5.bidimensional Random Variables
No ratings yet
Statistics I - Unit 5.bidimensional Random Variables
101 pages
Joint Distributions: A Random Variable Is That Maps To Numbers
No ratings yet
Joint Distributions: A Random Variable Is That Maps To Numbers
37 pages
Notes On Random Variables, Expectations, Probability Densities, and Martingales
No ratings yet
Notes On Random Variables, Expectations, Probability Densities, and Martingales
8 pages
Joint Continuous Distributions
No ratings yet
Joint Continuous Distributions
9 pages
P7-Joint Distribution and Independence
No ratings yet
P7-Joint Distribution and Independence
8 pages
Lecture 14 Joint Probability Distributions
No ratings yet
Lecture 14 Joint Probability Distributions
24 pages
Course_Chap3_Proba_S5_Centrale_Casa
No ratings yet
Course_Chap3_Proba_S5_Centrale_Casa
99 pages
Distributions and Normal Random Variables
No ratings yet
Distributions and Normal Random Variables
8 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Calculus Volume1
From Everand
Calculus Volume1
Ming Yao Tsai
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
ESO 208A: Computational Methods in Engineering: Abhas Singh
No ratings yet
ESO 208A: Computational Methods in Engineering: Abhas Singh
26 pages
Q1 Math 10 Lesson 1.2
No ratings yet
Q1 Math 10 Lesson 1.2
22 pages
A. Mezo Playing Zoma: Codeforces Round #613 (Div. 2)
No ratings yet
A. Mezo Playing Zoma: Codeforces Round #613 (Div. 2)
3 pages
Transforming Rational Algebraic Expression Into Quadratic Equation PDF
No ratings yet
Transforming Rational Algebraic Expression Into Quadratic Equation PDF
7 pages
Lectures on Exponential Decay of Solutions of Second Order Elliptic Equations Bounds on Eigenfunctions of N Body Schrodinger Operations MN 29 Shmuel Agmon download pdf
No ratings yet
Lectures on Exponential Decay of Solutions of Second Order Elliptic Equations Bounds on Eigenfunctions of N Body Schrodinger Operations MN 29 Shmuel Agmon download pdf
51 pages
Adaptive IIR Filter: Terry Lee EE 491D May 13, 2005
No ratings yet
Adaptive IIR Filter: Terry Lee EE 491D May 13, 2005
21 pages
GenMath Q1 W3
No ratings yet
GenMath Q1 W3
16 pages
OPTIMIZATION-Shuffled Frog Leaping Algorithm
No ratings yet
OPTIMIZATION-Shuffled Frog Leaping Algorithm
15 pages
Ncert Solutions Class 6 Maths
No ratings yet
Ncert Solutions Class 6 Maths
12 pages
SS2 Maths 2nd Term Lesson Note PDF
No ratings yet
SS2 Maths 2nd Term Lesson Note PDF
118 pages
Machine Problem
No ratings yet
Machine Problem
15 pages
IGCSE Cam Math P1 Å Ç - Password - Removed
No ratings yet
IGCSE Cam Math P1 Å Ç - Password - Removed
461 pages
DM - Question Bank 1 - 2024 25
No ratings yet
DM - Question Bank 1 - 2024 25
3 pages
Maxwell Forms
No ratings yet
Maxwell Forms
33 pages
MTH1101
No ratings yet
MTH1101
7 pages
Data Structure For GATE
No ratings yet
Data Structure For GATE
5 pages
Sets Functions Sequences Exercises
No ratings yet
Sets Functions Sequences Exercises
23 pages
Maths GR 10, 11 and 12 March Test Framework
No ratings yet
Maths GR 10, 11 and 12 March Test Framework
2 pages
Notes M1 MPG Unit 1
No ratings yet
Notes M1 MPG Unit 1
21 pages
How To Compose A Problem PDF
No ratings yet
How To Compose A Problem PDF
20 pages
1963 1999
No ratings yet
1963 1999
74 pages
Numerical Methods With Excel/VBA
No ratings yet
Numerical Methods With Excel/VBA
7 pages
General Physics 1 (Module 3)
No ratings yet
General Physics 1 (Module 3)
5 pages
Linear Block Codes: Lecture Notes by Y. N. Trivedi
No ratings yet
Linear Block Codes: Lecture Notes by Y. N. Trivedi
12 pages
Bansal Classes Maths PDF
No ratings yet
Bansal Classes Maths PDF
500 pages
Medical Supply Transportation Problem
No ratings yet
Medical Supply Transportation Problem
28 pages
Complete_200_Math_Application_Questions
No ratings yet
Complete_200_Math_Application_Questions
4 pages
ANN Assignment I
No ratings yet
ANN Assignment I
44 pages
Higher Maths Syllabus GCSE
No ratings yet
Higher Maths Syllabus GCSE
3 pages
RD Sharma Solutions Class 12 Maths Chapter 22 Ex 22.7
No ratings yet
RD Sharma Solutions Class 12 Maths Chapter 22 Ex 22.7
30 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

(Last) Extension of Several Random Variables

Uploaded by

(Last) Extension of Several Random Variables

Uploaded by

EXTENSION TO SEVERAL RANDOM VARIABLES

𝐹𝑿 (𝒙) = ∫ ∫ 𝑓(𝑤1 , … , 𝑤𝑛 ) 𝑑𝑤1 … 𝑑𝑤𝑛 ,

For the continues case,

= (1 − 𝑒 −𝑥 )(1 − 𝑒 −𝑦 )(1 − 𝑒 −𝑧 ), 0 ≤ 𝑥, 𝑦, 𝑧 < ∞

∑ … ∑|𝑢(𝑥1 , 𝑥2 , … , 𝑥𝑛 )|𝑓 (𝑥1 , 𝑥2 , … , 𝑥𝑛 )

For the continuous case, and by

𝐸(𝑌) = ∑ … ∑ 𝑢(𝑥1 , 𝑥2 , … 𝑥𝑛 )𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )

where 𝑘1 , … , 𝑘𝑚 are constant.

where 𝑓1 (𝑥1 ) is defined by the (𝑛 − 1)-fold integral

if the random variables of the continuous type.

= ∏ 𝑃(𝑎𝑖 < 𝑋𝑖 < 𝑏𝑖 ),

where the symbol ∏𝑛𝑖=1 𝜑(𝑖) is defined to be

∏ 𝜑(𝑖) = 𝜑(1)𝜑(2) … 𝜑(𝑛)

The theorem that

𝐸 [∏ 𝑢𝑖 (𝑋𝑖 )] = ∏ 𝐸[𝑢𝑖 (𝑋𝑖 )]

The moment-generating function (mgf) of the joint distribution of 𝑛 random variables

𝑀(𝑡1 , 𝑡2 , … , 𝑡𝑛 ) = ∏ 𝑀(0, … ,0, 𝑡𝑖 , 0, … ,0) (2.6.6)

∫ ∫ ∫(5𝑥1 𝑥23 + 3𝑥2 𝑥34 )8𝑥1 𝑥2 𝑥3 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3 = 2

Let 𝑌 be the maximum of 𝑋1 , 𝑋2 , and 𝑋3. Then, for instance, we have

= ∫ ∫ ∫ 8𝑥1 𝑥2 𝑥3 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3

𝐸[𝑨𝟏 𝑾𝟏 𝑩] = 𝑨𝟏 𝐸[𝑾𝟏 ]𝐵 (2.6.10)

𝐸 [∑ 𝑎1𝑖𝑠 𝑊1𝑠𝑗 + ∑ 𝑎2𝑖𝑠 𝑊2𝑠𝑗 ] = ∑ 𝑎1𝑖𝑠 𝐸[𝑊1𝑠𝑗 ] + ∑ 𝑎2𝑖𝑠 𝐸[𝑊2𝑠𝑗 ]

Hence by (2.6.8) expression (2.6.9) is true.

𝐸 [∑ [∑ 𝑎1𝑖𝑠 𝑊1𝑠𝑗 ] 𝑏𝑡𝑘 ] = ∑ 𝐸 [∑ 𝑎1𝑖𝑠 𝑊1𝑠𝑗 ] 𝑏𝑡𝑘 = ∑ [∑ 𝑎1𝑖𝑠 𝐸[𝑊1𝑠𝑗 ]] 𝑏𝑡𝑘

1 and zero elsewhere.

c. Are 𝑋, 𝑌, 𝑎𝑛𝑑 𝑍 independent?

Then the expected value is

𝑃(𝑋1 < 𝑋2 < 𝑋3 ) = ∫ ∫ ∫ 𝑒 −(𝑥1 +𝑥2 +𝑥3 ) 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3

While computing 𝑃(𝑋1 = 𝑋2 < 𝑋3 ) gives

𝑃(𝑋1 = 𝑋2 < 𝑋3 ) = ∫ ∫ ∫ 𝑒 −(𝑥1 +𝑥2+𝑥3) 𝑑𝑥1 𝑑𝑥2 𝑑𝑥3

3. Let 𝑀(𝑡1 , 𝑡2 , 𝑡3 ) be the mgf of the random variables 𝑋1 , 𝑋2 , and 𝑋3 of Bernstein’s

= [∑ 𝑒 𝑡1 𝑋1 𝑝1 (𝑥1 )] [∑ 𝑒 𝑡2 𝑋2 𝑝2 (𝑥2 )] = 𝑀(𝑡1 , 0, 0)𝑀(0, 𝑡2 , 0)

= [∑ 𝑒 𝑡1 𝑋1 𝑝1 (𝑥1 )] [∑ 𝑒 𝑡3 𝑋3 𝑝3 (𝑥3 )] = 𝑀(𝑡1 , 0, 0)𝑀(0, 0, 𝑡3 )

Thus it’s proved that 𝑀(𝑡1 , 𝑡2 , 0) = 𝑀(𝑡1 , 0 , 0)𝑀(0, 𝑡2 , 0) ; 𝑀(𝑡1 , 0, 𝑡3 ) =

The mgf of 𝑋1 , 𝑋2 and 𝑋3 is

𝑀(𝑡1 , 𝑡2 , 𝑡3 ) = ∑ ∑ ∑ 𝑒 𝑡1 𝑋1 +𝑡2 𝑋2+𝑡3 𝑋3 𝑝(𝑥1 , 𝑥2 , 𝑥3 )

whereas the product of the marginal mgfs of 𝑋1 , 𝑋2 , 𝑋3 is

𝑝(𝑥1 , 𝑥2 , 𝑥3 ) ≠ 𝑝1 (𝑥1 )𝑝2 (𝑥2 )𝑝3 (𝑥3 )

𝐸(𝑋𝑖 𝑋𝑗 − 𝑋𝑖 𝜇𝑗 − 𝑋𝑗 𝜇𝑖 + 𝜇𝑖 𝜇𝑗 ) = 𝐸 ((𝑋𝑖 − 𝜇𝑖 )(𝑋𝑗 − 𝜇𝑗 )) = 𝑐𝑜𝑣 (𝑋𝑖 , 𝑋𝑗 )

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.