0% found this document useful (0 votes)

35 views46 pages

Lec3 Inverse Transformation Rejection

Chapter 3 discusses various methods for generating random variables from specified probability distributions using R. Key methods include the Inverse Transform Method for both continuous and discrete cases, the Acceptance-Rejection Method, and transformation methods, along with multivariate distributions. The chapter provides examples and R code snippets for practical implementation of these methods.

Uploaded by

ataabuasad08

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views46 pages

Lec3 Inverse Transformation Rejection

Uploaded by

ataabuasad08

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Chapter 3: Methods for Generating

Random Variables

Lecturer: Zhao Jianhua

Department of Statistics
Yunnan University of Finance and Economics
Outline

3.1 Introduction
Random Generators of Common Probability Distributions in R
3.2 The Inverse Transform Method
3.2.1 Inverse Transform Method, Continuous Case
3.2.2 Inverse Transform Method, Discrete Case
3.3 The Acceptance-Rejection Method
The Acceptance-Rejection Method
3.4 Transformation Methods
3.5 Sums and Mixtures
3.6 Multivariate Distributions
3.6.1 Multivariate Normal Distribution
3.6.2 Mixtures of Multivariate Normals
3.6.3 Wishart Distribution
3.6.4 Uniform Dist. on the d-Sphere
Introduction

• One of the fundamental tools required in computational statis-

tics is the ability to simulate random variables (r.v.) from spec-
ified probability (prob.) distributions (dist.).
• A suitable generator of uniform pseudo random numbers is es-
sential. Methods for generating r.v. from other prob. dist. all
depend on the uniform random number generator.
runif: Generating Uniform random number.
runif ( n ) # generate a vector of size n in [0 ,1]
runif (n ,a , b ) # generate n Uniform (a , b ) numbers
matrix ( runif ( n * m ) ,n , m ) # generate n by m matrix
The sample function can be used to sample from a finite population,
with or without replacement.
Example 3.1 (Sampling from a finite population)
> # toss some coins
> sample (0:1 , size = 10 , replace = TRUE )
> # choose some lottery numbers
> sample (1:100 , size = 6 , replace = FALSE )
> # permuation of letters a - z
> sample ( letters )
> # sample from a multinomial dist .
> x = sample (1:3 , size =100 , replace = TRUE , prob = c (.2 ,.3 ,.5))
> table ( x )
x
1 2 3
17 35 48
Random Generators of Common Prob. Dist. in R
The prob. mass function (pmf) or density (pdf), cumulative dist. function
(cdf), quantile function, and random generator of many commonly used
prob. dist. are available. For example:

dbinom (x , size , prob , log = FALSE ); rbinom (n , size , prob )

pbinom (q , size , prob , lower . tail = TRUE , log . p = FALSE )
qbinom (p , size , prob , lower . tail = TRUE , log . p = FALSE )

Table: Selected Univariate Prob. Functions Available in R

Distribution cdf Generator Parameters
beta pbeta rbeta shape1, shape2
binomial pbinom rbinom size, prob
chi-squared pchisq rchisq df
exponential pexp rexp rate
F pf rf df1, df2
gamma pgamma rgamma shape, rate or scale
geometric pgeom rgeom prob
lognormal plnorm rlnorm meanlog, sdlog
negative binomial pnbinom rnbinom size, prob
normal pnorm rnorm mean, sd
Poisson ppois rpois lambda
Students t pt rt df
uniform punif runif min, max
The Inverse Transform Method
Theorem: If X is a continuous r.v. with CDF FX (x), then U =
FX (X) ∼ U (0, 1).
Proof : FX (x) = P (X ≤ x) = P (FX (X) ≤ FX (x)) = P (U ≤
FX (x)). Second = is because FX (x) is an increasing function. By
the definition of uniform CDF, we have U ∼ U (0, 1).
Define: the inverse transformation FX−1 (u) = inf{x : FX (x) = u},
0 < u < 1. If U ∼ U (0, 1), then for all x ∈ R,

P (FX−1 (U ) ≤ x) = P (inf {t : FX (t) = U } ≤ x)

= P (U ≤ FX (x)) = FU (FX (x)) = FX (x),

and therefore FX−1 (U ) has the same dist. as X.

To generate a random observation X,
1. Dreive the inverse function FX−1 (u)
2. generate a random u from U (0, 1).
3. deliver x = FX−1 (u).
The method easy to apply, provided that FX−1 is easy to compute.
Inverse Transform Method, Continuous Case

Example 3.2 (Inverse transform method, continuous case)

Simulate a random sample from the dist. with density
fX (x) = 3x2 , 0 < x < 1.
Here FX (x) = x3 for 0 < x < 1, and FX−1 (u) = u1/3 , then
n <- 1000 ; u <- runif ( n ) ; x <- u ^(1 / 3)
hist (x , prob = TRUE ) # density histogram of sample
y <- seq (0 ,1 ,.01); lines (y ,3 * y ^2) # density curve f ( x )
f(x) = 3x2
2.5
2.0
1.5

Fig.3.1: Prob. density histogram

Density

with theoretical density f (x) = 3x2

1.0

superimposed.
0.5
0.0

0.0 0.2 0.4 0.6 0.8 1.0

x
R note 3.1
In Figure 3.1, the title includes a math expression, obtained by
hist (x , prob = TRUE , main = expression ( f ( x )==3 * x ^2)).

Alternately, main = bquote(f(x)==3*x^2)) produces the same

title. Math annotation is covered in the help topic for plotmath.
Also see the help topics for text and axis.

Example 3.3 (Exponential dist.)

Generate a random sample from the exponential dist.
X ∼ Exp(λ), for x > 0, the cdf of X is FX (x) = 1 − e−λx , then
F −1 (u) = − λ1 log(1 − u). Note that U and 1 − U have the same
dist. and it is simpler to set x = − λ1 log(u). To generate a random
sample of size n with parameter lambda:
- log ( runif ( n )) / lambda

A generator rexp is available in R.

Inverse Transform Method, Discrete Case

If X is a discrete r.v. and

· · · < xi−1 < xi < xi+1 < · · ·

are the points of discontinuity of FX (x).

Define:

FX−1 (u) = inf {x : FX (x) ≥ u}

then the inverse transformation is FX−1 (u) = xi , where FX (xi−1 ) <
u ≤ FX (xi ).
For each random variate required:
1. Generate a random u from U (0, 1).
2. Deliver xi where F (xi−1 ) < u ≤ F (xi ).
The solution of FX (xi−1 ) < u ≤ FX (xi ) may be difficult for some
dist..
Example 3.4 (Two point distribution)
Generate a random sample of Bernoulli(p = 0.4) variates.
In this example, FX (0) = fX (0) = 1 − p and FX (1) = 1. Thus,
FX−1 (u) = 1, if u > 0.6 and FX (u) = 0 if u ≤ 0.6,
n <- 1000 ; p <- 0.4 ; u <- runif ( n )
x <- as . integer ( u > 0.6) # ( u > 0.6) is a logical vector
> mean ( x ); var ( x )
[1] 0.41 0.2421421

Sample mean and variance should be approximately p = 0.4 and

p(1 − p) = 0.24.

R note 3.2
rbinom (random binomial) function with size=1 generates a
Bernoulli sample. Another method is to sample from the vector
(0, 1) with prob. (1 − p, p).
rbinom (n , size = 1 , prob = p )
sample ( c (0 ,1) , size =n , replace = TRUE , prob = c (.6 ,.4))
Example 3.5 (Geometric distribution)
Generate a random geometric sample with parameter p = 14 .
The pmf is f (x) = pq x , x = 0, 1, 2, · · · , where q = 1 − p. At the
points of discontinuity x = 0, 1, 2, · · · , cdf is F (x) = 1 − q x+1 . For
each sample element, we generate a random uniform u and solve

1 − qx < u ≤ 1 − qx,

which simplifies to x < log (1 − u)/ log (q) ≤ x + 1. Thus

x + 1 = dlog (1 − u)/ log (q)e, where dte denotes ceiling function.
n <- 1000
p <- 0.25
u <- runif ( n )
k <- ceiling ( log (1 - u ) / log (1 - p )) - 1

U and 1 − U have the same dist. and the prob. that

log(1 − u)/ log(1 − p) equals an integer is zero, and thus the last step is

k <- floor ( log ( u ) / log (1 - p ))

Poisson distribution
The basic method to generate a Poisson(λ) variate is to generate
and store the cdf via the recursive formula
λf (x)
f (x + 1) = ; F (x + 1) = F (x) + f (x + 1).
x+1
For each Poisson variate, a random uniform u is generated, and the
cdf vector is searched for the solution to F (x − 1) < u ≤ F (x).
Note: R function rpois generates random Poisson samples.
Example 3.6 (Logarithmic distribution)
Simulate a Logarithmic(θ) random sample

aθx
f (x) = P (X = x) = , x = 1, 2, · · · (3.1)
x
where 0 < θ < 1 and a = (− log(1 − θ))−1 . A recursive formula for
f (x) is
θx
f (x + 1) = f (x), x = 1, 2, · · · (3.2)
x+1
Generate random samples from Logarithmic(0.5) dist.

n <- 1000 ; theta <- 0.5 ; x <- rlogarithmic (n , theta )

# compute density of logarithmic ( theta ) for comparison
k <- sort ( unique ( x )); p = -1 / log (1 - theta ) * theta ^ k / k
se <- sqrt ( p * (1 - p ) / n ) # standard error

Sample relative frequencies (line 1) match theoretical dist. (line 2)

of Logarithmic(0.5) dist. within two standard errors.
> round ( rbind ( table ( x ) /n , p , se ) ,3)
1 2 3 4 5 6 7
0.741 0.169 0.049 0.026 0.008 0.003 0.004
p 0.721 0.180 0.060 0.023 0.009 0.004 0.002
se 0.014 0.012 0.008 0.005 0.003 0.002 0.001

rlogarithmic
• Initially choose a length N for the cdf vector, and compute
F (x), x = 1, 2, · · · , N . If necessary, N will be increased.
rlogarithmic <- function (n , theta ) {
# returns a random logarithmic ( theta ) sample size n
u <- runif ( n )
# set the initial length of cdf vector
N <- ceiling ( -16 / log10 ( theta ))
k <- 1: N ; a <- -1 / log (1 - theta )
fk <- a * theta ^ k / k
Fk <- cumsum ( fk ); x <- integer ( n )
for ( i in 1: n ) {
x [ i ] <- as . integer ( sum ( u [ i ] > Fk )) # F ^{ -1}( u ) -1
while ( x [ i ] == N ) {
# if x == N we need to extend the cdf
# very unlikely because N is large
f <- a * theta ^( N +1) / N
fk <- c ( fk , f )
Fk <- c ( Fk , Fk [ N ] + fk [ N +1])
N <- N + 1
x [ i ] <- as . integer ( sum ( u [ i ] > Fk ))}}
x + 1 }
The Acceptance-Rejection Method
Suppose that X and Y are r.v. with density or pmf f and g re-
spectively, and there exists a constant c such that fg(t)
(t)
≤ c for all t
such that f (t) > 0. Then the acceptance-rejection method can be
applied to generate the r.v. X.
The acceptance-rejection method
1. Find a r.v. Y with density g satisfying f (t)/g(t) ≤ c, for all t
such that f (t) > 0.
2. Generate a random y from the dist. with density g
3. Generate a random u from the U (0, 1) dist.
4. If u < f (y)/(cg(y)) accept y and deliver x = y; otherwise
reject y and repeat step 2-4.
f (Y ) f (Y )
Note that in step 4, P (accept|Y ) = P (U < cg(Y ) |Y ) = cg(Y ) .
The total prob. of acceptance for any iteration is
X X f (y) 1
P (accept|y)P (Y = y) = g(y) = .
y y cg(y) c
In the discrete case, for each k such that f (k) > 0,
f (k)
P (A | k)g(k) ( cg(k) )g(k)
P (Y = k | A) = = = f (k)
P (A) 1/c
f (Y )
In the continuous case, we need to prove P (Y ≤ y | U ≤ cg(Y ) ) =
FX (y). In fact
f (Y )

f (Y )
P (U ≤ cg(Y ) , Y ≤ y)
P Y ≤y|U ≤ =
cg(Y ) 1/c
f (Y )
Z y P (U ≤ cg(Y ) | Y = ω ≤ y)
= g(ω)dω
−∞ 1/c
Z y
f (ω)
= c g(ω)dω = FX (y)
−∞ cg(ω)
Example 3.7 (Acceptance-rejection method), Beta(α = 2, β = 2) dist.
The Beta(2,2) density is f (x) = 6x(1 − x), 0 < x < 1. Let g(x)
be the U (0, 1) density. Then f (x)/g(x) ≤ 6 for all 0 < x < 1, so
c = 6. A random x from g(x) is accepted if
f (x)/cg(x) = x(1 − x) > u.
Averagely, cn = 6000 iterations will be required for sample size 1000.
n <- 1000; k <- 0 # counter for accepted
y <- numeric ( n ); j <- 0 # iterations
while ( k < n ) { u <- runif (1); j <-j +1;
x <- runif (1) # random variate from g
if ( x * (1 - x ) > u ) { k <-k +1; y [ k ] <- x }} # we accept x
>j
[1] 5873

This simulation required 5873 iterations (11746 random numbers).

# compare empirical and theoretical percentiles
p <- seq (.1 ,.9 ,.1); Qhat <- quantile (y , p ) # quantiles of sample
Q <- qbeta (p , 2 , 2) # theoretical quantiles
se <- sqrt ( p * (1 - p ) / ( n * dbeta (Q , 2 , 2))) # see Ch . 1
Sample percentiles (line 1) approximately match the Beta(2,2) per-
centiles computed by qbeta (line 2).
> round ( rbind ( Qhat , Q , se ) , 3)
10% 20% 30% 40% 50% 60% 70% 80% 90%
Qhat 0.189 0.293 0.365 0.449 0.519 0.589 0.665 0.741 0.830
Q 0.196 0.287 0.363 0.433 0.500 0.567 0.637 0.713 0.804
se 0.010 0.011 0.012 0.013 0.013 0.013 0.012 0.011 0.010

n = 10000 replicates produces more precise estimates.

> round ( rbind ( Qhat , Q , se ) , 3)
10% 20% 30% 40% 50% 60% 70% 80% 90%
Qhat 0.194 0.292 0.368 0.436 0.504 0.572 0.643 0.716 0.804
Q 0.196 0.287 0.363 0.433 0.500 0.567 0.637 0.713 0.804
se 0.003 0.004 0.004 0.004 0.004 0.004 0.004 0.004 0.003

Large numbers of replicates are required for estimation of percentiles

where the density is close to zero.
Remark 3.2
See Example 3.8 for a more efficient beta generator based on the
ratio of gammas method.
Transformation Methods
Many types of transformations other than the prob. inverse trans-
formation can be applied to simulate r.v..
1. If Z ∼ N (0, 1), then V = Z 2 ∼ χ2 (1).
2. If U ∼ χ2 (m) and V ∼ χ2 (n) are independent, then F =
V /m
U/m ∼ F (m, n).
3. If Z ∼ N (0, 1) and V ∼ χ2 (n) are independent, then T =
√Z ∼ t(n)
V /n
√
∼ U (0, 1) are independent, then Z1 = −2 log U cos (2πV )
4. If U, V √
Z2 = −2 log V sin (2πU ) are independent standard normal
variables.
5. If U ∼ Gamma(r, λ) and V ∼ Gamma(s, λ) are independent,
then X = U/(U + V ) has the Beta(r, s) dist.
log (V )
6. If U, V ∼ U (0, 1) are independent, then X = b1+ log (1−(1−θ) U)c

has the Logarithmic() dist., where bxc denotes the integer part
of x.
Example 3.8 (Beta distribution)

If U ∼ Gamma(r, λ) and V ∼ Gamma(s, λ) are independent, then

X = U/(U + V ) has the Beta(r, s) dist.
1. Generate u from Gamma(a, 1).
2. Generate v from Gamma(b, 1).
3. Deliver x = u/(u + v).
Beta(3,2) sample
n <- 1000; a <- 3; b <- 2;
u <- rgamma (n , shape =a , rate =1);
v <- rgamma (n , shape =b , rate =1);
x <- u/(u + v)

q <- qbeta ( ppoints ( n ) , a , b )

qqplot (q , x , cex =0.25 , xlab = " Beta (3 , 2) " , ylab = " Sample " )
abline (0 , 1) # line x = q added for reference .
The sample data can be compared with the Beta(3, 2) dist. using a
quantile-quantile (QQ) plot. If the sampled dist. is Beta(3, 2), the
QQ plot should be nearly linear.

1.0
0.8
0.6
Sample
Fig.3.2: QQ Plot comparing the Be-
ta(3, 2) distribution with a simulated

0.4
data in Example 3.8.

0.2 0.2 0.4 0.6 0.8 1.0

Beta(3, 2)
Remark: The QQ plot of the ordered sample vs the Beta(3, 2)
quantiles is very nearly linear.
Example 3.9 (Logarithmic dist., more efficient generator)
log(V )
If U, V are independent U (0, 1) r.v., then X = b1 + log(1−(1−θ) U)c

has the Logarithmic(θ) dist.

1. Generate u from U (0, 1).
2. Generate v from U (0, 1).
3. Deliver x = b1 + log(v)/ log (1 − (1 − theta)u )c.

# compare the Logarithmic (0.5) dist . with

# a sample using the transformation
n <- 1000; theta <- 0.5
u <- runif ( n ) # generate logarithmic sample
v <- runif ( n );
x <- floor (1 + log ( v ) / log (1 - (1 - theta )^ u ))
k <- 1: max ( x ) # calc . logarithmic probs .
p <- -1 / log (1 - theta ) * theta ^ k / k
se <- sqrt ( p * (1 - p ) / n ); p . hat <- tabulate ( x ) / n
> print ( round ( rbind ( p . hat , p , se ) , 3))
[ ,1] [ ,2] [ ,3] [ ,4] [ ,5] [ ,6] [ ,7]
p . hat 0.740 0.171 0.052 0.018 0.010 0.006 0.003
p 0.721 0.180 0.060 0.023 0.009 0.004 0.002
se 0.014 0.012 0.008 0.005 0.003 0.002 0.001

Below is a simple replacement for rlogarithmic in Example 3.6

rlogarithmic <- function (n , theta ) {
stopifnot ( all ( theta > 0 & theta < 1))
th <- rep ( theta , length = n )
u <- runif ( n )
v <- runif ( n )
x <- floor (1 + log ( v ) / log (1 - (1 - th )^ u ))
return ( x )
}
Sums and Mixtures

Sums and mixtures of r.v. are special types of transformations.

Convolutions
Let X1 , . . . , Xn be i.i.d. with dist. Xj ∼ X, and let S = X1 + . . .
+Xn . The dist. function of the sum S is called the n-fold
convolution of X. It is straightforward to simulate a convolution
by directly generating X1 , . . . , Xn and computing the sum.
Examples:
• If ν > 0 is an integer, chisquare dist. with ν degrees of freedom
is the convolution of ν iid squared standard normal variables.
• Negative binomial dist. NegBin(r, p) is the convolution of r iid
Geom(p) r.v.
• Convolution of r independent Exp(λ) r.v. has the Gamma(r, λ)
dist.
Note: the functions rchisq, rgeom and rnbinom to generate chisquare,
geometric and negative binomial random samples.
Example 3.10 (Chisquare)
Generate chisquare χ2 (ν) random sample. If Z1 , · · · , Zν are iid
N (0, 1) r.v., then V = Z12 + · · · + Zν2 has the χ2 (ν) dist.
1. Fill an n × ν matrix with ν r.v. from N (0, 1).
2. Compute the row sums of the squared normals.
3. Deliver the vector of row sums.
n <- 1000; nu <- 2
X <- matrix ( rnorm ( n * nu ) , n , nu )^2 # matrix of sq . normals
# sum the squared normals across each row : method 1
y <- rowSums ( X )
y <- apply (X , MARGIN =1 , FUN = sum ) # a vector length n , method 2
> mean ( y ); mean ( y ^2)
[1] 2.027334 7.835872

Sample statistics agree with theoretical moments E[Y ] = ν = 2 and

E[Y 2 ] = 2ν + ν 2 = 8. std. of sample moments are 0.063 and 0.089.
R note 3.5
• apply: apply a function to the margins of an array. The function
(FUN=sum) is applied to the rows (MARGIN=1)
• For efficient programming in R, avoid unnecessary loops.
Discrete mixture X P
dist. of X is a weighted sum FX (x) = θi FXi (x)Pfor some
sequence of r.v. X1 , X2 , · · · and θi > 0 such that i θi = 1.

Continuous mixture X R +∞
dist. of X is FX (x) = −∞ FX|Y =y (x)fY (y)dy for a family
X|Y = y indexed
R +∞ by the real numbers y and weighting function fY
such that −∞ fY (y)dy = 1.

Example: 50% Normal mixture

FX (x) = pFX1 (x) + (1 − p)FX2 (x)
To simulate the mixture:
1. Generate an integer k ∈ {1, 2},where P (1) = P (2) = 0.5
2. if k = 1 deliver random x from N (0, 1);
3. if k = 2 deliver random x from N (3, 1).
Example 3.11 (Convolutions and mixtures)
Let X1 ∼ Gamma(2, 2), X2 ∼ Gamma(2, 4)
Histogram of s Histogram of x

be independent. Compare the histogram-

0.8
0.5
s of the samples generated by the convolu-

0.4

0.6
tion S = X1 + X2 and the mixture FX =

0.3
Density

Density
0.4
0.5FX1 + 0.5FX2 .

0.2

0.2
Remark: Histograms of the convolution S

0.1
0.0

0.0
and mixture X are different. 0 1 2
s
3 4 5 0 1 2
x
3 4

n <- 1000; x1 <- rgamma (n , 2 , 2)

x2 <- rgamma (n , 2 , 4); s <- x1 + x2 # the convolution
u <- runif ( n )
k <- as . integer ( u > 0.5) # vector of 0 s and 1 s
x <- k * x1 + (1 - k ) * x2 # the mixture
par ( mfcol = c (1 ,2)) # two graphs per page
hist (s , prob = TRUE ); hist (x , prob = TRUE )
par ( mfcol = c (1 ,1)) # restore display

R note 3.6: par: set (or query) certain graphical parameters. par():
return a list of all graphical parameters. par(mfcol=c(n,m)): configure
graphical device to display nm graphs per screen (n rows and m columns).
Example 3.12 (Mixture of several gamma dist.)
There are several components to the mixture
P5and the mixing weights
are not uniform. The mixture is FX = i=1 θj FXj where Xj ∼
Gamma(r = 3, λj = 1/j) are independent and the mixing prob. are
θj = j/15, j = 1, · · · , 5.
To simulate the mixture FX :
1. Generate an integer k ∈ {1, 2, 3, 4, 5}, P (k) = θk , k = 1, . . . , 5.
2. Deliver a random Gamma(r, λk ) variate.
which suggests using a for loop to generate a sample size n, but
for loops are really inefficient in R.
Efficient vectorized algorithm:
1. Generate a random sample k1 , . . . , kn of integers in vector k,
where P (k) = θk , k = 1, . . . , 5. k[i] indicates which of the
five gamma distributions will be sampled to get the ith element
of sample (use sample).
2. Set rate equal to the length n vector λ = (λk ).
3. Generate a gamma sample size n, with shape parameter r and
rate vector rate (use rgamma).
n <- 5000
k <- sample (1:5 , size =n , replace = TRUE , prob =(1:5) / 15)
rate <- 1 / k ; x <- rgamma (n , shape =3 , rate = rate )
# plot the density of the mixture
# with the densities of the components
plot ( density ( x ) , xlim = c (0 ,40) , ylim = c (0 ,.3) ,
lwd =3 , xlab = " x " , main = " " )
for ( i in 1:5)
lines ( density ( rgamma (n , 3 , 1 / i )))
0.30
0.25
0.20

Fig.3.4: Density estimates from Ex-

Density
0.15

ample 3.12: A mixture (thick line) of

several gamma densities (thin lines).
0.10
0.05
0.00

0 10 20 30 40
x
Example 3.14 (Plot density of mixture)

The density is f (x) = 5j=1 θj fj (x), x > 0, where fj is the Gamma(3,λj )

P
density, with rates λ = (1, 1.5, 2, 2.5, 3) and mixing prob. θ =
(0.1, 0.2, 0.2, 0.3, 0.2).
function to compute the density f (x)
f <- function (x , lambda , theta ) {
# density of the mixture at the point x
sum ( dgamma (x , 3 , lambda ) * theta )}

dgamma(x,3,lambda)*theta is vector (θ1 f1 (x), · · · , θ5 f5 (x)).

x <- seq (0 ,8 , length =200); dim ( x ) <- length ( x ) # need for apply
y <- apply (x ,1 ,f , lambda = lambda , theta = p ) # compute density f ( x )
# plot the density of the mixture
plot (x ,y , type = " l " , ylim = c (0 ,.85) , lwd =3 , ylab = " Density " )
for ( j in 1:5) {
# add the j - th gamma density to the plot
y <- apply (x , 1 , dgamma , shape =3 , rate = lambda [ j ])
lines (x , y )}
0.8
0.6

Fig.3.5: Densities from Example

Density

3.14: A mixture (thick line) of sev-

0.4

eral gamma densities (thin lines).

0.2
0.0

0 2 4 6 8
x

R note 3.7
• apply function requires a dimension attribute for x.
• Vector x does not have a dimension attribute, which is thus as-
signed by dim(x) <- length(x). Alternately, x <- as.matrix(x)
converts x to a matrix (a column vector), which has a dimen-
sion attribute.
Example 3.15 (Poisson-Gamma mixture, continuous mixture)

If (X|Λ = λ) ∼ Poisson(λ) and Λ ∼ Gamma(r, β), then X has the

negative binomial dist. with parameters r and p = β/(1 + β).
# generate a Poisson - Gamma mixture
n <- 1000; r <- 4; beta <- 3
lambda <- rgamma (n , r , beta ) # lambda is random
# now supply the sample of l a m b d a s as the Poisson mean
x <- rpois (n , lambda ) # the mixture
# compare with negative binomial
mix <- tabulate ( x +1) / n
negbin <- round ( dnbinom (0: max ( x ) , r , beta / (1+ beta )) , 3)
se <- sqrt ( negbin * (1 - negbin ) / n )
> round ( rbind ( mix , negbin , se ) , 3)
[ ,1] [ ,2] [ ,3] [ ,4] [ ,5] [ ,6] [ ,7] [ ,8] [ ,9]
mix 0 .334 0 .305 0 .201 0 .091 0 .042 0 .018 0 .005 0 .003 0 .001
negbin 0 .316 0 .316 0 .198 0 .099 0 .043 0 .017 0 .006 0 .002 0 .001
se 0 .015 0 .015 0 .013 0 .009 0 .006 0 .004 0 .002 0 .001 0 .001
3.6.1 Multivariate Normal Dist.
A random vector X = (X1 , · · · , Xd ) has a d-dimensional mutivari-
ate normal (MVN) dist. denoted Nd (µ, Σ) if the density of X is
1 1
f (x) = 1/2
exp{− (x − µ)0 Σ−1 (x − µ)}, x ∈ Rd
(2π)d/2 | Σ| 2

Suppose that Σ can be factored as Σ = CC T for some matrix C.

Let z ∼ N (0, Id ), then Cz + µ = Nd (µ, Σ). Factorization of Σ
can be obtained by the spectral decomposition method (eigenvector
decomposition, eigen), Choleski factorization (chol), or singular
value decomposition (svd).
To generate a random sample of size n from Nd (µ, Σ):
1. Generate an nd matrix Z containing nd random N (0, 1) variates
2. Compute a factorization Σ = QT Q.
3. Apply the transformation X = ZQ + JµT and deliver X.
Z <- matrix ( rnorm ( n * d ) , nrow = n , ncol = d )
X <- Z % * % Q + matrix ( mu , n , d , byrow = TRUE )
Spectral decomposition method for generating Nd (µ, Σ) samples
Spectral decomposition: Σ1/2 = P Λ1/2 P −1 , Q = Σ1/2 , QT Q = Σ,
where Λ is the diagonal matrix with the eigenvalues of Σ along the
diagonal and P is the matrix whose columns are the correspond-
ing eigenvectors. This method also called eigen-decomposition, in
which, P −1 = P T and Σ1/2 = P Λ1/2 P T .
Generate a random sample from Nd (µ, Σ) with µ = (0, 0)T and

1.0 0.9
Σ=
0.9 1.0

# mean and covariance parameters

mu <-c (0 ,0); Sigma <- matrix ( c (1 ,.9 ,.9 ,1) , nrow =2 , ncol =2)
rmvn . eigen <- function (n , mu , Sigma ) {
# generate n random vectors from MVN ( mu , Sigma )
# dimension is inferred from mu and Sigma
d <- length ( mu ); ev <- eigen ( Sigma , symmetric = TRUE )
lambda <- ev $ values ; V <- ev $ vectors
R <- V % * % diag ( sqrt ( lambda )) % * % t ( V )
Z <- matrix ( rnorm ( n * d ) , nrow = n , ncol = d )
X <- Z % * % R + matrix ( mu , n , d , byrow = TRUE ); X }
Check results: Print summary statistics and display a scatterplot.

# generate the sample

X <- rmvn . eigen (1000 , mu , Sigma )
plot (X , xlab = " x " , ylab = " y " , pch = 20)
> print ( colMeans ( X ))
[1] -0.001628189 0.023474775
> print ( cor ( X ))
[ ,1] [ ,2]
[1 ,] 1.0000000 0.8931007
[2 ,] 0.8931007 1.0000000
3
2

Fig.3.6: Scatterplot of a random bi-

variate normal sample from Example

3.16.
y
−1

Remark: The scatter plot exhibits

−2

the elliptical symmetry of multivari-

−3

ate normal dist.

−4

−3 −2 −1 0 1 2 3
x
SVD Method of generating Nd (µ, Σ) samples
SVD generalizes the idea of eigenvectors to rectangular matrices.
• svd: X = U DV T , where D is a vector containing the singular
values of X, U is a matrix whose columns contain the left
singular vectors of X, and V is a matrix whose columns contain
the right singular vectors of X.
• Since Σ 0 (positive definite), U V T = I, thus Σ1/2 =
U Λ1/2 U T and svd is equivalent to spectral decomposition.
• svd is less efficient because it does not take advantage of the
fact that the matrix Σ is square symmetric.

rmvn . svd <- function (n , mu , Sigma ) {

# generate n random vectors from MVN ( mu , Sigma )
# dimension is inferred from mu and Sigma
d <- length ( mu ); S <- svd ( Sigma )
R <- S $ u % * % diag ( sqrt ( S $ d )) % * % t ( S $ v ) # sq . root Sigma
Z <- matrix ( rnorm ( n * d ) , nrow =n , ncol = d )
X <- Z % * % R + matrix ( mu , n , d , byrow = TRUE ); X }
Choleski factorization method

Choleski factorization: X = QT Q(X 0), where Q is an upper

triangular matrix. The R syntax: Q=chol(X).
rmvn . Choleski <- function (n , mu , Sigma ) {
d <- length ( mu );
Q <- chol ( Sigma ) # Choleski factorization of Sigma
Z <- matrix ( rnorm ( n * d ) , nrow =n , ncol = d )
X <- Z % * % Q + matrix ( mu , n , d , byrow = TRUE ); X }

Generate 200 random observations from a 4-D normal using chol

y <- subset ( x = iris , Species == " virginica " )[ , 1:4]
mu <- colMeans ( y ); Sigma <- cov ( y )
# now generate MVN data with this mean and covariance
X <- rmvn . Choleski (200 , mu , Sigma ); pairs ( X )
2.5 3.5 1.5 2.5

8
7
Sepal.Length

6
5
3.5

Sepal.Width Fig.3.7: Pairs plot of the bivariate

2.5

marginal distributions of a simulat-

ed multivariate normal random sam-

4.5 5.5 6.5

Petal.Length

ple in Example 3.18.

2.5

Petal.Width
1.5

5 6 7 8 4.5 5.5 6.5

Remark: The joint distribution of each pair of marginal distributions

is theoretically bivariate normal. The plot can be compared with
Figure 4.1, which displays the iris virginica data.
Comparing Performance of Generators
• Compares the timing of the above three generators and two
other generators (rmvnorm and rmvnorm).
• Generate data in R30 . system.time function is used.
library ( MASS ); library ( mvtnorm )
n <- 100 # sample size ;
d <- 30 # dimension ;
N <- 2000 # iterations ;
mu <- numeric ( d )
set . seed (100); system . time ( for ( i in 1: N )
rmvn . eigen (n , mu , cov ( matrix ( rnorm ( n * d ) , n , d ))))
set . seed (100); system . time ( for ( i in 1: N )
rmvn . svd (n , mu , cov ( matrix ( rnorm ( n * d ) , n , d ))))
set . seed (100); system . time ( for ( i in 1: N )
rmvn . Choleski (n , mu , cov ( matrix ( rnorm ( n * d ) , n , d ))))
set . seed (100); system . time ( for ( i in 1: N )
mvrnorm (n , mu , cov ( matrix ( rnorm ( n * d ) , n , d ))))
set . seed (100); system . time ( for ( i in 1: N )
rmvnorm (n , mu , cov ( matrix ( rnorm ( n * d ) , n , d ))))
set . seed (100); system . time ( for ( i in 1: N )
cov ( matrix ( rnorm ( n * d ) , n , d )))
3.6.2 Mixtures of Multivariate Normals
A multivariate normal mixture is denoted

pNd (µ1 , Σ1 ) + (1 − p)Nd (µ2 , Σ2 ) (3.7)

1. Generate U ∼ U (0, 1) (N ∼ Bernoulli(p)).

2. If U ≤ p (N = 1), generate X from Nd (µ1 , Σ1 ); otherwise
generate X from Nd (µ2 , Σ2 ).
Generate a multivariate normal mixture with two components
library ( MASS ) # for mvrnorm
# ineffecient version loc . mix .0 with loops
loc . mix .0 <- function (n , p , mu1 , mu2 , Sigma ) {
# generate sample from BVN location mixture
X <- matrix (0 , n , 2)
for ( i in 1: n ) {
k <- rbinom (1 , size = 1 , prob = p )
if ( k )
X [i ,] <- mvrnorm (1 , mu = mu1 , Sigma ) else
X [i ,] <- mvrnorm (1 , mu = mu2 , Sigma ) }
return ( X ) }
More efficient version
loc . mix <- function (n , p , mu1 , mu2 , Sigma ) {
# generate sample from BVN location mixture
n1 <- rbinom (1 , size = n , prob = p ); n2 <- n - n1
x1 <- mvrnorm ( n1 , mu = mu1 , Sigma )
x2 <- mvrnorm ( n2 , mu = mu2 , Sigma )
X <- rbind ( x1 , x2 ) # combine the samples
return ( X [ sample (1: n ) , ]) # mix them
}

Generate a sample of n = 1000 from a 50% 4-dimensional normal

mixture with µ1 = (0, 0, 0, 0) and µ2 = (2, 3, 4, 5) and Σ = I4 .
x <- loc . mix (1000 , .5 , rep (0 , 4) , 2:5 , Sigma = diag (4))
r <- range ( x ) * 1.2; par ( mfrow = c (2 , 2))
for ( i in 1:4)
hist ( x [ , i ] , xlim = r , ylim = c (0 , .3) , freq = FALSE ,
main = " " , breaks = seq ( -5 , 10 , .5))
par ( mfrow = c (1 , 1))
0.30

0.30
0.20

0.20
Density

Density
0.10

0.10
Fig.3.8: Histograms of the
0.00

0.00
−4 0 2 4 6 8 10 −4 0 2 4 6 8 10 marginal distributions of
x[, i] x[, i]
MVN data generated in
0.30

0.30
Example 3.20.
0.20

0.20
Density

Density
0.10

0.10
0.00

0.00

−4 0 2 4 6 8 10 −4 0 2 4 6 8 10
x[, i] x[, i]

Remark: It is difficult to visualize data in R4 , so display only the

histograms of the marginal distributions. All of the 1-D marginal
distributions are univariate normal location mixtures. Methods for
visualization of multivariate data are covered in Chapter 4.
3.6.3 Wishart Distribution
Suppose = X T X, where X is nd data matrix from Nd (µ, Σ) dist.
Then M has a Wishart dist. with scale matrix Σ and n degrees of
freedom, denoted M ∼ Wd (Σ, n) .
D
When d = 1, X is from N (µ, α2 ), so W1 (α, n) = σ 2 χ2 (n).
To generate Wishart r.v.
Generate MVN sample X and compute the matrix product X T X.
Computationally expensive because nd r.v. must be generated to
determine the d(d + 1)/2 distinct entries in M.
More efficient method based on Bartletts decomposition
Let T = (Tij ) be a lower triangular d × d random matrix with
independent entries satisfying
iid
1. Tij ∼ N (0, 1), i > j.
p
2. Tii ∼ χ2 (n − i + 1), i = 1, · · · , d.
Then the matrix A = T T T has a Wd (Id , n) dist.
To generate Wd (Σ, n) r.v.: Obtain the Choleski factorization
Σ = LLT , where L is lower triangular. Then LALT ∼ Wd (Σ, n).
3.6.4 Uniform Dist. on the d-Sphere

The d-sphere is the set of all points x ∈ Rd such that k x k=

1/2
(xT x) = 1. If X1 , ..., Xd are iid N (0, 1), then U = (U1 , ..., Ud ) is
uniformly distributed on the unit sphere in Rd , where
Xj
Uj = , j = 1, · · · , d. (3.8)
(X12 + + Xd2 )1/2

To generate uniform r.v. on the d-Sphere, for each variate ui , i =

1, · · · , n, repeat
1. Generate a random sample xi1 , . . . , xid from N (0, 1)
2. Compute the Euclidean norm k x k= (x2i1 + + x2id )1/2
3. Set uij = xij / k x k, j = 1, · · · , d.
4. Deliver ui = (ui1 , . . . , uid ).
runif . sphere <- function (n , d ) {
# return a random sample uniformly distributed
# on the unit sphere in R ^ d
M <- matrix ( rnorm ( n * d ) , nrow = n , ncol = d )
L <- apply (M , MARGIN = 1 ,
FUN = function ( x ){ sqrt ( sum ( x * x ))})
D <- diag (1 / L ); U <-D % * % M ; U }

Generate a sample of 200 points uniformly distributed on the circle.

# generate a sample in d =2 and plot
X <- runif . sphere (200 , 2)
par ( pty = " s " )
plot (X , xlab = bquote ( x [1]) , ylab = bquote ( x [2]))
par ( pty = " m " )
1.0
0.5

Fig.3.9: A random sample of 200

0.0
x2

points that is uniformly distributed on

the unit circle in Example 3.21.
−0.5
−1.0

−1.0 −0.5 0.0 0.5 1.0

R note 3.9
• par(pty = "s"): set the square plot type so the circle is round
rather than elliptical;
• par(pty = "m") restore the type to maximal plotting region.

Solutions Manual For Statistical Computing With R - Rizzo
100% (1)
Solutions Manual For Statistical Computing With R - Rizzo
136 pages
HW7 Solutions
No ratings yet
HW7 Solutions
39 pages
book_chap4CMCM
No ratings yet
book_chap4CMCM
12 pages
MIT Microeconomics Lec09 PDF
No ratings yet
MIT Microeconomics Lec09 PDF
7 pages
smlt
No ratings yet
smlt
40 pages
AM207 2 Transforms Sampling
No ratings yet
AM207 2 Transforms Sampling
50 pages
Group 4generating Continuous Random Variables
No ratings yet
Group 4generating Continuous Random Variables
33 pages
MESIO-SIM - (US) Generation of Random Variables
No ratings yet
MESIO-SIM - (US) Generation of Random Variables
42 pages
6_Random Variates_2025
No ratings yet
6_Random Variates_2025
37 pages
STAT 512 Mathematical Statistics: Lecture Notes
No ratings yet
STAT 512 Mathematical Statistics: Lecture Notes
120 pages
Variables Aleatorias 2
No ratings yet
Variables Aleatorias 2
34 pages
SlidesCourse 21 Oct
No ratings yet
SlidesCourse 21 Oct
10 pages
lecture_note_5
No ratings yet
lecture_note_5
7 pages
Introduction To Simulation Using R: 13.1 Analysis Versus Computer Simulation
No ratings yet
Introduction To Simulation Using R: 13.1 Analysis Versus Computer Simulation
19 pages
Advanced Graphics With R
No ratings yet
Advanced Graphics With R
3 pages
lec03
No ratings yet
lec03
16 pages
Simulating Continuous and Non-Continuous Distributions
No ratings yet
Simulating Continuous and Non-Continuous Distributions
17 pages
Simulation
No ratings yet
Simulation
5 pages
The Uniform Distributn
No ratings yet
The Uniform Distributn
7 pages
R03 Simulation.128
No ratings yet
R03 Simulation.128
18 pages
Module07-RandomVariateGenerationSlides 171116
No ratings yet
Module07-RandomVariateGenerationSlides 171116
114 pages
Econ-2042- Unit 2-HO
No ratings yet
Econ-2042- Unit 2-HO
12 pages
Lecture 7
No ratings yet
Lecture 7
6 pages
4 Random Variates - 2023
No ratings yet
4 Random Variates - 2023
30 pages
Simulation Models in Industrial Engineering: Random-Variate Generation
No ratings yet
Simulation Models in Industrial Engineering: Random-Variate Generation
18 pages
Chapter 2 Choosing Random Numbers From Distributions: 2.1 Direct Inversion
No ratings yet
Chapter 2 Choosing Random Numbers From Distributions: 2.1 Direct Inversion
21 pages
Stochastic
No ratings yet
Stochastic
63 pages
Uniform Distribution
No ratings yet
Uniform Distribution
6 pages
Von Neumann'S Comparison Method For Random Sampling From The Normaland Other Distributions
No ratings yet
Von Neumann'S Comparison Method For Random Sampling From The Normaland Other Distributions
21 pages
sujal 4
No ratings yet
sujal 4
31 pages
BaseR Cheat Sheet
No ratings yet
BaseR Cheat Sheet
21 pages
Non Unif HO
No ratings yet
Non Unif HO
58 pages
S2 Vol2 Jointcontsdistributions
No ratings yet
S2 Vol2 Jointcontsdistributions
81 pages
Stats2 Textbook Week4
No ratings yet
Stats2 Textbook Week4
34 pages
Annotated_L18
No ratings yet
Annotated_L18
73 pages
Statistical_Computing
No ratings yet
Statistical_Computing
6 pages
STAT 330 Supplementary Notes
No ratings yet
STAT 330 Supplementary Notes
134 pages
Lec3.Generating RVs
No ratings yet
Lec3.Generating RVs
26 pages
SlidesCourse 14 Oct
No ratings yet
SlidesCourse 14 Oct
10 pages
App12 CDF PDF Rand
No ratings yet
App12 CDF PDF Rand
23 pages
Learn Distribute
No ratings yet
Learn Distribute
23 pages
Stochalgslides
No ratings yet
Stochalgslides
406 pages
lect2
No ratings yet
lect2
7 pages
Lectures Prepared By: Elchanan Mossel Yelena Shvets
No ratings yet
Lectures Prepared By: Elchanan Mossel Yelena Shvets
21 pages
IE 403 Ch03 RNG RVG With Comments
No ratings yet
IE 403 Ch03 RNG RVG With Comments
27 pages
4 Variates
No ratings yet
4 Variates
24 pages
Introduction To Probability: 2.1 Random Variable
No ratings yet
Introduction To Probability: 2.1 Random Variable
4 pages
Lectures Prepared By: Elchanan Mossel Yelena Shvets
No ratings yet
Lectures Prepared By: Elchanan Mossel Yelena Shvets
21 pages
Notes
No ratings yet
Notes
56 pages
3and4_main
No ratings yet
3and4_main
10 pages
Chapter 2
No ratings yet
Chapter 2
57 pages
Sheet2_sol
No ratings yet
Sheet2_sol
5 pages
MATH2010 2022 23 AutumnNotes Gappy
No ratings yet
MATH2010 2022 23 AutumnNotes Gappy
92 pages
Chapter-3
No ratings yet
Chapter-3
26 pages
RM2
No ratings yet
RM2
102 pages
Exercise Sheet 04
No ratings yet
Exercise Sheet 04
8 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
NeurIPS-2019-sample-adaptive-mcmc-Paper
No ratings yet
NeurIPS-2019-sample-adaptive-mcmc-Paper
12 pages
Fritz_John_Slides_2016
No ratings yet
Fritz_John_Slides_2016
22 pages
l04 46discrete Case
No ratings yet
l04 46discrete Case
7 pages
NeurIPS-2019-sample-adaptive-mcmc-Paper
No ratings yet
NeurIPS-2019-sample-adaptive-mcmc-Paper
12 pages
AERchapter05
No ratings yet
AERchapter05
65 pages
2019-05-30
No ratings yet
2019-05-30
7 pages
Some Asymptotic Methods For Strongly Nonlinear Equ
No ratings yet
Some Asymptotic Methods For Strongly Nonlinear Equ
60 pages
Arthmatic Geometric Mean Kitt Bahatia
No ratings yet
Arthmatic Geometric Mean Kitt Bahatia
15 pages
1 s2.0 S0024379500000483 Main
No ratings yet
1 s2.0 S0024379500000483 Main
9 pages
WWW Emis de/journals/AFA
No ratings yet
WWW Emis de/journals/AFA
9 pages
1102 5492
No ratings yet
1102 5492
12 pages
Subramani 010207783
No ratings yet
Subramani 010207783
74 pages
Causes of Backward Bifurcations in Some - 2012 - Journal of Mathematical Analys
No ratings yet
Causes of Backward Bifurcations in Some - 2012 - Journal of Mathematical Analys
11 pages
Lec3 Inverse Transformation Rejection
No ratings yet
Lec3 Inverse Transformation Rejection
46 pages
Generating Random Numbers
No ratings yet
Generating Random Numbers
23 pages
[FREE PDF sample] Monte Carlo Methods for Radiation Transport: Fundamentals and Advanced Topics 1st Edition Oleg N. Vassiliev (Auth.) ebooks
100% (5)
[FREE PDF sample] Monte Carlo Methods for Radiation Transport: Fundamentals and Advanced Topics 1st Edition Oleg N. Vassiliev (Auth.) ebooks
65 pages
Fast Inverse Transform Sampling in One and Two Dimensions: Sheehan Olver Alex Townsend
No ratings yet
Fast Inverse Transform Sampling in One and Two Dimensions: Sheehan Olver Alex Townsend
10 pages
Inverse Transform
No ratings yet
Inverse Transform
4 pages
Putational Statistics Using Matlab
No ratings yet
Putational Statistics Using Matlab
78 pages
MTH210
No ratings yet
MTH210
126 pages
Computational Statistics With Matlab
No ratings yet
Computational Statistics With Matlab
71 pages
MA 6.101 Probability and Statistics: Assistant Professor, IIIT Hyderabad
No ratings yet
MA 6.101 Probability and Statistics: Assistant Professor, IIIT Hyderabad
65 pages
Inverse_Transform_Sampling
No ratings yet
Inverse_Transform_Sampling
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec3 Inverse Transformation Rejection

Uploaded by

Lec3 Inverse Transformation Rejection

Uploaded by

Chapter 3: Methods for Generating

Lecturer: Zhao Jianhua

• One of the fundamental tools required in computational statis-

dbinom (x , size , prob , log = FALSE ); rbinom (n , size , prob )

Table: Selected Univariate Prob. Functions Available in R

P (FX−1 (U ) ≤ x) = P (inf {t : FX (t) = U } ≤ x)

and therefore FX−1 (U ) has the same dist. as X.

Example 3.2 (Inverse transform method, continuous case)

Fig.3.1: Prob. density histogram

with theoretical density f (x) = 3x2

0.0 0.2 0.4 0.6 0.8 1.0

Alternately, main = bquote(f(x)==3*x^2)) produces the same

Example 3.3 (Exponential dist.)

A generator rexp is available in R.

If X is a discrete r.v. and

· · · < xi−1 < xi < xi+1 < · · ·

are the points of discontinuity of FX (x).

FX−1 (u) = inf {x : FX (x) ≥ u}

Sample mean and variance should be approximately p = 0.4 and

which simplifies to x < log (1 − u)/ log (q) ≤ x + 1. Thus

U and 1 − U have the same dist. and the prob. that

k <- floor ( log ( u ) / log (1 - p ))

n <- 1000 ; theta <- 0.5 ; x <- rlogarithmic (n , theta )

Sample relative frequencies (line 1) match theoretical dist. (line 2)

This simulation required 5873 iterations (11746 random numbers).

n = 10000 replicates produces more precise estimates.

Large numbers of replicates are required for estimation of percentiles

If U ∼ Gamma(r, λ) and V ∼ Gamma(s, λ) are independent, then

q <- qbeta ( ppoints ( n ) , a , b )

0.2 0.2 0.4 0.6 0.8 1.0

has the Logarithmic(θ) dist.

# compare the Logarithmic (0.5) dist . with

Below is a simple replacement for rlogarithmic in Example 3.6

Sums and mixtures of r.v. are special types of transformations.

Sample statistics agree with theoretical moments E[Y ] = ν = 2 and

Example: 50% Normal mixture

be independent. Compare the histogram-

n <- 1000; x1 <- rgamma (n , 2 , 2)

Fig.3.4: Density estimates from Ex-

ample 3.12: A mixture (thick line) of

The density is f (x) = 5j=1 θj fj (x), x > 0, where fj is the Gamma(3,λj )

dgamma(x,3,lambda)*theta is vector (θ1 f1 (x), · · · , θ5 f5 (x)).

Fig.3.5: Densities from Example

3.14: A mixture (thick line) of sev-

eral gamma densities (thin lines).

If (X|Λ = λ) ∼ Poisson(λ) and Λ ∼ Gamma(r, β), then X has the

Suppose that Σ can be factored as Σ = CC T for some matrix C.

# mean and covariance parameters

# generate the sample

Fig.3.6: Scatterplot of a random bi-

variate normal sample from Example

Remark: The scatter plot exhibits

the elliptical symmetry of multivari-

ate normal dist.

rmvn . svd <- function (n , mu , Sigma ) {

Choleski factorization: X = QT Q(X  0), where Q is an upper

Generate 200 random observations from a 4-D normal using chol

Sepal.Width Fig.3.7: Pairs plot of the bivariate

marginal distributions of a simulat-

4.5 5.5 6.5

ple in Example 3.18.

5 6 7 8 4.5 5.5 6.5

Remark: The joint distribution of each pair of marginal distributions

pNd (µ1 , Σ1 ) + (1 − p)Nd (µ2 , Σ2 ) (3.7)

1. Generate U ∼ U (0, 1) (N ∼ Bernoulli(p)).

Generate a sample of n = 1000 from a 50% 4-dimensional normal

Remark: It is difficult to visualize data in R4 , so display only the

The d-sphere is the set of all points x ∈ Rd such that k x k=

To generate uniform r.v. on the d-Sphere, for each variate ui , i =

Generate a sample of 200 points uniformly distributed on the circle.

Fig.3.9: A random sample of 200

points that is uniformly distributed on

−1.0 −0.5 0.0 0.5 1.0

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Choleski factorization: X = QT Q(X 0), where Q is an upper