EPS - Chapter - 5 - Continuous Distributions - JNN - OK
EPS - Chapter - 5 - Continuous Distributions - JNN - OK
The normal distribution is also referred to as the Gaussian distribution in honor of Gauss
(1777 − 1855), who also derived from a study of errors in repeated measurements of the same
quantity.
Definition 5.1.1 A random variable X is said to have a Normal distribution with parameters
µ and σ 2 if its pdf is given by
1 1 2 2
f (x) = √ e− 2 (µ−x) /σ (5.1)
2πσ 2
or restated as
1 (x − µ)2
2 1
f (x, µ, σ ) = √ exp − −∞ < x < ∞, σ 2 > 0
2πσ 2 2 σ2
Remark 5.1.1 Any distribution defined by a density function given above is a normal dis-
tribution. Thus, X is said to normally distributed with the mean µ and variance σ 2 written
as
X ∼ N (µ, σ 2 )
The name normal originated in connection with the theory of errors of measurement in eigh-
teenth century, when it was found that errors ‘normally’ had the distribution function given by
equation of f (x) above.
131
5.1. NORMAL DISTRIBUTION
Example 5.1.1 Each different choice of specific numerical values for the pair µ and σ gives
a different bell curve. The value of µ determines the location of the curve, as shown in Figure
5.1. In each case the curve is symmetric about µ.
Example 5.1.2 The value of σ determines whether the bell curve is tall and thin or short
and squat, subject always to the condition that the total area under the curve be equal to 1.
This is shown in Figure 5.2, where we have arbitrarily chosen to center the curves at µ = 6.
Figure 5.3: Density Function for a Normally Distributed Random Variable with Mean µ and
Standard Deviation σ
Example 5.1.4 Heights of 25-year-old men in a certain region have mean 69.75 inches and
standard deviation 2.59 inches. These heights are approximately normally distributed. Thus
the height X of a randomly selected 25-year-old man is a normal random variable with mean
µ = 69.75 and standard deviation σ = 2.59. Sketch a qualitatively accurate graph of the density
function for X. Find the probability that a randomly selected 25-year-old man is more than
69.75 inches tall.
Solution : The distribution of heights looks like the bell curve in Figure 5.4. The
important point is that it is centered at its mean, 69.75, and is symmetric about the
mean.
Since the total area under the curve is 1, by symmetry the area to the right of 69.75
is half the total, or 0.5. But this area is precisely the probability P (X > 69.75), the
probability that a randomly selected 25-year-old man is more than 69.75 inches tall.
■
Note 5.1.1 The density function for a standard normal random variable is shown in Figure
5.5.
Remark 5.1.2 To compute probabilities for Z we will not work with its density function
directly but instead read probabilities out of positive normal table under Section 8.3.2. The
tables are tables of cumulative probabilities; their entries are probabilities of the form P (Z < z).
The use of the tables will be explained by the following series of examples.
Example 5.1.5 Find the probabilities indicated, where as always Z denotes a standard normal
random variable.
1.) P (Z < 1.48).
Solution : Figure 5.6 shows how this probability is read directly from the
positive normal table without any computation required. The digits in the ones
and tenths places of 1.48, namely 1.4, are used to select the appropriate row of the
table; the hundredths part of 1.48, namely 0.08, is used to select the appropriate
column of the table. The four decimal place number in the interior of the table that
lies in the intersection of the row and column selected, 0.9306, is the probability
sought:
P (Z < 1.48) = 0.9306
■
2.) P (Z < −0.25).
Solution : The minus sign in −0.25 makes no difference in the procedure; the
table is used in exactly the same way as in previous part - only that we now use the
negative normal table in Section 8.3.1: the probability sought is the number that
is in the intersection of the row with heading −0.2 and the column with heading
0.05, the number 0.4013. Thus P (Z < −0.25) = 0.4013. ■
Geometrically, since the total area under the curve is 1 and the area of the region
to the left of 1.60 is (from the table) 0.9452, the area of the region to the right of
1.60 must be 1 − 0.9452 = 0.0548. ■
2.) P (Z > −1.02).
Solution : The minus sign in −1.02 makes no difference in the procedure;
the negative normal table is used in exactly the same way. The number in the
intersection of the row with heading −1.0 and the column with heading 0.02 is
0.1539. This means that P (Z < −1.02) = P (Z ≤ −1.02) = 0.1539. Hence
P (Z > −1.02) = 1 − P (Z ≤ −1.02) = 1 − 0.1539 = 0.8461
■
Example 5.1.7 Find the probabilities indicated.
1. P (0.5 < Z < 1.57).
Solution :
P (0.5 < Z < 1.57) = P (Z < 1.57) − P (Z < 0.50) = 0.9418 − 0.6915 = 0.2503
■
2. P (−2.55 < Z < 0.09).
Solution :
P (−2.55 < Z < 0.09) = P (Z < 0.09)−P (Z < −2.55) = 0.5359−0.0054 = 0.5305
■
Example 5.1.8 Find the probabilities indicated.
1.) P (1.13 < Z < 4.16).
Solution : We attempt to compute the probability exactly by looking up the
numbers 1.13 and 4.16 in the table. We obtain the value 0.8708 for the area of
the region under the density curve to left of 1.13 without any problem, but when
we go to look up the number 4.16 in the table, it is not there. We can see from
the last row of numbers in the table that the area to the left of 4.16 must be so
close to 1 that to four decimal places it rounds to 1.0000. Therefore
P (1.13 < Z < 4.16) = P (Z < 4.16) − P (Z < 1.13) = 1.0000 − 0.8708 = 0.1292
■
2.) P (−5.22 < Z < 2.15).
Solution : Similarly, here we can read directly from the table that the area
under the density curve and to the left of 2.15 is 0.9842, but −5.22 is too far to
the left on the number line to be in the table. We can see from the first line of the
table that the area to the left of −5.22 must be so close to 0 that to four decimal
places it rounds to 0.0000. Therefore
P (−5.22 < Z < 2.15) = P (Z < 2.15)−P (Z < −5.22) = 0.9842−0.0000 = 0.9842
■
Remark 5.1.3 The next Example 5.1.9 of this section explains the origin of the proportions
given in the Empirical Rule.
Since Z has mean 0 and standard deviation 1, for Z to take a value between
−1 and 1 means that Z takes a value that is within one standard deviation of
the mean. Our computation shows that the probability that this happens is about
0.68, the proportion given by the Empirical Rule for histograms that are mound
shaped and symmetrical, like the bell curve. ■
This corresponds to the proportion 0.95 for data within two standard deviations
of the mean. ■
Solution : Similarly,
which corresponds to the proportion 0.997 for data within three standard devia-
tions of the mean. ■
To derive the pdf of a uniform random variable, we consider a constant function f (x) = c on
a≤x≤b
f (x)
c f (x) = c
a b x
Example 5.2.1 The number of goals scored during the African cup of Nation 2013 is uniformly
distributed with a probability density function
1
f (x) = , 1≤x≤7
6
Compute the
1.) Probability that the number of goals scored in any game is less or equal to 3.
Z 3
1 2
dx =
1 6 6
2.) Expected number of goals in any game at the African Cup of Nations 2013?
Z ∞
E[X] = xf (x)dx
−∞
Z 7
x
= dx
1 6
2 7
x 49 1 48
= = − =
12 1 12 12 12
= 4
3.) The final is a nil-nil at the end of the game, P (X = 0). The probability does not exist since
we do not have a distribution below x = 1.
Exercise 5.1 For a uniform distribution, show that the CDF (cumulative distribution func-
tion) is given by
0 ; x<a
x − a
F (x) = ; x ∈ [a, b)
b−a
1 ; x≥b
Exercise 5.2 The continuous random variable X is uniformly distributed over the interval
[−2, 7].
Example 5.2.2 A packing line consistently packages 200 cartons per hour. After weighing
every package variation was found in weights ranging from 18.2kg to 20.4kg measured to
nearest tenths. The customer requires < 20.0kg for ergonomic reasons. Find the
1.) mean
a+b 1
µ= = (18.2 + 20.4) = 19.3kg
2 2
2.) standard deviation
(b − a) 1
σ= √ = √ (20.4 − 18.2) = 0.635kg
12 12
3.) probability that a package exceeds the customer requirement
Z 20.4 Z 20.4
1 1 20.4 − 20.0
P (X ≥ 20.0) = dx = dx = = 0.1818
20.0 b − a 20.0 20.4 − 18.2 20.4 − 18.2
18.2% chance that a package exceeds conforming specifications or otherwise called the Voice
of the Customer.
Exercise 5.3 A coffee machine dispenses coffee into cups. It is controlled electronically to
randomly cut off the flow of coffee between 180 ml to 190 ml. Find the probability that the
machine dispenses
1.) less than 188 ml
2.) exactly 188 ml
3.) between 182 ml and 186 ml
Exercise 5.4 The time in minutes that Elaine takes to checkout at her local supermarket
follows a continuous uniform distribution over the interval [3, 9]. Find
1.) Elaine’s expected checkout time,
2.) the variance of the time taken to checkout at the supermarket,
3.) the probability that Elaine will take more than 7 minutes to checkout.
4.) Given that Elaine has already spent 4 minutes at the checkout, find the probability that
she will take a total of less than 6 minutes to checkout.
Exercise 5.5 In a game, players select sticks at random from a box containing a large number
of sticks of different lengths. The length , in cm, of a randomly chosen stick has a continuous
uniform distribution over the interval [7, 10].
A stick is selected at random from the box.
1.) Find the probability that the stick is shorter than 9.5 cm.
To win a bag of sweets, a player must select 3 sticks and wins if the length of the longest
stick is more than 9.5 cm.
2.) Find the probability of winning a bag of sweets.
To win a soft toy, a player must select 6 sticks and wins the toy if more than four of the
sticks are shorter than 7.6 cm.
3.) Find the probability of winning a soft toy.
Example 5.2.3 A random variable X has the uniform distribution on the interval [0, 1]: the
density function is f (x) = 1 if x is between 0 and 1 and f (x) = 0 for all other values of x, as
shown in Figure 5.8.
(a) Find P (X > 0.75), the probability that X assumes a value greater than 0.75.
Solution : P (X > 0.75) is the area of the rectangle of height 1 and base length
1 − 0.75 = 0.25, hence is
(b) Find P (X ≤ 0.2), the probability that X assumes a value less than or equal to 0.2.
Solution : P (X ≤ 0.2) is the area of the rectangle of height 1 and base length
0.2 − 0 = 0.2, hence is base × height = (0.2) · (1) = 0.2. See Figure 5.9 part (b).
■
(c) Find P (0.4 < X < 0.7), the probability that X assumes a value between 0.4 and 0.7.
Solution : P (0.4 < X < 0.7) is the area of the rectangle of height 1 and length
0.7 − 0.4 = 0.3, hence is base × height = (0.3) · (1) = 0.3. See Figure 5.9 part
(c). ■
Example 5.2.4 A man arrives at a bus stop at a random time (that is, with no regard for the
scheduled service) to catch the next bus. Buses run every 30 minutes without fail, hence the
next bus will come any time during the next 30 minutes with evenly distributed probability (a
uniform distribution). Find the probability that a bus will come within the next 10 minutes.
Solution : The graph of the density function is a horizontal line above the interval
from 0 to 30 and is the x-axis everywhere else. Since the total area under the curve
must be 1, the height of the horizontal line is 1/30 (Figure 5.10). The probability
sought is P (0 ≤ X ≤ 10).By definition, this probability is the area of the rectangular
region bounded above by the horizontal line f (x) = 1/30, bounded below by the x-
axis, bounded on the left by the vertical line at 0 (the y-axis), and bounded on the
right by the vertical line at 10. This is the shaded region in Figure 5.10. Its area is
the base of the rectangle times its height,
5.) Waiting time for the next person to be served at the bank counter
We note that
Rb
1.) P (a < X < b) = a f (x)dx
Ra a
2.) P (X < a) = 0 λe−λx dx = −e−λx 0 = 1 − e−λa
4.)
Thus
Also
Example 5.3.1 Suppose that the length of a phone call in minutes is an exponential random
1
variable with parameter λ = . If Someone arrives immediately ahead of a public telephone
10
booth, find the probability that you will have to wait
Z∞ 1
1 − x
P (X > 10) = e 10 dx
10
10
∞
1
− x
= −e 10
10
−1
= e
= 0.3679
= 0.233
Example 5.3.2 The waiting time for the Bsc External students to their next class is exponen-
tially distributed with λ = 5. Find the probability that any student of Bsc External will wait
for less than 4 hours?
−5x
5e ; x≥0
f (x) =
0 ; x<0
Z 4
⇒ P (X < 4) = 5e−5x dx
0
−5x 4
= −e 0
= 1 − e−20
= 0.999999997
By definition;
Z x Z x x
λe−λt dt = −e−λx 0 = 1 − e−λx , x ≥ 0
F (x) = f (t)dt =
−∞ 0
Example 5.3.3 The lifetime of a particular type of bulb has an exponential distribution with
mean lifetime of 1000 hours.
1.) Find the probability that a bulb is still working after 1300 hours.
f (x) = λe−λx , x ≥ 0.
Now
1
E(X) =
λ
And given E(X) = 1000 then
1
= 1000 ⇒ λ = 0.001.
λ
And so,
f (x) = 0.001e−0.001x
P (X > 1300) = e−1.3 = 0.2725
2.) Given that it is still working after 1300 hours, find the probability that it is still working
after 1500 hours.
P [(X > 1500) | (X > 1300)] = P (X > 200) Exponential identity 5.7
= e−0.001×200
= e−0.2
= 0.819
3.) Find the standard deviation of the lifetime of this type of light bulb.
r
1 1
Standard Deviation = 2
= = 1000hours
λ λ
Example 5.3.4 Suppose that the amount of time one spends in a bank is exponentially dis-
tributed with mean 10 minutes.
1.) What is the probability that a customer will spend more than 15 minutes in the bank?
2.) What is the probability that a customer will spend more than 15 minutes in the bank given
that he is still in the bank after 10 minutes?
Example 5.3.5 The time required to repair a machine is an exponential random variable
with rate λ = 0.5 downs/hour.
P (T ≥ 2) = e−1 = 0.36788
2.) What is the probability that the repair time will take at least 4 hours given that the repair
man has been working on the machine for 3 hours?
P (T ≥ 4 | T ≥ 3) = P (T ≥ 1) = e−0.5 = 0.60653
Example 5.3.6 Buses arrive to a bus stop according to an exponential distribution with rate
λ = 4 buses/hour.
1.) If you arrived at 8:00 am to the bus stop, what is the expected time of the next bus?
2.) Assume you asked one of the people waiting for the bus about the arrival time of the last
bus and he told you that the last bus left at 7:40 am. What is the expected time of the
next bus?
1
= 8 : 00 + hour
4
= 8 : 00 + 15 min
= 8 : 15 am
Exercise 5.6 On the average, a certain computer part lasts 10 years. The length of time the
computer part lasts is exponentially distributed.
1.) What is the probability that a computer part lasts more than 7 years? 0.4966
2.) On the average, how long would 5 computer parts last if they are used one after another?
50 years
3.) Eighty percent of computer parts last at most how long? 16.1 years
4.) What is the probability that a computer part lasts between 9 and 11 years? 0.0737
properties of Γ(α)
Γ(α) = (α − 1)Γ(α − 1) , α > 1
Γ(α) = (α − 1)! α > 1
√
1
Γ = π, thus
2
1 1 3 1 1
Γ α+ = α− α− ... Γ
2 2 2 2 2
1.3.5. . . . (2α − 1) 1
= Γ( )
2α 2
Consider the function Z ∞
G(α, β) = xα−1 e−βx dx β > 0 α > 0
0
Let U = βx ⇒ du = βdx
so
α−1 Z ∞
u du
G(α, β) = e−u
0 β β
Z ∞
1
= uα−1 e−u du
βα 0
Γ(α)
=
βα
Thus ∞
βα
Z
G(α, β) = 1
0 Γ(α)
Exercise 5.8 Z ∞
Γ(5) 24
x4 e−θx dx = 5
= 5
0 θ θ
Exercise 5.9 Use the above result to find the third central moment of X.
Definition 5.4.3 A random variable with probability density function f (x, n) in Equation
(5.15) is said to have a Chi-Squared χ2 distribution with n degrees of freedom. We write
X ∼ χ2(n) . Verify that
E(X) = n
Var(X) = 2n
Sometimes, we find that the variance of the population from which we are sampling is not
known. For samples of size n ≥ 30, a good estimate for σ 2 is S 2 , the sample variance. Then,
by estimating σ 2 with S 2 , the Z-value in the Central Limit Theorem is still approximately
normally distributed and
X̄ − µ
Z= √
S/ n
If n < 30, the values of S 2 fluctuate considerably from sample to sample and as such by
estimating σ 2 with S 2 , the value of Z are no longer normal. Then we refer to this distribution
as the t−distribution.
Definition 5.6.1 If X̄ and S 2 are the mean and variance, respectively, of a random sample of
size n taken from a population that is normally distributed with mean µ and variance σ 2 then
X̄ − µ
t = √ (5.18)
S/ n
is a value of a random variable T having the t-distribution with v = n − 1 degrees of freedom.
The pdf of the t-distribution is defined by
1 (ν−1)
x2 2
f (x) = Cv 1 + ; −∞ < x < ∞ (5.19)
v
where v is the degree of freedom and cν depends on ν.
It is important to note that: The total area under an F -curve is 1, An F -curve starts from
0 and then behaves like a χ2 -curve, An F -curve is not symmetric but is skewed to the right.
From the above, we note that the F -distribution has an f -value defined by
χ21 /ν1 S12 /σ12 σ22 S12
f = = = (5.21)
χ22 /ν2 S 2 /σ22 σ12 S22
where χ2 is a value of a chi-square distribution with ν1 = n1 − 1 degrees of freedom and X22 is
a value of a chi-square with ν2 = n2 − 1 degrees of freedom and we write f (ν1 , ν2 ).
1−p 1−p
n−1
Negative Binomial N b(n, r, p) pr (1 − p)n−r r r
r−1 p p2
1 1−p
Geometric Geo(p) p(1 − p)x−1 x ∈ N+
p p2
e−λ λx
Poisson Po(λ) λ λ
x!
0.5 0.25
0.2
pmf
pmf
pmf
0.4 0.2
0.15
0.3 0.15
0.1
0.2 0.1
0.05
0.1 0.05
0 0 0
0 10 20 30 40 0 2 4 6 8 10 12 0 5 10 15 20
x x x
indicate that some distribution can be symmetric at some specific values, but some, say the
Geometric fail to become bell shaped.
1 a+b (b − a)2
Uniform Unif(a, b)
b−a 2 12
(x − µ)2
2 1
Normal N (µ, σ ) √ exp − µ σ2
σ 2π 2σ 2
−(ν+1)/2 (
Γ ν+1
ν
2 x2 ν−2
; ν > 2,
Student’s t Student(ν) √ 1+ 0
νπ Γ ν2 ν ∞ ; 1 < ν ≤ 2.
1 k
−1
x 1−p 1−p
Chi-square χ2k k
x 2 exp − r r
2k/2 Γ 2
2 p p2
s
(d1 x)d1 dd22
(d1 x + d2 )d1 +d2 d2 2d22 (d1 + d2 − 2)
F F(d1 , d2 )
d1 d1 d2 − 2 d1 (d2 − 2)2 (d2 − 4)
xB ,
2 2
1 1
Exponential Exp(λ) λe−λx
λ λ2
1 x
α−1 − β
Gamma Gamma(α, β) x e αβ αβ 2
Γ(α)β α
λα α−1 −λx α α
Gamma(α, λ) x e
Γ(α) λ λ2
Γ(α + β) α−1 α αβ
Beta Beta(α, β) x (1 − x)β−1
Γ(α)Γ(β) α+β (α + β)2 (α + β + 1)
k x k−1 x k 1 2
Weibull Weibull(λ, k) exp − λΓ 1 + λ2 Γ 1 + − µ2
λ λ λ k k
For the Matlab codes used to generate the distribution graphs above, check the file “mat
distributions.m” in the appendix or the prob zip folder.
r2 Normal
2 1
k=1 µ = 0, m2 = 0.2
1.8 k=2 0.9 µ = 0, m2 = 1
k=3
µ = 0, m2 = 5
1.6 k=4 0.8
1.2 0.6
pdf
pdf
1 0.5
0.8 0.4
0.6 0.3
0.4 0.2
0.2 0.1
0 0
0 1 2 3 4 5 6 7 8 ï6 ï4 ï2 0 2 4 6
x x
Student’s t Beta
5
0.4
i=1 _ =0.3,` = 0.3
i=2 4.5 _ =5,` = 1
0.35 _ = 1,` = 3
i=5
i=' 4 _ = 2,` = 2
0.3 _ = 2,` = 5
3.5
0.25
3
pdf
pdf
0.2 2.5
2
0.15
1.5
0.1
1
0.05
0.5
0 0
ï6 ï4 ï2 0 2 4 6 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x x
Exercise 5.10 Each time customers visit a restaurant they are given a game card. Suppose
the probability of winning a prize with the game card is 0.2. Let X represent the number of
visits to a restaurant before winning a prize with the game card. What is the probability that
a customer will win a prize for the first time on the 6th visit? [0.06554]
Exercise 5.11 An oil company conducts a geological study that indicates that an exploratory
oil well should have a 0.20 chance of striking oil. What is the probability that the third strike
comes on the seventh well drilled? [0.0492]
1.5 0.3
pdf
pdf
0.25
1 0.2
0.15
0.5 0.1
0.05
0 0
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 2 4 6 8 10 12 14 16 18 20
x x
(x − µ)2
1
ρ(x) = √ exp − ,
σ 2π 2σ 2
Z x
(t − µ)2
1
F (x) = √ exp − dt
σ 2π −∞ 2σ 2
Example 5.7.3 The cumulative distribution function cf d of the Gamma distribution with
parameter α > 0, λ > 0. Z x
λα
F (x, α, λ) = tα−1 e−λt dt.
Γ(α) 0
Exercise 5.12 Suppose you are watching a radioactive source that emits particles at a rate
described by the exponential density
f (t) = λe−λt , t ≥ 0
where λ = 1, so that the probability P (0, T ) that a particle will appear in the next T seconds
RT
is P ([0, T ]) = 0 λe−λt dt. Find the probability that a particle (not necessarily the first) will
appear
1.) within the next second.
Exercise 5.13 Assume that a new light bulb will burn out after t hours, where t is chosen
from [0, ∞) with an exponential density
f (t) = λe−λt .
Exercise 5.14 Use the Binomial Probabilities to find the probability that, in 100 tosses of a
fair coin, the number of heads that turns up lies between 35 and 65, between 40 and 60, and
between 45 and 55.[Hint: For large n, n ≥ 30, apply Normal approximation to Binomial ]
Exercise 5.15 Charles claims that he can distinguish between beer and ale 75 percent of the
time. Ruth bets that he cannot and, in fact, just guesses. To settle this, a bet is made: Charles
is to be given ten small glasses, each having been filled with beer or ale, chosen by tossing a fair
coin. He wins the bet if he gets seven or more correct. Find the probability that Charles wins
if he has the ability that he claims. Find the probability that Ruth wins if Charles is guessing.
Exercise 5.16 A die is rolled 30 times. What is the probability that a 6 turns up exactly 5
times? What is the most probable number of times that a 6 will turn up?
Exercise 5.17 Find integers n and r such that the following equation is true:
13 13 13 n
+2 + = .
5 6 7 r
Exercise 5.18 In a ten-question true-false exam, find the probability that a student gets a
grade of 70 percent or better by guessing. Answer the same question if the test has 30 questions,
and if the test has 50 questions.
Exercise 5.19 A restaurant offers apple and blueberry pies and stocks an equal number of
each kind of pie. Each day ten customers request pie. They choose, with equal probabilities,
one of the two kinds of pie. How many pieces of each kind of pie should the owner provide so
that the probability is about 0.95 that each customer gets the pie of his or her own choice?
Exercise 5.20 A poker hand is a set of 5 cards randomly chosen from a deck of 52 cards. Find
the probability of a
1.) royal flush (ten, jack, queen, king, ace in a single suit).
2.) straight flush (five in a sequence in a single suit, but not a royal flush).
3.) four of a kind (four cards of the same face value).
4.) full house (one pair and one triple, each of the same face value).
5.) flush (five cards in a single suit but not a straight or royal flush).
6.) straight (five cards in a sequence, not all the same suit). (Note that in straights, an ace
counts high or low.)
Example 5.7.4 A fair coin is tossed until the second time a head turns up. The distribution
for the number of tosses is u(x, 2, p).
x − 1 k x−k x−1 1
1
u(x, k, p) = p q ⇒ u x, 2, =
k−1 2 1 2x
for x = 2, 3, . . . . A Negative Binomial problem.
Example 5.7.5 Two players A and B flip a biased coin alternately and the first player to
obtain a head wins. The probability of obtaining a head is p > 0 at each toss. If A flips first,
find the probability that A wins the game.
Then we have
p
P (A wins ) = p + (1 − p)2 p + (1 − p)4 p + · · · =
1 − (1 − p)2
Example 5.7.6 A fair coin is tossed independently n times (n > 3). Find the probability at
least three of tosses yield heads.
1 n 1 n 1
P ( at least 3 H’s) = 1 − P ( at most 2 H’s) = 1 − n − −
2 1 2n 2 2n
Example 5.7.7 Find a value c that minimizes E [(X − c)2 ] for a discrete random variable X.
Let
f (c) = E(X − c)2 = c2 − (2E[X])c + E[X 2 ]
Then f (c) is minimum at a point where f ′ (c) = 0. f ′ (c) = 2c − 2E[X] = 0. So
c = E[X]
Example 5.7.8 Suppose the probability of containing at least one typographical error in a
page of a book is 0.005. Assuming typographical errors occur independently from page to page,
what is the probability that a 400 page novel will contain exactly one page with errors?
Example 5.7.9 The length of time required to complete a college test is found to be normally
distributed with mean 50 minutes and standard deviation 12 minutes.
1.) When should the test be terminated if we wish to allow sufficient time for 90% of the
students to complete the test?
X − 50
Let X be the length of time to complete the test. Then Z = ∼ N (0, 1), the
12
standard normal distribution. Need to find c such that
c − 50 c − 50
P (X < c) = P Z < = 0.9 ⇔ = 1.28
12 12
So at least c = 65.36 minutes should be given.
2.) What proportion of students will finish the test between 30 and 60 minutes?
2.) Find the cumulative distribution function of X and plot its graph.
Integrating f , we get (
0 ; x ≤ 1,
F (x) =
1 − e1−x ; x > 1.
F is a non-decreasing function and F (∞) = 1.
P (X = 3)
P (−1 < X ≤ 3|X ≥ 3) = =0
P (X ≥ 3)
Since the probability of a continuous random variable taking a single point is zero.
Exercise 5.21 Bob is a high school basketball player with probability of making a free throw
as 0.70. During the season, what is the probability that Bob makes his third free throw on his
fifth shot?
Example 5.7.11 The speed of a molecule in a uniform gas at equilibrium is a random variable
whose probability density function is given by
2 −bx2
ax e , x≥0
f (x) =
0, x<0
m
where b = κT and κ, T , and m denote, respectively, Boltzmann’s constant, the absolute
2
temperature of the gas, and the mass of the molecule. Evaluate a in terms of b.
Z ∞
4b3/2 1 2 1
a = √ , using integration by parts and the fact √ e−y /2 dy =
π 2π 0 2
Example 5.7.12 Suppose X is a random variable following a normal distribution with vari-
ance σ 2 , where σ is the standard deviation of X. Show that the standard deviation of −3X + 2
is 3σ. p √
SD(−3x + 2) = Var(−3x + 2) = 9σ 2 + 0 = 3σ
Example 5.7.13 Let f (x) denote the probability density function of a normal random variable
with mean µ and variance σ 2 . Show that µ−σ and µ+σ are points of inflection of this function.
That is, show that f ′′ (x) = 0 when x = µ − σ or x = µ + σ.
1
Let c = √ , such that
2πσ
2 /2σ 2 2 /2σ 2 2 /2σ 2
f (x) = ce−(x−µ) ⇒ f ′′ (x) = cσ −4 e−(x−µ) (x − µ)2 − cσ −2 e−(x−µ)