Week - 5 Lecture
Week - 5 Lecture
Note this differs from common English usage where the word
variable implies something that varies from individual to individual.
y P(y)
outcome probability
0 1/4
1 2/4
2 1/4
• Binomial Distribution
• Poisson Distribution
Binomial Distribution
• The experiment consists of n identical trials (simple experiments).
• Each trial results in one of two outcomes (success or failure)
• The probability of success on a single trial is equal to and
remains the same from trial to trial.
• The trials are independent, that is, the outcome of one trial does not
influence the outcome of any other trial.
• The random variable y is the number of successes observed during
n trials.
n!
P( y ) y (1 ) n y
y!(n y )!
n!=1x2x3x…x n
n Mean
Standard deviation
n (1 )
RVDist-13
RVDist-14
RVDist-15
Poisson Distribution
A random variable is said to have a Poisson Distribution with rate
parameter , if its probability function is given by:
e=2.718…
y
P( y ) e , for y 0,1,2,...
y!
, 2
Mean and variance for a Poisson
RVDist-19
Probability Density Function
A function which integrates to 1 over its range and from which
event probabilities can be determined.
A mathematical abstraction
Normal Probability Density Function
Recall: continuous
Figure: Age distribution
random variables are of a pediatric population
described with with overlying Normal
probability density pdf
function (pdfs)
curves
Normal pdfs are
recognized by their
typical bell-shape
23
Area Under the Curve
pdfs should be viewed
almost like a histogram
Top Figure: The darker
bars of the histogram
correspond to ages ≤ 9
(~40% of distribution) f ( x)
1
x
12
e
2
2
Bottom Figure: shaded
area under the curve
(AUC) corresponds to
ages ≤ 9 (~40% of area)
24
Parameters μ and σ
Normal pdfs have two parameters
μ - expected value (mean “mu”)
σ - standard deviation (sigma)
25
Mean and Standard Deviation of Normal
Density
μ
26
Two types of means and
standard deviations
• The mean and standard deviation from
the pdf (denoted μ and σ) are
parameters
• The mean and standard deviation from
a sample (“xbar” and s) are statistics
• Statistics and parameters are related,
but are not the same thing!
27
t
y
D
i
s
t
r
i
b
u
t
i
o
n
s
28
Example: 68-95-99.7 Rule
Wechsler adult 68% of scores within
intelligence scores: μ±σ
Normally distributed = 100 ± 15
with μ = 100 and σ = 15; = 85 to 115
X ~ N(100, 15) 95% of scores within
μ ± 2σ
= 100 ± (2)(15)
= 70 to 130
99.7% of scores in
μ ± 3σ =
100 ± (3)(15)
= 55 to 145
7: Normal Probability Distributions 29
Symmetry in the Tails
Because the Normal
curve is symmetrical
and the total AUC is
exactly 1…
… we can easily
determine the AUC in
95%
tails
7: Normal Probability Distributions 30
Example: Male Height
Male height: Normal with μ = 70.0˝ and σ = 2.8˝
68% within μ ± σ = 70.0 2.8 = 67.2 to 72.8
32% in tails (below 67.2˝ and above 72.8˝)
16% below 67.2˝ and 16% above 72.8˝ (symmetry)
D
i
s
t
r
i
b
u
t
i
o
n
s
Example: Logarithmic
Reexpression
Prostate Specific Antigen Take exponents of “95% range”
(PSA) is used to screen e−1.9,1.3 = 0.15 and 3.67
Thus, 2.5% of non-diseased
for prostate cancer
population have values greater
In non-diseased than 3.67 use 3.67 as
populations, it is not screening cutoff
Normally distributed, but
its logarithm is:
ln(PSA) ~N(−0.3, 0.8)
95% of ln(PSA) within
= μ ± 2σ
= −0.3 ± (2)(0.8)
= −1.9 to 1.3
33
7.4 Assessing Departures
from Normality
Approximately Same distribution on
Normal histogram Normal “Q-Q” Plot