02 Normal Distribution
02 Normal Distribution
Junzhe Bao
Department of Biostatistics and Epidemiology
Email: statistics_bjz@126.com
1
Outline
2
Probability density curve
i: interval width
(4)
n: sample size(
1402)
3
Probability density curve
Probability density
weight
4
Probability density curve
5
Normal distribution and it’s general features
When the data values are evenly distributed about the mean, high in the middle and low on
both sides, showing a bell shape, which is similar to a normal curve, such distribution could
be called a normal distribution.
1 −( X − ) 2 / 2 2
f (X ) = e ,− X +
2
f(X): the probability density.
σ : standard deviation.
π ≈ 3.14 (≈ means “is approximately equal to”)
e ≈ 2.718
μ : mean value.
σ2 :variance
X ~ N(μ, σ2 )
8
Normal distribution and it’s general features
The shapes and locations of normal distributions with different values of μ and σ
μ: location; σ: shape
9
Normal distribution and it’s general features
a1 a2
x
a μ
The calculation of the probability of the values falling into the area smaller than a specific value
(such as “a”) or falling into a range (such as “a1~a2”) is equal to calculate the corresponding
area under the normal distribution curve.
10
Normal distribution and it’s general features
➢ The distribution has two parameters. One is μ ( location parameter), the other is σ
(variability parameter).
➢ The curve never touches the x axis, and the total area under a normal distribution
curve is equal to 1.
11
Outline
12
Standard normal distribution
➢ Since each normally distributed variable has its own mean and standard
deviation, as stated earlier, the shape and location of these curves will vary.
➢ In practical applications, then, you would have to have a table of areas under
the curve for each variable, so as to predict the probability of a specific value
or range.
➢ To simplify this situation, statisticians use what is called the standard normal
distribution, with a mean of 0 and a standard deviation of 1.
13
Standard normal distribution
All normally distributed variables can be transformed into the standard normally
distributed variable :
1 −( X − ) 2 / 2 2
f (X ) = e ,− X +
2
X −
Z=
1 −Z 2 / 2
(Z ) = e ,− Z +
2
14
Standard normal distribution
X −
Normal
Z = Standard normal
distribution distribution
=1
Z
x =
15
Standard normal distribution
Example 1:
Suppose that X~N(μ, σ2 ), calculate the probability of the values of X falling into
the range between μ - 1.96σ and μ + 1.96σ.
16
The area
for
Negative
infinity to
Z (-∞ ~ Z)
17
Standard normal distribution
Example 2:
Supposed that X~N(μ, σ2 ), please calculate the probability of the values of X
falling into the range between μ – 2.58σ and μ + 2.58σ, as well as between μ –
1.64σ and μ + 1.64σ.
18
Standard normal distribution
Example 3:
Suppose that diastolic blood pressure (X) in hypertensive
women centers is normally distributed, with the mean value
of 100mmHg and standard deviation of 16mmHg. Please
calculate P(X<90mmHg), P(X>124mmHg), as well as
P(90mmHg < X <124mmHg).
20
Standard normal distribution
-0.63 0 1.50 21
The area
for
Negative
infinity to
Z (-∞ ~ Z)
P(Z>1.5)=
1-P(Z<1.5)
=
1-0.9332
=
0.0668=
6.68%=
P(Z<-1.5)
22
Standard normal distribution
Example 3:
P(X<90mmHg)=P(Z<-0.63)=0.2643=26.43%
P(X>124mmHg)=P(Z>1.50)=P(Z<-1.50)=0.0668=6.68%
23
Outline
24
The applications of normal distributions
25
The applications of normal distributions
26
The applications of normal distributions
27
The applications of normal distributions
28
The applications of normal distributions
Two-sided values
2.5% of the time a sample value will be less than the lower limit
of this interval, and 2.5% of the time it will be larger than the
upper limit of this interval.
Examples:
29
The applications of normal distributions
➢ One-sided values
In some cases, only one side of the range is usually of interest, such
as with markers of pathology including cancer antigen, where it is
generally without any clinical significance to have a value below
what is usual in the general population.
30
The applications of normal distributions
➢ Example 4
Suppose the concentration of hemoglobin in 120 health women is
normally distributed with a mean of 117.4g/L and standard
deviation 10.2g/L. What is the 95% reference range of hemoglobin?
31
The applications of normal distributions
Because it is abnormal whether hemoglobin is too higher or too lower. We should use
two-sided range.
In practical applications, the sample mean(𝑥)ҧ is often used to represent the population
mean (μ), and the sample standard deviation (s) is the population standard deviation (σ).
➢ Example 5
Measure the vital capacity of 110 health adult men and get a mean
of 4.2/L and standard deviation of 0.7/L.
What is 95% reference range of vital capacity ?
34
The applications of normal distributions
35
The applications of normal distributions
X + 1.64S
One-sided
normal distribution X −1.64S
Two-sided X 1.96S
Distribution Plot Distribution Plot
Normal, Mean=0, StDev=1 Normal, Mean=0, StDev=1
0.4 0.4
0.3 0.3
Density
Density
0.2 0.2
95% 95%
0.1 0.1
0.0 0.0
-1.960 0 1.960 -1.645 0
X X
36
The applications of normal distributions
37
The applications of normal distributions
One-sided X + 1.64S
X −1.64S
normal distribution
two-sided X 1.96S
One-sided <P Or >P
95 5
Percentile method
Example 6
Suppose that the scores on CET-4 are normally distributed with a mean of 70 and
standard deviation of 6.
What is the probability that a randomly selected score is larger than 80?
The applications of normal distributions
X − 80 − 70
Z= = = 1.67
6
P(X>80)=P(Z>1.67)=0.0475
So the probability that a randomly selected score is larger than 80
is 0.0475. that is to say, the scores of 4.75% people is higher than
80
The applications of normal distributions
Example 7
An instructor is administering a final examination. She tells her class that she will
give an A grade to the 10% of the students who earn the highest marks. Past
experience with the same examination has yielded grades that are normally
If the same present class runs true to form, what numerical score would a student
Step one
P (One-sided area)=0.1 Z?
X − 80 − 70
Z= = = 1.67
6
10%
43
The applications of normal distributions
Step two
Z=1.28 x?
x−x x − 70
Z= = 1.28
s 10
x=82.8
Thanks!
45