15chap 3.1 Sampling Distribution
15chap 3.1 Sampling Distribution
Sampling Distribution
Population Parameter and Sample Statistic
Sampling is the process to obtain useful information on a large
target group. The target group is called the population, the
portion of the population which is selected and observed is
called the sample, and the number of members in the sample
is called the sample size. The members in the sample are
called sampling units. The listing of all possible sampling units
is called sampling frame.
The numerical measures which describe certain characteristics of a
population are called population parameters and the numerical
measures calculated from the sample are called sample statistic.
Example:
1. The mean height of all Grade 11 students in Metro Manila is a
population parameter.
2. The mean height of the 1000 Grade 11 students in the sample is a
sample statistic.
2. Testing:
In this type of inference, we formulate a decision about the value
of a population parameter.
3. Regression:
In this type of inference, we make predictions or forecasts about
the value of a population parameter.
Random Sample
2. Without replacement
This means that once a datum is chosen, it has no further chance of being
chosen. In this case, the size of the population for sampling is reduced by
one as each sampling unit is chosen.
sample mean
It is the mean computed from the randomly selected samples.
sampling distribution
A sample statistic together with its corresponding probability distribution
forms a sampling distribution.
Sampling Distribution
sampling error
It is the difference between the results obtained from a sample and the results
obtained from the population from which the sample was selected.
sampling error = 𝑥ҧ − 𝜇
Sampling Distribution:
𝑼𝒔𝒆 𝒕𝒉𝒆 𝒄𝒂𝒍𝒄𝒖𝒍𝒂𝒕𝒐𝒓 𝒕𝒐 𝒇𝒊𝒏𝒅
ഥ
𝒙 6 7 8 9 10
𝑷(ഥ
𝒙) 1/9 2/9 3/9 2/9 1/9 ഥ =𝟖
𝑬 𝑿
ഥ = 𝟏. 𝟑𝟑
𝑽𝒂𝒓 𝑿
Example 2:
Suppose a population consists of four data, namely 5, 8, 12, 16 and each
has an equal probability to be chosen. Suppose a sample of size 2 is
chosen from the population with replacement.
a. Find the mean 𝜇 and variance 𝜎 2 of the population.
ത
b. Find the probability distribution of the sample mean 𝑋.
c. Find 𝐸 𝑋ത and 𝑉𝑎𝑟 𝑋ത
Samples 5,5 5,8 5,12 5,16 8,8 8,12 8,16 12,12 12,16 16,16
ഥ
𝑿 5 6.5 8.5 10.5 8 10 12 12 14 16
Example 2:
a. Find the mean 𝜇 and variance 𝜎 2 of the population.
5+8+12+16
𝜇= = 10.25 𝜎 2 = 17.19
4
b. ഥ
𝒙 2 2.7 3.3 4 4.7 5.3 6
𝑷(ഥ
𝒙) 0.1 0.1 0.2 0.2 0.2 0.1 0.1
c. 𝑃 𝑋ത = 4 = 0.2
Example 4:
Suppose a random sample of size 200 is taken from a population with a
mean of 510 kg and a standard deviation of 15 kg;
a. Find the mean and the variance of the sample mean.
b. If it is required to reduce the standard error of the mean to less than
0.5 kg, what is the minimum sample size?
Solution:
𝝈𝟐 𝟏𝟓𝟐
a. 𝑆𝑖𝑛𝑐𝑒 𝜇 = 510, 𝑡ℎ𝑒𝑛 𝐸 𝑋ത = 𝜇𝑥ҧ = 𝟓𝟏𝟎 𝒌𝒈 and 𝑽𝒂𝒓 𝑿
ഥ = = = 𝟏. 𝟏𝟑 𝒌𝒈𝟐
𝒏 𝟐𝟎𝟎
𝜎 15
b. < 0.5, 𝑛> , 𝑛 > 30, n > 900 , Answer: 901
𝑛 0.5
Sampling Distribution of the sample
mean from a Normal Distribution
a. Find the probability that the sample mean is greater than 52.
52 − 51
𝑃 𝑋ത > 52 = 𝑃 = 𝑃 𝑍 > 1 = 𝟎. 𝟏𝟓𝟖𝟕
1
a. Find the probability that the sample mean lies between 49.5 and
51.5.
49.5 − 51 51.5 − 51
𝑃 49.5 < 𝑋ത < 51.5 = 𝑃 <𝑍< = 𝑃 −1.5 < 𝑍 < 0.5 = 𝟎. 𝟔𝟐𝟒𝟕
1 1
Sampling Distribution of the sample mean from
other Distribution
If the population from which the sample is taken does not
follow a normal distribution, then the sampling distribution of
the sample mean is also not normal.
ҧ
𝑥−𝜇
Formula: 𝑍= 𝜎
𝑛
Example 6:
A sample size of 400 is taken from a population X that
follows a Poisson distribution with 𝜇 = 4. Find the
probability that the sample mean is less than 3.85.
Solution:
𝑿~𝑷𝒐 𝝁 𝒘𝒉𝒆𝒓𝒆 𝝁 = 𝝈𝟐 = 𝟒
3.85−4
𝑃 𝑋ത < 3.85 = 𝑃 𝑍 < 2 = 𝑃 𝑍 < −1.5 = 𝟎. 𝟎𝟔𝟔𝟖
400
Example 7:
The mean and standard deviation of the number of
phone calls that a company receives in an hour are found
to be 25 and 15 respectively. A sample of n different
hours is randomly selected. If the probability that the
sample mean number of phone calls received is greater
than 24 per hour and is 97.5%, find the least possible
value of n assuming that n is sufficiently large.
Example 7:
Solution:
Let X be the random variable representing the number of phone calls received in an
hour.
Central Limit Theorem can be used to approximate the distribution.
−29.4 − 𝑛
𝑃 𝑋ത > 24 > 0.975 >
−1 −1
24−25
𝑍> 15 > 0.975 (29.4)2 < ( 𝑛)2
𝑛
−𝟏
-1.96 ≻ 𝟏𝟓 𝒏 > 𝟖𝟔𝟒. 𝟑𝟔
𝒏
− 𝑛
−1.96 > Answer: The least possible sample size is 865.
15
Assessment 1:
The following table shows the probability distribution of a
discrete random variable X. (recall: its not sample mean)
𝒙 0 2 4 6 8
𝑷(𝒙) 0.1 0.3 0.2 0.25 0.15
b. Use Central Limit theorem to solve for the probability of 𝑋ത > 4.5.
𝑥ҧ − 𝜇 4.5 − 4.10 0.4
𝑧= 𝜎 = = = 1.61
2.49 0.249
𝑛 100
𝑃 𝑍 > 1.61 = 0.0537
Assessment 2:
A random sample of size 20 is taken from a normal population
with mean 10 and variance 6. Find the probabilities that the
sample mean 𝑋ത is:
a. greater than 10.5,
b. less than 9.7; and
c. between 9.8 and 10.2
Assessment 2:
ത
Use the Central Limit theorem to solve for the probabilities of the sample mean 𝑋.
𝜇 = 10 𝜎2 = 6 hence 𝜎 = 2.45
a. greater than 10.5
ҧ
𝑥−𝜇 10.5−10 0.5
𝑧= 𝜎 = 2.45 = = 0.91 𝑃 𝑍 > 0.91 = 0.1814
0.5478
𝑛 20
Question 4:
A random sample of size 50 is taken from a normal population with mean
120 cm and standard deviation of 10 cm. Find the probabilities that :
a. the sample mean is less than or equal to 117 cm.
b. the sample mean lies between 121 cm and 124 cm.
Next are…
Point Estimation
Confidence Interval for Population Mean
Confidence Interval for Population Proportion