Chapter 07
Chapter 07
Learning Objectives
1. Understand the importance of sampling and how results from samples can be used to
2. Know what simple random sampling is and how simple random samples are selected.
4. Understand the central limit theorem and the important role it plays in sampling.
5. Specifically know the characteristics of the sampling distribution of the sample mean ( )
6. Learn about a variety of sampling methods, including stratified random sampling, cluster
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Solutions
c. E and C because 8 and 0 do not apply; 5 identifies E; 7 does not apply; 5 is skipped
because E is already in the sample; 3 identifies C; 2 is not needed because the sample of
size 2 is complete.
2. Using the last three digits of each five-digit grouping provides the random numbers:
Numbers greater than 350 do not apply, and the 147 can only be used once. Thus, the simple
4. a. 5, 0, 5, 8
b.
5. 283, 610, 39, 254, 568, 353, 602, 421, 638, 164
7. 108, 290, 201, 292, 322, 9, 244, 249, 226, 125, (continuing at the top of column 9) 147, and
113.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
9. 102, 115, 122, 290, 447, 588, 351, 157, 835, 498, 55, 801
10. a. Finite population. A frame could be constructed obtaining a list of licensed drivers from
b. Infinite population. Sampling from a process. The process is the production line
c. Infinite population. Sampling from a process. The process is one of generating arrivals
e. Infinite population. Sampling from a process. The process is one of generating orders
11. a.
b.
s=
b. = 55/150 = .3667
13. a.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
b.
94 +1 1
100 +7 49
85 –8 64
94 +1 1
92 –1 1
b. Seventeen of the 40 stocks in the sample are rated Above Average with respect to risk.
c. There are eight stocks in the sample that are rated one star or two stars.
15. a.
b.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
16. a. The sampled population is U. S. adults that are 50 years of age or older.
b. We would use the sample proportion for the estimate of the population proportion.
c. The sample proportion for this issue is .74, and the sample size is 426. The number of
d. We would use the sample proportion for the estimate of the population proportion.
e. The inferences in parts b and d are being made about the population of U.S. adults who
are age 50 or older. So the population of U.S. adults who are age 50 or older is the target
population. The target population is the same as the sampled population. If the sampled
population was restricted to members of AARP who were 50 years of age or older, the
sampled population would not be the same as the target population. The inferences
made in parts b and d would only be valid if the population of AARP members age 50
or older was representative of the U.S. population of adults age 50 and older.
b. = 2,770/4,135 = .6699
c. = 2,233/4,135 = .5400
18. a.
b.
d. It shows the probability distribution of all possible sample means that can be observed
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
with random samples of size 100. This distribution can be used to compute the
E = μ = 200
For +5,
At = 205, = .8413
At = 195, = .1587
b. For + 10,
At = 210, = .9772
At = 190, = .0228
20.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The standard error of the mean decreases as the sample size increases.
21. a.
b. n / N = 50 / 50,000 = .001
Use
c. n / N = 50 / 5000 = .01
Use
d. n / N = 50 / 500 = .10
Note: Only case d where n /N = .10 requires the use of the finite population correction
factor.
b. For n = 120, E ( ) remains $71,800, and the sampling distribution of can still be
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
approximated by a normal distribution. However, is reduced to = 365.15.
c. As the sample size is increased, the standard error of the mean, , is reduced. This
appears logical from the point of view that larger samples should tend to provide sample
means that are closer to the population mean. Thus, the variability in the sample mean,
At = 72,300,
At = 71,300,
b.
At = 72,300,
At = 71,300,
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
24. a. Normal distribution,
25. This value for the standard error can be used for parts a and b.
c. The probability of being within 10 of the mean on the evidence-based reading and
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
writing portion of the test is exactly the same as the probability of being within 10 on
the Mathematics portion of the SAT. This is because the standard error is the same in
both cases. The fact that the means differ does not affect the probability calculation.
26. a.
The z value for – 16,642 = –200 is the negative of the z value for – 16,642
b. A larger sample increases the probability that the sample mean will be within a
specified distance of the population mean. In this instance, the probability of being
within +200 of μ ranges from .3544 for a sample of size 30 to .9050 for a sample of size
400.
27. a.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(36.39 ≤ ≤ 38.39) = .9382 – .0618 = .876
b.
c. In part b we have a higher probability of obtaining a sample mean within $1.00 of the
population mean because the standard error for female graduates (.5798) is smaller than
d. With n = 120,
At = 27.23,
The probability the sample mean will be within 1 inch of the population mean of 22
is .8294.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
c.
The probability the sample mean will be within 1 inch of the population mean of 42
is .9070.
The probability of being within 1 inch is greater for New York in part c because
a. n = 30 Within 16 means
b. n = 50 Within 16 means
d. None of the sample sizes in parts a, b, and c is sufficiently large. The sample size will
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
need exceed n = 100, which was used in part c.
30. a. n / N = 40 / 4000 = .01 < .05; therefore, the finite population correction factor is not
necessary.
Including the finite population correction factor provides only a slightly different value
31. a. E( ) = p = .40
b.
32. a. E( ) = .40
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(z ≤ .87) = .8078
33.
34. a.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(.26 ≤ ≤ .34) = .8078 – .1922 = .6156
b.
c.
e. With a larger sample, there is a higher probability will be within + .04 of the
population proportion p.
35. a.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
p(1 p) .30(.70)
p .0458
n 100
p
.30
b. P (.20 ≤ ≤ .40) = ?
c. P (.25 ≤ ≤ .35) = ?
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(.50 .60) = P(–1.42 ≤ z ≤ 1.42) = .9222 – .0778 = .8444
d.
e. No, the probabilities are exactly the same. This is because , the standard error, and the
width of the interval are the same in both cases. Notice the formula for computing the
f. For n = 400, .
The probability is larger than in part b. This is because the larger sample size has
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
reduced the standard error from .0352 to .0249.
37. a. np = 525(.10) = 52.5 and n(1 – p) = 525(1 – .10) = 472.5, so is normal distributed with
and
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(.39 ≤ ≤ .44) = .9599 – .0401 = .9198
d. The probabilities would increase. This is because the increase in the sample size makes
39. a. np = 200(.80) = 160 and n(1 – p) = 200(1 – .80) = 40, so is normal distributed normal
b.
c. np = 450(.80) = 360 and n(1 – p) = 450(1 – .80) = 90, so is normal distributed normal
d.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
e. The probability of the sample proportion being within .04 of the population mean is
increased .8410 to .9660. The likelihood of the sample proportion being within 0.04 of
the population proportion increases by .125 by increasing the sample size from 200 to
450. If the extra cost of using the larger sample size is not too great, we should probably
do so.
40. a. E ( ) = .76
c.
41. a. E( ) = .17
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Distribution is approximately normal because np = 800(.17) = 136 > 5 and n(1 – p) =
c.
– .0166 = .9668
42. a. E ( x )=μ=400
b. σ x =σ / √ n=100 / √ 100,000=.3162
d. It shows the probability distribution of all possible sample means that can be observed
with random samples of size 100,000. This distribution can be used to compute the
43.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The standard error of the mean decreases and becomes extremely small as the sample
44. a. E( p) = p = .75
b.
d. It shows the probability distribution of all possible sample proportions that can be
observed with random samples of size 100,000. This distribution can be used to
45.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The standard error of the sample proportion, decreases and becomes extremely small as
c. In part a we found that P(99 ≤ x ≤ 101) = .9892, so the probability the mean annual
number of hours of vacation time earned for a sample of 15,000 blue collar and service
employees who work for small private establishments and have at least 10 years of
service differs from the population mean μ by more than 1 hour is 1 – . 9892 – .0108, or
approximately 1%.
Because these sample results are unlikely if the population mean is 100, the
sample results suggest either (a) the mean annual number of hours of vacation time
earned by blue-collar and service employees who work for small private establishments
and have at least 10 years of service is not 100 or (b) the sample is not representative of
the population of blue-collar and service employees who work for small private
establishments and have at least 10 years of service. The sampling procedure should be
carefully reviewed.
47. . This value for the standard error can be used for parts a
and b.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
a. P (25.15 ≤ x ≤ .25.25) = ?
b. P (25.19 ≤ x ≤ .25.21) = ?
For samples of the same size, the probability of being within 05 of the mean miles per
gallon is much higher than the probability of being within .01 miles per gallon.
c. P (25.19 ≤ x ≤ .25.21) = ?
The standard error is now smaller because the sample size is larger.
The probability is larger here than it is in part b because the larger sample size has made
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
d. P (24.2 ≤ x ≤ .26.2) = ?
Because the probability the mean miles per gallon for a sample of 70,000 new cars and
light trucks sold in the United States in 2017 differs from the population mean μ by less
than 1 gallon is approximately 1.0000, the probability the mean miles per gallon for a
sample of 70,000 new cars and light trucks sold in the United States in 2017 differs
from the population mean μ by more than 1 gallon is approximately 1 – 1.0000 = .0000.
These sample results are unlikely if the population mean is 25.2, so these sample
results would suggest either (1) the population mean miles per gallon for new cars and
light trucks sold in the United States in 2017 is not 25.2 or (2) the sample is not
representative of the population of new cars and light trucks sold in the United States in
48. a.
p (1 p ) .42(.58)
p .0015
n 108, 700
p
.30
.42
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
b. P (.419 ≤ p ≤ .421) = ?
c. P (.4175 ≥ p ≥ .4225) = ?
For samples of the same size, the probability of being within 1% of the population
d. P (.41 ≤ p ≤ .43) = ?
Because the probability the proportion of orders placed by repeat customers for a
sample of 108,700 orders from the past six months differs from the population
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
orders placed by repeat customers for a sample of 108,700 orders from the past six
1.0000 = .0000.
These sample results are unlikely if the population proportion is .42, so these
sample results would suggest either (1) the population proportion of orders placed by
repeat customers from the past six months is not 42% or (2) the sample is not
representative of the population of orders placed by repeat customers from the past six
49. a. For a sample of 207,000 U.S. homes, the normal distribution is appropriate because np
= 207,000(.492) = 101,844 and n(1 – p) =207,000(.508) = 105156 are both greater than
c. For a sample of 86,800 U.S. homes, the normal distribution is appropriate because np =
86,800(.492) = 42,705.6 and n(1 – p) =86,800(.508) = 44,094.4 are both greater than 5.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(.490 p .494) = P(–1.18 ≤ z ≤ 1.18) = .8810 – .1190 = .7620
e. Yes, the probability in d is smaller than the probability in b because the sample size in d
50. a. Sorting the list of companies and random numbers to identify the five companies
associated with the five smallest random numbers provides the following sample.
b. Step 1: Generate a new set of random numbers in column B. Step 2: Sort the random
numbers and corresponding company names into ascending order and then select the
companies associated with the five smallest random numbers. It is extremely unlikely
51. a.With n = 100, we can approximate the sampling distribution with a normal distribution
having
E( ) = 10,348
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
b. P(z ≤ .80) = .7881
The probability that the sample mean will be within $200 of the population mean
is .5762.
c. At 12,000,
Yes, the research firm should be questioned. A sample mean this large is extremely
E = 406
c. At = 380,
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P( ≤ 380) = P(z ≤ –2.60) = .0047
Yes, this is an unusually low performing group of 64 stores. The probability of a sample
mean annual sales per square foot of $380 or less is only .0047.
53. With n = 60, the central limit theorem allows us to conclude the sampling distribution for
E = 320.51
a.
b.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P( > 27,175) = P(z > 0) = .50
Note: This could have been answered easily without any calculations; 27,175 is the
d.
55. a.
N = 2,000
N = 5,000
N = 10,000
Note: With n / N ≤ .05 for all three cases, common statistical practice would be to ignore
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
the finite population correction factor and use for each case.
b. N = 2,000
N = 5,000
N = 10,000
All probabilities are approximately .78, which indicates that a sample of size 50 will
56. a.
b. For + 25,
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(z < –1.25) = .1056
x
n 30
0.05 0.05
x
1.9 2.1
The area below = 2.1 must be 1 – .05 = .95. An area of .95 in the standard normal table
shows z = 1.645.
Thus,
Solve for .
58. p = .15
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
b. Within ± .04 means .11 ≤ ≤ .19
59.
P ( ≥ .375) = ?
60. a.
b. We want P( .45).
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P( .45) = 1 – P( ≤ .45) = 1 – P(z ≤ 1.99) = 1 – .9767 = .0233
b. P (.12 .18) = ?
62. a.
Solve for n
c. P ( ≥ .30) = ?
= .2119
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
b. Within ±4 hours means 7,996 8,006.
x
At = 8,001, P(z ≤ 0.39) = .6517
x
At = 7,999, P(z < –0.39) = .3483
d. In part b we found that P(7,996 ≤ x ≤ 8,004) = .8812, so the probability the mean life of
a sample of 35,000 14-watt CFLs will differ from the population mean life μ by more
than 4 hours is 1 – .8812 = .1188. If the population mean is 8,000, sample results as
extreme as this would occur more than one time out of 10 samples. The results therefore
do not provide strong evidence the population mean is different from 8,000.
b. Within ±3 minutes of the mean is the same as within 3/60 – .05 hours of the mean, i.e.,
17.55 17.65.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
At = 17.55, z = –2.86. P(z < –2.86) = .0021
c. In part b we found that if we assume the U.S. population mean and standard deviation
are appropriate for Florida, then P(17.55 ≤ x ≤ 17.65) = .9958, so the probability the
mean for a sample of 85,020 Floridians will differ from the U.S. population mean by
more than 3 minutes is 1 – .9958 = .0042. This result is would be highly unlikely and
would suggest that the home Internet usage of Floridians differs from home Internet
65. a.
p (1 p ) .043(.957)
p .0006
n 114, 250
p
.30
.043
b. P (.042 ≤ p ≤ .044) = ?
c. P (.0425 ≤ p ≤ .0435) = ?
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
P(z ≤ 0.83) = .7967
For samples of the same size, the probability of being within .1% of the population
proportion of undeliverable mail pieces is much larger than the probability of being
66. a.
p (1 p ) .58(.42)
p .0035
n 20, 000
p
.30
.58
b. P (.57 ≤ p ≤ .59) = ?
c. In part b we found that P(.57 ≤ x ≤ .59) = .9958, so the probability the proportion of a
sample of 20,000 drivers that is speeding will differ from the U.S. population proportion
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
of drivers that is speeding by more than 1% is 1 – .9958 = .0042.
These sample results are unlikely if the population proportion is .58, so these
sample results would suggest either (1) the population proportion of drivers who are
speeding is not 58% or (2) the sample is not representative of the population of U.S.
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Case Solutions
1. We know that
• E( x ) = μ regardless of the sample size, so the mean of all possible values of x is equal
to 80.
σ 25
• The sample size is 50, so the standard error of the mean is σ x = = =3.5355 .
√ n √50
• The simple random sample of 50 customers, so the central limit theorem enables us to
conclude that the sampling distribution of the mean can be approximated by a normal
distribution.
At x = 75, we have
75−80
z= =−1.41
3.5355
The area under the curve to the left of z = –1.41 is .0793. Therefore, P( x ≤ 75) = .0793
when n = 50. The probability the mean score of Blugurt given by a simple random
2. We know that:
• E( x ) = μ regardless of the sample size, so the mean of all possible values of x is again
equal to 80.
• The sample size is now 150, so the standard error of the mean is
σ 25
σ x= = =2.0412.
√ n √150
• The simple random sample of 150 customers and the central limit theorem enable us
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
normal distribution.
At x = 75, we have
75−80
z= =−2.45
2.0412
The area under the curve to the left of z = –2.45 is .0071. Therefore, P( x ≤ 75) = .0071
when n = 150. The probability the mean score of Blugurt given by a simple random
3. Whenever the sample size is increased, the standard error of the sample mean decreases.
With n = 50, the standard error of the sample mean for Blugurt scores is 3.5355. With the
increase in the sample size to n = 150, the standard error of the sample mean is decreased
to 2.0412. Because the sampling distribution of the sample mean with n = 150 has a
smaller standard error, the potential values of x that the sample can yield have less
variation and tend to be closer to the population mean than the potential values of x when
n = 50. With a smaller standard error, it is less likely that a sample will have a mean
© 2019 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.