Stat 8
Stat 8
2019
Tests of Hypothesis II
F Distribution
Analysis of Variance
We describe (1) a test for variances and then a (2) test that
simultaneously compares several means to determine if they
came from equal populations.
Characteristics of the F Distribution
There is a family of F distributions. Each time the degrees of
freedom in either the numerator or the denominator change, a
new distribution is created.
The larger of the two sample variances is placed in the numerator, forcing
the ratio to be at least 1.00
▪ We calculate the standard deviation, 𝑠, and square the
standard deviations to get the variance, 𝑠 2 , for each
population
Comparing Two Population Variances
Example:
A health services corporation manages two hospitals in Knoxville:
St. Mary’s North and St. Mary’s South.
The mean waiting time in both Emergency Departments is 42
minutes.
The hospital administrator believes St. Mary’s North has more
variation than St. Mary’s South.
Compare Two Population Variances Example
Lammers Limos offers limousine service
from Government Center in downtown
Toledo, Ohio, to Metro Airport in Detroit.
The president of the company is
considering two routes: via U.S. 25 and
via Interstate-75.
He wants to study the time it takes to get
to the airport using each route and
compare the results.
He collected the following sample data.
Compare Two Population Variances Example
Using the .10 significance level, is there a difference in the
variation in the driving times for the two routes?
Compare Two Population Variances Example
Using the .10 significance level, is there a difference in the
variation in the driving times for the two routes?
▪ Step 1: State the null and alternate hypothesis
𝐻0 : 𝜎12 = 𝜎22
𝐻1 : 𝜎12 ≠ 𝜎22
Analysis of Variance
▪ Step 6: Interpret the result, we conclude the populations are not all equal
Pairs of Means
If a null hypothesis of equal treatment means is rejected, we can identify the
pairs of means that differ with the following confidence interval
If the confidence interval includes zero, there is not a difference between the
treatment means
Pairs of Means Analysis Example
▪ Recall in the previous example of airline
satisfaction, we rejected the null hypothesis
that the population means were equal; at
least one of the airline’s mean level of
satisfaction is different from the others. But
we do not know which pairs.
Analysis of Variance
▪ The SSE term, or sum of squares error, is found with the following
equation
ANOVA Test Example
▪ WARTA, the Warren Area Regional Transit Authority, is expanding bus service from the
suburb of Starbrick to the business district of Warren. There are four routes being
considered, U.S. 6, West End, Hickory St. , and Rte. 59.
▪ WARTA conducted tests to determine whether there is a difference in the mean travel
times along the four routes; each driver drove each route.
ANOVA Test Example
Questions
▪ At the .05 significance level, is there a difference in the mean travel time
along the four routes?
▪ If we remove the effects of the drivers, is there a difference in the mean
travel time?
ANOVA Test Example
▪ Step 1: State the null and alternate hypothesis
𝐻0 : 𝜇2 = 𝜇3 = 𝜇1 = 𝜇1
𝐻1 : 𝑁𝑜𝑡 𝑎𝑙𝑙 𝑡𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡 𝑚𝑒𝑎𝑛𝑠 𝑎𝑟𝑒 𝑡ℎ𝑒 𝑠𝑎𝑚𝑒
▪ Step 2: Select the level of significance, we decide to use .05
▪ Step 3: Select the test statistic, we use F
▪ Step 4: State the decision rule, Reject H0 if F > 3.24
▪ Step 5: Make decision, F = 2.483, we do not reject the null hypothesis
▪ Step 6: Interpret, there is no reason to conclude that any one of the
routes is faster than any other.
ANOVA Test Example
▪ Step 5: Make decision, F = 2.483, we do not reject the null hypothesis
▪ To do so, requires that we calculate the SSB, the sum of the squares due
to blocks
Two-Way Analysis of Variance
Including the variance of the drivers, here is a table of the drivers
respective means with an overall mean of 22.8 minutes.
Two-Way Analysis of Variance
Including the variance of the drivers, here is a table of the drivers
respective means with an overall mean of 22.8 minutes.
Substituting this information in formula 12-6, we determine SSB is 119.7
SSB = kΣ(തxb – xതG)2
= 4(19.5 – 22.8)2 + 4(21.0 – 22.8)2 + 4(22.5 – 22.8)2 + 4(24.75 –22.8)2 +4(26.25 – 22.8)2
= 119.7
Then use formula 12-7 to find SSE
SSE = SS total – SST – SSB
= 229.2 – 72.8 – 119.7
= 36.7
A Second Treatment Variable Continued
Determine the F statistics for the treatment variable and the blocking variable from the
following ANOVA table
Hypothesis Test of Equal Block Means
▪ Step 1: State the null and alternate hypothesis
𝐻0 : 𝜇2 = 𝜇3 = 𝜇1 = 𝜇1
𝐻1 : 𝑁𝑜𝑡 𝑎𝑙𝑙 𝑡𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡 𝑚𝑒𝑎𝑛𝑠 𝑎𝑟𝑒 𝑡ℎ𝑒 𝑠𝑎𝑚𝑒
▪ Step 2: Select the level of significance, we decide to use .05
▪ Step 3: Select the test statistic, we use F
▪ Step 4: State the decision rule, Reject H0 if F > 3.24
▪ Step 5: Make decision, F = 2.483, we do not reject the null hypothesis
▪ Step 6: Interpret, there is no reason to conclude that any one of the
routes is faster than any other.
Hypothesis Test of Equal Block Means
Next, we test to find if the travel times for the various drivers are equal.
▪ State the decision rule for the second set of hypotheses, reject 𝐻0 if F > 3.26
▪ Make a decision, the computed F ratio is 9.78 so we reject the null hypothesis
𝑀𝑆𝐵 29.93
𝐹= = = 9.78
𝑀𝑆𝐸 3.06
▪ Interpret, we conclude at least one driver’s mean travel time is different from
the others. WARTA management can conclude, based on the sample results,
that there is a difference in the mean travel times of drivers.
Interaction Plot
▪ An interaction plot illustrates the interaction of the two factors,
route and driver
▪ Travel time is the response variable
Routes
Drivers U.S. 6 West End Hickory Rte. 59
Deans 18 17 21 22
Snaverly 16 22.33 23 22
Ormson 18 23 26 22
Zollaco 23 22 29 23.67
Filbeck 25 24 28 28
Hypothesis Tests for Interaction