Anova
Anova
Samprit Chakrabarti
Topics for Discussion
• ANOVA
Example
• Gulfstream Aerospace Company produced three different prototypes
as candidates for mass production as the company’s newest large-
cabin business jet, the Gulfstream IV. Each of the three prototypes
has slightly different features, which may bring about differences in
performance. Therefore, as part of the decision-making process
concerning which model to produce, company engineers are
interested in determining whether the three proposed models have
about the same average flight range. Each of the models is assigned
a random choice of 10 flight routes and departure times, and the
flight range on a full standard fuel tank is measured (the planes carry
additional fuel on the test flights, to allow them to land safely at
certain destination points). Range data for the three prototypes, in
nautical miles (measured to the nearest 10 miles), are as follows.
Data
One Independent
Variable One or More
Independent Variables
Categorical: Categorical
Binary Interval
Factorial and Interval
Analysis of Analysis of
t Test Variance Covariance Regression
More than
One Factor One Factor
SUMMARY
Groups Count Sum Average Variance
Column 1 6 132 22 0.2
Column 2 6 117.5 19.58333 0.641667
Column 3 6 106 17.66667 0.566667
Column 4 6 123.5 20.58333 0.441667
ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 59.70833 3 19.90278 43.03303 6.54E-09 3.098391
Within Groups 9.25 20 0.4625
Total 68.95833 23
df for SST is (4-1) = 3, df for SSE is (24-4) = 20, df for total is (24-1) =
23. F theoretical is 3.098 where as F calculated is 43.03. As F
calculated greater than F theoretical, F falls in the rejection region.
Null hypothesis is rejected.
Conclusion
• There is enough evidence to believe that there
is a significant difference in the prices across
four cities
Two Way ANOVA
• A company which produces stationery items wants to diversify
into the photocopy paper manufacturing business. The company
has decided to first test market the product in three areas termed
as the north area, central area, and the south area. The company
takes a random sample of five salesmen S1, S2, S3, S4 and S5 for
this purpose. The sales volume generated by these five salesmen,
in thousand rupees, and total sales in different regions are given
in the table.
• Use a randomized block design analysis to examine:
– Whether the salesmen significantly differ in performance?
– Whether there is a significant difference in terms of sales capacity
between the regions?
– Take 95% confidence level
Data
Region Salesmen
S1 S2 S3 S4 S5 Region’s
Total
North 24 30 26 23 32 135
Central 22 32 27 25 31 137
South 23 28 25 22 32 130
Salesmen’s 69 90 78 70 95 402
Total
Hypotheses
• Divided into two parts
• For treatments (columns)
• For blocks (rows)
• For treatments: Null hypothesis: all the
treatment means are equal; Alternative
hypothesis: all the treatment means are unequal
• For blocks: Null hypothesis: all the block means
are equal; Alternative hypothesis: all the block
means are unequal
Excel Path
• Select Data Analysis
• From Data Analysis dialogue box, select Anova:
Two Factor Without Replication
• Click OK
• In the Anova: Two Factor dialogue box, enter
the location of the samples in the variable Input
Range box
• Place the value of α
• Click OK
Output Sheet: Excel
Anova: Two-Factor Without Replication
Column 1 3 69 23 1
Column 2 3 90 30 4
Column 3 3 78 26 1
Column 4 3 70 23.33333 2.333333
Column 5 3 95 31.66667 0.333333
ANOVA
Source of Variation SS df MS F P-value F crit
Rows 5.2 2 2.6 1.714286 0.2401 4.45897
Columns 183.0667 4 45.76667 30.17582 7.09E-05 3.837853
Error 12.13333 8 1.516667
Total 200.4 14
Store Num ber Coupon Level In-Store Prom otion Sales Clientel Rating
1 1.00 1.00 10.00 9.00
2 1.00 1.00 9.00 10.00
3 1.00 1.00 10.00 8.00
4 1.00 1.00 8.00 4.00
5 1.00 1.00 9.00 6.00
6 1.00 2.00 8.00 8.00
7 1.00 2.00 8.00 4.00
8 1.00 2.00 7.00 10.00
9 1.00 2.00 9.00 6.00
10 1.00 2.00 6.00 9.00
11 1.00 3.00 5.00 8.00
12 1.00 3.00 7.00 9.00
13 1.00 3.00 6.00 6.00
14 1.00 3.00 4.00 10.00
15 1.00 3.00 5.00 4.00
16 2.00 1.00 8.00 10.00
17 2.00 1.00 9.00 6.00
18 2.00 1.00 7.00 8.00
19 2.00 1.00 7.00 4.00
20 2.00 1.00 6.00 9.00
21 2.00 2.00 4.00 6.00
22 2.00 2.00 5.00 8.00
23 2.00 2.00 5.00 10.00
24 2.00 2.00 6.00 4.00
25 2.00 2.00 4.00 9.00
26 2.00 3.00 2.00 4.00
27 2.00 3.00 3.00 6.00
28 2.00 3.00 2.00 10.00
29 2.00 3.00 1.00 9.00
30 2.00 3.00 2.00 8.00
16-41
One-Way ANOVA: Effect of In-store Promotion on
Store Sales
Cell means
16-42
Illustrative Applications of One-Way
Analysis of Variance