Anova
Anova
(anova)
What is anova?
- It is a statistical test used to determine if there is a
statistically significant difference between two or
more categorical groups by testing for difference of
means using a variance.
- It splits the IV into two or more groups (e.g. one or
more groups might be expected to influence the DV,
while the other group is used as a control group and is
not expected to influence the DV)
2
What is anova?
- ANOVA test helps you to figure out if you need to
reject the null hypothesis or accept the alternate
hypothesis.
- ANOVA determines whether the groups created by
the levels of the independent variable are statistically
different by calculating whether the means of the
treatment levels are different from the overall mean of
the dependent variable.
3
How to state hypothesis
4
Assumptions of anova
1. There must be no relationship between the subjects
in each sample. This means that subjects in the first
group cannot also be in the second group.
2. The different groups/levels must have equal sample
sizes.
3. The DV should be normally distributed so that the
middle scores are the most frequent and the extreme
scores are the least frequent.
5
Assumptions of anova
1. Population variances must be equal (i.e.,
homoscedastic). Homogeneity of variance means that
the deviation of scores (measured by the range or
standard deviation, for example) is similar between
populations.
6
TYPES OF ANOVA TEST
ONE-WAY TWO-WAY
Has one categorical IV Has two or more
and a normally distributed categorical IVs and a
continuous (i.e. interval or normally distributed
ratio) DV continuous DV
7
ANOVA VS. T-TEST
- T-tests and ANOVA tests are both statistical
techniques used to compare differences in means and
spreads of the distributions across populations.
- The t-test determines whether two populations are
statistically different from each other, whereas
ANOVA tests are used when an individual wants to
test more than two levels within an independent
variable.
8
ANOVA VS. T-TEST
- Example would be, testing Non-Organic vs. Organic
would require a t-test while adding in Free Range as a
third option demands ANOVA. Rather than generate a
t-statistic, ANOVA results in an f-statistic to
determine statistical significance.
9
1
ONE-WAY ANOVA
EXAMPLES OF WHEN TO USE A ONE-WAY ANOVA
Situation 1: You have a group of individuals randomly
split into smaller groups and completing different tasks.
For example, you might be studying the effects of tea on
weight loss and form three groups: green tea, black tea,
and no tea.
11
EXAMPLES OF WHEN TO USE A ONE-WAY ANOVA
Situation 2: Similar to situation 1, but in this case the
individuals are split into groups based on an attribute
they possess. For example, you might be studying leg
strength of people according to weight. You could split
participants into weight categories (obese, overweight
and normal) and measure their leg strength on a weight
machine.
12
LIMITATIONS OF ONE-wAY ANOVa
A one way ANOVA will tell you that at least two groups
were different from each other. But it won’t tell you
which groups were different. If your test returns a
significant f-statistic, you may need to run an ad hoc test
to tell you exactly which groups had a difference in
means.
13
2
two-WAY ANOVA
What is two-way anova
- A Two Way ANOVA is an extension of the One Way
ANOVA.
- It is used to estimate how the mean of a quantitative
variable changes according to the levels of two
categorical variables.
- Use a two-way ANOVA when you want to know how
two independent variables, in combination, affect a
dependent variable.
15
Assumptions of two-way anova
1. Your DV should be measured at the continuous level.
2. Your two IVs should each consist of two or more
categorical, independent groups.
3. You should have independence of observations,
which means that there is no relationship between
the observations in each group or between the
groups themselves.
16
Assumptions of two-way anova
1. There should be no significant outliers.
2. Your DV should be approximately normally
distributed for each combination of the groups of the
two IVs
3. There needs to be homogeneity of variances for each
combination of the groups of the two independent
variables.
17