What Is Inferential Statistics
What Is Inferential Statistics
Inferential statistics is mainly used to derive estimates about a large group (or
population) and draw conclusions on the data based on hypotheses testing
methods.
Inferential statistics uses sample data because it is more cost-effective and less
tedious than collecting data from an entire population. It allows one to come
to reasonable assumptions about the larger population based on a sample’s
characteristics. Sampling methods need to be unbiased and random for
statistical conclusions and inferHypothesis Testing
Hypothesis testing falls under the “statistical tests” category. Statistical tests
account for sampling errors and can either be parametric (includes
assumptions made regarding population distribution parameters) or non-
parametric (does not include assumptions made regarding population
distribution parameters).
Parametric tests tend to be more trusted and reliable because they enable the
detection of potential effects. Parametric tests assume that the population
from which sample data is derived is normally distributed. The sample size
provides an adequate representation of the population from which it was
derived. The groups, variances, and measures of spread are comparable.
ences to be validated.
Summary
Hypothesis testing is a form of inferential statistics that allows us to draw conclusions about an
entire population based on a representative sample. You gain tremendous benefits by working
with a sample. In most cases, it is simply impossible to observe the entire population to
understand its properties. The only alternative is to collect a random sample and then use
statistics to analyze it.
Use a hypothesis test to help determine whether the differences between these means are random
error or a real effect.
Hypothesis Testing
Hypothesis testing is a statistical analysis that uses sample data to assess two mutually
exclusive theories about the properties of a population. Statisticians call these theories
the null hypothesis and the alternative hypothesis. A hypothesis test assesses your
sample statistic and factors in an estimate of the sample error to determine which
hypothesis the data support.
Hypothesis testing is a formal procedure for investigating our ideas about the world
using statistics. It is most often used by scientists to test specific predictions, called
hypotheses, that arise from theories.
Though the specific details might vary, the procedure you will use when testing a
hypothesis will always follow some version of these steps.
Table of contents
1.
2.
3.
4.
5.
Step 1: State your null and alternate hypothesis
After developing your initial research hypothesis (the prediction that you want to
investigate), it is important to restate it as a null (H o) and alternate (Ha) hypothesis so
that you can test it mathematically.
You want to test whether there is a relationship between gender and height. Based on
your knowledge of human physiology, you formulate a hypothesis that men are, on
average, taller than women. To test this hypothesis, you restate it as:
To test differences in average height between men and women, your sample should
have an equal proportion of men and women, and cover a variety of socio-economic
classes and any other variables that might influence average height.
You should also consider your scope (Worldwide? For one country?) A potential data
source in this case might be census data, since it includes data from a variety of regions
and social classes and is available for many countries around the world.
Based on the type of data you collected, you perform a one-tailed t-test to test whether
men are in fact taller than women. This test gives you:
Your t-test shows an average height of 175.4 cm for men and an average height of
161.7 cm for women, with an estimate of the true difference ranging from 10.2cm to
infinity. The p-value is 0.002.
In most cases you will use the p-value generated by your statistical test to guide your
decision. And in most cases, your cutoff for refuting the null hypothesis will be 0.05 –
that is, when there is a less than 5% chance that you would see these results if the null
hypothesis were true.
In your analysis of the difference in average height between men and women, you find that
the p-value of 0.002 is below your cutoff of 0.05, so you decide to reject your null hypothesis of
no difference.
In the results section you should give a brief summary of the data and a summary of the
results of your statistical test (for example, the estimated difference between group
means and associated p-value). In the discussion, you can discuss whether your initial
hypothesis was supported or refuted.
In the formal language of hypothesis testing, we talk about refuting or accepting the null
hypothesis. You will probably be asked to do this in your statistics assignments.
If your null hypothesis was refuted, this result is interpreted as being consistent with
your alternate hypothesis.
You might notice that we don’t say that we accept or reject the alternate
hypothesis. This is because hypothesis testing is not designed to prove or disprove
anything. It is only designed to test whether a pattern we measure could have arisen by
chance.
If we reject the null hypothesis based on our research (i.e., we find that it is unlikely that
the pattern arose by chance), then we can say our test lends support to our
hypothesis. But if the pattern does not pass our decision rule, meaning that it could
have arisen by chance, then we say the test is inconsistent with our hypothesis.