Inferential Statistics
Inferential Statistics
Descriptive Inferential
statistics statistics
Measure of
Measure of Hypothesis
central Estimation
dispersion testing
tendency
Mean,
Variance, SD, Point Confidence P-value
Median, Ho/Ha
IQR,etc. estimate interval calculation
Mode
Inferential Statistics
• Statistical method used to infer results of sample
(statistic) to population (parameter)
Population
Sample
Statistical conclusion
Notations of
parameter & statistic
Sample Population
Statistic Parameter
Sample size n N
Mean x µ
Standard deviation s σ
IMPORTANT CONCEPTS
IN INFERENTIAL STATISTICS
Important concepts
• Binomial distribution
• Poisson distribution
• Normal distribution
• Sampling distribution
1. Binomial distribution
2. Poisson distribution
Binomial distribution
• One of the most widely used probability
• Follow Bernoulli Process
• Each trial results in one of two possible mutually
exclusive outcomes. One denoted as success and
another is failure
• the probability of a success denoted by p remains
constant from trials to trials and probability of
failures , 1-p is denoted by q
• the trials are independent
Poisson distribution
• Usually used if the events under observation is
rarely happen
• E.g.
• Suicides cases
Poisson probability =
Normal distribution
Important characteristics:
• Symmetrical about its mean, µ
• Mirror image
• Mean=median=mode
• Total area under the curve above x axis =
1
• Area enclosed under the curve 68%,
95% and 99% by 1SD, 2SD and 3SD both
direction
• Changing µ shift the graph at x axis
• Changing σ influence the peak/flatness
of graph
Sampling distribution
• Suppose that we draw all possible samples of
size n from a given population, compute
a statistic (e.g., a mean, proportion, SD) for each
sample.
If the sample size is large, parametric statistics (e.g t-test or ANOVA) should
15
be used, because the averages used in the tests, are reasonably close to a
normal distribution.
Standard normal distribution
• Normal distribution with mean 0 and SD 1
• Random variable, z = (x - µ)/σ
Inferential Statistics
2 components of inferential statistics:
• Hypothesis testing
• Comparing two means
• Comparing two proportion
• Association/relationship between one variable and another
variable
Confidence Interval (C.I)
• One of the main objective of statistics is
estimation of population parameters based on
the information contained in the sample.
Definition
A null hypothesis is a claim (or statement) about
a population parameter that is assumed to be
true until it is declared false.
Definition
An alternative hypothesis is a claim about a
population parameter that will be true if the null
hypothesis is false.
P value
• Example:
• Our study has beta error of 20%
• It means that there is 20% chance that we will get no
significant association even though there is an
association in reality.
• In other words, there is 80% chance that we will get
significant association if there is an association in
reality.
• It means our study has power of 80%.
Two procedures to make tests of hypothesis
• Example:
• Is a new drug superior to standard drug?
• Does the air pollution level exceed safe limits?
• Example:
• Is there a difference between the BMI of men and women?
• Does the mean age of volunteers differ from that of general
population?
(α=0.05)
Z value 0 1.645
(α/2=0.025) (α/2=0.025)
- - One Sample t -