0% found this document useful (0 votes)
188 views15 pages

Hypothesis Testing 1 PDF

- Hypothesis testing involves forming a null hypothesis (H0) and alternative hypothesis (HA) based on a question with two possible answers. A test is conducted to collect data to either reject or fail to reject the null hypothesis. - When testing a population proportion, the sample proportion is used as the test statistic. The p-value is calculated as the probability of obtaining a test statistic as extreme or more extreme than what was observed, assuming the null hypothesis is true. - For a one-sided test, the p-value is the probability of observing a sample proportion in the tail specified by the alternative hypothesis. For a two-sided test, the p-value is twice the probability of observing
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
188 views15 pages

Hypothesis Testing 1 PDF

- Hypothesis testing involves forming a null hypothesis (H0) and alternative hypothesis (HA) based on a question with two possible answers. A test is conducted to collect data to either reject or fail to reject the null hypothesis. - When testing a population proportion, the sample proportion is used as the test statistic. The p-value is calculated as the probability of obtaining a test statistic as extreme or more extreme than what was observed, assuming the null hypothesis is true. - For a one-sided test, the p-value is the probability of observing a sample proportion in the tail specified by the alternative hypothesis. For a two-sided test, the p-value is twice the probability of observing
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Hypothesis Testing

The idea of hypothesis testing is:


● Ask a question with two possible answers
● Design a test, or calculation of data
● Base the decision (answer) on the test

Example: In 2010, 24% of children were


dressed as Justin Bieber for Halloween. We
want to test whether or not this proportion
increased in 2011.
Constructing a Hypothesis Test
● Define your Null and Alternative Hypotheses
● H0 (pronounced “H naught”) is the null
hypothesis. This is typically the default
assumption – assuming no change, or that a
new drug is no better than a placebo.
● HA is the alternative hypothesis. This is
typically your hunch, that there has been a
change, or that the drug works better than the
placebo, or a claim that you are trying to
debunk.
Example: Justin Bieber Costumes
H0: In 2011, 24% of Halloween costumes were
Bieber costumes. (p=.24)
HA: In 2011, the proportion of Bieber costumes
was greater than 24%; (p>.24)
Alternative Hypothesis
● The Alternative Hypothesis is chosen to match
a claim that is being tested, or something you
hope is true.
● Say we are testing for a proportion p. Null
Hypothesis is p=p0.
● If the Alternative Hypothesis is p>p 0, or if it is
p<p0, these are examples of a one-sided test.
● If the alternative is p≠p0, this is called a two-
sided test.
Calculate a Test Statistic
● For a hypothesis test about population
proportion, sample proportion is a good test
statistic (if the conditions of the CLT are met,
we can use the normal distribution)

Example: We randomly poll 1000 children who


dressed up for Halloween in 2011. 254 of them
dressed up as Justin Bieber, so our sample
proportion is .254
P-Value
● For a hypothesis test of a proportion, we use a P-
Value. This is the conditional probability of the tails
assuming H0 is true.
● The smaller the P-value, the more strong the
evidence in favor of our alternative Hypothesis.
● If the P-Value is less than or equal to a certain
predefined threshold (the significance level), we
will reject the null Hypothesis.
Calculating P-Values (1-sided tests)
● p^ is normal with μ=p0, σ=√(p0(1-p0)/n)
● Calculate x/n, the proportion from our observed
sample.
● Case1: H0: p=p0 vs. HA: p>p0
^
P-Value= P[p>x/n | p=p0] =normalcdf(x/n,1,μ,σ)
● Case2: H0: p=p0 vs. HA: p<p0
^
P-Value= P[p<x/n | p=p0] =normalcdf(0,x/n,μ,σ)
Example: Calculate P-Value
We are using a 5% significance level.
● ^ our sample proportion, is
If we assume H0 true, then p,
normal with μ=.24 and σ=√(.24*.76/1000)=.0135
● ^
P[p>.254|H ] =normalcdf(.254,1,.24,.0134)
0
=.1481
● This is not less than our .05 significance level, so we
do not reject the null hypothesis
The evidence is not strong enough to support the
claim that Justin Bieber costumes were more
popular in 2011 than in 2010.
Visual Hypothesis Test
H0: p=p0 vs. HA: p>p0 5% significance level
By CLT, p^ is normal with μ=p0, σ=√(p0q0/n)
(Assuming H0 is true)

Calculate x/n, our observed sample proportion


Distribution of p^

95%

5%

p0

If x/n is here, do not reject H0 If x/n is here, we


WILL reject H0
Visual Hypothesis Test
H0: p=p0 vs. HA: p<p0 5% significance level
By CLT, p^ is normal with μ=p0, σ=√(p0q0/n)
(Assuming H0 is true)

Calculate x/n, our observed sample proportion


Distribution of p^

95%

5%

p0

If x/n is here, we If x/n is here, do not reject H0


WILL reject H0
2-Sided Test
● A Two tailed test looks like this:
H0: p=p0 vs. HA:p≠p0, 5% significance level
Distribution of ^p (Assuming H0 is true)

95%
2.5% 2.5%

p0

If x/n is here, we If x/n is here, If x/n is here, we


WILL reject H0 do not reject H0 WILL reject H0
Calculating P-Values (2-sided tests)
● p^ is normal with μ=p0, σ=√(p0q0/n)
● Calculate x/n, the proportion from our observed
sample.
H0: p=p0 vs. HA:p≠p0 x x
−μ − p0
n n
use standardized z value z= σ
=

p0 q 0
n


^
P-Value= 2P[Z>|z| |p=p ] =2*normalcdf(|z|,6)
0

● Notice we use |z| - the absolute value of z


Example- 2-sided test
In 1996, 25% of students who had perfect attendance one
month would also have perfect attendance the following
month. In 2000, the school wants to see if the proportion has
changed. The proportion of a random sample of 6543
students is 23.4%. With a 5% significance level, should the
school conclude that there has been a change?
● H0: proportion is still 25%, HA: proportion is not 25%
● Under the null hypothesis, p^ is normal with mean 0.25 and
s.d √(.25*.75/6453)=.00535
● Standardized proportion is z=(.234-.25)/.00535 =-2.9907
● P value is 2*P[Z>2.9907] = 2*normalcdf(2.9907,6)=.0028
● This is less than .05, so we reject the null hypothesis;
there has been a change in attendance.
Test using Z-Statistic
You can calculate the P-Value using a z-score:
Assume null hypothesis: μ=p0, σ=√(p0q0/n)
X/n is our sample proportion. z=(x-μ)/σ
The P-value depends on the form of HA:
● HA:p>p0 → P-Value = P(Z>z)=normalcdf(z,6)
● HA:p<p0 → P-Value = P(Z<z)=normalcdf(-6,z)
● HA:p≠p0 → P-Value = 2P(Z>|z|)=2*normalcdf(|z|,6)
Remember, for 2-tailed test use |z| (absolute value)
One-Sided Test using Z-statistic
A Magazine wants to launch an online version, but only if
more than 20% of its subscribers would subscribe to it. A
random survey of 400 subscribers indicated that 90 would be
interested.
● H0: p=20%, HA: p>20%, assume 5% significance level
● Sample proportion is 90/400=.225
● Under the null hypothesis, p^ is normal with mean 0.20 and
s.d √(.20*.80/400)=.02
● Standardized proportion is z=(.225-.20)/.02 =1.25
● P value is P[Z>1.25] = normalcdf(1.25,6)=.10565
● This is higher than our .05 significance level; evidence is
not strong enough to reject the null hypothesis. The
magazine should not launch the online version.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy