0% found this document useful (0 votes)

17 views37 pages

Power Analysis

Uploaded by

murat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views37 pages

Power Analysis

Uploaded by

murat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Power Analysis

Anne Segonds-Pichon
v2020-09
Question

Results
Experimental design

Data Analysis
Choice of statistical tests

Data Exploration

Sample Size

Data Collection/Storage
Experiment
Sample Size: Power Analysis

• Definition of power: probability that a statistical test will reject a false null hypothesis (H0).
• Translation: the probability of detecting an effect, given that the effect is really there.

• In a nutshell: the bigger the experiment (big sample size), the bigger the power (more likely
to pick up a difference).
• Main output of a power analysis:
• Estimation of an appropriate sample size
• Too big: waste of resources,
• Too small: may miss the effect (p>0.05)+ waste of resources,
• Grants: justification of sample size,
• Publications: reviewers ask for power calculation evidence,
• Home office: the 3 Rs: Replacement, Reduction and Refinement.
What does Power look like?
What does Power look like? Null and alternative hypotheses
Control Treatment

• Probability that the observed result occurs if H0 is true

• H0 : Null hypothesis = absence of effect
• H1: Alternative hypothesis = presence of an effect
What does Power look like? Type I error α

• Type I error (α) is the failure to reject a true H0

• Claiming an effect which is not there.
• p-value: probability that the observed statistic occurred by chance alone
• probability that a difference as big as the one observed could be found even if there is no effect.
• Statistical significance: comparison between α and the p-value
• p-value < 0.05: reject H0
• p-value > 0.05: fail to reject H0
What does Power look like? Power and Type II error β

Area = 1

• Type II error (β) is the failure to reject a false H0

• Probability of missing an effect which is really there.
• Power: probability of detecting an effect which is really there.

• Direct relationship between Power and type II error:

• Power = 1 – β
What does Power look like? Power = 80%

• General convention: 80% but could be more

• if Power = 0.8 then β = 1- Power = 0.2 (20%)

• Hence a true difference will be missed 20% of the time

• Jacob Cohen (1962):

• For most researchers: Type I errors are four times more serious than Type II errors so:
0.05 * 4 = 0.2
• Compromise: 2 groups comparisons:
• 90% = +30% sample size
• 95% = +60% sample size
Critical value
The critical value
70 70 70 70 70 70

60 60 60 60 60 60
Quantitative variable

Quantitative variable

Quantitative variable
50 50 50 50 50 50

40 40 40 40 40 40

30 30 30 30 30 30

20 20 20 20 20 20

10 10 10 10 10 10

0 0 0 0 0 0
Sample 1 Sample 2 Sample 1 Sample 2 Sample 1 Sample 2 Sample 1 Sample 2 Sample 1 Sample 2 Sample 1 Sample 2

Small difference Big difference

Not significant: p>0.05 Significant: p<0.05

Critical value = size of difference + sample size + significance

What does Power look like? Example with the t-test
Example: 2-tailed t-test with n=15 (df=14)

T Distribution

0.95
0.025 0.025

t(14)
t=-2.1448 t=2.1448

• In hypothesis testing:
• test statistic is compared to the critical value to determine significance
• Example of test statistic: t-value

• If test statistic > critical value: statistical significance and rejection of the null hypothesis
• Example: t-value > critical t-value
To recapitulate:
• The null hypothesis (H0): H0 = no effect
• The aim of a statistical test is to reject or not H0.
Statistical decision True state of H0
H0 True (no effect) H0 False (effect)
Reject H0 Type I error α Correct
False Positive True Positive
Do not reject H0 Correct Type II error β
True Negative False Negative

• High specificity = low False Positives = low Type I error

• High sensitivity = low False Negatives = low Type II error

https://github.com/allisonhorst/stats-illustrations#other-stats-artwork
Sample Size: Power Analysis

The power analysis depends on the relationship between 6 variables:

• the difference of biological interest

Effect size
• the variability in the data (standard deviation)
• the significance level (5%)
• the desired power of the experiment (80%)
• the sample size
• the alternative hypothesis (ie one or two-sided test)
The difference of biological interest
• This is to be determined scientifically, not statistically.
• minimum meaningful effect of biological relevance

• the larger the effect size, the smaller the experiment will need to be to detect it.
• How to determine it?
• Previous research, pilot study …

The Standard Deviation (SD)

• Variability of the data
• How to determine it?
• Data from previous research on WT or baseline …
The effect size: what is it?
• The effect size: Absolute difference + variability

• How to determine it?

• Substantive knowledge
• Previous research
• Conventions

• Jacob Cohen
• Defined small, medium and large effects for different tests
The effect size: how is it calculated?
The absolute difference
• It depends on the type of difference and the data
• Easy example: comparison between 2 means
Absolute difference

• The bigger the effect (the absolute difference), the bigger the power
= the bigger the probability of picking up the difference

http://rpsychologist.com/d3/cohend/
The effect size: how is it calculated?
The standard deviation
• The bigger the variability of the data, the smaller the power

critical value

H0 H1
Power Analysis
The power analysis depends on the relationship between 6 variables:

• the difference of biological interest

• the standard deviation
• the significance level (5%) (p< 0.05) α
• the desired power of the experiment (80%) β
• the sample size
• the alternative hypothesis (ie one or two-sided test)
The sample size

• Most of the time, the output of a power calculation.

• The bigger the sample, the bigger the power

• but how does it work actually?

• In reality it is difficult to reduce the variability in data, or the contrast between means,
• most effective way of improving power:
• increase the sample size.
The sample size 2

3
n=3
1

Sample means
2
0

1 -1
Continuous variable

‘Infinite’ number of samples

-2
0 Samples means = 𝐱 ത 2
Sample

n=30
-1 1

Sample means
0
-2

-1

-3
Population
-2
Sample
The sample size
2
2

1 1

Sample means
Sample means
0 0

-1 -1

-2
-2
The sample size
The sample size: the bigger the better?

• It takes huge samples to detect tiny differences but tiny samples to detect huge differences.

• What if the tiny difference is meaningless?

• Beware of overpower
• Nothing wrong with the stats: it is all about
interpretation of the results of the test.

• Remember the important first step of power analysis

• What is the effect size of biological interest?
Power Analysis
The power analysis depends on the relationship between 6 variables:

• the effect size of biological interest

• the standard deviation
• the significance level (5%)
• the desired power of the experiment (80%)
• the sample size
• the alternative hypothesis (ie one or two-sided test)
The alternative hypothesis: what is it?
• One-tailed or 2-tailed test? One-sided or 2-sided tests?

• Is the question:
• Is the there a difference?
• Is it bigger than or smaller than?

• Can rarely justify the use of a one-tailed test

• Two times easier to reach significance with a one-tailed than a two-tailed
• Suspicious reviewer!
• Fix any five of the variables and a mathematical relationship can be used
to estimate the sixth.
e.g. What sample size do I need to have a 80% probability (power) to detect this particular
effect (difference and standard deviation) at a 5% significance level using a 2-sided test?
• Good news:
there are packages that can do the power analysis for you ... providing you have some prior
knowledge of the key parameters!
difference + standard deviation = effect size

• Free packages:
• R
• G*Power
• InVivoStat

• Cheap package: StatMate (~ $95)

• Not so cheap package: MedCalc (~ $495)

Power Analysis
Let’s do it

• Examples of power calculations:

• Comparing 2 proportions: Exercise 1

• Comparing 2 means: Exercise 2

Exercises 1 and 2

• Use the functions below to answer the exercises

• Clue: exactly one of the parameters must be passed as NULL, and that parameter is determined from the others.

• Use R Help to find out how to use the functions

• e.g. ?power.prop.test in the console

Exercise 1
power.prop.test(n=NULL, p1=NULL, p2=NULL,
sig.level=NULL, power=NULL, alternative=c("two.sided", "one.sided"))

Exercise 2
power.t.test(n=NULL, delta=NULL, sd=1, sig.level=NULL, power=NULL,
type=c("two.sample", "one.sample", "paired"),
alternative=c("two.sided", "one.sided"))
Exercise 1:
• Scientists have come up with a solution that will reduce the number of lions being shot by farmers in Africa:
painting eyes on cows’ bottoms.
• Early trials suggest that lions are less likely to attack livestock when they think they’re being watched
• Fewer livestock attacks could help farmers and lions co-exist more peacefully.
• Pilot study over 6 weeks:
• 3 out of 39 unpainted cows were killed by lions, none of the 23 painted cows from the same herd were killed.

• Questions:
• Do you think the observed effect is meaningful to the extent that such a ‘treatment’ should be applied?
Consider ethics, economics, conservation …
• Run a power calculation to find out how many cows should be included in the study.
• Clue 1: power.prop.test()
• Clue 2: exactly one of the parameters must be passed as NULL, and that parameter is determined from the others.

http://www.sciencealert.com/scientists-are-painting-eyes-on-cows-butts-to-stop-lions-getting-shot
Exercise 1: Answer
• Scientists have come up with a solution that will reduce the number of lions being shot by farmers in Africa:
• Painting eyes on the butts of cows
• Early trials suggest that lions are less likely to attack livestock when they think they’re being watched
• Less livestock attacks could help farmers and lions co-exist more peacefully.

• Pilot study over 6 weeks:

• 3 out of 39 unpainted cows were killed by lions, none of the 23 painted cows from the same herd were killed.

power.prop.test(p1 = 3/39, p2 = 0, sig.level = 0.05, power = 0.8, alternative="two.sided")

Exercise 2:
• Pilot study: 10 arachnophobes were asked to perform 2 tasks:
Task 1: Group1 (n=5): to play with a big hairy tarantula spider with big fangs and an evil look in its eight eyes.
Task 2: Group 2 (n=5): to look at pictures of the same hairy tarantula.
• Anxiety scores were measured for each group (0 to 100).

• Use R to calculate the values for a power calculation

• Get the data in R (spider.csv)
• Hint: you can use group_by()and summarise()
• Or you can do it in Excel!
• Run a power calculation (assume balanced design and parametric test)
• Clue 1: power.t.test()
• Clue 2: choose the sd that makes more sense.
Exercise 2: Answer
spider.data %>%
group_by(Group) %>%
summarise(mean=mean(Scores), sd=sd(Scores))

power.t.test(delta = 52 - 39, sd = 9.75, sig.level = 0.05, power = 0.8,

type = "two.sample", alternative = "two.sided")

• To reach significance with a t-test, providing the preliminary results are to be trusted,
and be confident in a difference between the 2 groups, we need about 10 arachnophobes in each group.
Unequal sample sizes
• Scientists often deal with unequal sample sizes
• No simple trade-off:
• if one needs 2 groups of 30, going for 20 and 40 will be associated with decreased power.
• Unbalanced design = bigger total sample
• Solution:
Step 1: power calculation for equal sample size
Step 2: adjustment
• Cow example: balanced design: n = 97
but this time: unpainted group: 2 times bigger than painted one (k=2):
• Using the formula, we get a total:
N=2*97*(1+2)2/4*2 = 219
Painted butts (n1)=73 Unpainted butts (n2)=146

• Balanced design: n = 2*97 = 194

• Unbalanced design: n= 70+140 = 219
Non-parametric tests

• Non-parametric tests: do not assume data come from a Gaussian distribution.

• Non-parametric tests are based on ranking values from low to high
• Non-parametric tests almost always less powerful

• Proper power calculation for non-parametric tests:

• Need to specify which kind of distribution we are dealing with
• Not always easy

• Non-parametric tests never require more than 15% additional subjects providing that the
distribution is not too unusual.

• Very crude rule of thumb for non-parametric tests:

• Compute the sample size required for a parametric test and add 15%.
Sample Size: Power Analysis

• What happens if we ignore the power of a test?

• Misinterpretation of the results

• p-values: never ever interpreted without context:

• Significant p-value (<0.05): exciting! Wait: what is the difference?
• >= smallest meaningful difference: exciting
• < smallest meaningful difference: not exciting
• very big sample, too much power

• Not significant p-value (>0.05): no effect! Wait: how big was the sample?
• Big enough = enough power: no effect means no effect
• Not big enough = not enough power
• Possible meaningful difference but we miss it

Statistical Power Analysis For The Behavioral Sciences 2nd Edition ISBN 0805802835, 9780805802832 Digital PDF Download
0% (1)
Statistical Power Analysis For The Behavioral Sciences 2nd Edition ISBN 0805802835, 9780805802832 Digital PDF Download
15 pages
Statistical Power Analysis For The Behavioral Sciences 2nd Edition Ebook Full Text
100% (9)
Statistical Power Analysis For The Behavioral Sciences 2nd Edition Ebook Full Text
14 pages
Introduction To Statistics With GraphPad Prism Slides
No ratings yet
Introduction To Statistics With GraphPad Prism Slides
101 pages
Employee Job Satisfaction Research
100% (8)
Employee Job Satisfaction Research
40 pages
G Power
No ratings yet
G Power
5 pages
Sample Size Computations and Power Analysis With The SAS System
100% (2)
Sample Size Computations and Power Analysis With The SAS System
8 pages
Power: Type I & Type II Error
No ratings yet
Power: Type I & Type II Error
11 pages
Cohen 1992 A Power Primer PDF
No ratings yet
Cohen 1992 A Power Primer PDF
8 pages
Introduction To Statistics With GraphPad Prism Slides
No ratings yet
Introduction To Statistics With GraphPad Prism Slides
101 pages
GraphPad Prism Slides
No ratings yet
GraphPad Prism Slides
79 pages
Practical Research 2 CS - RS12-If-J-4
No ratings yet
Practical Research 2 CS - RS12-If-J-4
3 pages
Biostats L11
No ratings yet
Biostats L11
33 pages
6 - Praktek G Power
No ratings yet
6 - Praktek G Power
74 pages
HMIS Final Papers Ruweji
No ratings yet
HMIS Final Papers Ruweji
37 pages
Power of Test
100% (1)
Power of Test
3 pages
Power Power: Statistical Inference
No ratings yet
Power Power: Statistical Inference
12 pages
HYpothesis Testing
No ratings yet
HYpothesis Testing
26 pages
PK PM Jan 2014 Power Handout
No ratings yet
PK PM Jan 2014 Power Handout
14 pages
Chapter 33 Power
No ratings yet
Chapter 33 Power
11 pages
Power and Sample Size: Points of Significance
No ratings yet
Power and Sample Size: Points of Significance
4 pages
1.3 Type I Error Type II Error and Power PDF
No ratings yet
1.3 Type I Error Type II Error and Power PDF
11 pages
Dredging
No ratings yet
Dredging
324 pages
Chapter 11
No ratings yet
Chapter 11
24 pages
Stats Power
No ratings yet
Stats Power
53 pages
Malcolm Tight Documentary Research in The Social Sciences SAGE 2019 34 53
100% (1)
Malcolm Tight Documentary Research in The Social Sciences SAGE 2019 34 53
20 pages
Power and Sample Size + Principles of Simulation: Benjamin Neale March 4, 2010 International Twin Workshop, Boulder, CO
No ratings yet
Power and Sample Size + Principles of Simulation: Benjamin Neale March 4, 2010 International Twin Workshop, Boulder, CO
45 pages
Power and Sample Size
No ratings yet
Power and Sample Size
4 pages
Power Calculations
No ratings yet
Power Calculations
13 pages
INGLES Poder Estadístico en Experimentos de ISW
No ratings yet
INGLES Poder Estadístico en Experimentos de ISW
63 pages
Power Analysis Notes-1
No ratings yet
Power Analysis Notes-1
10 pages
M Api
No ratings yet
M Api
17 pages
W2 - Homework Assignment
No ratings yet
W2 - Homework Assignment
3 pages
MGT 302
No ratings yet
MGT 302
5 pages
Introduction To Hypothesis Testing, Power Analysis and Sample Size Calculations
No ratings yet
Introduction To Hypothesis Testing, Power Analysis and Sample Size Calculations
8 pages
INVITRO Sample Size Estimation Course Manual
No ratings yet
INVITRO Sample Size Estimation Course Manual
32 pages
Hypothesis Power Analysis
No ratings yet
Hypothesis Power Analysis
38 pages
Ar Full Project
No ratings yet
Ar Full Project
98 pages
Construction Quality and Risk Management
No ratings yet
Construction Quality and Risk Management
65 pages
Statistical Power and Effect Size 1
No ratings yet
Statistical Power and Effect Size 1
29 pages
Sample Size Determination and A Priori Power Analysis Using GPower
No ratings yet
Sample Size Determination and A Priori Power Analysis Using GPower
40 pages
Power Chapter Emerging Trends
No ratings yet
Power Chapter Emerging Trends
25 pages
A Project Report On "Consumer Behavior On Toyota & Implementation of TQM"
No ratings yet
A Project Report On "Consumer Behavior On Toyota & Implementation of TQM"
32 pages
ExerciseC PowerCalc TAs
No ratings yet
ExerciseC PowerCalc TAs
15 pages
How To Write The Methodology Chapter of A Dissertation or Thesis
100% (3)
How To Write The Methodology Chapter of A Dissertation or Thesis
8 pages
Chapters 1 3
No ratings yet
Chapters 1 3
30 pages
Power, Power Curves and Sample Size
No ratings yet
Power, Power Curves and Sample Size
36 pages
Chapter 21 More About Tests: Zero in On The Null
No ratings yet
Chapter 21 More About Tests: Zero in On The Null
13 pages
Power Analysis
No ratings yet
Power Analysis
13 pages
Hypothesis Test
No ratings yet
Hypothesis Test
3 pages
Complemento Aula 8
No ratings yet
Complemento Aula 8
43 pages
Power Analysis Talk
No ratings yet
Power Analysis Talk
40 pages
Vi. Statistical Method, Analysis and Interpretation of Data: Scores Frequency (LB) ( CF)
No ratings yet
Vi. Statistical Method, Analysis and Interpretation of Data: Scores Frequency (LB) ( CF)
10 pages
G - Power Guide
No ratings yet
G - Power Guide
86 pages
Why Is It Important To Consider Sample Size?
No ratings yet
Why Is It Important To Consider Sample Size?
98 pages
Type I Type II Error
No ratings yet
Type I Type II Error
24 pages
Week 6 Key Concepts in Inferential Statistics II
No ratings yet
Week 6 Key Concepts in Inferential Statistics II
14 pages
Power of The Test: DR Smita Pandey
No ratings yet
Power of The Test: DR Smita Pandey
9 pages
R Cheat Sheet: 1. Basics 4. Input and Export of Data
100% (1)
R Cheat Sheet: 1. Basics 4. Input and Export of Data
4 pages
Intern Report Apex Shoes
No ratings yet
Intern Report Apex Shoes
36 pages
DOI: 10.1111/j.1471-0528.2006.00908.x WWW - Blackwellpublishing.com/bjog
No ratings yet
DOI: 10.1111/j.1471-0528.2006.00908.x WWW - Blackwellpublishing.com/bjog
10 pages
Power and Sample Size Calculation
No ratings yet
Power and Sample Size Calculation
13 pages
Understanding Statistical Power in The Context of Applied Research
No ratings yet
Understanding Statistical Power in The Context of Applied Research
8 pages
Customer Loyalty Marketing Research
No ratings yet
Customer Loyalty Marketing Research
12 pages
Sample Size R Module
No ratings yet
Sample Size R Module
85 pages
Power and Effect Size
No ratings yet
Power and Effect Size
3 pages
BioEpi Lab Module 9
No ratings yet
BioEpi Lab Module 9
2 pages
Kang (2021)
No ratings yet
Kang (2021)
12 pages
Is There A Difference Between A Term Paper and A Research Paper
No ratings yet
Is There A Difference Between A Term Paper and A Research Paper
8 pages
Sample Size Calculation & Software
No ratings yet
Sample Size Calculation & Software
26 pages
Ho - Sample Size
No ratings yet
Ho - Sample Size
5 pages
S1 TITAN 600-800 Alloy Calibration New
No ratings yet
S1 TITAN 600-800 Alloy Calibration New
2 pages
PSY 240: Statistics in Psychology: One Sample Statistics: Calculating Significance For Paired Means
No ratings yet
PSY 240: Statistics in Psychology: One Sample Statistics: Calculating Significance For Paired Means
41 pages
Service Quality, Customer Satisfaction, and Behavioral Intentions in Fast-Food Restaurants
No ratings yet
Service Quality, Customer Satisfaction, and Behavioral Intentions in Fast-Food Restaurants
19 pages
Civil Open Elective
No ratings yet
Civil Open Elective
4 pages
Power and Sample Size Calculation
No ratings yet
Power and Sample Size Calculation
13 pages
San Carlos City, Negros Occidental
No ratings yet
San Carlos City, Negros Occidental
6 pages
Risk Lecture
No ratings yet
Risk Lecture
75 pages
Determining The Relative Amounts of Components
No ratings yet
Determining The Relative Amounts of Components
10 pages
AUDIT PROJECT Final
No ratings yet
AUDIT PROJECT Final
103 pages
Assist With Policy Development For Client Support
No ratings yet
Assist With Policy Development For Client Support
20 pages
The Role of Accounting Information in Production Decision Making A Case Study of Unga LTD Company in Eldoret.
No ratings yet
The Role of Accounting Information in Production Decision Making A Case Study of Unga LTD Company in Eldoret.
48 pages
Basic Parts of A Research Paper
No ratings yet
Basic Parts of A Research Paper
2 pages
CHAPTER 1 - Nature of Inquiry and Research - Lesson 1 3
No ratings yet
CHAPTER 1 - Nature of Inquiry and Research - Lesson 1 3
56 pages
Power Analysis
No ratings yet
Power Analysis
8 pages
Summer Internship Goldy
No ratings yet
Summer Internship Goldy
125 pages
: µ = 0 vs H: µ 6= 0. Previous work shows that σ = 2. A change in BMI of 1.5 is considered important to detect (if the true effect size is 1.5 or higher
No ratings yet
: µ = 0 vs H: µ 6= 0. Previous work shows that σ = 2. A change in BMI of 1.5 is considered important to detect (if the true effect size is 1.5 or higher
5 pages
Introduction To Inference: Use and Abuse of Tests Power and Decision
No ratings yet
Introduction To Inference: Use and Abuse of Tests Power and Decision
15 pages
The Impact of Ai Chatbots On Asj Students in Critical Thinking and Academic Sucess
No ratings yet
The Impact of Ai Chatbots On Asj Students in Critical Thinking and Academic Sucess
27 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Power Analysis

Uploaded by

Power Analysis

Uploaded by

Power Analysis

• Probability that the observed result occurs if H0 is true

• Type I error (α) is the failure to reject a true H0

• Type II error (β) is the failure to reject a false H0

• Direct relationship between Power and type II error:

• General convention: 80% but could be more

• Hence a true difference will be missed 20% of the time

• Jacob Cohen (1962):

Small difference Big difference

Not significant: p>0.05 Significant: p<0.05

Critical value = size of difference + sample size + significance

• High specificity = low False Positives = low Type I error

The power analysis depends on the relationship between 6 variables:

• the difference of biological interest

The Standard Deviation (SD)

• How to determine it?

• the difference of biological interest

• Most of the time, the output of a power calculation.

• The bigger the sample, the bigger the power

‘Infinite’ number of samples

• What if the tiny difference is meaningless?

• Remember the important first step of power analysis

• the effect size of biological interest

• Can rarely justify the use of a one-tailed test

• Cheap package: StatMate (~ $95)

• Not so cheap package: MedCalc (~ $495)

• Examples of power calculations:

• Comparing 2 proportions: Exercise 1

• Comparing 2 means: Exercise 2

• Use the functions below to answer the exercises

• Use R Help to find out how to use the functions

• Pilot study over 6 weeks:

power.prop.test(p1 = 3/39, p2 = 0, sig.level = 0.05, power = 0.8, alternative="two.sided")

• Use R to calculate the values for a power calculation

power.t.test(delta = 52 - 39, sd = 9.75, sig.level = 0.05, power = 0.8,

• Balanced design: n = 2*97 = 194

• Non-parametric tests: do not assume data come from a Gaussian distribution.

• Proper power calculation for non-parametric tests:

• Very crude rule of thumb for non-parametric tests:

• What happens if we ignore the power of a test?

• p-values: never ever interpreted without context:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.