PharmaSUG 2012 SP04
PharmaSUG 2012 SP04
ABSTRACT
Drug development is a long and expensive process where each step should be carefully planned. During design
stage of a clinical trial, the sample size calculation has to be performed based on a primary objective of the trial and
target to achieve the desired power for detecting a clinically meaningful difference between test drug and
standard/control at a fixed Type I Error rate. However, information about test drug in pilot studies is limited and most
of the times statisticians do “the best guess” about effect size of a new drug that leads to wrong sample size
calculation and then to failure of the trail.
Meta-analysis can help. Combining all existing information about the test drug, meta-analyses intent to give better
estimation of effect size for a new drug that determines required sample. In current years, one of the obstacles was
that meta-analysis is not an easy task and special software is usually required to perform it.
The goal of this paper is to demonstrate that concept of meta-analyses is apprehensible, and SAS® software can be
used to perform meta-analysis on regular basis. User-friendly SAS macro calculates the effect size and determines
what sample size is needed to reach the goal of clinical trial. For visual presentation of the result, macro generates
accompanying forest plots of effect sizes and plot of anticipated sample size. For the sake of validation, the results
from meta-analysis were compared with analysis of subject-level (pooled) data. The conclusions from two
approaches came out the same. It demonstrates the validity and strength of meta-analysis.
This paper suggests that when data sets have been accumulated with ongoing research, the effect size calculated in
meta-analysis can be treated as “best evidence” and should be taking in consideration while designing the next
clinical trial that will lead to successful NDA submission.
INTRODUCTION
“Five clinical trials were conducted in my company for a new drug with one success and four failures…” sounds
familiar for everyone who worked long enough in Pharmaceutical industry. “Hmm…” is your answer: “There is
something wrong with expectations from a new drug, and sample size was probably too small. Why didn’t you check
the effect size of your company’s new drug while accumulating data from study to study?” Let’s do it. It is never too
late.
Effect size is measured as standardized difference between two groups. Along with Confidence Limit Intervals (CLI),
it gives information how different two samples are. In other words, effect size estimates the magnitude of treatment
effect for a new drug. Effect size should be used to calculate sample size required to meet a p-value with level of
significance α.
For this paper, data for ten studies were simulated using RAND function and normal distribution. Assumption was
made that there are equal variances in the groups. Analysis for each study was done by PROC MIXED. Meta-
analysis, and individual data analysis was performed for overall effect size calculations. The results from two
approaches were compared. Sample size was proposed for a new clinical trial using estimated effect size of a new
treatment. SAS® 9.2 was used for analyses.
DESCRIPTION
Assume that company is concentrated on developing a drug that reduces pain. Pain is measured in a continuous
scale 0-100 mm. The more drop in score from baseline, the better is the treatment. The score assessment was
performed at visits 1,2,3,4, and 5(end of study). The primary objective is to demonstrate the difference in the change
from baseline between Active treatment and Placebo.
Ten studies have been conducted with different sample sizes. For meta-analysis, difference in means, standard
deviation and 95% confidence limit intervals should be already in the final clinical study reports of the completed
studies. If full data sets are available, the results of studies should be reproducible.
1
APPLICATION OF META-ANALYSIS IN CLINICAL TRIALS, CONTINUED
SAS language is used in the text instead of mathematical formulas to be able to use highlighted lines right away into
the program.
Cohen’s d
SIGMA_pooled=Stderr_Diff/sqrt((1/N_Placebo)+(1/N_Trt));
(d) Effect_Size = Means_Diff/ SIGMA_pooled;
Hedges’ g
sp=sqrt(((Stderr_Trt**2)*(N_Trt-1)+(Stderr_Placebo**2)*(N_Placebo-1))/(N_Trt +
N_Placebo-2));
(g) Effectsize_g=Mean_Diff/sp;
Stderr_g=sqrt((N_Placebo+N_Trt)/(N_Placebo*N_Trt) +Effectsize_g^2/(2*(N_Placebo +
N_Trt)));
Glass’ delta
(∆) Effectsize_delta = Means_Diff/ Stderr_Placebo;
Stderr_delta=sqrt((N_Placebo+N_Trt)/(N_Placebo*N_Trt) +
Effectsize_delta^2/(2*(N_Placebo -1)));
Fixed-effect model
The weight of the individual study results is proportional to the square of standard errors:
Wi=1/(stderr_effectsize^2);
2
APPLICATION OF META-ANALYSIS IN CLINICAL TRIALS, CONTINUED
Q-statistic has chi-square distribution with k-1 degree of freedom (k is the number of effect sizes in the sample). High
values of Q (above the critical point for a given significance level α) enable us to reject the null hypothesis about
homogeneity and conclude that there is statistically significant variation between studies.
If the value of Q is unusually large then there is substantial heterogeneity, because there is more variation among the
studies than expected just by looking at the individual standard errors. If there is no heterogeneity, then Q should be
approximately equal to k-1.
Knowing Q, the estimator of between-study variance tau can be calculated using Method of Moments. [4]
If Q > k-1 then tau^2= (Q-(k-1)) / C;
If Q ≤ k-1 then tau^2= 0 ;
Where C = sum (Wi) – (sum (Wi^2))/ (sum(Wi));
The extent of heterogeneity can be estimated by I^2 index by comparing the Q value with its expected value
assuming homogeneity. It measures proportion of inconsistency in individual studies, and values close to 100% imply
very high degrees of heterogeneity. I^2 equal to 25%, 50%, and 75% represent low, moderate, and high
heterogeneity accordingly.
If Q > k-1 then I^2=100 * (Q-(k-1)) /Q;
If Q ≤ k-1 then I^2= 0;
Random-effect model
Random effects model can be considered as the case of fixed effects model with addition of between-studies
variability. The weights will be calculated by adding tau^2 in the formula.
Wi_random=1/( tau^2 + stderr_effectsize^2 );
Confidence intervals for random effects models are usually wider than a fixed effects model. It happens because the
estimated study heterogeneity adds uncertainty to the confidence interval calculations.
Sample Size
The sample size required for calculated Effect size in units and 2-tail p-value with level of significance α will be
calculated by formula: N= (Z * GroupSD/Effect_size)^2 ; where Z value=Mean_Diff/GroupSE.
EXAMPLE
In this paper example, our new drug is supposed to reduce a headache. Subjects complain about the pain and report
its level (scale: 0 to 100) every day in the diary. Study drug is compared to placebo at visit 1, 2, 3, 4, and 5(the end of
clinical trial). The change from baseline in the level of pain demonstrates that study drug had significantly high (let’s
hope) reduction in pain compared to placebo subjects at the end of study (Visit 5) at level of significance α=0.05. LS
Means were used in this example, because of the adjustment for baseline in the model. Data for 10 studies were
simulated where each study has two arms with different number of subjects.
Study N0 (placebo) N1 (Active) Total
1 25 27 52
2 50 53 103
3 75 79 154
4 100 105 205
5 125 131 256
6 150 157 307
7 175 183 358
8 200 209 409
9 225 235 460
10 259 261 511
3
APPLICATION OF META-ANALYSIS IN CLINICAL TRIALS, CONTINUED
1) Difference in LS Means (95% CI) is calculated for each study. The results are summarized in Table 1
Table 1: LS Means* difference along with 95% CL Intervals between Active and Placebo
STD 95% CL
STD LS STD ERR interval
N LS Mean ERR N Mean ERR LS Means Diff t- Lower
Study Visit Placebo Placebo Placebo Active Active Active Difference Value Upper
1 5 25 -0.44 5.24 27 6.85 5.04 7.29 7.27 -1.00 -7.02 21.61
2 5 50 9.83 3.56 53 10.97 3.46 1.15 4.96 -0.23 -8.60 10.90
3 5 75 3.26 3.09 79 8.87 3.01 5.61 4.32 -1.30 -2.86 14.08
4 5 100 0.39 2.47 105 6.94 2.41 6.55 3.45 -1.90 -0.22 13.31
5 5 125 4.21 2.14 131 6.95 2.09 2.74 2.99 -0.92 -3.13 8.61
6 5 150 0.40 2.04 157 3.07 2.00 2.67 2.85 -0.94 -2.93 8.27
7 5 175 0.11 1.86 183 12.18 1.81 12.07 2.59 -4.65 6.98 17.16
8 5 200 4.38 1.76 209 8.10 1.72 3.72 2.46 -1.51 -1.10 8.55
9 5 225 2.45 3.04 235 9.14 3.02 6.69 2.34 -2.86 2.10 11.27
10 5 250 1.79 1.60 261 9.84 1.57 8.06 2.24 -3.60 3.66 12.45
*Adjusted for baseline in the model
Any effect size can be calculated using formulas from the description section. This example use Cohen’s effect size.
Table 2 presents Cohen’s effect sizes together with 95% CL intervals.
Figure 1 depicts LS Mean Difference together with 95% CL intervals for all 10 studies using original units. It can be
seen, that there is a difference between active treatment and placebo to reduce pain in favor of a new drug; but most
studies (#1-6, and 8) didn’t demonstrate the significant reduction.
4
APPLICATION OF META-ANALYSIS IN CLINICAL TRIALS, CONTINUED
Lower Upper
Effect Size Std Err Bound Bound
Θo Θo Θo Θo Effect Std Err Lower Upper
(original (original (original (original Size Θ Bound Bound
Visit units) units) units) units) Θ Θ Θ
3) Test for Homogeneity generates Q and I^2 statistics, as well as Tau^2 for LS means differences and
Cohen’s effect sizes (d). Q-statistic values that close to k-1 suggest homogeneity across studies (Table 4).
I2 is equals to 26% that indicates the low level of heterogeneity among the studies.
4) Knowing Tau^2, the overall effect size for LS means difference and Cohen’s effect size (d), the results from
random-effects model can be easily calculated (Table 5). Since there is homogeneity across studies, overall
effect sizes for fixed and random models are very close.
5
APPLICATION OF META-ANALYSIS IN CLINICAL TRIALS, CONTINUED
Table 6: Individual Data Analysis: LS Mean Difference along with 95% CL interval
(Random Model)
Overall effect size from random-effects model was used as difference in means equals to 6.00. The group
sample size of 550 ( 275 per arm) will achieve 80% power to detect means difference of 6.0 between the two
groups with known standard deviations of 25.0 for both groups, and with a significance level (alpha) of 0.05 using
a two-sided two-sample t-test (Figure 2: Sample Size and Power).
This can explain why studies #1-6 and 8 were not successful. Their sample sizes were too small to detect small
effect size of a new drug.
VALIDATION
Validation of the macro was performed by comparing the results using validated Comprehensive Meta-Analysis
software (Copyright ©2006 Biostat, Inc.) on the same data sets. Results matched.
6
APPLICATION OF META-ANALYSIS IN CLINICAL TRIALS, CONTINUED
LIMITATION
Macro was written for comparing means of two samples. Another step should be taken for other choices.
CONCLUSION
It is very important to estimate the effect size of a new drug correctly. When data sets have been accumulated with
ongoing research, meta-analysis can be very useful. The effect size calculated in meta-analysis can be treated as
“best evidence” and should be taking in consideration while designing next clinical trial and performing a sample size
calculation.
Example in this paper demonstrated that results from meta-analysis are very similar to individual data analysis.
Individual data analysis requires more time and resources to pool data together from various studies. Moreover, very
often subject-level data is not available at all. Meta-Analysis can be performed with less effort using SAS® and help
drug companies with proper effect size estimation that leads to successful NDA submission.
ACKNOLEDGEMENT
I want to thank my supervisor Sailaja Bhaskar, PhD for motivation and valuable support.
SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS
Institute Inc. in the USA and other countries. ® indicates USA registration.
Other brand and product names are trademarks of their respective companies.
REFERENCES
1. “Computing Effect Size Measures with ViSta ‐ The Visual Statistics System” Rubén Daniel Ledesma,
Guillermo Macbeth, Nuria Cortada de Kohan. Tutorials in Quantitative Methods for Psychology 2009, Vol.
5(1), p. 25‐34.
2. “Evidence-based sample size calculations based upon updated meta-analysis” Alexander J. Sutton, Nicola
J. Cooper, David R. Jones, Paul C. Lambert, John R. Thompson and Keith R. Abrams. Statist. Med. 2007; 26:2479–
2500
3. “Meta-analysis Notes” DeCoster, J. (2004) from http://www.stat-help.com/notes.html
4. “Assessing Heterogeneity in Meta-Analysis: Q Statistic or I2 Index?” Tania B. Huedo-Medina, Julio
Sa´nchez-Meca, and Fulgencio Marı´n-Martı´nez, Juan Botella; Psychological Methods 2006, Vol. 11, No. 2, 193–
206.
5. “ How to calculate effect sizes from published research: A simplified methodology” Will Thalheimer
Samantha Cook; A Work-Learning Research Publication; Copyright 2002 by Will Thalheimer
CONTACT INFORMATION
Marina Komaroff, MPH, MS, DrPH Student
Senior Manager, Biostatistics,
Clinical/Regulatory Operations
Noven Pharmaceuticals, Inc.
Empire State Building
350 Fifth Avenue, 37th Floor
New York, NY 10118
Tel: 212 299 4202
Email: Mkomaroff@noven.com