Innovative sample size methods for adaptive clinical trials webinar web version (0.1)

 Head of Statistics
 nQuery Lead Researcher
 FDA Guest Speaker
 Guest Lecturer
Webinar Host
HOSTED BY:
Ronan
Fitzpatrick

Webinar Overview
Adaptive Designs and Sample Size Re-estimation
(SSR)
Blinded Sample Size Re-estimation
GSD, Conditional Power & Unblinded SSR
Discussion and Conclusions

Worked Examples Overview
Blinded Sample Size Re-
estimation
Two Means Group Sequential
Conditional Power
Unblinded SSR Example
WORKED EXAMPLES

In 2017, 90% of organizations with clinical trials
approved
by the FDA used nQuery for sample size and power
calculation

Context
SSD finds the appropriate sample size for
your study
 Common metrics are statistical power, interval
width or cost
SSD seeks to balance ethical and practical
issues
 Crucial to arrive at valid conclusions, Type M/S
errors
High cost of failed clinical trials drug
development

Adaptive Trials Overview
Adaptive Trials are any trial where a change
or decision is made to a trial while still on-
going
Encompasses a wide variety of potential
adaptions
 E.g. Early stopping, SSR, enrichment, seamless,
dose-finding
Adaptive trials seek to give control to trialist
to improve trial based on all available
information

Adaptive Trials Pros & Cons
Advantages
1. Earlier Decisions
2. Reduced Potential
Cost
3. Higher Potential
Success
4. Greater Control
5. Better Seamless
Designs
Disadvantages
1. More Complex
2. Logistical Issues
(IDMC)
3. Modified Test
Statistics
4. Greater Expertise
5. Regulatory
Approval?

Adaptive Trials Regulatory
Background
Draft FDA CBER/CDER Guidance published in
2010
 “Well-understood” and “Less well-understood”
Designs
 EMA published similar reflection paper (2007)
Increase in interest in encouraging adaptive
design
 US: Innovative Cures Act, EU: Adaptive Pathways
 New FDA Guidance expected later this year
Will likely to see proliferation of new designs

Sample Size Re-estimation
(SSR)
Will focus here on specific adaptive design of
SSR
Adaptive Trial focused on higher sample size
if needed
 Strong adaption target due to intrinsic SSD
uncertainty
 Note that more suited to knowable/short follow-
up
 Note that could lower sample size but not
encouraged

Blinded Sample
Size Re-estimation
PART II

Blinded Sample Size Re-
estimation
BSSR uses interim blinded nuisance
parameter estimate
 Use of blinded data reduces logistical/regulatory
issues
 Considered a “well understood” type of adaptive
design
Multiple methods but focus on internal pilot
approach
 Update N based on parameter estimate from
internal pilot
 Use same methods as fixed term trial incl. pilot

Blinded SSR nQuery Summary
(Autumn 2018)
Blinded SSR Means
SSR Criteria: Variance
Three σ2 Estimate
Methods
1. Two Sample
Inequality
2. Two Sample NI
3. Two Sample Equiv
Blinded SSR Props
SSR Criteria: Overall
Success Rate
Assumes effect size
true
1. Two Sample
Inequality
2. Two Sample NI

Two Sample Mean Blinded SSR
Example
Source: nejm.com
Parameter Value
Significance Level (2-
Sided)
0.05
Mean Difference (%) -9
Standard Deviation (%) 16
Dropout Rate 15%
Target Power 90%
Nuisance Parameter? Standard
Deviation
“We estimated that we
would need to enrol 160
patients, given an
expected mean (±SD)
annual decline in the FVC
of 9±16 percent of the
predicted value and a
dropout rate of 15 percent,
to achieve a two-sided
alpha level of 0.05 and a
statistical power of 90

Two Sample Proportion BSSR
Example
“For our sample size
calculations, we assumed that
20% of women in the control arm
would be using LARC after their
6-week postpartum visit, … An
analysis population with at least
626 women (313 in each arm)
was required to provide 80%
power (using a two-sided alpha
of 0.05) to detect an absolute
10% increase to 30% in LARC use
in the intervention arm [14].
Anticipating a maximum drop-
out rate of 20% at the time of
Follow-Up Survey #2, we planned
to randomize 800 participants”
Source: Contraception
Parameter Value
Sided)
0.05
Control Rate 0.2
Intervention Rate 0.3
n per Group 313
Target Power (%) 80%
Nuisance Parameter Overall Success
Rate

GSD, Conditional
Power & Unblinded
SSR
PART III

Group Sequential Designs (GSD)
GSD facilitate interim analyses
 Interim analyses occur while trial
on-going
Accrued data analysed at pre-
specified times
 E.g. After 1/2 subjects have been
measured
Can stop for benefit or futility
 If neither found, continue trial
until end/next interim
Need to account for effect of
multiple analyses
 Do this by “spending” α and/or β
errors
GSD Changes
1. Futility Only Designs
2. Additional Outputs
3. New Two Sample
TTE
4. One Sample Mean
GSD
5. One Sample Prop
GSD

Error Spending (Lan & DeMets)
Two Criteria for early stopping
1. Efficacy (α-spending)
2. Futility (β-spending)
Multiple Error Spending
Functions
O’Brien Fleming, Pocock etc.
Both α and β spending work
similarly
Can be very liberal or conservative
At each interim analysis,
spending a proportion of the
total error
 Makes analysis at endpoint more
conservative
𝛼 𝜏 = 2 1 − Φ
𝑧 𝛼/2
𝜏

Group Sequential Example
“A sample size of 242
subjects (121 per treatment
group) provides at least 80%
power to detect a relative
difference of 53% between
botulinum toxin A and
standardized anticholinergic
therapy, assuming a
treatment difference of -0.80
and a common SD of 2.1
(effect size = 0.381), and a
two-sided type I error rate of
5%. Sample size has been
adjusted to allow for a 10%
loss to follow-up over the 6-
months of treatment as well
Parameter Value
sided)
0.05
OnabotulinumtoxinA
Mean
-2.3
Anticholinergic Mean -1.5
Standard Deviation
(Both)
2.1
Power 80%
# Interim Analyses 1
α Spending Function
O’Brien-
Source: NEJM (2012)

Conditional Power (CP)
CP gives prob. of rejecting null given interim
test statistic
 Calculation still depends on what “true” difference
set to
Often used as ad-hoc criteria for futility testing
in GSD
 More flexible than β-spending but less error guarantee
Focus here on CP as measure of “promising”
results
 “Promising” meaning less than target but close to
target power

Conditional Power & Unblinded SSR
Most common criteria proposed for unblinded SSR
is CP
SSR suggested when interim results “promising”
(Chen et al)
 Gives third option vs GSD: continue, stop early,
increase N
 “Promising” user-defined but based on unblinded effect
size
 Power for optimistic effect but increase N for lower
relevant effects?
2 methods here: Chen, DeMets & Lan; Cui, Hung &
Wang
 1st uses GSD statistics but only penultimate look & high
CP
nd

Innovative sample size methods for adaptive clinical trials webinar web version (0.1)

PART 4
Discussion &
Conclusions

Discussion and Conclusions
Adaptive Trials expected to become more
common
 Reduction of costs, greater regulatory interest
etc.
SSR will be one common type of adaptive
trial
 Blinded SSR already widely accepted, unblinded
growing
Blinded SSR targets initial variance under-
estimates

nQuery Spring 2018 Update
Initial release focused on Survival & Bayesian tables.
April release adds 72 new tables in following areas:
New Bayes tables in April
update
New tables in April
update
Epidemiology Non-inferiority/
Equivalence
Correlation/ROC
Bayesian
Sample Size
52 20

nQuery Autumn 2018 Update
Autumn 2018 release adds nQuery Adapt module, 32 new tables
& undo/redo
New Core
Tables
Proportions +
Crossover Assurance
Conditional
Power
GST + SSR
20
nQuery Bayes
Tables
12
nQuery Adapt
Tables
15

Q&A
Any Questions?
For further details,
contact at:
info@statsols.com
Thanks for listening!

References
Friede, T., & Kieser, M. (2006). Sample size recalculation in internal pilot study designs: a
review. Biometrical Journal: Journal of Mathematical Methods in Biosciences, 48(4), 537-555.
Tang, J. H., Dominik, R. C., Zerden, M. L., Verbiest, S. B., Brody, S. C., & Stuart, G. S. (2014).
Effect of an educational script on postpartum contraceptive use: a randomized controlled
trial. Contraception, 90(2), 162-167.
Tashkin, D. P., Elashoff, R., Clements, P. J., Goldin, J., Roth, M. D., Furst, D. E., ... & Seibold,
J. R. (2006). Cyclophosphamide versus placebo in scleroderma lung disease. New England
Journal of Medicine, 354(25), 2655-2666.
Jennison, C., & Turnbull, B. W. (1999). Group sequential methods with applications to clinical
trials. CRC Press.
Visco, A. G., et al (2012). Anticholinergic therapy vs. onabotulinumtoxina for urgency
urinary incontinence. New England Journal of Medicine, 367(19), 1803-1813.
Chen, Y. J., DeMets, D. L., & Gordon Lan, K. K. (2004). Increasing the sample size when the
unblinded interim result is promising. Statistics in medicine, 23(7), 1023-1038.
Cui, L., Hung, H. J., & Wang, S. J. (1999). Modification of sample size in group sequential
clinical trials. Biometrics, 55(3), 853-857.

Innovative sample size methods for adaptive clinical trials webinar web version (0.1)

More Related Content

What's hot (20)

Similar to Innovative sample size methods for adaptive clinical trials webinar web version (0.1) (20)

Recently uploaded (20)

Innovative sample size methods for adaptive clinical trials webinar web version (0.1)

Editor's Notes