Efficient and Flexible Designs for Vaccine Efficacy Trials

Abstract

Vaccine efficacy trials can often need large sample sizes as well as long follow-up times. The factors influencing the total trial duration are: vaccine efficacy and incidence, recruitment and lost-to-follow-up rates. At the planning stage assumptions on these factors are made to finalize the sample size. However, if the observed data in the target trial deviate from these assumptions then there is an increased risk of an under/over powered trial. We describe an efficient and flexible Bayesian trial design for testing for the vaccine efficacy of a new Tuberculosis (TB) vaccine in India. We first describe the Bayesian statistical framework and highlight key differences from the classical framework.

The Bayesian Statistical Framework

Unlike the traditional Frequentist framework, the Bayesian framework treats parameters of interest like Vaccine Efficacy (VE) as random (not fixed) and as such are assigned a probability distribution. Before the data is revealed, the chosen distribution is called the Prior which is then updated to the Posterior once the data is available using the Bayes Theorem. Today’s posterior becomes tomorrow’s prior. All inference are then made using the posterior. When there is no a priori information on the parameters of interest then we assume a flat or a weakly-informative prior.

Consider vaccine efficacy (𝑉 𝐸). This is traditionally measured in terms of 1 - incidence rate ratio (vaccine vs. placebo) or as 1- hazard ratio. The latter is more efficient when anticipating long and varying follow-up times across study subjects. Another way to measure VE is in terms of the case proportion $(𝜃 = \frac{𝜋_{𝑣}}{𝜋_{𝑣} + 𝜋_{𝑐}})$ as $𝑉 𝐸 = 100 \times (1 - \frac{𝜋_{𝑣}}{𝜋_{𝑐}}) = 100 \times \frac{1 - 2 𝜃}{1 - 𝜃}$

where 𝜋_𝑣 and 𝜋_𝑐 are the proportion of cases in the 𝑣accine and 𝑐ontrol arms respectively. The choice of such a parametrization facilitates sequential testing via multiple pre-specified interim looks. With large sample sizes and 1:1 randomisation, 𝜃 can be estimates using $\frac{𝑛_{𝑣}}{𝑛_{𝑣} + 𝑛_{𝑐}}$ , where 𝑛𝑣 and 𝑛𝑐 are the observed number of cases. We present an adaptive and sequential vaccine efficacy trial design in the Bayesian framework and highlight important advantages. Testing a hypothesis of the form 𝐻₀ : 𝑉𝐸 ≤ 0.3 vs. 𝐻_𝑎 : 𝑉𝐸>0.3 would be equivalent to testing 𝐻₀ : 𝜃 ≥ 0.411 vs. 𝐻_𝑎 : 𝜃<0.411. We will consider 𝜃 as our parameter of interest (primary estimand) and assume a flat uniform prior (see Figure 1).

Statistical Significance (Success) Criteria: Once the target trial data is available, we update the prior to the posterior distribution using the Bayes Theorem. Since 𝜃 is random, we can make probability statements about 𝜃 or on any hypothesis regarding 𝜃 based on the posterior. The trial would be deemed successful in showing efficacy (with a lower bound 30%) if Pr{𝜃 < 0.411 | 𝐷} is high, i.e. the posterior probability of case-proportion being less than 0.411, given the data (𝐷), exceeds a high threshold (𝛾). For a fixed design (no interim looks) 𝛾 = 0.975 ensures a type-I error (𝛼) of 2.5% (one-sided). For an adaptive and sequential design like the one we propose, 𝛾 will need to be determined via simulations (under 𝐻₀).

Sequential Adaptive Design

Initial Sample Size using the Frequentist log-rank test is around 148 cases required to reject 𝑉 𝐸 ≤ 0.3 with 90% power at 2.5% 𝛼-level (one-sided) if true (underlying) 𝑉 𝐸 = 0.6. ‣ Assuming an incidence of 0.2/100 person-year (PY) in the first two years and 0.1/100 PY in subsequent years for the IGRA+ (TB infection test) cohort, a constant 0.1/100 PY for the IGRA- cohort, a 50-50 split between IGRA+/- and min. (max.) follow-up of 3(5) years, this would require around 42,000 subjects to be randomised. Sample size requirement will almost be halved if true VE = 0.7.
Bayesian Design: To gain speed and to mitigate the risk of an underpowered trial due to uncertainty around VE, incidence, IGRA+/- split, etc. we propose a 7-look group sequential design:
‣ Plan for accruing a maximum of 140 cases and have 6-interim analyses (IA) starting at 80 cases and in increments of 10. The final look is at 140 cases.
‣ Using alpha-spending approach, define varying thresholds 𝛾1 ≥ 𝛾2 ≥ … ≥ 𝛾7 ≥ 0.975 for the posterior probability of VE > 0.3 (equivalently, 𝜃 < 0.411) computed at each look.
‣ Set 𝛾-s above via simulations under the null scenario (𝑉 𝐸 ≤ 0.3): For type-I error of 2.5%, 𝛾𝑘 = 0.997 for 𝑘 = 1,2, 𝛾𝑘 = 0.995 for 𝑘 = 3,4,5,6 and for final 𝛾7 = 0.98. Higher thresholds needed for interim stops. Figure 1, using simulated data, illustrates the sequential testing using posterior probabilities calculated at each interim and where the trial can be deemed successful only at the final look. Table 1 shows power and duration for sample sizes of 30K and 40K.

Table 1: Power and Duration (in years) for proposed design, assuming VE = 0.6 for IGRA+ and 0.7 for IGRA -ve, 60:40 split, incidence rate as in initial sample size.

N	Var./Cases ->	80	90	100	110	120	130	Final
30000	Power Duration	0.46 4.5	0.61 5.0	0.74 5.4	0.83 5.9	0.86 6.4	0.91 6.6	0.96 7.5
40000	Power Duration	0.47 4.2	0.61 4.5	0.72 4.9	0.83 5.3	0.86 5.7	0.91 6.0	0.97 6.4

Figure 1: Sequential testing example after 100, 110, … , 140 cases accrued. Flat prior (―), posterior (―). The area under the shaded regions (■) are the posterior probabilities of VE > 0.3.

Advantages and Discussions

Major advantage with the proposed Bayesian design is that it covers for VE between 0.6 and 0.7, with faster efficacy readout with higher VE.
‣ Relative to a traditional group sequential design, power/sample size advantage here comes from the ability to carry out sequential testing without the high alpha-spending that would be incurred in the traditional setting due to the reliance on the notion of extremism for calculating p-values.
‣ Use of case proportion as the primary estimand reduces data management burden and thus facilitates multiple interim looks.
Adaptive features like blinded sample size re-estimation and population enrichment (dropping IGRA-) can be easily incorporated into the design.
Rigorous statistical and operational planning needed as well as early communication with the regulatory bodies for such innovative designs.