Simulating trials with survival endpoints
John Aponte
Source:vignettes/simulation_trials.Rmd
simulation_trials.Rmd
Introduction
We can use objects of the class SURVIVAL to simulate surviving times in clinical trials. We present one example in which we want to estimate the empirical power to detect significantly a vaccine efficacy.
The empirical power is define as the percentage of times the p-value for the coefficient that indicates the treatment is equal or below 0.05.
Empirical power for superiority
Assumptions:
We made 1000 simulations
There are 250 participants in each group, one group is control and the other is vaccinated
The vaccine efficacy is 40% (i.e the hazard ratio is 1-40/100 = 0.6)
The control group follows an exponential distribution with 40% of subjects having an event at time 12 months (365.25 days)
The simulated data is analyzed using Cox regression.
We estimate the empirical power as the percentage of the simulations where the p-value of the coefficient for the group is 0.05 or lower. We present the empirical power and the distribution of the total number of events and the estimated vaccine efficacy
# Number of simulations
nsim = 1000
# Participants in each group
nsubjects = 250
# Vaccine efficacy
ve = 40
# Hazard ratio
hr = 1-ve/100
# Follow-up time
ftime <- 12
# Fail events in controls
fail_control = 0.4
# Define Object with exponential distribution for events in controls
s_events <- s_exponential(fail = fail_control, t = ftime)
Simulation
set.seed(12345)
# Define the group for the subjects
group = c(rep(0, nsubjects), rep(1, nsubjects))
# Define the hazard ratio according to the group
hr_vector <- ifelse(group ==0,1,hr)
# Loop
sim <- lapply(
1:nsim,
function(x){
# Simulate survival times for event
sim_time_event <- s_events$rsurvhr(hr_vector)
# Censor events at end of follow-up.
cevent <- censor_event(censor_time = ftime, time = sim_time_event, event = 1)
ctime <- censor_time(censor_time = ftime, time = sim_time_event)
# Analyze the data using cox regression
reg <- summary(coxph(Surv(ctime, cevent)~ group))
# Collect the information
pval = reg$coefficients["group","Pr(>|z|)"]
ve = (1- exp(reg$coefficients["group","coef"]))*100
nevents = reg$nevent
# return values
return(data.frame(simid = x, pval,ve, nevents))
}
)
# Join all the simulations in a single data frame
sim_df <- do.call(rbind, sim)
Analyze the simulation
empirical_power = binom.test(sum(sim_df$pval <= 0.05), length(sim_df$pval))
empirical_power$estimate
#> probability of success
#> 0.902
empirical_power$conf.int
#> [1] 0.8818715 0.9197225
#> attr(,"conf.level")
#> [1] 0.95
# Distribution of the simulated VEs}
summary(sim_df$ve)
#> Min. 1st Qu. Median Mean 3rd Qu. Max.
#> 2.22 33.37 40.06 39.33 46.16 68.22
# Distribution of the simulated number of events
summary(sim_df$nevents)
#> Min. 1st Qu. Median Mean 3rd Qu. Max.
#> 134.0 159.0 166.0 166.5 174.0 206.0