Dynamic Treatment Regime, Sequentially Randomized Designs, and Semi-parametric Theory Abdus S. Wahed, PhD University of Pittsburgh [email protected] July 13, 2013 1 / 35 Table of Contents 1 Introduction Background and Motivation Objective 2 Estimation IPW Estimator Semiparametric Efficient Estimator 3 Simulation Scenarios Results of Simulation 4 Application CALGB 8923 Trial Results of Application 5 Conclusion 2 / 35 Dynamic Treatment Regime An algorithm that chooses treatment for patients based on the patient characteristics, previous treatment history and response to previous treatments. Examples: Depression Sertraline for 8 weeks, if the patient does not respond, treat the patient with Sertraline as well as with cognitive behavioral therapy; if the patient responds, continue Sertraline HIV Start on a ARV regimen, e.g., containing Efavirenz, switch to second line regimen less than 8 weeks after confirmed virologic failure Cancer Treat with chemotherapy followed by cis-RA if there is no disease progression 3 / 35 Dynamic Treatment Regime An algorithm that chooses treatment for patients based on the patient characteristics, previous treatment history and response to previous treatments. Examples: Depression Sertraline for 8 weeks, if the patient does not respond, treat the patient with Sertraline as well as with cognitive behavioral therapy; if the patient responds, continue Sertraline HIV Start on a ARV regimen, e.g., containing Efavirenz, switch to second line regimen less than 8 weeks after confirmed virologic failure Cancer Treat with chemotherapy followed by cis-RA if there is no disease progression 3 / 35 Dynamic Treatment Regime An algorithm that chooses treatment for patients based on the patient characteristics, previous treatment history and response to previous treatments. Examples: Depression Sertraline for 8 weeks, if the patient does not respond, treat the patient with Sertraline as well as with cognitive behavioral therapy; if the patient responds, continue Sertraline HIV Start on a ARV regimen, e.g., containing Efavirenz, switch to second line regimen less than 8 weeks after confirmed virologic failure Cancer Treat with chemotherapy followed by cis-RA if there is no disease progression 3 / 35 Dynamic Treatment Regime An algorithm that chooses treatment for patients based on the patient characteristics, previous treatment history and response to previous treatments. Examples: Depression Sertraline for 8 weeks, if the patient does not respond, treat the patient with Sertraline as well as with cognitive behavioral therapy; if the patient responds, continue Sertraline HIV Start on a ARV regimen, e.g., containing Efavirenz, switch to second line regimen less than 8 weeks after confirmed virologic failure Cancer Treat with chemotherapy followed by cis-RA if there is no disease progression 3 / 35 Dynamic Treatment Regime Usual goal is To develop optimal treatment regime to achive maximum patient benefit. To compare a fixed number of treatment regimes arising from a sequence of treatments available at different stages of therapy. 4 / 35 Sequentially Randomized Trials Patients are randomized to treatment options sequentially as they become eligible to receive treatments. Useful for comparing treatment regimes with discrete treatment options and intermediate responses Has been emloyed in various reseach settings such as cancer depression drug and alcohol addiction education 5 / 35 CALGB 8923 Trial Figure : Patient flow and study design for CALGB 8923 trial Cytarabine n=37 Consent n=79 Cytarabine +Mitoxantrone n=42 Response n=99 GM-CSF n=193 No Consent n=20 No Response n=94 CALGB 8923 n=388 Cytarabine n=45 Consent n=90 Cytarabine +Mitoxantrone n=45 Response n=106 Placebo n=195 No Consent n=16 No Response n=89 6 / 35 Motivation: CALGB 8923 Trial Previous analyses1 treated non-responders and non-consenters alike. Cytarabine n=37 Response and Consent n=79 GM-CSF n=193 Cytarabine +Mitoxantrone n=42 No Response or No Consent n=114 CALGB 8923 n=388 Cytarabine n=45 Response and Consent n=90 Cytarabine +Mitoxantrone n=45 Placebo n=195 No Response or No Consent n=105 1 Lunceford et al. (2002), Wahed & Tsiatis (2006), and Lokhnygina & Helterbrand (2007) 7 / 35 Objective However, the goal of such design is to determine which of the treatment regimes result in longer survival. Examples of regimes: “Treat with GM-CSF followed by Cytarabine if respond” “Treat with GM-CSF followed by Cytarabine+Mitoxantrone if respond” “Treat with placebo followed by Cytarabine if respond” “Treat with placebo followed by Cytarabine+Mitoxantrone if respond” In general, let us define regime Aj Bk as “treat with Aj followed by Bk if respond”. 8 / 35 Objective But combining response and consent together does not allow us to answer that question. 9 / 35 Evolution of a Paitent’s Status in CALGB Trial D X NR D X NR R/NC D R/NC R/C R/C R/C R/C R: responded C: consented Z: 2nd randomization X D X Z Z D: dead X: censored D X NC: not consented NR: not responded 10 / 35 Evolution of a Paitent’s Status - dealt in previous analyses D X NR D X NR R/C/Z R/C/Z R: responded C: consented Z: 2nd randomization D X D: dead X: censored NC: not consented NR: not responded 11 / 35 Goal of this work Consistent and efficient estimation of mean survival under a given treatment regime taking into account the actual data structure. Specifically, We will treat non-responders and non-consenters separately. We will take into account the delay from becoming eligible for second stage treatment to the assignment of treatment. 12 / 35 For the rest of the work, assume: Two stage 1 treatments Aj , j = 1, 2 Two stage 2 treatments Bk , k = 1, 2 for responders Consider data from patients who received treatment A1 . 13 / 35 For the rest of the work, assume: Two stage 1 treatments Aj , j = 1, 2 Two stage 2 treatments Bk , k = 1, 2 for responders Consider data from patients who received treatment A1 . 13 / 35 Counterfactual Variables Each patient i has an associated set of random variables Ri , (1 − Ri )TiNR , Ri TiR , Ri Ci , Ri (1 − Ci )TiNC , Ri Ci TiZ , Ri Ci T1i∗ , Ri Ci T2i∗ Ri : response status TiNR : survival time for a patient who did not respond TiR : time from initial treatment, A1 , to the time a patient responded to A1 Ci : consent status TiNC : survival time for a patient who responded but did not consent to Bk TiZ : time from initial treatment, A1 , to the initiation of second-stage treatment T1i∗ : survival time if patient responded to A1 and received treatment B1 T2i∗ : survival time if patient responded to A1 and received treatment B2 14 / 35 Survival Time Under regime A1 Bk In terms of counterfactual variables, the survival time under regime A1 Bk can be expressed as Tki = (1 − Ri )TiNR + Ri Tki∗ , k = 1, 2 The goal is to draw inference on the distribution of Tk , k = 1, 2. For example, we may be interested in µk = E [Tki ] 15 / 35 Observed Data The observed data in the presence of right censoring can be charaterized as the collection of i.i.d random vectors h i Ri , Ri Ci , Ri TiR , Ri Ci TiZ , Ri Ci Zki , Ui , ∆i , {GiH (u), u 6 Ui } , i = 1, 2, . . . , n; k = 1, 2. Ri , Ci , TiR , and TiZ are defined as before. Z1i : the B1 treatment assignment indicator Z2i : the B2 treatment assignment indicator Ti : observed survival time Xi : censoring time Ui : min(Ti , Xi ) ∆i : I (Ti < Xi ) GiH (u): data history collected on patient i prior to time u 16 / 35 Assumptions In order to draw conclusion about counterfacters, we need to relate them to the observed data, which is done through some assumptions. Consistency, or Stable Unit Treatment Value Assumption (SUTVA) Ti = (1 − Ri )TiNR + Ri (1 − Ci )TiNC + Ri Ci (Z1i T1i∗ + Z2i T2i∗ ), where Z2i = 1 − Z1i . No unmeasured confounding given data history, or Sequential Randomization Assumption (SRA) n o πk = P Zki = 1 Ri = 1, Ci = 1, GiH (TiZ ) = P {Zki = 1 |Ri = 1, Ci = 1 } 17 / 35 IPW Estimator Given a treatment regime, A1 Bk , the inverse-probability-weighted (IPW) estimator for mean survival time is µ̂IPW = n−1 k n X i=1 ∆i K (Ui ) 1 − Ri + Ri Ci Zki πCi πk Ui Note that there are two nuisance parameters, K (.) and πCi . K (.) is estimated by the Kaplan-Meier method. πCi , the consent rate, we considered two estimators: πCi is calculated as the proportion of responders who .P Pn n ∆i i consented, e.g. π̂Ci = i=1 K̂∆ R C i=1 K̂ (U ) Ri (U ) i i i i πCi is modeled by logistic regression on baseline and the 1st stage covariate, e.g. logit(π̂Ci ) = α̂ + β̂ × TiR + γ̂ × Wi , where Wi is the collection of baseline covariates. 18 / 35 Variance of IPW Estimator When πCi is estimated empirically: var ˆ (µ̂IPW )= k − Ri Ci − πCi π̂R Ê 1 n ( n 1 X ∆i n i=1 K̂ (Ui ) ! Ri Ci Zki Ti πC2 πk i " ! Ri Ci Zki Ui − µ̂IPW k π̂Ci πk ! !#2 Ri Ci Ri Ci Zki + (Ri − π̂R )Ê Ê Ti πR2 πC2 πk i ) Z L 0 2 dN c (u) Ê Li (u, π̂Ci ) + 0 K̂ (u)Y (u) 1 − Ri + (1) 19 / 35 Variance of IPW Estimator where Ê 2 L0i (u, π̂Ci ) "( ) n 1 X ∆i Ri Ci Zki = 1 − Ri + Ui − Ĝ (u, π̂Ci ) n i=1 K̂ (Ui ) π̂Ci πk !) ( h i Ri Ci Ri Ci − Ĝ 0 (u, π̂R ) + Ri − Ĝ 00 (u) Ê + π̂R πR2 !#2 Ri Ci Zki ×I (TiR ≥ u)Ê Ti I (Ti > u) πC2 πk i Ĝ (u, π̂Ci ) = Ĝ 0 (u, π̂R ) = 00 Ĝ (u) = n X 1 nŜ(u −1 ) 1 nŜ(u −1 ) 1 nŜ(u − ) i=1 n X i=1 ∆i K̂ (Ui ) ∆R i ( 1 − Ri + Ri K̂ (TiR ) π̂R n X ∆R i i=1 K̂ (TiR ) Ri Ci Zki π̂Ci πk ) Ui × I (Ui > u) Ci × I (TiR ≥ u) Ri × I (TiR ≥ u) 20 / 35 Variance of IPW Estimator When πCi is modeled by logistic regression: var ˆ (µ̂IPW ) k ( − ∆R i Ri 1 = n " n 1 X ∆i n i=1 K̂ (Ui ) Ci − π̂Ci K (TiR ) π̂Ci (1 − π̂Ci ) ) Ê ( Ri Ci Zki 1 − Ri + π̂Ci πk ) Ui − µ̂IPW k !2 Ri Ci Zki Ê Ti 2 πC2 πk πCi (1 − πCi ) i # Z L 2 dN c (u) + Ê L00 (u, π̂ ) Ci i 0 K̂ (u)Y (u) (πCi − Ci )2 !−1 (2) 21 / 35 Variance of IPW Estimator where n X 2 Ê L00 = n−1 i (u, π̂Ci ) ∆i K̂ (Ui ) ! Ri C i Ê πR2 "( 1 − Ri + i=1 00 +Ĝ (u)Ê Ri Ci Zki Ti πC2 πk Ri Ci Zki π̂Ci πk ! I (TiR ) Ui − Ĝ (u, π̂Ci ) #2 ≥ u) I (Ti > u) i 22 / 35 Semiparametric Efficient Estimator The IPW estimator is consistent, but not efficient1 . Fails to take into account information from the censored patients as well as patients who fail to receive the treatment Bk due to randomization to other B-treatment or refusal to take B-treatment. 1 Robins, Rotnitzky, & Zhao (1994) 23 / 35 Semiparametric Efficient Estimator Semi-parametric efficient estimator1 : µ̂EFF k " n 1X ∆i = n i=1 K̂ (Ui ) Ri Ci Zki 1 − Ri + π̂Ci πk − 1 Robins, ! ∆R i Ri (Ci − π̂Ci )ν̂Ri K̂ (UiR )π̂Ci dMic (u) H τ̂ Gi (u) K (u) 0 # ∆C i Ri Ci (Zki − πk )ν̂Ci Z Ui + − L K̂ (UiC )π̂Ci πk (3) Rotnitzky, & Zhao (1994) 24 / 35 Semiparametric Efficient Estimator τ (.), νRi , and νCi are estimated from the data based on suitable regression models. τ GiH (u) = E o Ti | Ri = 1, Zki = 1, TiR , GiH (TiR ) = α + βTiR n o = E Ti | Ri = 1, Ci = 1, Zki = 1, TiZ , GiH (TiZ ) = α + β1 TiR + β2 (TiZ − TiR ) νRi = E νC i ∆i Ui GiH (u) K (Ui ) n where (TiZ − TiR ) is the delay of the second randomization 25 / 35 Variance of Semiparametric Efficient Estimator var ˆ (µ̂EFF )= k " ( ) Z L n dMic (u) H 1 X ∆i Ri Ci Zki τ̂ Gi (u) 1 − R + U + i i 2 n i=1 K̂ (Ui ) π̂Ci πk K (u) 0 #2 R C ∆i Ri (Ci − π̂Ci )ν̂Ri ∆i Ri Ci (Zki − πk )ν̂Ci EFF − − − µ̂k (4) K̂ (UiR )π̂Ci K̂ (UiC )π̂Ci πk 26 / 35 Simulation Scenarios Population: R ∼ Bernoulli(πr ); T R ∼ EXP(αr ) truncated at br ; T NR ∼ EXP(α0 ) truncated at b0 ; T Z ∼ T R + Ur ; Ur ∼ uniform(0, θr ); Tk∗ ∼ T Z + (β1 + β2 × T Z )Uk ; Uk ∼ uniform(0, θk ), k = 1, 2; Tk = (1 − R)T NR + RTk∗ , k = 1, 2; Data generation: C ∼ Bernoulli(πC ); logit(πC ) = α + β × T R ; T NC ∼ T R + U NC ; U NC ∼ uniform(0, θNC ); Z1 ∼ Bernoulli(π1 ), Z2 = 1 − Z1 ; T = min((1 − R)T NR + R(1 − C )T NC + RC {Z1 T1∗ + Z2 T2∗ }, L); X ∼ uniform(0, ξ); U = min(T , X ); 27 / 35 Simulation Results Table : Results of Monte-Carlo simulations for treatment regimes A1 B1 and A1 B2 under 50% response rate and 51%, and 25% censoring rate Simulation=2000, n=500, 50% response 51% censoring Estimator IPW (emp1 ) IPW (mod2 ) Efficient (emp1 ) Efficient (mod2 ) A1 B1 (µ1 = 386) A1 B2 (µ2 = 359) µ̂1 SEMC (µ̂1 ) SE (µ̂1 ) CP% µ̂2 SEMC (µ̂2 ) SE (µ̂2 ) CP% 386.3 385.9 389.2 386.4 29.3 29.6 18.5 19.4 30.1 30.2 19.6 20.8 95.4 95 96.4 97 360.1 360.7 362.0 359.9 27.4 28.1 18.0 19.2 28.3 28.6 19.2 20.5 95.7 95.4 96.7 96.9 1 empirical estimator for πC 2 logistic regression estimator for πC with covariates of TiR 28 / 35 Simulation Results Table : Results of Monte-Carlo simulations for treatment regimes A1 B1 and A1 B2 under 50% response rate and 51%, and 25% censoring rate Simulation=2000, n=500, 50% response 51% censoring Estimator IPW (emp1 ) IPW (mod2 ) Efficient (emp1 ) Efficient (mod2 ) A1 B1 (µ1 = 386) SEMC (µ̂1 ) SE (µ̂1 ) CP% µ̂2 SEMC (µ̂2 ) SE (µ̂2 ) CP% 386.3 385.9 389.2 386.4 29.3 29.6 18.5 19.4 30.1 30.2 19.6 20.8 95.4 95 96.4 97 360.1 360.7 362.0 359.9 27.4 28.1 18.0 19.2 28.3 28.6 19.2 20.5 95.7 95.4 96.7 96.9 µ̂1 SEMC (µ̂1 ) SE (µ̂1 ) CP% µ̂2 SEMC (µ̂2 ) SE (µ̂2 ) CP% 386 386.7 388.2 386.7 20.8 21.0 12.4 12.7 21.3 21.1 12.8 13.1 95.5 95.3 95.0 95.3 358.2 359.7 361.6 360.2 18.9 19.3 12.1 12.5 19.8 19.6 12.3 12.7 96.0 95.4 94.9 95.1 25% censoring Estimator 1 IPW (emp ) IPW (mod2 ) Efficient (emp1 ) Efficient (mod2 ) A1 B2 (µ2 = 359) µ̂1 A1 B1 (µ1 = 386) A2 B2 (µ2 = 359) 1 empirical estimator for πC 2 logistic regression estimator for πC with covariates of TiR 29 / 35 Simulation Results Both IPW and efficient estimators consistently estimated the mean survival time, but the efficient estimators improved the efficiency significantly, in some cases by as large as 69%. The Monte-Carlo and estimated variances increase with the increase in censoring rate. Decreasing the sample size from 500 to 250 and increasing the response rate from 50% to 70% (results not shown here) showed similar results. When consent rate was estimated empirically, the precision of the estimators was better than when consent rate was estimated using logistic regression. 30 / 35 CALBG 8923 Trial Double-blind, placebo-controlled two-stage trial; The effects of infusions of granulocyte-macrophage colony-stimulating factor (GM-CSF) after initial chemotherapy in 388 elderly patients with acute myelogenous leukemia (AML); Patients were randomized initially to GM-CSF or placebo following standard chemotherapy. Later, patients meeting the criteria for complete remission were offered a second randomization to one of two intensification treatments (Cytarabine or Cytarabine+Mitoxantrone); The treatment regimes: “treat with GM-CSF or placebo followed by Cytarabine or Cytarabine+Mitoxantrone if respond” 31 / 35 Results of Analysis of CALGB 8923 Trial Table : Estimated mean survival time (days) and its variance for the CALGB 8923 study A1 B1 3 Estimator 1 IPW (emp ) IPW (mod2 ) Efficient (emp1 ) Efficient (mod2 ) A 1 B2 4 A2 B 1 5 A2 B2 6 µ̂ SE (µ̂) µ̂ SE (µ̂) µ̂ SE (µ̂) µ̂ SE (µ̂) 404.1 414.9 452.5 461.1 59.6 66.4 49.1 48.7 406.3 395.6 421.8 417.1 69.3 68.9 62.1 60.1 363.2 358.9 414.0 415.6 45.7 46.8 41.5 41.4 760.0 767.8 717.0 722.2 192.4 195.4 227.5 222.1 1 empirical estimator for πC 2 logistic regression estimator for πC with covariates of TiR , gender, race, age, and white blood cell counts 3 treatment regime “treat with GM-CSF followed by Cytarabine if respond” 4 treatment regime “treat with GM-CSF followed by Cytarabine+Mitoxantrone if respond” 5 treatment regime “treat with placebo followed by Cytarabine if respond” 6 treatment regime “treat with placebo followed by Cytarabine+Mitoxantrone if respond” 32 / 35 Results of Analysis of CALGB 8923 Trial The IPW estimators tend to provide smaller estimates for the mean survival time compared to the efficient estimators. Standard errors of efficient estimators are smaller than that of IPW estimators, as expected by theory. Based on the estimates, it seems that treatment regime “treat with placebo followed by Cytarabine + Mitoxantrone if respond” provides the largest mean survival time among four regimes. However, considering the large SE for the corresponding estimator, a formal test of hypothesis might not be able to show statistical significance. Treatment regime “treat with placebo followed by Cytarabine if respond” provides the shortest mean survival time among four regimes. Adding GM-CSF to chemotherapy does not improve regime mean survival time. 33 / 35 Conclusion We have presented several approaches to estimate mean survival time and its variance in two-stage randomization designs when some patients did not consent after responding to the initial treatment. The efficient (semi-parametric) estimator shows considerable gain in efficiency over IPW estimator. This method allows us to use information from auxiliary covariates. We have demonstrated our methods through application of a leukemia clinical trial data set. In some cases, it might be of interest to estimate survival probability at certain time point other than mean survival time. In such case, an entirely similar strategy can be followed by replacing Ti by I (Ti > t) for survival probability. 34 / 35 THANK YOU! 35 / 35
© Copyright 2026 Paperzz