Covariate Selection and Balance in Propensity Score Methods M Sanni Ali University Medical Center Utrecht, the Netherlands Outline Confounding Propensity Score (PS) Methods Covariate Selection Balance Diagnostics and their Applications Balance Assessment PS Methods in Time-varying Treatment Reporting of PS Analysis Confounding Non-intermediate common cause of exposure and outcome Unblocked backdoor path in causal diagrams Observed, unobserved, unknown Propensity Score Methods Propensity Score - Summary Score – Variables included? - Balancing Score – Covariate balance? Often derived using logistic regression - Selection of variables and forms Ensemble methods - Automated – Interactions/polynomials Matching, stratification, weighting, and covariate adjustment Rubin & Rosenbaum 1983; Westreich et al 2010; Lee 2010 Advantages of PS Confidence on the causal inference Transparency/Easy to communicate Design a study separate from analysis Summary Scores – rare outcome Covariate Selection Clinical knowledge - Sufficient? Variable with Instrumental Variable like properties - Exacerbate the imbalance in unmeasured confounders = Amplify the bias When could bias be amplified - Unmeasured Confounding – Strong!!! - Strong IV !!! - Linear models Balance diagnostics are powerful tools - Absolute standardized difference is robust Bhattacharya & Vogt 2007; Pearl 2010&2011; Myers et al 2011; Austin 2009; Belitser et al 2011; M S Ali et al 2014 Covariate Selection A simulation study using binary covariates, treatment, and outcome. In different scenarios, confounding variables, risk factors, instrumental variables, their interaction and square terms were considered. X9 X3 X8 A Y X6 X7 X1, X2 , X5 X4 M S Ali et al. 2014 Covariate Balance PS model versus covariate sets (balance) Four sets of covariates - Full PS Model - True PS Model - Outcome PS model (Prognostic score) - Confounder PS model Main terms versus Main + Interaction/squares 9 Covariate Selection Description of the Different Propensity Score Models Covariate Selection Treatment effects estimates (risk ratios) were derived using Poisson models. PS model selection was made based on the balance achieved on different sets of covariates, their interaction/square terms. Covariate balance was assessed using the absolute standardized difference. Covariate sets in balance assessment were compared with respect to bias and precision of the treatment-outcome relation as well as the PS model selected. 11 Covariate Selection - Balance Table 1. Balance of Covariates Measured Using Standardized Difference When Different Sets of Covariates Were Included in the PS Model Data was unmatched* and matched using Full PS model (PS_2), True PS model (PS_18), Risk Factor PS model (PS_25), Confounder PS model (PS_11), Confounder PS model omitting confounder, X4 (PS_15), Confounder PS model omitting confounder, X4, but risk factors were included (PS_29), Confounder PS model omitting confounder, X4, but instrumental variables were included (PS_22), Confounder PS model omitting confounder, X4, but risk factors and instrumental variables were included (PS_8). 12 Covariate Selection - Balance Table 2. Balance of Covariates Measured Using Standardized Difference When Different Sets of Covariates Were Included in the PS Model Data was unmatched* and matched using Full PS model (PS_2), True PS model (PS_18), Risk Factor PS model (PS_25), Confounder PS model (PS_11), Confounder PS model omitting confounder, X4 (PS_15), Confounder PS model omitting confounder, X4, but risk factors were included (PS_29), Confounder PS model omitting confounder, X4, but instrumental variables were included (PS_22), Confounder PS model omitting confounder, X4, but risk factors and instrumental variables were included (PS_8). 13 Covariate Selection: balance Table 3. Balance of Covariates Measured Using Standardized Difference When Different Sets of Covariates Were Included in the PS Model Data was unmatched* and matched using Full PS model (PS_2), True PS model (PS_18), Risk Factor PS model (PS_25), Confounder PS model (PS_11), Confounder PS model omitting confounder, X4 (PS_15), Confounder PS model omitting confounder, X4, but risk factors were included (PS_29), Confounder PS model omitting confounder, X4, but instrumental variables were included (PS_22), Confounder PS model omitting confounder, X4, but risk factors and instrumental variables were included (PS_8). Table 4. Median (Interquartile Range, IQR) of Estimated Treatment Effect Using Different PS Models in Different Scenarios (True RR=1.75) Model All Covariates were Independent X4 and X5 were Correlated X4 and X7 were Correlated RR (IQR) RR (IQR) RR (IQR) Crude 0.93 0.15 0.89 0.17 0.91 0.14 Full PS Model 1.73 0.40 1.78 0.68 1.72 0.59 True PS Model 1.71 0.47 1.73 0.63 1.67 0.58 Outcome PS Model 1.71 0.39 1.77 0.48 1.72 0.51 Conf PS Model 1.77 0.40 1.79 0.56 1.71 0.50 Omit.Conf PS Model 1* 1.55 0.34 1.57 0.40 1.46 0.35 Omit.Conf PS Model 2** 1.54 0.32 1.59 0.39 1.45 0.34 Omit.Conf PS Model 3† 1.50 0.36 1.50 0.50 1.51 0.47 Omit.Conf PS Model 4†† 1.50 0.34 1.55 0.52 1.52 0.45 15 Covariate Selection PS matching improved balance of measured covariates included in the PS model It exacerbated the imbalance in the unmeasured covariate that was unrelated to measured covariates In choosing covariates for a PS model, the pattern of association among covariates has substantial impact on other covariates’ balance and the bias of the treatment effect estimate. Table 5. Median (IQR) Treatment Effect When Propensity Score Model was Chosen Based on Balance Calculations on Different Sets of Covariates (True RR =1.75) Calliper All Covariates Confounding Factors Confounding and Risk Factors Confounding and Treatment Related Factors 0.05 1.51 0.42 1.77 0.64 1.75 0.62 1.48 0.43 0.10 1.50 0.43 1.75 0.63 1.75 0.58 1.48 0.42 0.15 1.50 0.44 1.73 0.67 1.76 0.65 1.48 0.43 0.20 1.48 0.42 1.75 0.62 1.75 0.60 1.46 0.43 0.25 1.46 0.43 1.75 0.61 1.72 0.58 1.43 0.42 0.30 1.47 0.43 1.74 0.62 1.72 0.60 1.43 0.42 0.35 1.44 0.41 1.69 0.61 1.69 0.57 1.41 0.39 0.40 1.40 0.41 1.68 0.59 1.67 0.61 1.37 0.39 0.50 1.38 0.36 1.68 0.54 1.63 0.50 1.35 0.35 0.60 1.33 0.37 1.58 0.51 1.55 0.52 1.31 0.34 *Balance was assessed on main terms as well as interaction/square terms Table 6. Median (IQR) Treatment Effect When Propensity Score Model was Chosen Based on Balance Calculations on Different Sets of Covariates (True RR =1.75) Calliper All Covariates Confounding Factors Confounding and Risk Factors Confounding and Treatment Related Factors 0.05 1.73 0.59 1.74 0.47 1.73 0.51 1.77 0.67 0.10 1.76 0.64 1.78 0.52 1.75 0.54 1.75 0.62 0.15 1.72 0.68 1.75 0.50 1.73 0.54 1.75 0.63 0.20 1.74 0.60 1.74 0.50 1.74 0.45 1.74 0.61 0.25 1.70 0.65 1.74 0.52 1.71 0.50 1.72 0.61 0.30 1.71 0.59 1.72 0.53 1.72 0.53 1.71 0.58 0.35 1.67 0.56 1.70 0.48 1.68 0.47 1.68 0.58 0.40 0.50 1.67 1.60 0.60 0.53 1.70 1.65 0.49 0.49 1.68 1.62 0.48 0.48 1.66 1.61 0.56 0.55 0.60 1.54 0.48 1.65 0.45 1.64 0.46 1.53 0.50 *Balance was assessed only main terms not interaction/square terms Table 7. Median Number of Subjects Included When Propensity Score Model was Chosen Based on Balance Calculations on Different Sets of Covariates (True RR =1.75, n=2000) Confounding and Risk Factors Confounding and Treatment Related Factors Calliper All Covariates Confounding Factors 0.05 1388 1388 1388 1388 0.10 1390 1389 1389 1390 0.15 1391 1391 1391 1391 0.20 1393 1392 1392 1393 0.25 1394 1394 1394 1394 0.30 1397 1396 1396 1397 0.35 1397 1398 1398 1398 0.40 1401 1401 1401 1401 0.50 1408 1408 1406 1408 0.60 1415 1417 1415 1415 *Balance was assessed main and interaction/square terms Table 8. Median Number of Subject Included When Propensity Score Model was Chosen Based on Balance Calculations on Different Sets of Covariates (True RR =1.75, n=2000) Confounding and Risk Factors Confounding and Treatment Related Factors Calliper All Covariates Confounding Factors 0.05 1388 1481 1479 1388 0.10 1390 1481 1479 1389 0.15 1392 1479 1478 1391 0.20 1393 1479 1479 1392 0.25 1393 1479 1478 1393 0.30 1396 1480 1480 1396 0.35 1398 1482 1481 1397 0.40 1401 1484 1484 1401 0.50 1407 1488.5 1489 1407 0.60 1415 1496 1496 1415 *Balance was assessed only main terms not interaction/square terms PS Model Selection When PS model was chosen based on balance calculation on confounding variables, the confounder PS is often selected followed by the Risk factor PS models. When balance calculation involves all covariates or treatment related covariates, the full PS and true PS model are often selected compared to the confounder PS and Risk factor PS models. PS Model Selection In selecting PS model based on covariate balance, the choice of covariates, interaction/square terms in balance calculation has substantial impact on bias and precision of the treatment effect. PS model selection based on the balance achieved on confounding variables, risk factors and important interaction terms among confounders and risk factors is optimal approach. Balance Diagnostics Quantitative Falsification of Instrumental Variable Assumptions Instrumental Variable Methods three Basic IV assumption - Associated with Exposure - Independent of Confounders - Independent of outcome except through exposure Association of IV with observed confounders could be quantified with balance metrics M S Ali et al 2014 IV Methods and assumptions PS and time-varying treatment In the presence of treatment switching or channeling - “Intension to treat analysis”? Time-varying Cox model or Time-varying propensity score or Inverse probability weighting or others Time-varying treatment Example Associations Between Current SSRI Use and the Risk of Hip Fracture Study using two EU databases (Mondriaan and BIFAP) A cohort of patients with a first prescription for Antidepressant (SSRI or tricyclic AD, TCA) period 2001-2009 Data sources (GP databases) : - the Dutch Mondriaan - the Spanish BIFAP M S Ali et al 2015 Time-varying treatment Cox models with time-varying coefficients to control for time-varying nature of covariates SSRIt-1 Benzot-1 SSRIt Benzot HF Time-varying treatment When time-varying confounders are themselves affected by the previous treatment (SSRI use at t-1), conventional time-varying Cox model gives biased estimate Adjusting-away part of treatment effect? SSRIt-1 Benzot-1 SSRIt Benzot HF Time-varying treatment In the presence of unmeasured common causes of confounders and outcome (U): collider-stratification bias? SSRIt-1 Benzot-1 SSRIt HF Benzot U Time - varying treatment Table 9. Associations Between SSRI Use and the Risk of Hip Fracture Using Time-Varying Cox Models Mondriaan Adjusted for BIFAP HR 95% CI HR 95% CI Crude 1.75 1.12, 2.72 2.09 1.89, 2.32 Gender 1.73 1.10, 2.69 2.07 1.87, 2.30 Gender +Age 2.36 1.51, 3.68 1.51 1.37, 1.68 Gender +Age + TCAt 2.59 1.63, 4.12 1.56 1.40, 1.73 Gender +Age + TCAt + Benzot 2.60 1.63, 4.16 1.54 1.38, 1.71 All Confounders* 2.62 1.63, 4.19 1.52 1.37, 1.69 Time-varying treatment Table 10. Associations Between SSRI Use and the Risk of Hip Fracture Using Propensity Score Based Cox Analyses Mondriaan Adjusted for HR 95% CI HR 95% CI 1.75 1.12, 2.72 2.09 1.89, 1.71 Quintiles 2.64 1.63, 4.25 1.54 1.39, 1.71 Deciles 2.72 1.63, 4.54 1.53 1.38, 1.70 2.82 1.63, 4.25 1.61 1.45, 1.78 Crude PS* Stratification PS* Adjustment BIFAP Time-varying treatment Table 11. Associations Between SSRI Use and the Risk of Hip Fracture Using Propensity Score Based Cox Analyses Mondriaan Adjusted for Without Accounting for Censoring* Accounting for Censoring** BIFAP HR 95% CI HR 95% CI Crude 1.69 1.05, 2.67 2.14 1.91, 2.39 Gender 1.68 1.06, 2.67 2.11 1.89, 2.38 Gender +Age 2.46 1.55, 3.99 1.54 1.37, 1.72 Crude 1.73 1.08, 2.77 2.05 1.83, 2.30 Gender 1.71 1.07, 2.74 2.03 1.81, 2.28 Gender +Age 2.47 1.53, 3.98 1.51 1.35, 1.70 * Only inverse probability of treatment weights were used ** Combined inverse probability of treatment and censoring weights were used + Trimming at 1% and 99% Time-varying treatment Differences between the various methods to adjust for time-dependent confounding (i.e., time-varying Cox and PS as well as inverse probability weighting of marginal structural models) were small. The observed differences in treatment effects estimates between the datasets are likely attributable to different confounding information in the datasets. Adequate information on (time-varying) confounding. Conduct and Reporting PS Analysis: Literature review 388 articles PubMed search: 92 Articles were excluded: 296 Articles available for review 63 20 6 2 1 Non clinical Methodological Other language Systematic reviews Editorials/letters Drug-related intervention: 108 (36.5%) Clinical* 50 (16.9%) Surgical intervention: 138 (46.6%) Figure 1. Flow chart of abstracts or articles extraction for the systematic review M S Ali et al 2015 Reporting PS Analysis PS matching is the commonly used approach. Balance is often checked with PS matching followed by inverse probability weighting P-values are the most commonly used tests. Absolute standardized difference was used only in 25% of the studies in which balance was assessed. M S Ali et al 2015 Reporting PS Analysis Transparency in conducting the analysis What needs to be reported?? Guidelines: STROBE or ENCePP M S Ali et al 2015 Variable Selection for PS Model - Method used for variable selection - Whether empirical knowledge was considered - Whether variable’s association with treatment and/or outcome Propensity Score Estimation - Method used to estimate the PS, e.g. logistic regression Variables and/or interaction terms included in the PS PS Matching - Matching algorithm, caliper width, and matching ratio used. - Whether matching was with replacement. and matched nature of the data accounted in the analysis. - Number of patients at the start and after matching. - Number and characteristics of excluded patients (versus matched ones) - The distribution of baseline characteristics between treated and control patients in the matched and starting population. -Balance of Covariates between matched groups Applying the PS methods ♠ - Type of PS method employed. PS Adjustment - Overlaps of the PS distribution between treatment groups. * Whether linear relationship was checked between outcome and the PS. Balance Assessment - - Balance measure used. Quantifying the balance and whether imbalance on covariates was detected after the final PS model. Treatment Effect Estimation - Statistical method used. - Whether additional adjustment was made for covariates. - Whether sensitivity analysis was performed. Interpretation of Effect estimate - The interpretation of the effect estimates in relation to the research question, target population & type of PS method used (ATE, ATT) PS Stratification - The quantile used for stratification. - The overlap of PS with quantiles of PS using plots or balance measures. - Balance of covariates with in quintiles of the PS. PS Weighting - The range (mean, max, min) of unstabilzed and stabilized weights. - Variables included in the PS models for - both numerator and denominator of the weights. Whether weights were truncated and method used. Balance of Covariates in the weighted sample groups. M Sanni Ali 2015 Limitations of PS Unmeasured Confounding Treatment Modification/Interaction Multilevel Treatment Modeling Dynamic Prescription Patterns/rare Exposure Pooling Data on PS? Treatment guidelines Alternatives: Disease Risk Score? Acknowledgement Olaf H. Klungel, PharmD, PhD1,2 Anthonius de Boer, MD, PhD1 Rolf HH Groenwold, MD, PhD1,2 Arno W. Hoes, MD, PhD2 Svetlana V. Belitser, MSc1 Kit C.B. Roes, PhD2 1 Division of Pharmacoepidemiology and Clinical Pharmacology, Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht. 2 Julius Centre for Health Science and Primary Care, University Medical Centre Utrecht, Utrecht. Thank You!
© Copyright 2026 Paperzz