Appendix: Intuitive ideas behind essential heterogeneity, marginal treatment effects and personcentered treatment (PeT) effects: To provide the intuition behind these concepts, we start with a stylized example. It is widely known that the effectiveness of surgery is heterogeneous across men with prostate cancer. Let us assume that surgery is more effective for younger men and for men with high baseline PSA values. Moreover, choice of treatment (surgery or watchful waiting) is influenced by age and baseline PSA of the patient. However, if one obtains a sample of men diagnosed with prostate cancer from SEER-Medicare, “PSA values” will not be measured. Suppose the available dataset contains for each patient: age, treatment, whether the patient was cured, and a characteristic that all agree is a powerful and valid instrument. Let the instrument be a continuous variable (e.g. distance to hospital). Greater the distance to the hospital, less likely is the use of surgery. In this case, a traditional naïve regression would produce biased estimate of the average treatment effect (ATE) and also the conditional average treatment effects (CATEs) for old and young groups due to the endogeneity of the treatment status caused by the missingness of PSA values in the analysis. A traditional instrumental variable analysis, using a strong and valid IV, will also produce biased estimates for ATE and CATEs due to the presence of essential heterogeneity, which suggests that the treatment effects vary over unobserved confounders, which in this case are the PSA values. A local instrumental variable (LIV) approach can be used to overcome these issues when a continuous instrument is available. LIV methods are used to estimate the marginal treatment effects (MTEs) parameters. MTEs are the effects for individuals for whom the influence of the observed characteristics (old age and distance to hospital) balance with the influence of the unobserved confounders (PSA level) on treatment choice such that they are indifferent to choosing between using surgery or watchful waiting. To estimate an MTE, LIV methods compare the outcomes of two groups of, say, young patients, where one group is staying at a distance d from the hospital and the other at a distance d+ε, ε representing an epsilon (very small) change in distance. These two group of patients should be identical with respect to the distribution of their risk factors (observed and unobserved to the analysts). This is because if distance is a valid instrumental variable, then by definition, it is independent of all risk factors affecting outcomes. Treatment choices are affected differentially between these two groups only through the costs of traveling the extra distance. Therefore any difference between the average outcomes between these two groups must be driven by the fact that some people in these groups, who are identical in their risk factor distributions, are changing treatment choices brought about by the difference in distance. However, since the difference in distance is very small, the effect can be attributed to a margin of patients who were indifferent between two treatment options and were moved to change treatment choice by the small perturbation of the instrumental variable, i.e., distance. For this margin of patients, we can quantify a normalized level of unobserved confounders as they must balance the observed levels. Here, normalized means a scalar score that represents a balancing score for unobserved risk factors, irrespective of their empirical distributions. Technically, equation A3 below shows that such a scalar score is always distributed Uniform(0,1). The conceptualization of this scalar score is in the spirit of constructing a scalar propensity score with many observed risk factors with varying empirical distributions. Similarly, for another dyad of distances, d’ and d’+ ε, one can estimate another MTE, which reflects the causal treatment effect of patients at another level of unobserved confounder. In this way, a full schedule of MTEs can be estimated that vary over the unobserved confounder levels (i.e. PSA values) given any level of the observed confounder (i.e. age). Intuitively, LIV methods estimates these MTEs by first estimating a control function, which models how the observed outcome varying over observed risk factors, IV-dependent estimated propensity to choose treatment, interactions between them and non-linear polynomials of the propensity score. The partial derivate of the outcome, as characterized by the control function, with respect to the IVdependent propensity score (reflecting epsilon changes) estimates the marginal treatment effect evaluated at specific values of the scalar unobserved risk factor levels. Once MTEs are estimated over the range of observed and unobserved levels, they can then be easily aggregated to form meaningful treatment effect parameters such as the ATE, CATEs, TT and TUT and also study heterogeneity in effects using person-centered treatment (PeT) effects. The PeT effect for a patient is this stylized sample is conditioned not only on that individual’s age but his age-specific MTEs are also averaged over a distribution of PSA levels that conforms with the individual’s observed choice of surgery or watchful waiting. Thus these are deemed to be personalized effect for this patient. Formal models behind essential heterogeneity, marginal treatment effects and personcentered treatment (PeT) effects: We start by formally developing structural models of outcomes and treatment choice. Error! Bookmark not defined.,Error! Bookmark not defined. For the sake of simplicity we will restrict our discussion to two treatment states – the surgery receipt (treated) state denoted by j = 1 and the WW receipt (untreated) state denoted by j = 0. The corresponding potential individual outcomes in these two states are denoted by Y1 and Y0. We assume, (A1) Y1 1( XO , XU ,) and Y0 0 ( XO , XU , ) , where X0 is a vector of observed random variables, XU is a vector of unobserved random variables which are also believed to influence treatment selection (they are the unobserved confounders) and is an unobserved random variable that capture all remaining unobserved random variables. ( XO , XU ) and XO XU where denotes statistical independence. We assume individual choose to be in state 1 or 0 (prior to the realization of the outcome of interest) according to the following equation: (A2) D = 1 if D ( X0, Z) UD 0 , where Z is a (non-degenerate) vector of observed random variables (instruments) influencing the decision equation but not the potential outcome equations, D is an unknown functions of X0 and Z, and UD is a random variable that captures XU and all remaining unobserved random variables influencing choice. By definition, UD , which also defines the distinction between XU and in (A1). Equations (A1) and (A2) represent the nonparametric models that conform to the Imbens and Angrist’s independence and monotonicity assumptionsError! Bookmark not defined. needed to interpret instrumental variable estimates in a model of heterogeneous returns.Error! Bookmark not defined. As in Heckman and Vytlacil,Error! Bookmark not defined.,Error! Bookmark not defined.,Error! Bookmark not defined. we can rewrite (A2) as (A3) D = 1 if P( x0 , Z ) V , where V = FUD |XO ,Z UD | X O , Z , P(x0,Z) = FUD |XO ,Z D ( xO , Z ) | XO , Z and F represents a cumulative distribution function. Therefore, for any arbitrary distribution of UD conditional on XO and Z, by definition, V ~ Unif[0, 1] conditional on XO and Z. Under regular IV assumptions, Heckman and Vytlacil show that Marginal Treatment EffectsError! Bookmark not defined. can be identified by (A4) E (Y | x0 , Z ) E ((Y1 Y0 ) | X O ,V v ) MTE ( xO ,v ) , p where Y = D*Y1 + (1 - D)*Y0 is the observed outcomes. A MTE is perhaps the most nuanced estimable effect. It identifies an effect for an individual who is at the margin of choice such that one’s levels of XO and Z are just balanced by one’s level of V (which includes XU), i.e. P( x0 , Z ) V . Basu extends the LIV methods to identify PeT effects, which, for persons who choose treatment,Error! Bookmark not defined. follow (A5) EXu |XO ,P ( Z ),DE (Y1 Y0 | x0 , P(z), D 1) = E(Y1 Y0 | x0,V P(z)) P (z)1 0 P(z) = MTE ( xO ,v )dv . Similarly, conditional effect for a person who did not choose treatment is obtained by integrating MTEs over values of V greater than p. Conceptually, a PeT effect is also a weighted version of MTEs. For any given individual, the PeT effect identifies the specific margins where that individual may belong given its individual values of XO, P(Z) and D. It then averages the MTEs over those margins, but not all as in CATE. Therefore, a PeT effect is basically the X-Z-conditional Effect on the Treated for persons undergoing treatment and is the X-Z-conditional Effect on the Untreated for persons not undergoing treatment. Further details can be found in Basu (2014).Error! Bookmark not defined. PeT effects on survival outcomes in the presence of administrative censoring: Let St represents the cumulative probability of survival up to time period t. Therefore, MTE(x,u) in the cumulative survival scale is given as dS / dPˆ . Let the discrete time periods be t pˆ ( x ,z ) u denoted by k = 1,2..t. Then, (A6) St kt 1 (1 hk ) , where hk is the probability of death in time period k conditional on surviving k-1 periods. When both sides are logged, we have: ln St kt 1 ln(1 hk ) . It follows then, (A7) dSt / dPˆ St kt 1 dhk / dP (1 hk ) . An estimator for the Marginal Treatment Effect on cumulative survival is obtained as: (A8) dSt / dPˆ Sˆt kt 1 dhˆk / dP (1 hˆk ) , where indicates that the quantities were estimated from the data at hand. With the formulation of the discrete hazard model given as (A9) logit(hk) = g( Pˆ , X, k), all of the components Sˆt , hˆk ,and dhˆk / dP can be readily computed to form an estimator for the Marginal Treatment Effect on cumulative survival as shown in (A8). PeT estimates are then derived based on conditional aggregation of the MTE estimates as shown in (A5). Table A1: Exclusions table Criteria Substantive Exclusions: Patients on the PEDSF file (1995-2005 diagnosis) Site1 is not 619 First diagnosis is after 2002 Not eligible to both Part A & Part B or eligible for HMO. Age >=66 & <=85 Restrict to first diagnosis of cancer (multsite=”1”) No. of primary side on first diagnosis =1 (numprims=”01”) Restrict to clinically localized stage (tstage in (“1”, “2”)) Medicare status code (med_stcd=”10”) ESRD (chr_esrd==”0”) Seq1 (seq1=”00”) Empirical exclusions Choose surgery with radiation therapy Did not chose surgery or watchful waiting Unknown death flag (dod_flg in (0”, “1”)) Must survive at least 6 months from diagnosis Unknown grade Unknown race Unknown zip-code level characteristics Less than two PSA test in year following diagnosis Drop if diagnosis year =1995 HRRs do not have enough patients to calculate IV rates Age restriction, keep 66-79 year olds, mirror PIVOT trial Excluded Total 220,049 15,028 1,557 115,127 5,570 32 8,470 11,949 183 33 1,893 205,021 203,464 1,413 32,781 23 350 69 255 744 7,201 1,974 2,415 1,946 58,794 26,013 25,990 25,640 25,571 25,316 24,572 17,371 15,397 12,982 11,036 88,337 82,767 82,735 74,265 62,316 62,133 62,100 60,207
© Copyright 2026 Paperzz