Incidence Densities in a Competing Events Analysis

American Journal of Epidemiology
ª The Author 2010. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of
Public Health. All rights reserved. For permissions, please e-mail: [email protected].
Vol. 172, No. 9
DOI: 10.1093/aje/kwq246
Advance Access publication:
September 3, 2010
Practice of Epidemiology
Incidence Densities in a Competing Events Analysis
Nadine Grambauer*, Martin Schumacher, Markus Dettenkofer, and Jan Beyersmann
* Correspondence to Nadine Grambauer, Department of Medical Biometry and Statistics, University Medical Center Freiburg,
Stefan-Meier-Str. 26, 79104 Freiburg, Germany (e-mail: [email protected]).
Initially submitted January 19, 2010; accepted for publication June 30, 2010.
Epidemiologists often study the incidence density (ID; also known as incidence rate), which is the number of
observed events divided by population-time at risk. Its computational simplicity makes it attractive in applications,
but a common concern is that the ID is misleading if the underlying hazard is not constant in time. Another difficulty
arises if competing events are present, which seems to have attracted less attention in the literature. However,
there are situations in which the presence of competing events obscures the analysis more than nonconstant
hazards do. The authors illustrate such a situation using data on infectious complications in patients receiving stem
cell transplants, showing that a certain transplant type reduces the infection ID but eventually increases the
cumulative infection probability because of its effect on the competing event. The authors investigate the extent
to which IDs allow for a reasonable analysis of competing events. They suggest a simple multistate-type graphic
based on IDs, which immediately displays the competing event situation. The authors also suggest a more formal
summary analysis in terms of a best approximating effect on the cumulative event probability, considering another
data example of US women infected with human immunodeficiency virus. Competing events and even more
complex event patterns may be adequately addressed with the suggested methodology.
competing risks; event-specific hazard; incidence rate; model misspecification; proportional hazards; survival
analysis
Abbreviations: AIDS, acquired immunodeficiency syndrome; CIF, cumulative incidence function; HAART, highly active antiretroviral therapy; HIV, human immunodeficiency virus; ID, incidence density.
Incidence density (ID; also known as incidence rate), the
number of observed events divided by the population-time
at risk, is often calculated in applications. Examples include
studies of cardiovascular diseases (1, 2), human immunodeficiency virus (HIV) infection (3), and hospital-acquired
infection (4–6). IDs are attractive because they are computationally simple and allow for right-censoring as well as
left-truncation (7). Right-censoring is present if, for some
individuals, only the minimum time until the event is
known. For example, a patient surviving after a study has
been closed has a censored survival time. Left-truncation
occurs when individuals are not followed from time zero
but from some time afterward. For example, event times
are often left-truncated if the time scale of interest is age.
Under a constant hazards assumption, IDs are the standard
maximum likelihood estimators. However, investigators
such as Kraemer (8) recently emphasized that the validity
of ID analyses might be compromised if the underlying
hazards are not constant.
Another difficulty occurs if competing risks (or competing
events—both terms are used synonymously in the following)
are present (7, 9). In this case, when performing an ID analysis, the competing risks should be accounted for separately,
meaning that event-specific IDs should be calculated, which
is rarely found in the literature (refer, for example, to Glynn
and Rosner (2) and Dignam and Kocherginsky (10)). Lau
et al. (11) recently pointed out the relevance of competing
risks in epidemiologic studies. The authors introduced and
compared the standard semiparametric regression models for
competing risks. They illustrated the difficulties with pertinent risk set plots that display different estimation techniques.
Similar risk set plots have also been studied by Latouche (12).
1077
Am J Epidemiol 2010;172:1077–1084
1078 Grambauer et al.
Lau et al. concluded that some modeling approaches are
better suited to investigating the etiology of a disease,
whereas others provide a better synthesis in terms of an individual’s outcome.
In this paper, we profit from the computational simplicity
of IDs in that we are able to give a simple graphic illustration of the estimation results. Our graphic both displays
results in terms of the etiology of a disease and lends itself
to a synthesis interpretation. The graphic is useful in understanding how the expected proportion of the respective
competing events over the course of time evolves—even if
the constant hazards assumption fails.
As Lau et al. (11) point out, the course of this so-called
cumulative incidence function (CIF) is often difficult to predict from standard analyses. For instance, quite differential
effects on the CIF for the event of interest can be observed in
the ONKO-KISS study (13, 14), our first application example. ONKO-KISS considers patients with hematologic malignancies who have undergone peripheral blood stem cell
transplantation. After transplantation, patients enter a critical
condition called ‘‘neutropenia,’’ characterized by a low
number of white blood cells, which primarily avert infections. Bloodstream infection during neutropenia is a severe
complication. An event-specific ID analysis considers the
time until ‘‘bloodstream infection during neutropenia’’ or
‘‘end of neutropenia (without prior bloodstream infection),’’
whichever comes first, and the type of that first event.
Results are in line with the semiparametric analyses of
Beyersmann et al. (15), who found similar effects of female
gender and allogeneic transplants on the ID for bloodstream
infection but observed quite differential effects on the CIF
for the event of interest because of different effects on the
competing IDs.
To further cope with potentially difficult interpretation,
we develop in this paper a more formal summary analysis
of the different event-specific effects of one risk factor
based on IDs, which is similar to that given by Lau et al.
(11). To illustrate this analysis, we perform a second
competing risks analysis, considering data from the
Women’s Interagency HIV Study on HIV-infected US
women (refer, for example, to Lau et al. and Barkan
et al. (16)). Here, we find the parametric approach based
on IDs yielding competitive results compared with the
more complex parametric mixture model approach performed by Lau et al., which results in close to nonparametric ID-based CIF estimates.
This paper is organized as follows: The next section introduces competing risks and expands on the parametric
approach using event-specific IDs that allow one to perform
further inference easily. Consequently, the methodology is
illustrated in detail by means of the ONKO-KISS study;
results from ID analyses are connected to interpretation of
the estimated CIFs. A second data example from the
Women’s Interagency HIV Study is considered next to illustrate the potential of obtaining a summary analysis. Finally, we close with a discussion.
To make the proposed methodology easily applicable, we
provide corresponding functions for the software environment for statistical computing R (17) and a worked example
in the supplementary Web material (this information is
posted on the Journal’s Web site (http://aje.oxfordjournals.
org/)).
COMPETING RISKS AND STATISTICAL INFERENCE
Event-specific IDs and CIF
Competing risks model time to first event and event type
(18), whereas standard survival analysis considers time to
a single, usually combined, endpoint only. Inference for
competing risks can be based completely on the eventspecific hazard rates ah(t), where ah(t)dt ¼ P(T 2 dt,
event ¼ h j T t), T is the time until any first event, and
dt refers to the probability of experiencing event h in the
small time interval [t, t þ dt). These rates are similar to the
standard hazard rates but are marked with the respective
event-type h. For ease of presentation, we assume 2 competing risks, h 2 {1, 2}, but the findings can be transferred to
any number of competing risks. This approach does not
make any independent competing risks assumption, which
has been subject to pointed critique (19–21). Note, however,
that one would need to make an assumption about whether
and how one event-specific hazard changes if the aim were
inference for outcome probabilities when the other eventspecific hazard is modified or even supposed to vanish (20).
In assuming the event-specific hazards to be constant, it is
straightforward to estimate them by calculating the corresponding event-specific IDs:
number of type-h-events
:
âh ¼ P
patient-time at risk
ð1Þ
However, although parametric survival analysis provides
a powerful tool (refer, for example, to Cox et al. (22)), the
constant hazards assumption may seldom apply well
to medical research data (8). To determine whether the constant hazards assumption may still be appropriate for the
particular data, one may consider
the cumulative eventRt
specific hazards Ah ðtÞ ¼ 0 ah ðuÞdu by comparing its
parametric plug-in estimates with the corresponding nonparametric Nelson-Aalen estimates (18). The latter are defined as
Âh ðtÞ ¼
X
st
number of type-h-events at s
;
number of patients at risk before s
ð2Þ
where the sum is calculated over all observed event times s
of any type, s t.
Another important quantity in a competing risks setting is
the CIF for the particular event type h:
Fh ðtÞ ¼ RPðT t; event ¼ hÞ
ð3Þ
t
¼ 0 PðT > u Þ3ah ðuÞdu;
Ru
where PðT > uÞ ¼ expð 0 ða1 ðvÞ þ a2 ðvÞÞdvÞ is the survival probability, indicating that the CIF depends on all
event-specific hazards. For example, if interest focuses on
a certain event of interest 1, this is the expected proportion
of type 1 events over the course of time.
If constant event-specific hazards are assumed, ah(t) ¼ ah,
the CIF of event type h is explicitly given as
Am J Epidemiol 2010;172:1077–1084
Incidence Densities in a Competing Events Analysis
Fh ðtÞ ¼
Rt
0 expð
ða1 þ a2 ÞuÞ3ah du
ah R t
expð ða1 þ a2 ÞuÞdu
a1 þ a2 0
ah
ð1 expð ða1 þ a2 ÞtÞÞ
¼
a1 þ a2
¼
ð4Þ
By plugging in both event-specific IDs, the CIF (equation 4)
can be estimated parametrically under a constant hazards
assumption.
Alternatively, one may estimate the CIF (equation 3) nonparametrically via the Aalen-Johansen estimator (18),
which generalizes the Kaplan-Meier estimator to the case
of more than one outcome state. For example, for 2 event
types, it is F̂1 ðtÞ þ F̂2 ðtÞ ¼ 1 Kaplan-MeierðtÞ ¼
P
P̂ðT > s Þ3DÂ:ðsÞ, where the sum is calculated over
st
all observed event times s of any type, s t, and DÂ:ðsÞ is
the increment of the Nelson-Aalen estimator (equation 2)
for the cumulative all-event hazard A.(s) = A1(s) þ A2(s).
Following this, the Aalen-Johansen estimate for the CIF
of event type h is
X
P̂ðT > s Þ3DÂh ðsÞ:
F̂h ðtÞ ¼
st
In the following section, we take the nonparametric CIF
estimate as a reference to judge the performance of the
parametric, ID-based CIF estimator, for example, by checking whether it is within the corresponding nonparametric
95% confidence intervals.
Regression analysis
To study potential risk factor effects, a standard approach
would be to use Cox proportional hazards models (refer, for
example, to Lau et al. (11), Kalbfleisch and Prentice (19),
and Cox (23)). In a competing events situation, Cox models
for all event-specific hazards have to be considered. In
addition, Poisson regression models may be used in a
person-time setting, which will typically provide maximum
likelihood estimates that closely agree with those of the Cox
model if the baseline event-specific hazards are assumed to
be piecewise constant (9, 24, 25).
However, even from a constant event-specific hazards (or
ID) analysis with, for example, a binary risk factor, the
course of the CIF may already be difficult to predict because
it is an involved function of both event-specific hazards. To
this end, for example, Lau et al. (11), Beyersmann et al.
(15), and Fine and Gray (26) discuss assessing a potential
covariate effect on the subdistribution hazard k(t), which is
implicitly defined
by a 1-to-1 relation with the CIF, Fh ðtÞ ¼
Rt
1 expð 0 kðuÞduÞ. This assessment may be performed
by using a Cox model for the subdistribution hazard (26).
However, if proportionality holds for the event-specific hazards, the proportional subdistribution hazards model will
generally be misspecified, since the latter hazard will be
time dependent (27) even if the former hazards are assumed
to be constant.
Am J Epidemiol 2010;172:1077–1084
1079
Still, the subdistribution analysis offers a summary analysis in that it estimates the least false parameter, a timeaveraged effect on the cumulative event probabilities
(27–30). The parameter is called least false in the sense that
it gives the best approximation within the misspecified model
class (here, proportional subdistribution hazards model) of
the true model (here, proportional event-specific hazards
models) in terms of the Kullback-Leibler distance (refer to
Claeskens and Hjort (29, chapter 2.2) for a formal definition
and discussion). As a solution to an explicit formula, this
time-averaged effect can be easily calculated numerically
for a given time interval (31). For further details, refer to
the Appendix and the supplementary Web material (R code).
APPLICATION TO BLOODSTREAM INFECTION DATA
Study population
The German multicenter ONKO-KISS surveillance study
population consisted of 1,699 patients undergoing peripheral blood stem cell transplantation for hematologic malignancies, observed during January 2000 and June 2004 (14).
After transplantation, patients become neutropenic. One
purpose of the ONKO-KISS study was to investigate the
impact of risk factors for the occurrence of bloodstream
infection during neutropenia. Among other factors, the binary risk factor transplant type (either allogeneic or autologous) was recorded. A total of 913 (56.5%) of 1,616 patients
received allogeneic transplants, of whom 193 (21.1%) acquired bloodstream infection; of the patients with an autologous transplant, 126 (17.9%) acquired bloodstream
infection. Because patients may also leave the critical phase
without a prior bloodstream infection, either alive or dead,
this constitutes a competing risks setting. A total of 319
(19.7%) events were observed for bloodstream infection,
and 1,280 (79.2%) events were observed for ‘‘end of neutropenia without prior bloodstream infection (alive or
dead)’’; only 17 (1.1%) patients were censored. Follow-up
terminated with the end of neutropenia. Only 20 (1.2%) of
the patients died without having a prior bloodstream infection. To keep things as simple as possible, we do not consider them separately in this paper.
In the following section, we perform a competing risks
analysis with the primary endpoints of bloodstream infection and end of neutropenia. Following Beyersmann et al.
(15), we concentrate on the binary risk factor transplant type
(Z ¼ 1 for allogeneic, Z ¼ 0 for autologous) because it
provides an interpretationally challenging example.
Statistical analysis
The complete competing risks situation is best visualized
and explained by Figure 1, a multistate-type graphic, which
shows the competing risks situation for both allogeneic (left
panel) and autologous (right panel) transplant types; the
arrows, which are thicker for higher values, represent
the particular event-specific IDs according to the results of
the analyses given in Table 1. As evident from Figure 1,
allogeneic transplant has a reducing effect on both becoming infected and leaving the critical phase, but the reduction
1080 Grambauer et al.
Allogeneic
α^ 1 = 0.014
Autologous
BSI
α^ 1 = 0 . 0 2
BSI
α^ 2 = 0.09
End of
Neutropenia
Neutropenia
Neutropenia
α^ 2 = 0.051
End of
Neutropenia
Figure 1. Multistate-type graphic illustrating the competing-risks ONKO-KISS study data analysis by means of event-specific incidence densities.
The arrow thickness describes the particular amount of every incidence density. BSI, bloodstream infection.
is fairly different for the different event types. Compared
with the bloodstream-infection ID, the competing end-ofneutropenia ID is much more pronounced, especially for the
autologous groups. Receiving allogeneic transplants reduces the end-of-neutropenia ID more than it reduces the
bloodstream-infection ID (the exact amount of reduction is
given by ID ratio analysis in Table 1).
In addition to the instantaneous risk of experiencing one
of the competing events, embodied by the IDs, interest also
focuses on the expected proportion of the particular events
along time, that is, the CIF. This quantity may be estimated
parametrically by plugging in the event-specific IDs
(equation 4).
Applied to the ONKO-KISS data, these estimates are
given for the event of interest, bloodstream infection, by
the thick continuous lines in panel A of Figure 2, stratified
by transplant type. To judge the performance of the estimates, nonparametric Aalen-Johansen estimates for bloodstream infection (thick step curves) and corresponding
log-log transformed 95% confidence intervals (thin step
curves) are also depicted. Irrespective of the estimation
method, the estimated CIFs for bloodstream infection indicate that patients with allogeneic transplants initially ap-
pear to be at a lower risk of bloodstream infection; the CIFs
then cross, and eventually there are more patients with allogeneic transplants who acquire bloodstream infection than
there are patients with autologous transplants who
acquire bloodstream infection (15). Both approaches result
in the same plateau. Crossing of the CIFs results from the
higher reduction in allogeneic transplants on the end-ofneutropenia hazard, even amplified by the fact that the baseline end-of-neutropenia hazard is much more pronounced
compared with the bloodstream-infection hazard (also refer
to our proposed multistate-type graphic, Figure 1).
However, the ID-based CIFs may be considered slightly
misspecified because they are not fully contained in the 95%
confidence intervals and therefore do not fully capture the
temporal dynamics. We find the reason for the differing time
dynamic to be the strong time-dependent course of the endof-neutropenia hazard (refer to the Nelson-Aalen estimates
in panel B of Figure 2), so the IDs might not offer an appropriate fit. Furthermore, according to the goodness-of-fit
plot in panel C of Figure 2, the end-of-neutropenia hazards
do not appear to be proportional; this curve should be an
approximately straight line with intercept zero under a proportional hazards model. However, even if the constant
Table 1. Event-specific Incidence Densities and Ratios Among Patients in the ONKO-KISS
Study, 2000–2004, Germany
Allogeneic
Autologous
Estimate
95% CI
Bloodstream infection
0.68
0.61, 0.76
End of neutropenia
0.55
0.52, 0.58
Estimate
95% CI
Estimate
95% CI
Bloodstream infection
0.014
0.012, 0.015
0.020
0.018, 0.023
End of neutropenia
0.051
0.048, 0.054
0.090
0.087, 0.097
Event-specific
incidence densities
Event-specific
incidence density
ratios (allogeneic
vs. autologous)
Abbreviation: CI, confidence interval.
Am J Epidemiol 2010;172:1077–1084
Incidence Densities in a Competing Events Analysis
B)
Nelson−Aalen Estimates
Estimated CIF for BSI
C)
6
0.25
0.20
0.15
0.10
0.05
0.00
5
EndNP
4
3
2
1
BSI
0
0
20
40
60
80
0
20
Days
40
60
Nelson−Aalen Estimate for
Allogeneic Transplant
A)
1081
6
5
EndNP
4
3
2
BSI
1
0
0
80
Days
1
2
3
4
5
6
Nelson−Aalen Estimate for
Autologous Transplant
Figure 2. A) Nonparametrically (thick step curves) and parametrically (thick lines) estimated cumulative incidence functions (CIFs) for bloodstream infection (BSI) according to allogeneic (dashed lines) and autologous (solid lines) transplant type, and nonparametric 95% confidence
intervals (thin lines, corresponding to transplant type). B) Estimated cumulative event-specific hazards for BSI (lower 2 curves) and end of
neutropenia (EndNP; upper 2 curves) for each transplant type. C) Goodness-of-fit plot for BSI and EndNP.
hazards assumption fails here, the event-specific IDs are still
useful because they offer good qualitative insights into the
competing risks situation.
In the following section, we consider another data example (provided in the Web material by Lau et al. (11)), in
which an ID analysis even offers good quantitative insight
in terms of a time-averaged effect on the CIF.
APPLICATION TO HIV DATA
death, or administrative censoring (September 28, 2006).
Among recorded covariates were history of injection drug
use at enrollment, age, CD4 nadir, and race. Following Lau
et al. (11), we considered the study sample of 1,164 women
who were alive, infected with HIV, and free of clinical AIDS
on December 6, 1995 (baseline), the day on which the first
protease inhibitor was approved by the US Federal Drug
Administration. The following statistical analysis concentrates on the effect of injection drug use on time until initiation of HAART without prior occurrence of clinical AIDS.
Study population
Statistical analysis
The multicenter Women’s Interagency HIV Study population consists of 2,058 HIV-positive and 567 HIV-negative
US women enrolled between October 1994 and November
1995. The aim of the study was to assess the impact of HIV
infection on US women. Details are given elsewhere (11,
16). Women were followed until the first of the following
events: initiation of highly active antiretroviral therapy
(HAART), acquired immunodeficiency syndrome (AIDS)/
From the multistate-type graphic (Figure 3, R code given
in the Web material), it is evident that history of injection
drug use reduces the HAART ID (ID ratio ¼ 0.744, 95%
confidence interval: 0.688, 0.800) but has a strong increasing effect on the AIDS/death ID (ID ratio ¼ 1.868, 95%
confidence interval: 1.675, 2.061). Especially here, the baseline HAART ID of interest is much more pronounced
Non-IDU
IDU
^ = 0.16
α
1
HAART
^ 1 = 0.22
α
HAART
^ 2 = 0.08
α
AIDS/Death
HIV
HIV
^ = 0.15
α
2
AIDS/Death
Figure 3. Multistate-type graphic illustrating the competing-risks Women’s Interagency HIV Study data analysis by means of event-specific
incidence densities. The arrow thickness describes the particular amount of every incidence density. AIDS, acquired immunodeficiency syndrome;
HAART, highly active antiretroviral therapy; HIV, human immunodeficiency virus; IDU, injection drug use. (This graphic is part of the worked
example outlined in the supplementary material posted on the Journal ’s Web site (http://aje.oupjournals.org/).)
Am J Epidemiol 2010;172:1077–1084
1082 Grambauer et al.
B)
0.6
0.4
0.2
0.0
0
2
4
6
8
10 12
C)
2.5
2.5
2.0
1.5
HAART
1.0
AIDS/Death
0.5
0.0
Nelson−Aalen Estimate
for IDU
0.8
Nelson−Aalen Estimates
Estimated CIF for HAART
A)
2.0
HAART
1.5
AIDS/Death
1.0
0.5
0.0
0
2
Years
4
6
8
Years
10 12
0.0
0.5
1.0
1.5
2.0
2.5
Nelson−Aalen Estimate
for Non−IDU
Figure 4. A) Nonparametrically (thick step curves) and parametrically (thick lines) estimated cumulative incidence functions (CIFs) for highly
active antiretroviral therapy (HAART) according to injection drug use (IDU; solid lines) and non-IDU (dashed lines), and nonparametric 95%
confidence intervals (thin lines, corresponding to IDU and non-IDU, respectively). B) Estimated cumulative event-specific hazards for HAART
(upper 2 curves) and acquired immunodeficiency syndrome (AIDS)/death (lower 2 curves) according to IDU and non-IDU. C) Goodness-of-fit plot
for HAART and AIDS/death.
compared with the baseline AIDS/death ID, which contributes even more to the estimated CIFs of interest (Figure 4,
panel A). Here, the CIF according to history of injection
drug use follows a course below the one without such a history for most of the time, meaning that the expected proportion of those receiving HAART is smaller for individuals
with an injection drug use history.
The Nelson-Aalen estimates of the cumulative eventspecific hazards (Figure 4, panel B), especially for HAART,
show a slight time-dependent course. However, at least the
proportional hazards assumption appears to hold rather well
because both curves in panel C of Figure 4 are approximately linear, indicating that the event-specific ID analysis
here is reasonable. Because of this latter fact, the ID-based
CIF estimates are close to the Aalen-Johansen estimates.
In addition, Lau et al. (11, refer to Figure 3 or Table 4)
report a time-averaged effect on HAART-CIF based on either a proportional subdistribution hazards model or their
parametric mixture model for nonproportional hazards (with
subsequent averaging). Our proposed summary analysis—the least false parameter determined using event-specific IDs (refer to the Appendix and Web material)—results
in a similar estimate of 0.496, corresponding to 0.61 ¼
exp(–0.496) (bootstrapped 95% confidence interval: 0.52,
0.7), which equals the respective estimate reported by Lau
et al. (11, Figure 3).
DISCUSSION
The ID is often calculated because it is computationally
simple and allows for right-censoring as well as lefttruncation. However, it is also questionable because of potentially nonconstant hazards; piecewise IDs (32) or even
semiparametric approaches, for example, proportional hazards models (23), might be preferred. However, in this paper, we point out that, although an ID analysis might be
questionable, the analysis might be even more obscured
when potentially existing competing events are not taken
into account. When dealing with competing risks, it is im-
portant to analyze event-specific IDs for every event instead
of considering just the ID of interest. However, accounting
for 2 or more risks simultaneously makes interpretation
much more difficult.
We examined 2 different data examples to address 2
different issues. First, analysis of the ONKO-KISS data
demonstrated that a covariate may have quite differential
effects on the competing hazards, possibly leading to a nonproportional (i.e., crossing) course of the respective estimated CIFs of interest. In addition, the Nelson-Aalen
estimates of the competing hazard showed a pronounced
time dependency, suggesting that an ID analysis is questionable. This was reflected in the ID-based CIF estimates,
which missed the time dynamic of the Aalen-Johansen
estimates. Nevertheless, the ID-based CIFs still reflected
the empirical course, and the eventual proportion of infected patients, that is, the plateau of the nonparametric
CIFs, was well captured. Here, although the constant hazards assumption failed, the ID analysis offered qualitative
insight into the underlying process. Furthermore, interpretation was facilitated by the proposed multistate-type
graphic that illustrates the underlying data structure by
displaying the existing risks and its particular IDs. The
graphic offers qualitative insights and lends itself to a synthesis interpretation. Competing events may be adequately
addressed with the suggested methodology. Moreover, it
can straightforwardly be extended to more general multistate models (20), which are realized as a series of nested
competing risks experiments.
In contrast, when we considered the Women’s Interagency HIV Study data, the ID-based CIF estimates were
close to their nonparametric counterparts for most of
the time, a consequence of the underlying proportional
course of the respective cumulative hazards. Although at
least one hazard appeared to be time dependent, the ID
analysis here is an appropriate approach while concurrently
also being the simplest. It was also highlighted that a subdistribution hazard analysis, although potentially misspecified, offered a summary analysis in terms of an average
Am J Epidemiol 2010;172:1077–1084
Incidence Densities in a Competing Events Analysis
effect on the CIF, similar to Lau et al. (11). This so-called
least false parameter can be easily estimated numerically
whenever all event-specific IDs are known, together with
an estimate of the censoring distribution (33). Current work
is allowing the least false parameter to also take lefttruncation into account.
Note that our approach is not restricted to binary covariates, which we used to illustrate the multistate-type
graphic. However, continuous covariates can also be considered, following, for example, Aalen et al. (18), Makuch
(34), and Shen and Fleming (35). That is, in a more involved
regression setup, one would need to compute the individual
ID predictions, which could then be averaged and subsequently used in the multistate-type graphic.
ACKNOWLEDGMENTS
Author affiliations: Department of Medical Biometry and
Statistics, Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg, Freiburg, Germany (Nadine Grambauer, Martin Schumacher, Jan
Beyersmann); Freiburg Center for Data Analysis and Modeling, University of Freiburg, Freiburg, Germany (Nadine
Grambauer, Jan Beyersmann); and Institute of Environmental Medicine and Hospital Epidemiology, University
Medical Center Freiburg, Freiburg, Germany (Markus
Dettenkofer).
This work was supported by the Deutsche Forschungsgemeinschaft (DFG Forschergruppe FOR 534). The funding
sources had no involvement in this manuscript.
Conflict of interest: none declared.
REFERENCES
1. Chen PC, Sung FC, Chien KL, et al. Red blood cell distribution width and risk of cardiovascular events and mortality in
a community cohort in Taiwan. Am J Epidemiol. 2010;171(2):
214–220.
2. Glynn RJ, Rosner B. Comparison of risk factors for the
competing risks of coronary heart disease, stroke, and
venous thromboembolism. Am J Epidemiol. 2005;162(10):
975–982.
3. Collin F, Duval X, Le Moing V, et al. Ten-year incidence and
risk factors of bone fractures in a cohort of treated HIV1infected adults. AIDS. 2009;23(8):1021–1024.
4. Kola A, Schwab F, Bärwolff S, et al. Is there an association
between nosocomial infection rates and bacterial cross transmissions? Crit Care Med. 2010;38(1):46–50.
5. Ammerlaan HS, Troelstra A, Kruitwagen CL, et al. Quantifying changes in incidences of nosocomial bacteraemia caused
by antibiotic-susceptible and antibiotic-resistant pathogens. J
Antimicrob Chemother. 2009;63(5):1064–1070.
6. Edwards JR, Peterson KD, Andrus ML, et al. National
Healthcare Safety Network (NHSN) report, data summary for
2006 through 2007, issued November 2008. Am J Infect
Control. 2008;36(9):609–626.
7. Rothman K, Greenland S, Lash TL, eds. Modern Epidemiology. 3rd ed. Philadelphia, PA: Lippincott-Raven Publishers;
2008.
Am J Epidemiol 2010;172:1077–1084
1083
8. Kraemer HC. Events per person-time (incidence rate): a misleading statistic? Stat Med. 2009;28(6):1028–1039.
9. Andersen P, Borgan Ø, Gill RD, et al. Statistical Models Based
on Counting Processes. Springer series in statistics. New York,
NY: Springer-Verlag Inc; 1993.
10. Dignam JJ, Kocherginsky MN. Choice and interpretation of
statistical tests used when competing risks are present. J Clin
Oncol. 2008;26(24):4027–4034.
11. Lau B, Cole SR, Gange SJ. Competing risk regression models for
epidemiologic data. Am J Epidemiol. 2009;170(2):244–256.
12. Latouche A. Regression Modeling in the Presence of Competition [in French] [dissertation]. Paris, France: University of
Paris 6; 2004.
13. Dettenkofer M, Wenzler-Röttele S, Babikir R, et al. Surveillance of nosocomial sepsis and pneumonia in patients with a
bone marrow or peripheral blood stem cell transplant: a multicenter project. Clin Infect Dis. 2005;40(7):926–931.
14. Meyer E, Beyersmann J, Bertz H, et al. Risk factor analysis of
blood stream infection and pneumonia in neutropenic patients
after peripheral blood stem-cell transplantation. Bone Marrow
Transplant. 2007;39(3):173–178.
15. Beyersmann J, Dettenkofer M, Bertz H, et al. A competing
risks analysis of bloodstream infection after stem-cell transplantation using subdistribution hazards and cause-specific
hazards. Stat Med. 2007;26(30):5360–5369.
16. Barkan SE, Melnick SL, Preston-Martin S, et al. The Women’s
Interagency HIV Study. WIHS Collaborative Study Group.
Epidemiology. 1998;9(2):117–125.
17. R Development Core Team. R: A Language and Environment
for Statistical Computing. Vienna, Austria: R Foundation for
Statistical Computing; 2008. (http://www.R-project.org).
(Accessed January 10, 2010).
18. Aalen O, Borgan O, Gjessing H. Survival and Event History
Analysis: A Process Point of View. Berlin, Germany: SpringerVerlag; 2008.
19. Kalbfleisch J, Prentice R. The Statistical Analysis of Failure
Time Data. 2nd ed. Wiley series in probability and mathematical statistics. Hoboken, NJ: John Wiley & Sons; 2002.
20. Andersen PK, Abildstrom SZ, Rosthøj S. Competing risks as
a multi-state model. Stat Methods Med Res. 2002;11(2):
203–215.
21. Mackenbach JP, Kunst AE, Lautenbach H, et al. Competing
causes of death: an analysis using multiple-cause-of-death
data from the Netherlands. Am J Epidemiol. 1995;141(5):
466–475.
22. Cox C, Chu H, Schneider M, et al. Parametric survival analysis
and taxonomy of hazard functions for the generalized gamma
distribution. Stat Med. 2007;26(23):4352–4374.
23. Cox DR. Regression models and life tables (with discussion).
J R Stat Soc (B). 1972;34(2):187–220.
24. Frome EL. The analysis of rates using Poisson regression
models. Biometrics. 1983;39(3):665–674.
25. Breslow NE, Day NE. Statistical methods in cancer research.
Vol. II—The design and analysis of cohort studies. IARC Sci
Publ. 1987(82):1–406.
26. Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. 1999;
94(446):496–509.
27. Latouche A, Boisson V, Chevret S, et al. Misspecified regression model for the subdistribution hazard of a competing
risk. Stat Med. 2007;26(5):965–974.
28. Hjort N. On inference in parametric survival data models. Int
Stat Rev. 1992;60(3):355–387.
29. Claeskens G, Hjort N. Model Selection and Model Averaging.
Cambridge series in statistical and probabilistic mathematics.
1084 Grambauer et al.
Cambridge, United Kingdom: Cambridge University Press;
2008.
30. Struthers C, Kalbfleisch J. Misspecified proportional hazard
models. Biometrika. 1986;73(2):363–369.
31. Grambauer N, Schumacher M, Beyersmann J. Proportional
subdistribution hazards modeling offers a summary analysis,
even if misspecified. Stat Med. 2010;29(7-8):875–884.
32. Miettinen O. Estimability and estimation in case-referent
studies. Am J Epidemiol. 1976;103(2):226–235.
33. Kaplan E, Meier P. Nonparametric estimation from incomplete
observations. J Am Stat Assoc. 1958;53:457–481.
34. Makuch RW. Adjusted survival curve estimation using covariates. J Chronic Dis. 1982;35(6):437–443.
35. Shen Y, Fleming TR. Large sample properties of some survival
estimators in heterogeneous samples. J Stat Plan Inference.
1997;60(1):123–138.
36. Andersen P, Gill R. Cox’s regression model for counting processes: a large sample study. Ann Stat. 1982;10(4):1100–1120.
APPENDIX
This appendix briefly addresses the meaning of the least false parameter as a time-averaged effect on the CIF and how it can
be easily obtained in the setting considered in this paper.
The subdistribution hazard is build such that it directly relates to the CIF for the event of interest. Therefore, interest is
directed toward a potential covariate effect on this quantity. By assuming event-specific IDs to hold, the Cox model for the
subdistribution hazard is generally misspecified. However, the maximizer, ĉ, of the partial log-likelihood, ‘n(c), for the
subdistribution hazard is still asymptotically consistent for the least false parameter c* (28, 36).
In our considerations, assuming a binary covariate Z with p ¼ P(Z ¼ 1), the least false parameter is the solution to the
equation b(c) ¼ 0, with
bðcÞ ¼ pð1 pÞ3
Z
N
0
ð1 F1 ðu; Z ¼ 0ÞÞf1 ðu; Z ¼ 1Þ expðcÞð1 F1 ðu; Z ¼ 1ÞÞf1 ðu; Z ¼ 0Þ
PðC > uÞdu;
ð1 pÞð1 F1 ðu; Z ¼ 0ÞÞ þ p expðcÞð1 F1 ðu; Z ¼ 1ÞÞ
where F1(u;Z) is the CIF for the event of interest 1 according to covariate Z, f1(u;Z) ¼ dF1(u;Z)/du its derivative, and P(C > u)
is the censoring survival probability (31).
Whenever all event-specific IDs and an estimate of the censoring survival probability (33) are available, the least false
parameter can be estimated numerically by plugging in the respective estimates and using, for example, the function uniroot of
the software environment for statistical computing R (17); the respective R code is provided in the Web material. Corresponding standard errors can be easily obtained via the bootstrap.
Am J Epidemiol 2010;172:1077–1084