Prognostic Score for Predicting Risk of Dementia Over 10 Years

American Journal of Epidemiology
© The Author 2014. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health.
All rights reserved. For permissions, please e-mail: [email protected].
Vol. 180, No. 8
DOI: 10.1093/aje/kwu202
Advance Access publication:
September 4, 2014
Practice of Epidemiology
Prognostic Score for Predicting Risk of Dementia Over 10 Years While Accounting
for Competing Risk of Death
Hélène Jacqmin-Gadda*, Paul Blanche, Emilie Chary, Lucie Loubère, Hélène Amieva, and
Jean-François Dartigues
* Correspondence to Dr. Hélène Jacqmin-Gadda, INSERM U897, Institut de Santé Publique, d’Épidémiologie et de Développement,
Université de Bordeaux, case 11, 146 rue Léo Saignat, 33076 Bordeaux, France (e-mail: [email protected]).
Initially submitted October 8, 2013; accepted for publication July 7, 2014.
Early detection of subjects at high risk of developing dementia is essential. By dealing with censoring and competing risk of death, we developed a score for predicting 10-year dementia risk by combining cognitive tests, and we
assessed whether inclusion of cognitive change over the previous year increased its discrimination. Data came from
the French prospective cohort study Personnes Agées QUID (PAQUID) and included 3,777 subjects aged 65 years
or older (1988–1998). The combined prediction score was estimated by means of an illness-death model handling
interval censoring and competing risk of death. Its predictive ability was measured using the receiver operating characteristic (ROC) curve, with 2 different definitions depending on the way subjects who died without a dementia
diagnosis were considered. To account for right-censoring and interval censoring, we estimated the ROC curves
by means of a weighting approach and a model-based imputation estimator. The combined score exhibited an
area under the ROC curve (AUROC) of 0.81 for discriminating future demented subjects from subjects alive and
nondemented 10 years later and an AUROC of 0.75 for discriminating future demented subjects from all other subjects (including deceased persons). Adjustment for cognitive change over the previous year did not improve
prediction.
competing risks; death; dementia; interval censoring; prediction; ROC curve
Abbreviations: AUROC, area under the ROC curve; BVRT, Benton Visual Retention Test; DSST, Digit Symbol Substitution Test;
IADL, Instrumental Activities of Daily Living; IPCW, inverse probability of censoring weighting; IST, Isaacs Set Test; MMSE, MiniMental State Examination; MMSE-EM, episodic memory subtest of the MMSE; PAQUID, Personnes Agées QUID; ROC, receiver
operating characteristic.
Prediction of dementia risk may be based on 3 types of markers: brain imaging, biomarkers, and neuropsychological tests.
Among them, the latter are clearly the least invasive and the
cheapest. Previous longitudinal studies have found that cognition is impaired long before the dementia diagnosis (3). Thus,
the main objective of this work was to develop and validate a
prognostic score based on cognitive evaluation for predicting a
person’s risk of developing dementia in the next 5 or 10 years.
Moreover, cognitive change between 2 assessments has been
shown to be predictive of dementia (4, 5), and an improvement
is often observed between the first 2 assessments in cohort studies (6). Thus, we first evaluated whether 2 cognitive assessments led to better prediction than a single one.
Dementia is a common disease in the elderly and is characterized by significant decline in several cognitive functions.
The cognitive decline is progressive and eventually leads to a
loss of autonomy. Early detection of subjects at high risk of
developing dementia in the future is essential, for 2 reasons.
First, current treatments given after dementia diagnosis have
demonstrated only modest efficiency, and research is now focused on the possibility of preventive treatment or nondrug
intervention that could reduce the rate of conversion to dementia among subjects at high risk (1). Thus, validated criteria for identifying this high-risk target population are required
(2). Secondly, early detection of subjects at the very beginning of the decline process may help in providing better care.
790
Am J Epidemiol. 2014;180(8):790–798
10-Year Dementia Prediction Accounting for Competing Death 791
The development and validation of prediction models for
dementia raises some methodological issues due to incomplete follow-up and competing risk of death that have not been
handled in previous work (1, 7–9). Studies of dementia predictors need cohorts of elderly subjects, which generally suffer
from right-censoring (loss to follow-up) and interval censoring
of age at dementia onset when cognitive functioning is assessed
only at the time of clinical visits with no possibility of identifying the exact date of disease onset. Moreover, death is a major competing risk in the prediction of dementia. Strictly
speaking, this is a semicompeting risk, since death may occur
before or after dementia, whereas dementia cannot occur after
death. In this context, the main problem of interval censoring
is that dementia status at death is unknown for subjects who
were dementia-free at the last clinical visit. Given that the risk
of death is much higher among demented subjects (10, 11),
the probability of developing dementia between the last clinical visit and death cannot be neglected when the planned interval between visits is 2 or 3 years and is frequently enlarged
because of missing visits. Moreover, many predictors of dementia are associated with the risk of death without dementia.
Thus, these issues must be accounted for in both the development and the validation of prediction scores.
In the development phase, a standard Cox model could be
used to estimate the cause-specific hazard of dementia only if
the data were not interval-censored. With interval censoring,
estimation of the cause-specific hazard requires fitting of the
illness-death model (shown in Figure 1) by maximum likelihood while accounting for interval censoring, as detailed
elsewhere (11, 12).
In the validation phase, definitions of sensitivity, specificity,
and the receiver operating characteristic (ROC) curve have
been extended for time-dependent outcomes with competing
risks (13, 14), and nonparametric estimators have been proposed for right-censored data according to an inverse probability of censoring weighting (IPCW) approach (15). Using a
crude imputation rule, these estimators may be applied to
interval-censored data with semicompeting risks. Alternatively, the predictive accuracy measures may be estimated by
means of an imputation approach based on the illness-death
model (16).
Our main objective in this paper is to propose a score combining cognitive tests, autonomy scales, and subjective memory
a01(t )
Health
Dementia
a02(t)
a12(t )
Death
Figure 1. The illness-death model of Joly et al. (11).
Am J Epidemiol. 2014;180(8):790–798
complaints to identify subjects at high risk of developing dementia in the next 5 or 10 years, accounting for the competing
risk of death and complex censoring schemes. We also aimed
to evaluate whether cognitive changes over the first year of
follow-up improved the score’s predictive ability after adjustment for current cognitive status.
METHODS
The PAQUID cohort
The Personnes Agées QUID (PAQUID) Study, a French
prospective cohort study, aims at studying cognitive aging
and loss of autonomy (17). The cohort was randomly selected
from electoral rolls and included 3,777 subjects aged 65 years
or older at baseline who were living at home in 2 departments
of southwestern France (Gironde and Dordogne). Subjects
were visited at home by a trained psychologist at baseline
in 1988/1989, and then again approximately 1 (T1), 3 (T3),
5 (T5), 8 (T8), and 10 (T10) years after the initial visit. Participants from Dordogne were not visited at T1. At each visit,
a questionnaire was administered that included information
about lifestyle and health characteristics, a battery of cognitive tests, and scales of disability. Dementia was assessed at
each visit using a 2-stage procedure: Subjects who met the
Diagnostic and Statistical Manual of Mental Disorders,
Third Edition, Revised, criteria for dementia (18) as assessed
by the psychologist underwent a clinical assessment by a neurologist, who ascertained the final diagnosis. Last, all cases
were reviewed by a group of experts. The PAQUID Study
protocol was approved by ethics committees, and all participants gave their informed consent.
Sample selection
A preliminary analysis was performed to evaluate whether
changes in cognitive function over the first year improved
prediction based on the cognitive and disability assessment
made at T1. For this analysis, the target sample (denoted sample A) included subjects visited at T1 who were free of dementia and not blind, deaf, or bedridden at T1 (n = 1,683).
Exclusion of subjects with missing data for the cognitive
and autonomy assessments at T1 or T0 led to a sample of
1,313 subjects. Among them, 121 subjects were diagnosed
with dementia by T10. Because the results showed that accounting for cognitive change between T0 and T1 did not improve prediction, the main analysis was performed using only
cognitive measures taken at T0.
The target sample for the main analysis (sample B) included
3,510 subjects who were dementia-free and not blind, deaf, or
bedridden at baseline (T0). After exclusion of persons with
missing data for the baseline cognitive/autonomy evaluation,
the final sample included 2,795 subjects. Up to the T10 visit
(between 1988 and 1998), 265 incident cases of dementia were
diagnosed. All subjects were censored after T10.
Prediction variables
Four cognitive tests from the neuropsychological battery of
the PAQUID Study—the Mini-Mental State Examination
792 Jacqmin-Gadda et al.
(MMSE), the Isaacs Set Test (IST), the Benton Visual Retention Test (BVRT), and the Digit Symbol Substitution Test
(DSST)—were selected as possible predictors for dementia
because scores on these tests were previously found to be decreased long before dementia diagnosis (3) or they were already
shown to be highly predictive of dementia (15) and because
scores were available at T0 and T1 with few missing data.
The MMSE is a global measure of cognitive abilities (19)
that evaluates various dimensions of cognition (memory, calculation, orientation in place and time, language and word
recognition), with a possible score ranging from 0 to 30.
The episodic memory subtest of the MMSE (MMSE-EM),
scored over a range of 8 points, includes the questions about
orientation to time and 3-word recall. Because this subtest
was previously found to be highly predictive of dementia,
it was included in the model in addition to the total MMSE
score (9, 20). The IST measures verbal fluency (21) and consists of generating a list of words (with a maximum of 10) belonging to 4 semantic categories (colors, animals, fruits, and
cities) in 15 seconds (range, 0–40). The multiple-choice form
of the BVRT measures short-term visual memory, with a
score ranging from 0 to 15 (22). The DSST measures attention and speed of information processing (23). Scores in the
PAQUID sample ranged from 0 to 76 points. For all of these
tests, a higher score indicates better performance.
Cognitive complaints were assessed by means of 4 questions requiring subjects to rate their current cognitive difficulties, such as forgetfulness in daily living, difficulty in retaining
new information, difficulty in calculation, and difficulty in spatial orientation. Each variable was coded 1 for “frequent difficulties” and 0 for “no or rare difficulties.”
Disability was assessed using the 4-item Instrumental Activities of Daily Living (4-IADL) Scale, which has been found
to have a high specificity for dementia diagnosis (24). This
score is the sum of 4 binary items from the IADL Scale (telephone, medication, budget, and transportation). Each item
was coded 0 if subjects were able to perform the task without
any help and 1 if they needed help or were unable to perform
the task.
Finally, standard predictors of dementia and death were
also considered: age (both as a continuous variable and as
a binary indicator for age ≥75 years), sex, and educational
level ( primary school diploma or higher level vs. no education or no diploma).
Statistical analysis
The illness-death model. Figure 1 displays the illnessdeath model. The transition intensities between 3 states—
healthy, demented, and deceased—are modeled by the
following proportional intensity models:
αijk ðtÞ ¼ α0jk ðtÞ expðβ jk Xijk Þ
for i ¼ 1; ; N; j ¼ 0; 1 and k ¼ 1; 2;
ð1Þ
where α0jk ðtÞ is the baseline transition intensity from state j to
state k and Xijk is the vector of predictors for subject i and transition jk associated with the regression parameters βjk. Assuming Weibull baseline transition intensities, the models
were fitted using the R package SmoothHazard (25) by
means of maximum likelihood, accounting for interval censoring (11). Use of semiparametric transition models fitted by
penalized likelihood (11) leads to very similar results.
Analysis strategy. For each sample (A and B), models
were chosen via step-by-step backward selection. The initial
models included all of the predictors listed in the “Prediction
variables” subsection above in the 3 transition intensity submodels. Then, for investigating the impact of learning in sample A, we successively tested in the selected model whether
the change in each cognitive measure between T0 and T1 was
significantly associated with the risk of dementia. In the main
analysis of sample B, we then tested the interactions between
educational level and all of the predictors retained after backward selection, because educational level was previously
found to be associated with differential prediagnosis decline
(26).
The proposed prediction score was the linear predictor
from the transition intensity model to dementia computed
with the estimated regression parameters:
Mi ¼ ^β01 Xi01 :
ð2Þ
Discrimination assessment. Sensitivity, specificity, the
ROC curve, and the area under the ROC curve (AUROC) were
estimated to evaluate the ability of the combined score M to
predict the risk of dementia in the next 10 years (t = 10) and
the next 5 years (t = 5). Denoting T as the time of the first
event and setting η = 1 if the first event is dementia and
η = 2 if it is death, the sensitivity (Se) for a cutpoint c and a
window of prediction t is defined as
Seðc; tÞ ¼ PðM > c j T t; η ¼ 1Þ:
We computed the specificity (Sp) using the 2 definitions previously proposed for the competing-risks setting (15):
Sp1ðc; tÞ ¼ PðM c j T > tÞ and
Sp2ðc; tÞ ¼ PðM c j fT > tg or fT t; η ¼ 2gÞ:
Thus, the ROC curve displaying Se(c,t) versus 1 − Sp2(c,t)
for all c (denoted ROC2) measures the discrimination between future demented subjects and all of the other subjects
(alive or dead without dementia), while the ROC curve displaying Se(c,t) versus 1 − Sp1(c,t) (denoted ROC1) evaluates
the discrimination between future demented subjects and
those who survive without dementia until time t.
Two estimators of the ROC curves were used. The nonparametric IPCW estimator, which accounts for right-censoring,
was computed using the R package timeROC (27). In this
approach, Se(c,t) and Sp(c,t) are computed in subjects uncensored at time t, but the contribution of each subject is
weighted by his/her probability of being uncensored. To
apply the IPCW estimator to interval-censored data, we used
the following simple imputation rules for subjects who died
without a dementia diagnosis: When the interval between the
last visit and death was less than 2 years, the subject was
Am J Epidemiol. 2014;180(8):790–798
10-Year Dementia Prediction Accounting for Competing Death 793
score, IST score, DSST score, and MMSE-EM score were
not associated with the risk of dementia (Table 2). After adjustment for total MMSE score and BVRT score, change in
MMSE score (P = 0.46) and change in BVRT score (P =
0.15) were not associated with the risk of dementia (not
shown). Associations were also nonsignificant when a binary
variable was used for the change (improvement in the score
between T0 and T1 vs. no improvement; results not shown).
Thus, after adjustment for current cognitive status, cognitive
change in the previous year did not appear to be associated
with the risk of dementia over the next 9 years.
considered deceased without dementia; otherwise, the subject was considered right-censored after the last visit. Even
if this strategy was shown to give good results with designs
similar to PAQUID, the ROC curves were also estimated by
using the illness-death model to impute the probability of
being a case or control for all of the subjects with unknown
status (16). Indeed, IPCW may be slightly biased for large
between-visit intervals and markers highly associated with
death risk, while model-based imputation may be slightly
biased in the case of a misspecified imputation model (16).
The 2 estimators are detailed in Web Appendix 1, available at
http://aje.oxfordjournals.org/.
The optimism due to the estimation of predictive accuracy
measures on the learning data set was corrected by means of a
pooled 10-fold cross-validation method (28).
Prediction score for 10-year risk of dementia
As a consequence, we performed the main analysis using
the whole data set and the cognitive assessment performed at
T0 to predict the risk of dementia up to T10. Table 1 gives the
characteristics and cognitive test scores of sample B. The mean
age at the initial cognitive assessment was 74 (standard deviation, 6.1) years, 55.8% of the subjects were women, and 71%
had a diploma from primary school or a higher educational
level. The mean MMSE score at baseline was 26.6 (standard
deviation, 2.7). Among the 2,795 subjects, 265 (9.5%) were
diagnosed as demented during the 10-year follow-up period,
1,015 (36.3%) died without a dementia diagnosis, and 120
(4.3%) died after a dementia diagnosis. The status of the
2,795 subjects at each visit is shown in Web Figure 1.
RESULTS
Impact of learning ability
The characteristics of sample A are provided in Table 1,
and estimates of the final illness-death model after backward
selection are displayed in Table 2. Five predictors of the risk
of dementia were retained: age as a continuous variable, complaints about forgetfulness in daily living, IST score, DSST
score, and MMSE-EM score. After adjustment for the above
variables, changes over the first year (T1 – T0) in 4-IADL
Table 1. Characteristics of 2 Participant Subsamples Chosen to Develop a Prediction Score for Identifying Subjects
at High Risk of Dementia, PAQUID Study, France, 1988–1998
Sample A (n = 1,313)
Characteristic
No.
Incident dementia
Death without dementia diagnosisa
a
Death after dementia diagnosis
%
High educational level
No.
121
265
424
1,015
59
%
Mean (SD)
120
Age, years
Female sex
Mean (SD)
Sample B (n = 2,795)
73.5 (5.9)
74.0 (6.1)
734
55.9
1,559
55.8
1,011
77.0
1,984
71.0
Cognitive test score
IST
28.3 (5.8)
27.5 (5.9)
DSST
30.2 (11.2)
27.5 (11.3)
BVRT
10.8 (2.4)
10.3 (2.5)
MMSE
26.9 (2.4)
26.6 (2.7)
6.5 (1.1)
6.3 (1.1)
MMSE-EM
Memory complaints
Forgetfulness in daily living
680
51.8
1,393
49.8
Retaining new information
406
30.9
913
32.7
Difficulty in calculation
261
19.9
640
22.9
Difficulty in orientation
50
3.8
119
4.3
Abbreviations: BVRT, Benton Visual Retention Test; DSST, Digit Symbol Substitution Test; IST, Isaacs Set Test;
MMSE, Mini-Mental State Examination; MMSE-EM, episodic memory subtest of the Mini-Mental State Examination;
PAQUID, Personnes Agées QUID; SD, standard deviation.
a
Because the 10-year follow-up visit could take place anytime between 9 and 11 years after baseline, models were
fitted using vital status at 11 years for everyone.
Am J Epidemiol. 2014;180(8):790–798
794 Jacqmin-Gadda et al.
Table 2. Adjusted Hazard Ratio for the 9-Year Risks of Dementia and Death in the Illness-Death Model According to
Cognitive Assessment at the 1-Year Visit (T1) and 1-Year Cognitive Changes (Sample A; n = 1,313), PAQUID Study,
France, 1988–1998
Risk of Dementia
Covariate
HR
95% CI
P Value
Risk of Death
HR
95% CI
Risk of Death After Dementia
P Value
HR
95% CI
P Value
Final model after backward
selection
Age, years
2.64 1.98, 3.54 <0.001 1.90 1.53, 2.37 <0.001 1.89 1.21, 2.94 0.005
Female sex
Forgetfulness in daily living
0.44 0.34, 0.56 <0.001
1.41 0.99, 2.01
0.060
4-IADL score
1.43 1.21, 1.68 <0.001
IST score
0.92 0.89, 0.96 <0.001
DSST score
0.97 0.95, 1.00
MMSE-EM score
0.72 0.62, 0.85 <0.001
0.017 0.99 0.97, 1.00
0.013
Association with 1-year cognitive
change (T1 minus T0)a
4-IADL
IST
1.01 0.97, 1.05
0.73
DSST
1.00 0.96, 1.04
0.94
MMSE-EM
1.13 0.96, 1.33
0.14
1.00 0.85, 1.18
0.99
0.99 0.97, 1.02
0.44
Abbreviations: CI, confidence interval; DSST, Digit Symbol Substitution Test; HR, hazard ratio; 4-IADL, 4-item
Instrumental Activities of Daily Living Scale; IST, Isaacs Set Test; MMSE-EM, episodic memory subtest of the
Mini-Mental State Examination; PAQUID, Personnes Agées QUID.
a
Adjusted for predictors from the final model.
Table 3 presents estimates from the final model after backward selection and inclusion of the significant interaction between sex and education in the risk of death. Seven predictors
remained associated with the 10-year risk of dementia. The 5
most significant predictors (P < 0.01) were the same as those
in the preliminary analysis: The risk of dementia was higher
for older subjects, those who complained about forgetfulness
in daily living, and those with poor performance on the IST,
the DSST, and the MMSE-EM. After adjustment for these
cognitive measures, a high educational level and a high global
MMSE score tended to be associated with a higher risk of dementia. Indeed, for a given current cognitive level, subjects
with high initial intellectual abilities had a greater risk of developing dementia. The risk of death among nondemented
Table 3. Adjusted Log Hazard Ratioa for the 10-Year Risks of Dementia and Death in the Final Illness-Death Model
(Sample B; n = 2,795) After Backward Selection, PAQUID Study, France, 1988–1998
Covariate
Age, years
Risk of Dementia
0.660 (0.075)
0.258 (0.145)
−0.142 (0.125)
0.333 (0.124)
−0.173 (0.084)
−1.07 (0.149)
Female sex
High educational level
Sex × education
Forgetfulness in daily living
0.735 (0.142)
−0.424 (0.153)
−0.210 (0.106)
4-IADL score
0.376 (0.051)
IST score
−0.060 (0.012)
DSST score
−0.044 (0.09)
MMSE-EM score
Risk of Death After Dementia
0.371 (0.174)
Difficulty in calculation
MMSE score
Risk of Death
0.912 (0.102)
−0.014 (0.005)
0.021 (0.008)
0.068 (0.029)
−0.318 (0.060)
Abbreviations: DSST, Digit Symbol Substitution Test; 4-IADL, 4-points score from the Instrumental Activities of Daily
Living; IST, Isaacs Set Test; MMSE, Mini-Mental State Examination; MMSE-EM, episodic memory subtest of the
Mini-Mental State Examination; PAQUID, Personnes Agées QUID.
a
Values are β coefficients with standard errors.
Am J Epidemiol. 2014;180(8):790–798
10-Year Dementia Prediction Accounting for Competing Death 795
Table 4. Area Under the Receiver Operating Characteristic Curve for the Proposed Dementia Prediction Score
Estimated in the Learning Data Set (Sample B; n = 2,795) and by Pooled 10-Fold Cross-Validation, Using an Inverse
Probability of Censoring Weighting Estimator and a Model-Based Imputation Estimator and 2 Different Definitions of
Specificity,a PAQUID Study, France, 1988–1998
Inverse Probability of Censoring Weighting
Model-Based Imputation
Parameter
Crude Estimate (SE)
10-Fold CV Estimate
Crude Estimate (SE)
10-Fold CV Estimate
AUROC1b
0.814 (0.015)
0.810
0.828 (0.014)
0.819
AUROC2c
0.750 (0.016)
0.746
0.770 (0.016)
0.761
AUROC1
0.844 (0.019)
0.838
0.851 (0.030)
0.843
AUROC2
0.817 (0.019)
0.811
0.830 (0.031)
0.822
10-year prediction
5-year prediction
Abbreviations: AUROC, area under the ROC curve; CV, cross-validation; PAQUID, Personnes Agées QUID; ROC,
receiver operating characteristic; SE, standard error.
a
For definitions, see text.
b
AUROC calculated using the first definition of specificity.
c
AUROC calculated using the second definition of specificity.
persons was lower for younger subjects, women, highly educated subjects, subjects without dependence on the IADL
Scale, and those with a high DSST score. After adjustment
for these factors, persons who complained about forgetfulness in daily living and about difficulties in calculation had
a lower risk of death without dementia. Finally, death risk
among demented persons was higher for older subjects,
men, and subjects with a high DSST score at baseline. Indeed, subjects who became demented despite a high initial
DSST score were declining more rapidly, a process that may
have led more quickly to death. Web Appendix 2 and Web
Table 1 show the good calibration of this model.
The dementia prediction score was computed by means of
equation 2, with the 7 predictors retained in the submodel for
the risk of dementia and the estimated parameters given in
Table 3. Table 4 gives the estimators of the AUROC at 5 and
10 years for this prediction score. Because of the large sample
size, the crude estimates and the 10-fold cross-validation estimates were very close. The nonparametric IPCW estimator and
the model-based imputation estimator led to similar conclusions. All of the AUROCs were significantly different from
0.5 (P < 0.001). The high values of AUROC1 meant that the
prediction score had a very good ability to discriminate between
subjects who would become demented in the next 5 or 10 years
and those who would still be free of dementia and alive at the
end of this period. The prediction score retained a very high
ability to distinguish subjects at high risk of developing dementia in the next 5 years from those who would remain dementiafree or die without dementia (AUROC2 > 0.80). For 10-year
prediction, the difference between AUROC1 and AUROC2
was larger because the number of deaths increased, and
AUROC2 dropped to about 0.75. To help users choose a cutpoint that is suitable for the objective of the screening, Table 5
displays estimated sensitivities and specificities for selected
prediction score cutpoints. The cutpoint −1.92 gives the highest value of the sum of sensitivity and specificity for both
10-year and 5-year predictions and the 2 definitions of specificity (similar to −1.55 for 10 years and Sp2) while maintaining
a good sensitivity; it is optimal when equal weights are given to
false-positive and false-negative subjects.
Figure 2 displays the ROC curves estimated by IPCW for
the proposed prediction score (cross-validation estimates)
and for the cognitive tests DSST, MMSE, and IST alone.
Whatever the definition of specificity, the predictive ability
of the combined score is better than that of each cognitive
test individually considered: AUROC2 values were 0.713
Table 5. Sensitivities and Specificities of Selected Dementia Prediction Score Cutpoints Estimated by Inverse
Probability of Censoring Weighting, Using 2 Different Definitions of Specificity,a PAQUID Study, France, 1988–1998
10-Year Prediction
Cutpoint
5-Year Prediction
Sensitivity, %
Specificity 1, %
Specificity 2, %
Sensitivity, %
Specificity 1, %
Specificity 2, %
−2.51
90.1
47.2
39.4
98.0
39.2
36.2
−1.92
79.9
67.5
58.8
92.9
59.4
55.6
−1.55
67.9
79.0
70.0
80.1
70.9
66.8
−1.15
53.0
88.7
80.0
70.4
81.6
77.7
−0.50
29.1
96.7
91.3
42.5
92.7
90.0
Abbreviation: PAQUID, Personnes Agées QUID.
a
For definitions, see text.
Am J Epidemiol. 2014;180(8):790–798
796 Jacqmin-Gadda et al.
B)
100
100
80
80
Sensitivity, %
Sensitivity, %
A)
60
40
60
40
20
20
0
0
0
20
40
60
80
100
1 − Specificity, %
0
20
40
60
80
100
1 − Specificity, %
Figure 2. Inverse probability of censoring weighting estimates of the receiver operating characteristic (ROC) curve for scores on the Digit Symbol
Substitution Test, the Isaacs Set Test, and the Mini-Mental State Examination and for the combined dementia prediction score M derived using
10-fold cross-validation, according to 2 different definitions of specificity, Personnes Agées QUID (PAQUID) Study, France, 1988–1998.
A) ROC curve among survivors (denoted ROC1 in the main text); B) ROC curve in the whole population (denoted ROC2 in the main text). Solid
line, combined score; dashed line, Digit Symbol Substitution Test; dotted line, Isaacs Set Test; dashed-dotted line, Mini-Mental State Examination.
for the DSST, 0.676 for the IST, and 0.656 for the MMSE as
compared with 0.746 for the combined score.
DISCUSSION
Using data from a large population-based cohort, this study
led to 2 main results. First, we found that 2 cognitive assessments administered 1 year apart did not lead to better prediction
of dementia risk than a single one. Especially, the improvement
in cognitive scores between the first 2 assessments was not predictive of a lower dementia risk when adjusted for the second
cognitive assessment. These results confirm those of 2 previous
studies showing that changes in MMSE score (5) and in
SIDAM [Structured Interview for the Diagnosis of Dementia
of the Alzheimer Type, Multi-Infarct Dementia and Dementias
of Other Aetiology] score (4) between 2 assessments given at
1- to 2-year intervals were predictive of future dementia only
when they were not adjusted for score at time 2. A possible explanation could be that this improvement reflects stress at T0
rather than true learning ability.
Secondly, we developed a simple prediction score based
on 3 cognitive tests (the IST, the DSST, and the MMSE) and
1 subjective memory complaint that exhibited a good predictive
ability to identify subjects at high risk of becoming demented in
the next 5 or 10 years. We emphasize that the subjective memory complaint remained a predictor of 10-year dementia after
adjustment for the objective cognitive assessment. Moreover,
this work confirms previous PAQUID results showing that
the MMSE-EM is more predictive than global MMSE score
(9, 20).
Important strengths of this study include the large sample
size and representativeness of the cohort and the quality of
the clinical diagnosis of dementia, including a clinical examination. Nevertheless, the main strengths of this work lie in the
careful handling of the semicompeting risk of death and interval censoring of dementia diagnosis in both the building and
the evaluation of the prediction score. Prediction scores previously proposed in the literature (2) have been built using a logistic model fitted in subjects with known dementia status at
the end of the prediction window (7, 9, 29–31) or using a
Cox model without dealing with interval censoring and
death (8). In both cases, the prediction scores were evaluated
in subjects with known dementia status at the end of the prediction window (excluding subjects who were censored or
who died without dementia). Although this strategy may be acceptable for short-term prediction, it presents several sources
of bias for long-term prediction. Because of attrition and
death, the selected sample is most often not representative of
the initial sample. Interval censoring is not handled, and thus
only demented subjects who survive long enough to be diagnosed are considered cases. Finally, it is generally not clearly
stated that the ROC curve estimated in such selected samples
evaluates the discrimination between future demented subjects
and subjects who are alive and nondemented (with possible
bias due to censoring). As illustrated here, the corresponding
AUROCs are often higher than the AUROCs measuring the
discrimination between demented subjects and all other subjects, because the predictors for dementia are also predictors
for death. In some studies, censored and deceased persons
were not excluded but the last observed disease status before
death or censoring was used (5), which can also lead to possible bias. With the above limitations, the previous studies led to
prediction scores with estimated AUROCs similar to or lower
than our estimated AUROC1: 0.77 for 10 years and 0.83 for
Am J Epidemiol. 2014;180(8):790–798
10-Year Dementia Prediction Accounting for Competing Death 797
5 years (28) or 0.79 for 4.5 years (8). A higher AUROC for
5-year prediction (0.88) was reached in the study by Nakata
et al. (31), but this estimate was not corrected for optimism,
whereas the sample size was small and highly selected by attrition (258 subjects followed among 465).
To our knowledge, this is the first proposed prediction
score for dementia that accounts for censoring and competing
risk of death. Even if external validation is found to be useful
in the future, this score, based on a simple cognitive assessment, exhibited good performance for prediction of 10-year
dementia risk. We emphasize that the method of considering
deceased subjects when evaluating the discriminatory ability
of a dementia prediction score must be clearly stated, since
discrimination among survivors may be largely better than
that in the whole population.
ACKNOWLEDGMENTS
Author affiliations: Institut National de la Santé et de la
Recherche Médicale (INSERM), Centre INSERM U897,
Bordeaux, France (Hélène Jacqmin-Gadda, Paul Blanche,
Emilie Chary, Lucie Loubère, Hélène Amieva, Jean-François
Dartigues); Centre INSERM U897, Institut de Santé Publique, d’Épidémiologie et de Développement, University of
Bordeaux, Bordeaux, France (Hélène Jacqmin-Gadda, Paul
Blanche, Emilie Chary, Lucie Loubère, Hélène Amieva,
Jean-François Dartigues); and Department of Neurology,
Bordeaux University Hospital, Bordeaux, France (JeanFrançois Dartigues).
This research was supported by a grant from France
Alzheimer awarded to H.J.-G. in 2009. The PAQUID
Study is supported by Ipsen (Paris, France), Novartis International AG (Basel, Switzerland), Caisse Nationale de Solidarité pour l’Autonomie (Paris, France), and Groupe AGRICA
(Paris, France).
Conflict of interest: none declared.
REFERENCES
1. Aisen PS, Andrieu S, Sampaio C, et al. Report of the task force
on designing clinical trials in early ( predementia) AD.
Neurology. 2011;76(3):280–286.
2. Stephan BC, Kurth T, Matthews FE, et al. Dementia risk
prediction in the population: Are screening models accurate?
Nat Rev Neurol. 2010;6(6):318–326.
3. Amieva H, Le Goff M, Millet X, et al. Prodromal Alzheimer’s
disease: successive emergence of the clinical symptoms. Ann
Neurol. 2008;64(5):492–498.
4. Hensel A, Angermeyer MC, Zaudig M, et al. Measuring
cognitive change in older adults: reliable change indices for the
SIDAM. J Neurol. 2007;254(1):91–98.
5. Hensel A, Luck T, Luppa M, et al. Does a reliable decline in
Mini Mental State Examination total score predict dementia?
Diagnostic accuracy of two reliable change indices. Dement
Geriatr Cogn Disord. 2009;27(1):50–58.
6. Jacqmin-Gadda H, Fabrigoule C, Commenges D, et al. A
5-year longitudinal study of the Mini-Mental State Examination
in normal aging. Am J Epidemiol. 1997;145(6):498–506.
Am J Epidemiol. 2014;180(8):790–798
7. Gomar JJ, Bobes-Bascaran MT, Conejero-Goldberg C, et al.
Utility of combinations of biomarkers, cognitive markers, and
risk factors to predict conversion from mild cognitive
impairment to Alzheimer disease in patients in the Alzheimer’s
Disease Neuroimaging Initiative. Arch Gen Psychiatry. 2011;
68(9):961–969.
8. Jessen F, Wiese B, Bickel H, et al. Prediction of dementia in
primary care patients. PLOS ONE. 2011;6(2):e16852.
9. Chary E, Amieva H, Pérès K, et al. Short- versus long-term
prediction of dementia among subjects with low and high
educational levels. Alzheimers Dement. 2013;9(5):562–571.
10. Ostbye T, Hill G, Steenhuis R. Mortality in elderly Canadians
with and without dementia: a 5-year follow-up. Neurology.
1999;53(3):521–526.
11. Joly P, Commenges D, Helmer C, et al. A penalized likelihood
approach for an illness-death model with interval-censored
data: application to age-specific incidence of dementia.
Biostatistics. 2002;3(3):433–443.
12. Leffondré K, Touraine C, Helmer C, et al. Interval-censored
time-to-event and competing risk with death: Is the
illness-death model more accurate than the Cox model? Int J
Epidemiol. 2013;42(4):1177–1186.
13. Saha P, Heagerty PJ. Time-dependent predictive accuracy in the
presence of competing risks. Biometrics. 2010;66(4):999–1011.
14. Zheng Y, Cai T, Jin Y, et al. Evaluating prognostic accuracy of
biomarkers under competing risk. Biometrics. 2012;68(2):
388–396.
15. Blanche P, Dartigues JF, Jacqmin-Gadda H. Estimating and
comparing time-dependent areas under receiver operating
characteristic curves for censored event times with competing
risks. Stat Med. 2013;32(30):5381–5397.
16. Jacqmin-Gadda H, Blanche P, Chary E, et al. Receiver
operating characteristic curve estimation for time to event
with semicompeting risks and interval censoring [ published
online ahead of print May 6, 2014]. Stat Methods Med Res.
(doi:10.1177/0962280214531691).
17. Dartigues JF, Gagnon M, Barberger-Gateau P, et al. The Paquid
epidemiological program on brain ageing. Neuroepidemiology.
1992;11(suppl 1):14–18.
18. American Psychiatric Association. Diagnostic and Statistical
Manual for Mental Disorders, Third Edition, Revised.
Washington, DC: American Psychiatric Association; 1987.
19. Folstein MF, Folstein SE, McHugh PR. “Mini-mental state”: a
practical method for grading the cognitive state of patients for
the clinician. J Psychiatr Res. 1975;12(3):189–198.
20. Carcaillon L, Amieva H, Auriacombe S, et al. A subtest of the
MMSE as a valid test of episodic memory? Comparison with
the Free and Cued Reminding Test. Dement Geriatr Cogn
Disord. 2009;27(5):429–438.
21. Isaacs B, Kennie AT. The Set Test as an aid to the detection of
dementia in old people. Br J Psychiatry. 1973;123(575):
467–470.
22. Benton A. Manuel pour l’Application du Test de Retention
Visuelle: Applications Cliniques et Experimentales. Paris,
France: Centre de Psychologie Appliquee; 1965.
23. Wechsler D. Wechsler Adult Intelligence Scale–Revised.
New York, NY: The Psychological Corporation; 1981.
24. Barberger-Gateau P, Commenges D, Gagnon M, et al.
Instrumental Activities of Daily Living as a screening tool for
cognitive impairment and dementia in elderly community
dwellers. J Am Geriatr Soc. 1992;40(11):1129–1134.
25. Touraine C, Joly P, Gerds TA. Package ‘SmoothHazard’: fitting
illness-death model for interval-censored data. Version 1.0.9.
Vienna, Austria: R Foundation for Statistical Computing; 2013.
http://cran.r-project.org/web/packages/SmoothHazard/
798 Jacqmin-Gadda et al.
SmoothHazard.pdf. Published March 8, 2013. Accessed
August 29, 2013.
26. Amieva H, Jacqmin-Gadda H, Orgogozo JM, et al. The 9 year
cognitive decline before dementia of the Alzheimer type: a
prospective population-based study. Brain. 2005;128(5):
1093–1101.
27. Blanche P. Package ‘timeROC’: time-dependent ROC curve
and AUC for censored survival data. Version 0.2. Vienna,
Austria: R Foundation for Statistical Computing; 2013. http://
cran.r-project.org/web/packages/timeROC/timeROC.pdf.
Published May 27, 2013. Accessed August 29, 2013.
28. Airola A, Pahikkala T, Waegeman W, et al. An experimental
comparison of cross-validation techniques for estimating the
area under the ROC curve. Comput Stat Data Anal. 2011;55(4):
1828–1844.
29. Tierney MC, Szalai JP, Snow WG, et al. Prediction of probable
Alzheimer’s disease in memory-impaired patients: a
prospective longitudinal study. Neurology. 1996;46(3):
661–665.
30. Tierney MC, Yao C, Kiss A, et al. Neuropsychological tests
accurately predict incident Alzheimer disease after 5 and 10
years. Neurology. 2005;64(11):1853–1859.
31. Nakata E, Kasai M, Kasuya M, et al. Combined memory and
executive function tests can screen mild cognitive impairment
and converters to dementia in a community: the Osaki-Tajiri
project. Neuroepidemiology. 2009;33(2):103–110.
Am J Epidemiol. 2014;180(8):790–798