Association of Serum Total Cholesterol with Coronary Disease and

American Journal of Epidemiology
Copyright O 1996 by The Johns Hopkins University School of Hygiene and Public Health
All rights reserved
Vol. 143, No. 5
Printed In U.SA.
Association of Serum Total Cholesterol with Coronary Disease and All-Cause
Mortality: Multivariate Correction for Bias Due to Measurement Error
Carlos Iribarren, 12 Dan Sharp,3 Cecil M. Burchfiel,3 Ping Sun,1 and James H. Dwyer1
Measurement error in the exposure under investigation is an important but often ignored source of bias in
observational studies. The authors examined the impact of measurement error in the association between total
serum cholesterol and 16-year coronary heart disease and all-cause mortality in a cohort of 6,137 middle-aged
men of Japanese descent in the Honolulu Heart Program (1973-1988). A Cox regression model that enables
modeling of survival time with correction for measurement errors in multiple covariates was employed. After
controlling for age, body mass index, systolic blood pressure, smoking status, alcohol consumption, dietary
cholesterol, and total calorie intake, a difference of one standard deviation (38 mg/dL) in total cholesterol was
associated with a significant increase in the risk of coronary disease death (uncorrected hazard ratio = 1.35).
After correction for measurement errors in total cholesterol and covariates (except smoking and age), the
estimated hazard ratio increased to 1.65 (a 22% increase). A U-shaped relation was observed between total
cholesterol levels and the risk of all-cause mortality. This association was then examined with a quadratic
model and with a two-slope or V-shaped regression model. In the quadratic fit, the magnitude of the quadratic
total cholesterol term increased threefold after the adjustment for measurement error. In the V fit, the hazard
ratio of all-cause death corresponding to a change in one standard deviation above 214 mg/dL (the nadir of
the V) was 1.15, and increased to 1.49 (by 29%) after the correction. The corresponding hazard ratio of a
change in one standard deviation below 214 mg/dL was 1.11, and increased to 1.37 (by 23%) after the
correction. The authors conclude that the impact of elevated total cholesterol on the risk of coronary disease
and all-cause mortality may be greater than previously estimated with standard methods of analysis. In
addition, the correction for measurement error in total cholesterol and covariates did not explain the excess
mortality associated with low total cholesterol. More research is needed to elucidate the fundamental issues
underlying the U-shaped association, i.e., confounding versus causal implications. Am J Epidemiol 1996;143:
463-71.
bias (epidemiology); cohort studies; coronary disease; mortality; regression analysis; risk factors
A common problem in observational studies is that
measured biologic or behavioral risk factors are only
surrogate indicators of the individual's true level of
exposure (1). A well-known consequence of the error
in measurement is that the slope of the simple regression of y (disease) on x (exposure) is attenuated (2, 3).
This phenomenon has been called "regression dilution
bias" by MacMahon et al. (4).
Because total serum cholesterol concentrations are
subject to intraindividual fluctuations and to laboratory error (5, 6), it follows that the univariate relation
of total cholesterol level with mortality will be subject
to the effect of regression dilution bias. In consequence, previous estimates of the relation of total
cholesterol with risk of coronary heart disease or allcause mortality may be attenuated relative to the true
extent of the association (7-9). However, when multivariate models are used and two or more variables
included in the model are measured with error, the
situation becomes more complex and the observed
regression coefficients may underestimate or overestimate the corresponding true regression coefficients
(3). Recently, a number of papers have appeared (3,
10-13) that describe methods to assess and correct for
measurement error.
Received for publication January 23,1995, and In final form May
30, 1995.
Abbreviations: Cl, confidence interval; ICD-8, International Classification of Diseases, 8th revision; SE, standard error.
1
Institute for Health Promotion and Disease Prevention Research, Department of Preventive Medicine, University of Southern
California School of Medicine, Los Angeles, CA.
2
Present address: Division of Epidemiology, School of Public
Health, University of Minnesota, Minneapolis, MN.
3
Honolulu Epidemiology Research Unit, Division of Epidemiology and Clinical Applications, National Heart, Lung and Blood Institute, Bethesda, MD.
Correspondence to Dr. James H. Dwyer, Institute for Prevention
Research, USC School of Medicine, 1540 Alcazar St., CHP 205, Los
Angeles, CA 90033.
463
464
Iribarren et al.
The aim of this report is to present two empirical
examples of a method for correcting for measurement
error bias using multiple Cox proportional hazards
regression. The method is an extension of a logistic
procedure described by Rosner et al. (13). In the first
example, and as previously shown in the Lipid Research Clinics Mortality Follow-up Study (14) and the
British United Provident Association Study (15), we
hypothesized that the crude (uncorrected) association
of elevated total cholesterol with the risk of coronary
disease death is an underestimation of the true extent
of the hazard of elevated total cholesterol. The second
example involved the application of the correction
procedure to the U-shaped relation of total cholesterol
with all-cause mortality. We hypothesized that the
quadratic relation and thus both arms of a V-shaped fit
will become steeper after the correction. It is anticipated that measurement error is not a likely explanation of the excess death rate in cohort members with
low total cholesterol.
mined by the same method, but sera specimens were
processed in a different laboratory (Kuakini Medical
Center, Honolulu, Hawaii). Deaths during the period
1973-1988 were then ascertained through hospital surveillance, state health department records, and newspaper obituaries. The underlying cause of death was
assigned by a mortality review panel, and coded following the eighth revision of the International Classification of Diseases (ICD-8) (21).
The principal endpoints for this analysis were deaths
due to coronary disease (rubrics 410-414 of the
ICD-8) and all-cause mortality. Less than 2 percent of
the cohort was lost to follow-up during the study
period. Of the total 8,006 men enrolled in the study,
231 had prevalent coronary disease, 68 had cerebrovascular disease, and 58 had cancer at the time of
baseline measurements, and were excluded from the
study. Of the remaining 7,515 men, 1,378 had missing
values in total cholesterol and/or study covariables, or
did not attend examination 3. Thus, the final sample
for analysis comprised 6,137 participants.
MATERIALS AND METHODS
The cohort
Statistical analysis
The Honolulu Heart Program was established in
1965-1968 with the examination of 8,006 JapaneseAmerican men born between 1900 and 1919 and who
lived on the island of Oahu, Hawaii. About 12 percent
of the cohort were migrant farmers from southern
Japan. The principal focus of the study has been the
identification of predictors of coronary disease and
stroke. Specifics of the design, study population, and
methodological procedures have been previously published (16, 17). On entry into the study (1965-1968)
and during the third examination 6 years later (1971—
1974), the following data were obtained from each
subject: sociodemographic factors, individual and
family medical history, weight, height, blood pressure,
non-fasting total cholesterol, as well as cigarette
smoking and alcohol consumption habits. At each
examination, blood pressure was measured once by a
physician and twice by a trained nurse, and the average of the three readings was used in the analysis.
Usual diet was assessed at the first examination with
the 24-hour dietary recall method (18). In 1970, a
random sample of 261 men were invited to participate
in a dietary intake validation study involving the completion of 7-day dietary records. The purpose of this
validation study was to assess within-person variability of nutrient intake (19). In 1965-1968, non-fasting
total cholesterol concentrations were measured by the
Autoanalyzer N-24A method (Technicon Instruments
Corp., Tarrytown, New York) at the US Public Health
Service Laboratory in San Francisco, California (20).
In 1971-1974, non-fasting total cholesterol was deter-
We first obtained descriptive statistics and Pearson
correlations among the study variables. Age-adjusted
mortality rates for coronary disease and all causes
were computed by quintiles of total cholesterol using
the direct method with the entire cohort as the standard
population.
To model survival time as a function of total cholesterol, we used the Cox proportional hazards model
(22, 23). Hazard ratios and 95 percent confidence
intervals are given for corresponding increases of one
standard deviation in each continuous predictor variable and for contrasted levels of categorical predictors.
To remove the effect of age, regression models were
stratified on age at baseline (in individual's years, and
assumed to contain no error).
Tests for linear or quadratic (U-shaped) associations
between total cholesterol and mortality outcomes were
performed with age-adjusted Cox regression on linear
total cholesterol only and with age-adjusted Cox regression of linear plus total cholesterol squared terms,
respectively. A U-shaped relation indicates that the
risk across the independent variable (in this case, total
cholesterol) is elevated at low and at high values of the
predictor and is minimum at the middle range.
One way to parametrize a quadratic risk relation is
to fit a "two-piece" or "V-shaped" regression line to
the data. This parametrization has been described in
the literature as "broken" or "piece-wise" regression
(24). The nadir or breakpoint of the "V" was estimated
as the minimum of the U-curve (25), and was equal to
214 mg/dL.
Am J Epidemiol
Vol. 143, No. 5, 1996
Measurement Error Bias in Cholesterol-Mortality Relations
To correct for the effect of measurement error in
total cholesterol and selected life-style (alcohol consumption, dietary cholesterol, and total calorie intake)
and physiologic characteristics (body mass index and
systolic blood pressure), we employed a multivariate
method described by Rosner et al. (13), and a matrixbased algorithm developed by Dwyer and Sun (Institute for Prevention Research, Department of Preventive Medicine, University of Southern California,
unpublished). The Dwyer-Sun program is a generalization of the Rosner method that applies to different
forms of regressions: linear, probit, logistic, and proportional hazards. In the current analysis, smoking
status (past, continuing smoking, and quitting, relative
to never smokers) was also included in the multivariate models, but no attempt was made to correct for
possible errors in the ascertainment of smoking status.
We entered a categorical variable for alcohol abstinence (presumed fully accurate), because abstainers
have been shown to be a group with compromised
health (26). The Dwyer-Sun algorithm yields measurement error corrected point and confidence limit estimates of proportional hazards regression coefficients
with covariates assumed to be measured with or without error. This analytic approach requires the availability of estimates of between- and withinperson components of variance. These are usually
determined via analysis of repeated measures or by
validation studies. In our analysis, we used two observations of total cholesterol and covariates (examinations 1 and 3, 6 years apart), and two observations of
dietary cholesterol and total calorie intake (one the
baseline 24-hour dietary recall, and the other the average of seven consecutive daily food records supplied
by a random subsample of 261 cohort members about
2 or 3 years later).
The method of Rosner et al. (13) for measurement
error correction involves four steps. First, uncorrected
or crude Cox regression coefficients are estimated for
each study outcome using the main study data (examination 1 and 24-hour dietary recall). Second, a multivariate linear model is performed in which the repeated and validation substudy data (examination 3
and average of 7-daily food records) are correlated
with individually paired data from the main study.
Third, the corrected regression estimates are obtained
by weighting the unadjusted coefficients by the
variance-covariance matrix relating the main study
variables with the repeated or validation study variables. Finally, the variance of the corrected estimate is
calculated using formulas based on the multivariate
delta method (3, 27) (also called the method of statistical differentials), and asymptotic 100(1 — a) percent
Am J Epidemiol
Vol. 143, No. 5, 1996
465
confidence limits for the corrected regression coefficients are derived as /3* ± Z\-an (var fi*)m. The
var O*) is the estimated variance and a is the type I
error (a = 0.05 in the current analysis). Var (0*)
includes a component attributable to the estimation of
the repeated correlations, and is obtained by expanding the risk function in a truncated Taylor series (27).
Because the linear model was adequate to describe
the relation of total cholesterol with coronary disease
mortality, example 1 is based on the correction for
measurement error in linear total cholesterol and covariates. The relation between total cholesterol and
all-cause mortality is U-shaped, and therefore example
2 comprises two different approaches: first, a multivariate quadratic model (linear total cholesterol, total
cholesterol squared, plus covariates); and second, a
segmented multivariate V-shaped regression model
(converging near the estimated total cholesterol value
of minimal mortality, plus covariates).
RESULTS
Table 1 presents descriptive statistics of all the predictor variables in the main sample of 6,137 men,
TABLE 1. Descriptive statistics of study variable* among a
cohort of 6,137 middle-aged men of Japanese descent In the
Honolulu Heart Program*
Malnsampk 3 (/l = 6,137)
Non-dietary
variables
Age (years), mean (SDt)
Serum cholesterol (mg/dL),
mean (SD)
Body mass index (kg/m*),
mean (SD)
Systolic blood pressure
(mmHg), mean (SD)
Current smokers (%)
Alcohol consumption
(oz/montht), mean (SD)
Alcohol abstainers (%)
At baseline
(1965-1968)
At examination 3
(1971-1974)
53.9 (5.4)
59.9
(5.4)
218 (36)
216
(36)
23.9 (3.0)
23.7 (3.0)
133 (20)
45.3
136 (20)
34.2
13.6 (23.6)
35.5
13.5 (24.4)
30.8
Dietary validation sample (n = 261)
Dietary
variables
Dietary cholesterol
(mg/day), mean (SD)
Total calorie Intake (Kcal/
day), mean (SD)
24-hour dietary
recall
(1965-1968)
7-day
dietary records
(1970)
542 (317)
528 (182)
2,318 (662)
2,300 (511)
* Men with prevalent coronary disease, stroke, or cancer at
baseline, or with missing values on study covariates at baseline or
examination 3 variables are excluded.
t SD, standard deviation.
$ 1 oz = 28.35 g.
466
Iribarren et al.
Example 1: total cholesterol and coronary
mortality
together with the intake of dietary cholesterol and total
calories in the validation sample of 261 men. There
were slight reductions in average levels of serum cholesterol and body mass index, and a small increase in
average systolic blood pressure from examination 1 to
examination 3. The standard deviations of dietary cholesterol and total calorie intake measured by 7-day
dietary records were appreciably smaller than the corresponding standard deviations of the same dietary
variables measured with a 24-hour dietary recall (18.2
vs. 31.2 and 727 vs. 511, respectively). Results in table
1 also indicate a smaller proportion of current smokers
and a tendency toward less alcohol abstinence in examination 3.
Total cholesterol at baseline showed a weak positive
correlation with body mass index (r = 0.12), systolic
blood pressure (r = 0.07) and dietary cholesterol (r =
0.06), and a weak negative correlation with alcohol
consumption (r = —0.05). Autocorrelations (i.e., correlation of a variable at examination 1 with the same
variable at examination 3) were high for body mass
index (r = 0.89) and alcohol consumption (r = 0.72),
and moderate for total cholesterol (r = 0.61) and
systolic blood pressure (r = 0.64). The correlation
between dietary cholesterol and total calories from the
24-hour dietary recall method and the same variables
measured by 7-day records were r = 0.33 and r =
0.46, respectively, in the 261 participants in the collection of 7-day dietary records.
There were 197 documented coronary disease
deaths, comprising about 13 percent of all deaths and
50 percent of all cardiovascular disease deaths. Ageadjusted coronary disease mortality rates per 100 men
increased consistently with the level of total cholesterol: 1.7 (<189 mg/dL), 2.7 (189-207 mg/dL), 3.2
(208-226 mg/dL), 3.1 (227-247 mg/dL), and 5.9
(>247 mg/dL), respectively (p linear trend = 0.0001).
Table 2 presents the uncorrected and measurement
error corrected multivariate hazard ratios of coronary
disease death associated with differences of one standard deviation in continuous variables, and the corresponding hazards of former smokers, continuing
smokers, and quitters, relative to never smokers. The
risk from alcohol abstinence is given in relation to any
alcohol consumption.
A difference of one standard deviation (38 mg/dL)
of total cholesterol was associated with a 1.35 "crude"
increase in coronary disease mortality. When the correction for measurement error was performed, the hazard increased to 1.65 (a 22 percent increase). The
association of systolic blood pressure with the risk of
death from coronary disease was also considerably
augmented after the correction for measurement error
(1.57 uncorrected vs. 1.98 corrected). Dietary cholesterol was a weak (and non-statistically significant)
TABLE 2. Coronary disease mortality among a cohort of 6,137 middle-aged men of Japanese descent,
the Honolulu Heart Program, 1973-1988: uncorrected and corrected multivariate hazard ratios (95%
confidence Intervals (Cl)) (n = 197 coronary disease deaths)*
Predictor
vanflCMas
(1 SD change)
Serum cholesterol (38 mg/dL)
Body mass index (3.1 kg/m*)
Systolic blood pressure (21
mmHg)
Alcohol consumption (24 oz/
month§)
Dietary cholesterol (300 mg/day)
Total calories (700 Kcal/day)
Past smoking vs. never smoking
Continuing smoking vs. never
smoking
Quitting smoking vs. never
smoking
Alcohol abstinence vs. any
alcohol
Hazard ratlot (95% Cl)
%
Uncorrected
Corrected
Changed
1.35(1.19-1.54)
1.20(1.04-1.38)
1.65(1.34-2.03)
1.19 (1.01-1.40)
+22.2
-0.8
1.57(1.38-1.78)
1.98(1.64-2.39)
+26.1
0.87 (0.72-1.06)
1.05(0.91-1.22)
0.96(0.82-1.12)
1.22(0.81-1.84)
0.81 (0.59-1.12)
1.26(0.56-2.80)
0.96(0.59-1.56)
1.25(0.82-1.90)
-6.8
+20.0
0.0
+2.4
2.22(1.54-3.22)
2.20(1.48-3.27)
-0.1
2.32(1.48-3.63)
2.08(1.31-3.31)
-10.3
1.39(1.01-1.90)
1.38(0.96-1.98)
-0.7
* Men with prevalent coronary disease, stroke, or cancer at baseline are excluded.
t Hazard ratios are estimated for 1 SD (standard deviation) Increase in continuous variables, and relative to
never smokers and any alcohol consumption, respectively, for smoking status and alcohol abstinence.
+. % change is estimated as ((corrected - uncorrected)/uncorrected) x 100.
§ 1 oz = 28.35 g.
Am J Epidemiol
Vol. 143, No. 5, 1996
Measurement Error Bias in Cholesterol-Mortality Relations
positive risk factor for fatal coronary disease. It is
noteworthy that the relative risk associated with an
increase of 300 mg/day in dietary cholesterol (holding
the total calorie intake constant) rose from 1.05 to 1.26
after the multivariate correction for measurement error
bias (about 20 percent increase). However, this deattenuation was accompanied by an extremely wide 95
percent confidence interval, including unity. Conversely, correction for errors of measurement did not
have an appreciable impact on the direct relation between body mass index and the inverse relation of
alcohol consumption with the risk of coronary death.
Among the variables assumed to contain no error, it
was found that smoking quitters, continuing smokers
(in relation to never smokers), and abstainers from
alcohol (in relation to any alcohol consumption) were
at increased risk of coronary disease mortality.
error (SE) = 0.0OOOO93; t = 3.6). To facilitate the
interpretation of the U-shaped relation, we estimated
the hazard ratios for changes of 1 standard deviation
(SD) (38 mg/L) above and below the nadir of the V
(that is, the intersection of the slopes at 214 mg/dL)
using a V-shaped regression model (table 3). An increase of 38 mg/dL above 214 mg/dL was associated
with a hazard ratio of 1.15. A decrease of 38 mg/dL
from the breaking point of the V predicted a hazard
ratio of 1.11. Corresponding corrected estimates were
about 29 percent higher for the hazard derived from
the right slope (total cholesterol change toward higher
level), and 23 percent higher for the hazard derived
from the left slope (total cholesterol change toward
lower level).
When the correction was applied to the multivariate
quadratic model, the coefficient for total cholesterol
squared became about three times larger (/3 =
0.000113; SE = 0.000044; t = 2.5).
Systolic blood pressure and alcohol consumption
were significant risk factors for all-cause mortality
among men of the Honolulu cohort. After the correction, the risk associated with these variables increased
by a moderate amount (11 percent for systolic blood
pressure and 6 percent for alcohol consumption). No
impact on all-cause mortality was observed for body
mass index, dietary cholesterol, and total calorie intake. On the other hand, all smoking categories (rela-
Example 2: total cholesterol and all-cause
mortality
There were a total of 1,490 deaths in the 16 years of
follow-up among the study subjects. Age-adjusted
mortality rates (per 100 persons) by quintiles of total
cholesterol at baseline were 25.5, 22.0, 22.1, 25.4, and
26.3, respectively, evidencing a quadratic relation of
all-cause mortality with the level of total cholesterol
(/3 for total cholesterol squared = 0.0000339; standard
TABLE 3. All-cause mortality among a cohort of 6,137 middle-aged men of Japanese descent, the
Honolulu Heart Program, 1973-1988: uncorrected and corrected multivariate hazard ratios (95%
confidence Intervals (Cl)) (n = 1,490 deaths)*
Predictor
vaiiiimoo
(1 SD change)
Serum cholesterol
+38 mg/dL above 214 mg/dL
-38 mg/dL below 214 mg/dL
Body mass Index (3.1 kg/m*)
Systolic blood pressure (21
mmHg)
Alcohol consumption (24 oz/
month§)
Dietary cholesterol (300 mg/day)
Total calories (700 calories)
Past smoking vs. never smoking
Continuing smoking vs. never
smoking
Quitting smoking vs. never
smoking
Alcohol abstinence vs. any
alcohol
Hazard ratlot (95% Cl)
%
Uncorrected
Corrected
Change^
1.15(1.07-1.24)
1.11 (1.00-1.22)
1.02(0.96-1.07)
1.49 (1.21-1.84)
1.37(1.06-1.76)
1.00(0.94-1.07)
+29.5
+23.4
-0.02
1.25(1.19-1.31)
1.39(1.28-1.49)
+11.2
1.10(1.04-1.16)
1.02(0.97-1.08)
0.92 (0.87-0.97)
1.18 (1.02-1.37)
1.17(1.06-1.29)
1.08(0.81-1.45)
0.84(0.70-1.01)
1.17(1.00-1.36)
-6.3
+5.8
-8.6
+2.4
2.10(1.84-2.40)
1.98(1.70-2.29)
-0.8
1.78(1.50-2.11)
1.70(1.42-2.04)
-4.5
1.22(1.08-1.36)
1.25(1.10-1.43)
+2.0
• Men with prevalent coronary disease, stroke, or cancer at baseline are excluded.
t Hazard ratios are estimated for 1 SD (standard deviation) change in continuous variables, and relative to
never smokers and any alcohol consumptton, respectively, for smoking status and alcohol abstinence.
+. % change is estimated as ((corrected - uncorrected)/uncorrected) x 100.
§ 1 oz = 28.35 g.
Am J Epidemiol
Vol. 143, No. 5, 1996
467
468
Iribarren et al.
tive to never smokers) and alcohol abstinence were
important determinants of death.
DISCUSSION
The aim of this analysis was to reduce the effect of
measurement error in the association between total
cholesterol and coronary disease mortality and between total cholesterol and all-cause mortality in the
Honolulu Heart Program. The corrected relative risk
estimates can be interpreted as the true measure of
association given that only random (unbiased) but not
systematic (biased) within-person errors were present
in total cholesterol and related confounders (body
mass index, systolic blood pressure, alcohol consumption, dietary cholesterol, and total calorie intake).
The sources of random within-person error in the
assessment of total cholesterol level are well documented. They consist of within-person seasonal (28),
day-to-day (29), or diurnal variation (30), as well as
possible technical errors in the laboratory (e.g., omissions in the protocol, variability of reagents, or faulty
instruments). The principal source of systematic error
in total cholesterol is actual physiologic change due to
life-style modification or disease.
Within-person random error and consequent regression dilution are closely related to the phenomenon of
regression toward the mean, a problem first described
by Galton (31) after he observed the stature of parents
and their offspring. Galton concluded that the stature
of the offspring tended toward values closer to the
mean population height. Regression toward the mean
in total cholesterol level implies that extreme readings
at baseline are more likely than not to shift toward
central values at a subsequent reexamination. This
phenomenon can be illustrated in the Honolulu cohort.
Men with total cholesterol values in the lowest quintile
at baseline had a mean total cholesterol concentration
of 170 mg/dL in 1965-1968, but the same men had a
mean value of 186 mg/dL at examination 3. Similarly,
the mean total cholesterol concentration in men in the
highest quintile at baseline was 272 mg/dL, and had
regressed 6 years later to 249 mg/dL.
In corroboration of the first hypothesis of the study,
we noted an increase in the hazard ratio of fatal coronary disease associated with a difference of 38 mg/dL
in total cholesterol after correction for measurement
error (from 1.35 to 1.65, a 22 percent increase). A
systolic blood pressure higher by 21 mmHg was associated with a crude 1.6 hazard ratio. However, when
measurement errors were accounted for, the hazard
ratio for the same difference in systolic blood pressure
was estimated to be about 2.0 (26 percent change in
the hazard ratio). The increased importance of blood
pressure as a coronary disease risk factor after correction for measurement error has been reported by
MacMahon et al. (4).
Total calorie intake had a weak protective effect on
coronary death risk, a circumstance that was unchanged after correction for measurement error. On
the other hand, the corrected risk associated with an
increase of 300 mg/day of dietary cholesterol was 20
percent greater than the uncorrected risk (1.26 vs.
1.05). Because the blood level of total cholesterol is
likely to be in the causal pathway of the dietary
cholesterol-coronary disease relation, we also estimated the risk of coronary disease death associated
with this nutrient without adjusting for total cholesterol in the multivariate model. The corresponding
crude estimate was 1.07 (95 percent CI 0.92-1.24) and
the corrected estimate was 1.32 (95 percent CI 0.592.93).
Connor and Connor (32) demonstrated a significant
correlation between cholesterol intake and death rates
from arteriosclerotic and degenerative heart disease in
24 countries. In the Western Electric Study (33), the
Ireland - Boston Diet-Heart Study (34), and the
Zutphen Study (35), dietary cholesterol was an independent risk factor of coronary disease mortality. In
the Honolulu Heart Program, dietary cholesterol intake was significantly higher among incident cases of
definite coronary disease (nonfatal myocardial infarction, coronary disease death, or sudden death within
one hour of coronary disease-type chest pain) after 10
years of follow-up than among non-cases (36). Other
studies, however, have failed to find associations between dietary lipids and the risk of coronary disease
within defined populations (37). Interestingly, our corrected results support the evidence a role of dietary
cholesterol in the development of coronary disease, as
previously reported in ecologic and population-based
data.
A controversy exists about the U-shaped association
between total cholesterol and all-cause mortality, arising from a (causal) positive association between high
total cholesterol and atherosclerotic disease, and a
negative association (of uncertain nature) between low
total cholesterol and nonatherosclerotic conditions
(apparently in men but not in women) (38). In this
paper, we focused on the potential impact of measurement error in this U-shaped association between total
cholesterol and mortality. Regression dilution bias has
been cited as an unlikely mechanism behind the Ushaped association between total cholesterol and allcause mortality (39) because, if present, it may increase rather than reduce the U-shape of the
association. The current analysis is an empirical demAm J Epidemiol
Vol. 143, No. 5, 1996
Measurement Error Bias in Cholesterol-Mortality Relations
onstration of this hypothesis. However, this analysis
does not address the other fundamental issues underlying the U-shaped relation, namely confounding by
other risk factors, reverse causality by undetected or
prevalent disease, potential interaction mechanisms, or
causal implications.
A reanalysis of the Honolulu data (40) and a recent
comprehensive review of the area (41) suggest that the
excess mortality linked to low total cholesterol may be
attributable to a failure to adequately control for diseases and behaviors (i.e., heavy smoking) leading to
reduced total cholesterol level and eventually to death.
On the other hand, Jacobs (42) has discussed potential
etiologic implications of low total cholesterol in intracranial hemorrhage, pulmonary disease, and infections.
A few methodological shortcomings of the current
analysis should be pointed out. First, within-person
variability in serum cholesterol and the covariates
was assumed to be solely due to random fluctuation.
In reality, however, it is likely that systematic together with random fluctuations in serum cholesterol
and related risk factors may have occurred. For example, since the repeated measures used in this analysis were taken an average of 6 years apart, it is
likely that some true biologic change may have
taken place. If this is the case, the correction procedure is anticonservative and may actually have led
to overcorrection of the risk estimates. Second, all
the exposures need to conform to the normal or
Gaussian distribution. All of the variables used in
the analysis were approximately normal, with the
exception of alcohol consumption. As a check of robustness in this instance, the continuous alcohol intake variable was logarithmically transformed and
the models reestimated. The risk estimates and confidence intervals were very similar. Finally, in the
case of the dietary variables, it was necessary to assume that the errors from the two dietary measures
(24-hour dietary recall and the average of the 7-day
records) were uncorrelated (e.g., that they fluctuate
randomly with respect to each other). This assumption implies that participants who completed both
the 24-hour and the 7-day dietary records were unlikely to either overreport or underreport their dietary intake in both occasions. The random error in
dietary measures includes actual variation over time,
imprecise recall of foods consumed, and errors in
portion size, as well as random recording and data
entry errors (43).
In the current analysis, we chose to include only
dietary cholesterol and total calories because of the
high intercorrelation (i.e., multicollinearity) of macronutrients, which makes it difficult to separate out
Am J Epidemiol
Vol. 143, No. 5, 1996
469
independent effects of fat, protein, carbohydrate, or
dietary cholesterol in multivariate models (19, 36).
Recently, several studies have been published that
employ methods to correct for exposure measurement
error in chronic disease epidemiology: colon cancer
and diet (44); breast cancer and dietary fat, calories,
and alcohol (13); blood pressure, stroke, and coronary
disease (4); and total cholesterol and risk of coronary
disease (14, 15, 45). As recognized by a recent review
on this topic (3), a much wider application of these
methods, particularly in the area of nutritional epidemiology, is desirable.
However, this analytical method is not without some
liabilities. It enables the estimation of deattenuated
risk relations, but at the expense of increased variance.
Furthermore, as seen in the examples presented here,
this effect is sensitive to the size of the validation
cohort.
Nevertheless, the question of measurement error
may have important consequences for conducting
epidemiologic research. First, it underscores the need
for reproducibility and validation studies, therefore
establishing some potential new priorities when planning and implementing studies. Second, from an etiologic point of view, measurement error correction may
help clarify causal mechanisms of disease, or may
support existing ones. Finally, correcting for regression dilution bias in associations between risk factors
and disease may have public health implications, because corrected estimates could substantially alter
population attributable risks. Therefore, there is a
growing consensus that the design, implementation,
and analytical phases of future studies should consider
the possibility and consequences of exposure measurement errors.
In conclusion, the example presented here illustrates
the feasibility and the precautions that one should bear
in mind when implementing procedures for measurement error correction. Our data suggest that the influence of elevated serum total cholesterol may have a
more important influence on the risk of coronary heart
disease and all-cause mortality than has been previously estimated with the use of standard methods of
analysis. In addition, correction for measurement error
in total cholesterol and study covariates did not explain the excess mortality associated with low total
cholesterol level. On the contrary, the hazard of low
total cholesterol became larger after the correction.
More experience and careful application of methods
for correcting for measurement error will allow researchers to balance the deattenuation of associations
with the increased uncertainty introduced by the correction procedure.
470
Iribarren et al.
ACKNOWLEDGMENTS
Supported by research contract nos. NO1-HC-02901 and
NO1-HV-02901 from the National Heart, Lung and Blood
Institute.
The authors thank Peter Hannan for his helpful comments
on the manuscript.
18.
19.
20.
REFERENCES
1. Cochran WG. Errors of measurement in statistics. Technometrics 1968;10:637-66.
2. Michalek JE, Tripathi RC. The effects of errors in diagnosis
and measurement on the estimation of the probability of an
event. JASA 1980;75:713-21.
3. Thomas D, Stram D, Dwyer JH. Exposure measurement error:
influence on exposure-disease relationships and methods of
correction. Ann Rev Public Health 1993;14:69-93.
4. MacMahon S, Peto R, Cutler J, et al. Blood pressure, stroke
and coronary heart disease. I. Prolonged differences in blood
pressure: prospective observational studies corrected for the
regression dilution bias. Lancet 1990;335:765-74.
5. Roeback JR, Cook JR, Guess HA, et al. Time-dependent
variability in repeated measurements of cholesterol levels:
clinical implications for risk misclassification and intervention
monitoring. J Clin Epidemiol 1993;46:1159-71.
6. Thompson SG, Pocock SJ. The variability of serum cholesterol measurements: implications for screening and monitoring. J Clin Epidemiol 1990;43:783-9.
7. Kagan A, McGee DL, Yano K, et al. Serum cholesterol and
mortality in a Japanese-American population. Am J Epidemiol
1981;114:11-20.
8. Stemmermann GN, Chyou PH, Kagan A, et al. Serum cholesterol and mortality among Japanese-American men: the
Honolulu (Hawaii) Heart Program. Arch Intern Med 1991;
151:969-72.
9. Frank JW, Reed DM, Grove JS, et al. Will lowering population levels of serum cholesterol affect total mortality? Expectations from the Honolulu Heart Program. J Clin Epidemiol
1992;45:333-46.
10. Prentice RL. Covariate measurement errors and parameter
estimation in a failure time regression model. Biometrika
1982;69:331-42.
11. Rosner B, Willett WC, Spiegelman D. Correction of logistic
regression relative risk estimates and confidence intervals for
systematic within-person measurement error. Stat Med 1989;
8:1051-71.
12. Clayton D. Models for the longitudinal analysis of cohort and
case-control studies with inaccurately measured exposures. In:
Dwyer JH, Feinleib M, Hoffmeister H, eds. Statistical models
for longitudinal studies of health. New York: Oxford University Press, 1992:301-31.
13. Rosner B, Spiegelman D, Willett WC. Correction of logistic
regression relative risk estimates and confidence intervals for
measurement error the case of multiple covariates measured
with error. Am J Epidemiol 1990; 132:734-45.
14. Davis CE, Rifkind BM, Brenner H, et al. A single cholesterol
measurement underestimates the risk of coronary heart disease. An empirical example from the Lipid Research Clinics
Mortality Follow-up Study. JAMA 1990;264:3044-6.
15. Law MR, Wald NJ, Wu T, et al. Systematic underestimation
of association between serum cholesterol concentration and
ischemic heart disease in observational studies: data from the
BUPA study. BMJ 1994;308:363-6.
16. Worth RM, Kagan A. Ascertainment of men of Japanese
ancestry in Hawaii through World War II selective service
registration. J Chronic Dis 1970;23:389-97.
17. Belsky JL, Kagan A, Syme DL. Epidemiologic studies of
coronary heart disease and stroke in Japanese men living in
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
Japan, Hawaii and California: research plan. Technical report
no. 12-71. Hiroshima, Japan: Atomic Bomb Casualty Commission, 1971.
Tillotson JT, Kato H, Nichaman MZ, et al. Epidemiology of
coronary heart disease and stroke in Japanese men living in
Japan, Hawaii, and California. Methodology for comparison
of diet. Am J Clin Nutr 1973;26:177-84.
McGee D, Rhoads G, Hankin J, et al. Within-person variability of nutrient intake in a group of Hawaiian men of Japanese
ancestry. Am J Clin Nutr 1982;36:657-63.
Technicon Instruments Corporation. Technicon AutoAnalyzer methodology. Tarrytown, NY: Technicon Corporation, 1965.
National Center for Health Statistics. International classification of diseases, adapted for use in the United States. 8th
revision. Washington, DC: US Department of Health, Education, and Welfare, 1967. (PHS publication no. 1693)
Cox DR. Regression models and life tables (with discussion).
J R Stat Soc 1972;34:187-220.
SAS Institute Inc. SAS/STAT software version 6.09. The
PHREG procedure. Cary, NC: SAS Institute Inc, 1994.
Neter J, Wasserman W, Kutner MH. Applied linear statistical
models. 3rd ed. Boston, MA: Irvan, 1990:370-3.
Aiken LS, West SG. Multiple regression: testing and interpreting interactions. 3rd ed. London: Sage Publications, 1991:
67-78.
Shaper AG. Alcohol and mortality: a review of prospective
studies. Br J Addict 1990;85:837-47.
Fuller WA. Measurement error models. New York: John
Wiley & Sons, 1987, Chapter 1, Appendix lA:85-8.
Bleiler RE, Yearick ES, Schnur SS, et al. Seasonal variation of
cholesterol in serum of men and women. Am J Clin Nutr
1963;12:12-16.
Elveback LR, Weidman WH, Ellefson RD. Day-to-day variability and analytical error in determinations of lipids in children. Mayo Clinic Proc 1980;55:267-9.
Peterson JE, Wilcox AA, Haley MI, et al. Hourly variation in
total serum cholesterol. Circulation 1960;22:247-53.
Galton F. Regression towards mediocrity in hereditary stature.
J Anthropol Inst 1886;15:246-63.
Connor WE, Connor SL. The key role of nutritional factors in
the prevention of coronary heart disease. Prev Med 1972;1:
49-83.
Stamler J, Dyer AR, Shekelle RB, et al. Relationship of
baseline major risk factors to coronary and all-cause mortality,
and to longevity: findings from long-term follow-up of Chicago cohorts. Cardiology 1993;82:191-222.
Kushi LH, Lew RA, Stare FJ, et al. Diet and 20-year mortality
from coronary heart disease: the Ireland-Boston Diet-Heart
Study. N Engl J Med 1985;312:811-18.
Kromhout D, Bosschieter EB, De Lezenne Coulander C. The
inverse relation between fish consumption and 20-year mortality from coronary heart disease. N Engl J Med 1985;312:
1205-9.
McGee DL, Reed DM, Yano K, et al. Ten-year incidence of
coronary heart disease in the Honolulu Heart Program: relationship to nutrient intake. Am J Epidemiol 1984; 119:667-76.
Gordon T, Kagan A, Garcia-Palmieri M, et al. Diet and its
relation to coronary heart disease in three populations. Circulation 1981;63:500-15.
Jacobs DR Jr, Blackburn H, Higgins M, et al. Report of the
Conference on Low Blood Cholesterol: mortality associations.
Circulation 1992;86:lt)46-60.
Hulley SB, Walsh JMB, Newman TB. Health policy on blood
cholesterol. Time to change directions. (Editorial). Circulation
1992;86:1026-9.
Iribarren C, Reed DM, Burchfiel CM, et al. Serum total
cholesterol and mortality: confounding factors and risk modification in Japanese-American men. JAMA 1995;273:
1926-32.
Meilahn EN, Ferrel RE. "Naturally occurring" low blood
cholesterol and excess mortality. Coron Artery Dis 1993;4:
Am J Epidemiol Vol. 143, No. 5, 1996
Measurement Error Bias in Cholesterol-Mortality Relations
843-53.
42. Jacobs DR Jr. Why is low blood cholesterol associated with
risk of nonatherosclerotic disease death? Annu Rev Publ
Health 1993;14:95-114.
43. Willett W. Nutritional epidemiology. New York: Oxford University Press, 1990:272-91.
Am J Epidemiol
Vol. 143, No. 5, 1996
471
44. Armstrong BG, Whittemore AS, Howe GR. Analysis of casecontrol data with covariate measurement error application to
diet and colon cancer. Stat Med 1989;8:1151-63.
45. Dwyer JH, Li L, Curtin R, et al. Correction for non-sampling
measurement error in risk factors for CHD in women from
Framingham. (Abstract). Circulation 1992;85:879.