The National Cancer Institute Diet History Questionnaire: Validation

American Journal of Epidemiology
Copyright ª 2005 by the Johns Hopkins Bloomberg School of Public Health
All rights reserved; printed in U.S.A.
Vol. 163, No. 3
DOI: 10.1093/aje/kwj031
Advance Access publication December 7, 2005
Practice of Epidemiology
The National Cancer Institute Diet History Questionnaire: Validation of Pyramid
Food Servings
Amy E. Millen1, Douglas Midthune2, Frances E. Thompson3, Victor Kipnis2, and Amy F. Subar3
1
Department of Social and Preventive Medicine, School of Public Health and Health Professions, State University of
New York at Buffalo, Buffalo, NY.
2
Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, MD.
3
Risk Factor Monitoring and Methods Branch, Applied Research Program, Division of Cancer Control and Population
Sciences, National Cancer Institute, Bethesda, MD.
Received for publication June 1, 2005; accepted for publication September 13, 2005.
The performance of the National Cancer Institute’s food frequency questionnaire, the Diet History Questionnaire
(DHQ), in estimating servings of 30 US Department of Agriculture Food Guide Pyramid food groups was evaluated
in the Eating at America’s Table Study (1997–1998), a nationally representative sample of men and women aged
20–79 years. Participants who completed four nonconsecutive, telephone-administered 24-hour dietary recalls
(n ¼ 1,301) were mailed a DHQ; 965 respondents completed both the 24-hour dietary recalls and the DHQ. The
US Department of Agriculture’s Pyramid Servings Database was used to estimate intakes of pyramid servings for
both diet assessment tools. The correlation (q) between DHQ-reported intake and true intake and the attenuation
factor (k) were estimated using a measurement error model with repeat 24-hour dietary recalls as the reference
instrument. Correlations for energy-adjusted pyramid servings of foods ranged from 0.43 (other starchy vegetables)
to 0.84 (milk) among women and from 0.42 (eggs) to 0.80 (total dairy food) among men. The mean q and k after
energy adjustment were 0.62 and 0.60 for women and 0.63 and 0.66 for men, respectively. This food frequency
questionnaire validation study of foods measured in pyramid servings allowed for a measure of food intake
consistent with national dietary guidance.
data collection; food; nutrition assessment; nutrition surveys; questionnaires; validation studies [publication type]
Abbreviations: CSFII, Continuing Survey of Food Intakes by Individuals; DHQ, Diet History Questionnaire; EATS, Eating at
America’s Table Study; FFQ, food frequency questionnaire; OPEN, Observing Protein and Energy Nutrition.
The food frequency questionnaire (FFQ) is a practical
approach to measuring dietary exposure in epidemiologic
studies investigating relations between diet and disease.
However, relative risk estimates derived from FFQs may
be biased or attenuated because of substantial FFQ measurement error (1, 2).
Researchers at the National Cancer Institute developed
a new cognitively based FFQ, the Diet History Questionnaire
(DHQ), which is designed to minimize measurement error
through improvements in questionnaire design, wording,
layout, and database development (3). One unique feature of
the DHQ database is the inclusion of the Pyramid Servings
Database (4). The Pyramid Servings Database allows for
the conversion of questionnaire responses into quantitative
estimates, called ‘‘pyramid servings,’’ of 30 different food
groups in the US Department of Agriculture’s original Food
Guide Pyramid, issued in 1992 (5). The latest Food Guide
Pyramid, introduced in April 2005, recommends intakes of
foods in common household measures which are directly
translated from the 1992 Pyramid Servings Database.
Correspondence to Dr. Amy E. Millen, Department of Social and Preventive Medicine, Farber Hall, Room 270, State University of New York
at Buffalo, 3435 Main Street (South Campus), Buffalo, NY 14214-8001 (e-mail: [email protected]).
279
Am J Epidemiol 2006;163:279–288
280 Millen et al.
Pyramid servings represent a standardized classification
scheme for estimating food intake consistent with dietary
guidance. Having a pyramid servings database for an FFQ is
an innovation in that component ingredients of disaggregated food mixtures are systematically assigned to the appropriate guidance-based food groups (6), thereby allowing
for assessment of the full, continuous range of reported intakes of foods. In addition, it allows for the quantitative assessment of added sugar and discretionary fat from all food
sources, previously unavailable from dietary databases. The
Pyramid Servings Database provides an additional, potentially more precise metric, different from previously used metrics of frequencies, servings (derived by different methods),
or grams, and thus warrants evaluation.
The DHQ’s ability to assess nutrient intake was previously
evaluated in the Eating at America’s Table Study (EATS) (3).
The purpose of this research was to validate the DHQ’s
ability to assess intake (in pyramid servings) of 30 US Department of Agriculture food groups and subgroups (4). We
used the EATS data to compare intakes of pyramid servings
between the DHQ and 24-hour dietary recalls (reference instrument). The standard measurement error model (7) for
nutrients is not directly applicable to foods, because of the
nonnegligible probability of zero intake on a given day for
most foods. In this paper, we present a measurement error
model for foods and show that under this model one can use
the standard estimation procedures to obtain consistent estimates of the correlation with true intake and the attenuation
factor, the multiplicative bias in the estimated log relative
risk of disease due to measurement error in the DHQ.
Evaluation of FFQ-reported food intake is important, because nutritional epidemiologists are increasingly interested
in the relations of foods and dietary patterns with disease
outcomes (8). Additionally, because dietary recommendations are phrased in terms of pyramid servings or equivalent
measures of food intake, directly estimating these from
FFQs is advantageous for epidemiologic studies, since it
will be possible to interpret findings in amounts and terms
consistent with public health food recommendations in the
new Food Guide Pyramid.
MATERIALS AND METHODS
Sample and study design
EATS, which began in August 1997, used random digit
dialing techniques to obtain a nationally representative sample of persons aged 20–79 years, balanced by gender. A detailed description of the sampling process and methods used
has been published elsewhere (3). There were 1,500 participants (n ¼ 738 women, n ¼ 762 men) who completed at least
one of four nonconsecutive, telephone-administered 24-hour
dietary recalls. Of these persons, 1,301 completed all four
24-hour dietary recalls and were eligible to receive the DHQ
by mail. Participants were also randomized into two groups
and were sent either the Block FFQ (9) or the Willett FFQ
(10) by mail, in addition to the DHQ. Participants were asked
to fill out the DHQ and the other mailed FFQ (either the
Block FFQ or the Willett FFQ) 1 month apart. Participants
were instructed to mail back the completed questionnaires.
Of the 1,301 participants who completed all four 24-hour
dietary recalls, 1,000 mailed back the DHQ.
Twenty-four-hour dietary recalls
The four 24-hour dietary recalls were scheduled to be
administered 3 months apart (one per season) and collected
between September 1997 and August 1998, using the
multiple-pass method developed for the 1994–1996 Continuing Survey of Food Intakes by Individuals (CSFII) (11).
The 24-hour dietary recall data were coded using the
Food Intake Analysis System (version 3.0), developed at
the University of Texas, which uses food codes from the
CSFII. For foods reported but not found in the CSFII database (n ¼ 14), a food which provided the best match with
regard to food description, total energy, and macronutrient
content was chosen from the CSFII data files released
through 1998 to approximate pyramid serving values.
Food frequency questionnaire
The DHQ, described previously (3), queries respondents
about their frequency of intake of 124 separate food items
and asks about portion sizes for most of these items by
providing a choice of three ranges. For 44 of the 124 foods,
1–7 additional nested questions are asked about related factors such as seasonal intake, food type (e.g., low-fat, lean,
diet, caffeine-free), fat use, or fat additions. The DHQ also
includes additional questions about the use of low-fat foods.
Of the 1,301 DHQs sent to participants, 1,000 were returned. Thirty-five were deemed incomplete and excluded
from further analysis because the respondents skipped two
or more adjacent pages of the 36-page DHQ booklet. This
left 519 women and 446 men for further analyses.
Pyramid Servings Database
We used the Pyramid Servings Database corresponding to
the 1994–1996 CSFII (11), which provides the number of
servings of each of the Food Guide Pyramid’s food groups
and the amounts of discretionary fat (in grams) and added
sugar (in teaspoons) contained in 100 g of every food included in the survey. It utilizes a recipe file to disaggregate
food mixtures into their component ingredients and assigns
them to food groups. Pyramid servings were added to the
DHQ database, in addition to the nutrient content of foods,
on the basis of methods previously described (6). Since
EATS’ 24-hour dietary recalls and the DHQ both utilized
the Pyramid Servings Database, pyramid servings of the 30
US Department of Agriculture major food groups and subgroups could be determined for both dietary measurement
tools. To our knowledge, pyramid servings have not been
added to either the Block or the Willett FFQ database,
thereby precluding similar analyses of those tools.
Statistical analyses
We evaluate the ability of the DHQ to measure food
group intakes in epidemiologic studies by estimating the
correlation coefficient (q) and the attenuation factor (k)
for each food group. q is the correlation between reported
Am J Epidemiol 2006;163:279–288
Validation of Pyramid Food Servings
intake (the DHQ) and estimated true usual intake (based on
four 24-hour dietary recalls) and is estimated in this case
from the deattenuated crude correlation between the DHQ
and the four 24-hour dietary recalls. k is the multiplicative
factor by which an estimated log relative risk of disease
would be biased because of measurement error in the DHQ.
It is equivalent to the slope of the linear regression of the 24hour dietary recall on the DHQ, and it usually falls between
0 and 1 in nutritional studies, thereby attenuating (biasing
toward the null) estimates of relative risk. Values of k close
to 0 indicate maximum attenuation, while values close to 1
indicate minimum attenuation.
We estimated correlation coefficients and attenuation factors using the following measurement error model. For individual i, let pi denote the probability of consuming an item
from a given food group on any given day. Let Ai be the
person’s usual intake on a consumption day, measured on
an appropriate scale. On the chosen scale, the person’s overall true usual intake, Ti, can be represented as the product
Ti ¼ pi Ai.
Error in the DHQ was allowed to include systematic bias
correlated with true usual intake and within-person random
variation as in the standard measurement error model (7).
The 24-hour dietary recall (our reference instrument) was
assumed to involve no misclassification of consumption
days and to report zero intake only on a nonconsumption
day. It was also assumed that, on an appropriate scale, the
24-hour dietary recall represents usual intake on a consumption day, plus additive within-person random error with
mean zero uncorrelated with true intake and error in the
DHQ. Because the repeat recalls were conducted approximately 3 months apart, it was assumed that random errors
corresponding to repeat measurements for the same person
were uncorrelated with each other.
Given the above assumptions, the measurement error
model is
Qi ¼ b0 þ b1 Ti þ ei ;
Fij ¼ 0 on a nonconsumption day j;
Fij ¼ Ai þ eij on a consumption day j;
where, for person i, Qi is the DHQ-reported intake, Ti is the
true usual intake, b0 is the intercept and b1 is the slope of the
linear regression of Qi on Ti, and ei is random within-person
error. Fij is the jth repeat 24-hour dietary recall-reported
intake, j ¼ 1, . . ., 4, and eij is the within-person random
error for repeat measurement j. Within-person errors ei
and eij are assumed to have means equal to zero and constant
variances and to be independent of each other and of Ti and
Ai. One can show that this measurement error model can be
rewritten as
Qi ¼ b0 þ b1 Ti þ ei ;
281
model is similar to the standard model (7), with the exception that neither true usual intake nor random within-person
error in the 24-hour dietary recall is normally distributed,
even if the distributions of the usual amount on consumption
days, Ai, and within-person error in the 24-hour dietary recall on consumption days, eij, are normal.
Prior to statistical analyses, we excluded observations
from the 24-hour dietary recall and the DHQ that were determined to be outliers with respect to reported total energy
intake measured on the log scale. For the remaining food
group observations, data were transformed to a scale on
which the 24-hour dietary recall had additive measurement
error on days on which the participant consumed an item
from the food group. Finally, we removed outliers among
nonzero values for each food group and instrument, measured on the transformed scale. Outliers were defined as
values outside the interval ranging from the 25th percentile
of the distribution minus two times the interquartile range to
the 75th percentile plus two times the interquartile range.
We used the functional transformation method of Eckert
et al. (12) to find the best power transformation for each
food group. So that outliers would not unduly influence
the choice of transformation, we first excluded outliers;
for this step only, we used a more conservative definition
of outliers, namely values outside the interval of the 25th
and 75th percentiles, plus or minus three times the interquartile range. Final outlier exclusions across all food
groups, instruments, and repeated applications of the 24hour dietary recall ranged from 2 to 14 for men (mean ¼ 7)
and from 9 to 24 for women (mean ¼ 13).
The measurement error model was applied to both unadjusted food intake and energy-adjusted food intake using
the density method to adjust for energy (13). Parameters
were estimated using maximum likelihood estimation, assuming that the random variables were normally distributed.
Although this normality assumption is likely to be violated,
at least with respect to such random variables as true usual
intake and within-person random error for the 24-hour dietary recall, the estimated parameters will still be consistent
(asymptotically unbiased). Since model-based estimates of
standard errors may be incorrect, we calculated bootstrap
standard errors, which do not depend on distributional assumptions (14). Since maximum likelihood estimation does
not require every person to have complete data (i.e., four 24hour dietary recalls and the DHQ), we included in the analyses all 1,500 study participants who completed at least one
24-hour dietary recall.
Because the adopted measurement error model assumes
that DHQ-reported intake follows a continuous distribution,
we excluded from the analysis three foods for which substantial percentages of respondents reported zero intake on
the DHQ: yogurt, organ meat, and soy. Alcoholic beverages,
for which rather large percentages of zero intake were also
reported on the DHQ, were further analyzed for the DHQ
consumers only.
Fij ¼ Ti þ nij ;
where nij, j ¼ 1, . . ., 4, have mean zero and constant variance
and are uncorrelated with true intake Ti, error in the DHQ,
and each other. Therefore, the adapted measurement error
Am J Epidemiol 2006;163:279–288
RESULTS
The demographic profile of the participants during the
progression of the study—from completion of at least one
282 Millen et al.
24-hour dietary recall to randomization after completion of
four 24-hour dietary recalls to completion of an FFQ—was
previously reported (3). Participants completing an FFQ, as
compared with those completing at least one 24-hour dietary
recall, were significantly more likely to be female, older, and
of White race/ethnicity. No educational differences were
seen.
Table 1 shows the mean values and standard errors for
daily intake of pyramid servings of the 30 food groups
among 497 women and 436 men who completed all four
24-hour dietary recalls and the DHQ, after exclusion of 32
participants whose values for reported total energy intake
were outliers. For women and men, reported intakes were
greater (>15 percent difference) on the DHQ than on the 24hour dietary recalls for other starchy vegetables, dark green
vegetables, total fruit, citrus fruit/melon/berries, other fruit,
and yogurt. For women, reported intakes were also greater
for total vegetables, deep yellow vegetables, legumes, other
vegetables, fish/other seafood, and milk. For men, reported
intakes were also greater for nuts and seeds and alcohol.
Reported intakes for men and women were lower (>15
percent difference) on the DHQ than on the 24-hour dietary
recalls for total grains, nonwhole grains, poultry, organ
meat, and eggs. For men, reported intakes were also lower
for red meat/poultry/fish, soy, and cheese.
Table 2 shows the estimated correlation (q) between the
DHQ and true usual intake and the attenuation factor (k) for
the transformed food group data with and without energy
adjustment, by gender. Adjusting for energy intake strengthened the correlation between truth and the DHQ for the
majority of the food groups. Among women, energyadjusted correlations ranged from 0.43 for servings of other
starchy vegetables to 0.84 for servings of milk. Among men,
energy-adjusted correlations ranged from 0.42 for servings
of tomatoes and eggs to 0.80 for servings of total dairy food.
The energy-adjusted correlation coefficients were greater
than 0.50 for all food groups except white potatoes (men
only), other starchy vegetables (women only), tomatoes,
poultry (women only), and eggs. Energy-adjusted correlation coefficients were highest (>0.65) for whole grains (men
only), total vegetables (women only), dark green vegetables,
legumes (men only), total fruit, other fruit (men only), beef/
pork/lamb, total dairy food, milk, discretionary fat (men
only), added sugar, and alcohol. The mean q across all
energy-adjusted food groups was 0.62 for women and 0.63
for men (table 3).
Energy adjustment primarily strengthened or left unchanged the attenuation coefficients. Among women, energyadjusted attenuation coefficients ranged from 0.20 for other
starchy vegetables to 0.92 for eggs (table 2). Among men,
energy-adjusted attenuation coefficients ranged from 0.41 for
other starchy vegetables to 1.32 for eggs. After energy adjustment, the attenuation coefficients were greater than 0.50
for all food groups except total grains (women only), nonwhole grains (women only), total vegetables (men only), white
potatoes (men only), other starchy vegetables, dark green
vegetables, tomatoes, legumes, poultry, fish/other seafood,
franks/luncheon meats (women only), and discretionary fat.
After energy adjustment, the mean k was 0.60 for women and
0.66 for men (table 3).
DISCUSSION
We estimated the ability of the DHQ to assess food group
intake as measured in pyramid servings for 30 US Department of Agriculture food groups. Validation of food intake
poses challenges different from those of nutrient intake
along several dimensions: 1) how to disentangle individual
foods from mixtures; 2) how to combine the nutrient contributions of solid foods and less dense liquid foods; and
3) how to relate food intake to dietary guidance. The availability of the US Department of Agriculture’s Pyramid Servings Database eloquently satisfies these challenges. To our
knowledge, this is the first FFQ validation study of food
groups measured in pyramid servings.
A further statistical challenge is the variability and nonnormal distribution of food intakes. Daily intakes of individual foods tend to be more variable than those for nutrients.
Pyramid servings may somewhat reduce this variability for
some foods, because the system disaggregates food mixtures into a smaller set of component ingredients. Therefore,
one implication of using pyramid servings in future studies
is a possible reduction in measurement error in assessing
food intake. The distributions of pyramid servings are
more similar to those of nutrients, both being represented
by continuous variables in the database. This is unlike other
methods of quantifying food intake, such as determining
frequencies or grams, the latter of which takes into account
frequency and portion size but generally does not disaggregate food mixtures.
The inclusion of a database with the ability to assess
pyramid servings is a scientific advance which may allow
researchers to more accurately assess intakes in different
food groups and may help facilitate the communication of
scientific findings to the public with reference to dietary
guidance. We have shown here that the use of pyramid
servings for FFQ databases is possible and that it has the
potential to provide a standardized measurement of food
intake from study to study.
The new Food Guide Pyramid, MyPyramid (http://www.
mypyramid.gov/), makes recommendations based on common household measurements (i.e., cups, ounces, and teaspoons). Translating between household measurements and
pyramid servings is relatively easy. This change is less a substantive one than an attempt to better communicate with
consumers. For example, in MyPyramid, 1/4 cup of raisins
is equivalent to 1/2 cup of fruit, and both are equivalent to
one pyramid serving. There are, however, a few substantive
changes, such as recommendations for the separation of oil
from discretionary fat and the inclusion of the term ‘‘discretionary calories’’ to convey the idea that after food intake
recommendations are met, people have a limited amount of
calories to consume within food groups or as sugar, alcohol,
or discretionary fat. At the time of this analysis, we used the
current Pyramid Servings Database available to us. In the
future, we will update the DHQ to the current system.
In our study, median intake estimates on the DHQ and the
24-hour dietary recalls differed for many pyramid food
groups. There are several possible reasons for this. There
may be a bias for people to report higher consumption
of more socially desirable foods (e.g., fruits, vegetables,
Am J Epidemiol 2006;163:279–288
Validation of Pyramid Food Servings
283
TABLE 1. Mean intakes and percentages of persons reporting no consumption in Food Guide Pyramid food groups among persons
who completed four 24-hour dietary recalls and the Diet History Questionnaire, Eating at America’s Table Study, 1997–1998
Women (n ¼ 497)
Food Guide Pyramid food group
24-hour dietary recalls
Mean (SE*)
% zeroy
Men (n ¼ 436)
Diet History
Questionnaire
Mean (SE)
24-hour dietary recalls
% zero
Mean (SE)
% zero
Diet History
Questionnaire
Mean (SE)
% zero
Grain group (servings/day)
Total grains
5.84 (0.10)
0
4.90 (0.12)z
0
8.54 (0.17)
0
6.76 (0.18)z
0
Whole grains
1.02 (0.04)
7
0.98 (0.03)
0
1.55 (0.07)
8
1.39 (0.05)
0
Nonwhole grains
4.82 (0.09)
0
3.92 (0.11)z
0
6.99 (0.15)
0
5.36 (0.15)z
0
Vegetable group (servings/day)
Total vegetables
3.10 (0.07)
0
3.97 (0.11)z
0
4.22 (0.10)
0
4.49 (0.14)
0
White potatoes
0.75 (0.03)
14
0.84 (0.03)
0
1.12 (0.05)
12
1.08 (0.05)
0
Other starchy vegetables
0.15 (0.01)
45
0.35 (0.01)z
0
0.23 (0.02)
46
0.38 (0.02)z
0
Dark green vegetables
0.26 (0.02)
42
0.41 (0.03)z
1
0.22 (0.02)
52
0.36 (0.03)z
1
Deep yellow vegetables
0.21 (0.01)
19
0.25 (0.01)z
0
0.27 (0.02)
23
0.25 (0.01)
0
Tomatoes
0.48 (0.02)
2
0.54 (0.02)
0
0.71 (0.03)
2
0.72 (0.03)
0
Legumes§
0.12 (0.01)
61
0.14 (0.01)z
4
0.21 (0.02)
57
0.19 (0.01)
3
Other vegetables
1.14 (0.04)
0
1.44 (0.05)z
0
1.47 (0.05)
0
1.51 (0.06)
0
Fruit group (servings/day)
Total fruit
1.48 (0.06)
3
2.40 (0.10)z
0
1.96 (0.10)
1
2.68 (0.14)z
0
Citrus fruit, melon, berries
0.75 (0.04)
6
1.16 (0.05)z
0
1.02 (0.07)
5
1.29 (0.07)z
0
Other fruit
0.73 (0.04)
11
1.25 (0.05)z
0
0.95 (0.05)
12
1.38 (0.08)z
0
Red meat, poultry, fish
3.57 (0.08)
1
3.30 (0.10)
0
5.93 (0.14)
2
5.03 (0.17)z
0
Beef, pork, lamb, etc.
1.59 (0.06)
7
1.46 (0.05)
0
2.65 (0.10)
6
2.42 (0.09)
0
Poultry
1.10 (0.05)
19
0.86 (0.04)z
0
1.60 (0.08)
18
1.06 (0.06)z
0
Fish, other seafood
0.43 (0.04)
52
0.51 (0.04)z
0
0.72 (0.06)
46
0.64 (0.04)
0
Franks, luncheon meats
0.43 (0.02)
35
0.46 (0.02)
2
0.94 (0.05)
21
0.90 (0.05)
1
Organ meat
0.03 (0.01)
96
0.01 (0.0)z
61
0.02 (0.01)
97
0.01 (0.0)z
58
0
Meat group (ounces{/day)
Lean meat
Lean meat equivalents
Eggs#
0.29 (0.02)
9
0.22 (0.01)z
0
0.43 (0.03)
5
0.26 (0.01)z
Nuts, seeds#
0.15 (0.01)
30
0.17 (0.01)
1
0.28 (0.03)
27
0.32 (0.03)z
0
Soy
0.06 (0.01)
81
0.05 (0.01)
66
0.06 (0.02)
82
0.04 (0.01)z
65
Total dairy food
1.26 (0.04)
0
1.41 (0.06)
0
1.75 (0.06)
0
1.76 (0.07)
0
Milk
0.76 (0.03)
2
0.92 (0.06)z
0
1.05 (0.05)
1
1.14 (0.06)
0
Cheese
0.44 (0.02)
7
0.39 (0.02)
0
0.66 (0.03)
6
0.54 (0.02)z
0
Yogurt
0.05 (0.01)
83
37
0.03 (0.01)
90
0.07 (0.01)z
52
Discretionary fat (g/day)
48.67 (0.93)
0
48.70 (1.24)
0
72.99 (1.45)
0
66.40 (1.98)
0
Added sugar (teaspoons**/day)
16.62 (0.50)
0
15.72 (0.66)
0
23.70 (0.76)
0
22.08 (0.88)
0
0.40 (0.04)
58
0.39 (0.05)
28
0.87 (0.07)
47
Dairy group (servings/day)
0.08 (0.01)z
Pyramid tip
Alcohol (no. of drinks/day)
1.10 (0.19)z
22
* SE, standard error.
y Percentage of persons reporting no consumption of items in this food group.
z Greater than 15% difference between the average of the four 24-hour dietary recalls and the Diet History Questionnaire sample means.
§ Legumes can be counted toward the meat group or the vegetable group, according to the Food Guide Pyramid. One serving of dry beans and
peas is equivalent to 1 ounce (28.4 g) of cooked lean meat.
{ 1 ounce ¼ 28.4 g.
# 1/3 cup of nuts, 1/4 cup of seeds, and one egg are equivalent to 1 ounce (28.4 g) of cooked lean meat.
** 1 teaspoon ¼ 5 g.
Am J Epidemiol 2006;163:279–288
284 Millen et al.
TABLE 2. Deattenuated correlation coefficients (r) and attenuation factors (l) for
correlation between the Diet History Questionnaire and true usual intake (based on four
24-hour dietary recalls) obtained using a measurement error model, unadjusted
and adjusted for total energy intake, Eating at America’s Table Study, 1997–1998
Women
Men
Food Guide Pyramid food group
q (SE*)
k (SE)
q (SE)
k (SE)
Grain group (servings/day)
Total grains
Unadjusted
0.48 (0.05)
0.32 (0.05)
0.53 (0.05)
0.41 (0.05)
Adjusted
0.59 (0.06)
0.44 (0.05)
0.63 (0.05)
0.53 (0.05)
Whole grains
Unadjusted
0.55 (0.05)
0.66 (0.07)
0.62 (0.05)
0.75 (0.08)
Adjusted
0.62 (0.05)
0.81 (0.07)
0.66 (0.04)
0.89 (0.08)
Nonwhole grains
Unadjusted
0.49 (0.06)
0.32 (0.04)
0.48 (0.05)
0.39 (0.04)
Adjusted
0.60 (0.06)
0.48 (0.05)
0.61 (0.05)
0.56 (0.05)
Vegetable group (servings/day)
Total vegetables
Unadjusted
0.49 (0.06)
0.31 (0.04)
0.46 (0.06)
0.31 (0.05)
Adjusted
0.66 (0.05)
0.53 (0.05)
0.63 (0.07)
0.45 (0.06)
White potatoes
Unadjusted
0.48 (0.06)
0.40 (0.06)
0.46 (0.08)
0.38 (0.07)
Adjusted
0.52 (0.06)
0.54 (0.08)
0.50 (0.08)
0.50 (0.09)
Other starchy vegetables
Unadjusted
0.38 (0.11)
0.16 (0.05)
0.56 (0.10)
0.36 (0.10)
Adjusted
0.43 (0.11)
0.20 (0.05)
0.53 (0.11)
0.41 (0.13)
Vegetable group (servings/day)
Dark green vegetables
Unadjusted
0.69 (0.07)
0.43 (0.05)
0.77 (0.08)
0.42 (0.06)
Adjusted
0.67 (0.06)
0.48 (0.06)
0.75 (0.10)
0.41 (0.06)
Deep yellow vegetables
Unadjusted
0.60 (0.08)
0.46 (0.07)
0.49 (0.07)
0.59 (0.16)
Adjusted
0.58 (0.09)
0.76 (0.23)
0.54 (0.08)
0.81 (0.30)
Tomatoes
Unadjusted
0.44 (0.07)
0.34 (0.06)
0.37 (0.08)
0.35 (0.07)
Adjusted
0.49 (0.08)
0.49 (0.09)
0.42 (0.08)
0.43 (0.08)
Legumesy
Unadjusted
0.59 (0.10)
0.33 (0.08)
0.53 (0.09)
0.34 (0.06)
Adjusted
0.62 (0.12)
0.33 (0.05)
0.66 (0.10)
0.44 (0.06)
Other vegetables
Unadjusted
0.52 (0.06)
0.47 (0.06)
0.49 (0.07)
0.40 (0.07)
Adjusted
0.63 (0.05)
0.67 (0.07)
0.57 (0.08)
0.52 (0.08)
Fruit group (servings/day)
Total fruit
Unadjusted
0.62 (0.04)
0.57 (0.04)
0.63 (0.05)
0.72 (0.06)
Adjusted
0.66 (0.04)
0.68 (0.05)
0.70 (0.03)
0.86 (0.05)
Citrus fruit, melon, berries
Unadjusted
0.60 (0.05)
0.61 (0.06)
0.58 (0.05)
0.83 (0.08)
Adjusted
0.63 (0.05)
0.74 (0.07)
0.61 (0.04)
0.94 (0.09)
Other fruit
Unadjusted
0.61 (0.04)
0.50 (0.05)
0.65 (0.04)
0.66 (0.07)
Adjusted
0.64 (0.04)
0.61 (0.05)
0.71 (0.04)
0.82 (0.07)
Table continues
Am J Epidemiol 2006;163:279–288
Validation of Pyramid Food Servings
TABLE 2. Continued
Women
Men
Food Guide Pyramid food group
q (SE*)
)
k (SE)
q (SE)
k (SE)
Meat group (ouncesz/day)
Lean meat
Red meat, poultry, fish
Unadjusted
0.48 (0.07)
0.39 (0.06)
0.53 (0.05)
0.51 (0.07)
Adjusted
0.58 (0.06)
0.56 (0.07)
0.61 (0.06)
0.72 (0.09)
Beef, pork, lamb, etc.
Unadjusted
0.71 (0.07)
0.58 (0.06)
0.61 (0.06)
0.55 (0.07)
Adjusted
0.79 (0.07)
0.70 (0.06)
0.72 (0.07)
0.73 (0.07)
Poultry
Unadjusted
0.45 (0.08)
0.32 (0.06)
0.60 (0.10)
0.47 (0.07)
Adjusted
0.50 (0.08)
0.41 (0.06)
0.56 (0.10)
0.46 (0.08)
Fish, other seafood
Unadjusted
0.53 (0.11)
0.37 (0.08)
0.60 (0.08)
0.46 (0.07)
Adjusted
0.53 (0.17)
0.39 (0.08)
0.60 (0.09)
0.47 (0.07)
Franks, luncheon meats
Unadjusted
0.56 (0.09)
0.33 (0.05)
0.60 (0.10)
0.43 (0.08)
Adjusted
0.54 (0.09)
0.38 (0.06)
0.64 (0.09)
0.56 (0.09)
Lean meat equivalents
Eggs§
Unadjusted
0.50 (0.08)
0.81 (0.21)
0.46 (0.07)
1.01 (0.30)
Adjusted
0.45 (0.08)
0.92 (0.25)
0.42 (0.08)
1.32 (0.52)
Nuts, seeds§
Unadjusted
0.53 (0.06)
0.51 (0.09)
0.57 (0.06)
0.62 (0.13)
Adjusted
0.51 (0.07)
0.65 (0.13)
0.56 (0.06)
0.72 (0.17)
Dairy group (servings/day)
Total dairy food
Unadjusted
0.76 (0.03)
0.70 (0.04)
0.73 (0.04)
0.63 (0.05)
Adjusted
0.78 (0.03)
0.72 (0.05)
0.80 (0.04)
0.65 (0.04)
Milk
Unadjusted
0.84 (0.03)
0.80 (0.05)
0.75 (0.03)
0.71 (0.04)
Adjusted
0.84 (0.04)
0.86 (0.05)
0.78 (0.03)
0.76 (0.05)
Cheese
Unadjusted
0.58 (0.06)
0.49 (0.05)
0.62 (0.05)
0.62 (0.06)
Adjusted
0.62 (0.08)
0.76 (0.22)
0.61 (0.06)
0.95 (0.17)
Pyramid tip
Discretionary fat (g/day)
Unadjusted
0.49 (0.06)
0.33 (0.05)
0.59 (0.05)
0.41 (0.04)
Adjusted
0.65 (0.04)
0.45 (0.03)
0.66 (0.04)
0.46 (0.04)
Added sugar (teaspoons{/day)
Unadjusted
0.72 (0.03)
0.65 (0.03)
0.66 (0.04)
0.60 (0.05)
Adjusted
0.79 (0.02)
0.74 (0.03)
0.68 (0.05)
0.67 (0.05)
Alcohol (no. of drinks/day)
Unadjusted
0.77 (0.05)
0.89 (0.12)
0.76 (0.04)
0.84 (0.07)
Adjusted
0.77 (0.04)
0.82 (0.11)
0.79 (0.03)
0.86 (0.10)
* SE, standard error.
y Legumes can be counted toward the meat group or the vegetable group, according to the Food Guide
Pyramid. One serving of dry beans and peas is equivalent to 1 ounce (28.4 g) of cooked lean meat.
z 1 ounce ¼ 28.4 g.
§ 1/3 cup of nuts, 1/4 cup of seeds, and one egg are equivalent to 1 ounce (28.4 g) of cooked lean meat.
{ 1 teaspoon ¼ 5 g.
Am J Epidemiol 2006;163:279–288
285
286 Millen et al.
TABLE 3. Mean correlation and attenuation coefficients for
correlation between the Diet History Questionnaire and four
24-hour dietary recalls, with and without adjustment for total
energy intake, Eating at America’s Table Study, 1997–1998
Mean correlation
coefficient (q)
Mean attenuation
factor (k)
Unadjusted
Adjusted
Unadjusted
Adjusted
Overall
0.58
0.62
0.52
0.63
Women
0.57
0.62
0.48
0.60
Men
0.58
0.63
0.55
0.66
fish/other seafood, yogurt, milk) and lower consumption of
less socially desirable foods (e.g., nonwhole grains, red
meat/poultry/fish, organ meat, eggs, cheese) on the DHQ
than on the 24-hour dietary recalls. In general, these findings
were consistent with findings from previous studies (15–20).
However, a few discrepancies did occur. For example, a few
studies reported added fat consumption at a higher frequency on FFQs than on 24-hour dietary recalls (16–18).
These inconsistencies may be partially explained by variation in the proportions of different types of fats consumed
by country.
Another reason that reported intakes for some food
groups differed between the DHQ and the 24-hour dietary
recalls may be that there are differences in the nature of the
two dietary assessment methods. Four random 24-hour dietary recalls are less likely to provide observations of intake
for foods that are rarely consumed or only episodically consumed. The recalls may indicate lower levels of intake in
comparison with an FFQ that queries about usual dietary
intake over the past year.
For most food groups, the deattenuated correlation coefficients and attenuation factors were greater than 0.50 for
both genders after energy adjustment; however, they were
slightly stronger, on average, among men. Unlike mean and
median values, correlation coefficients do not compare absolute intakes of foods between dietary assessment tools but
rather compare how the different tools classify individuals
with respect to food intake. Energy adjustment, on average,
improved the correlation and attenuation coefficients. This
suggests that adjusting for energy intake reduces measurement error in reporting of foods on the DHQ. Nearly all
items on the DHQ contribute calories, and therefore total
energy intake may serve as a good surrogate variable with
which to adjust for measurement error in FFQs (21).
Attenuation factors are used to estimate the degree to
which the log relative risk of disease would be attenuated
due to measurement error in the DHQ. For example, if a true
relative risk of 0.50 existed for a specific disease among
women with a high adjusted total vegetable intake as compared with a low intake, this relative risk would be attenuated by measurement error to 0.85.
The attenuation and correlation coefficients among men
were, on average, slightly higher than those among women.
This may be explained by differential measurement error by
gender. In the Observing Protein and Energy Nutrition
(OPEN) Study, which also used the National Cancer Insti-
tute DHQ, Subar et al. (21) found that women underreported
their total energy intake more than men did.
A number of FFQ-type food validation studies have been
conducted among adults in Europe (15–18, 22–28), Asia
(29–36), the United States (19, 20, 37, 38), Africa (39,
40), and South America (41). While it is recognized that
differences in instruments and study samples used among
these various studies may have caused differences in results,
another factor deserving of attention is differences in analytical methods.
To our knowledge, our study is the first food validation
study to use pyramid servings; no other studies reviewed in
this manuscript measured food intake with this unit. Thus, it
is possible that differences in study results could be explained by differences in the units used to express food intake: frequencies (the number of times a food is eaten within
a specified time period (day, week, etc.), without reference
to portion size) or servings (a count of the number of times
a specific portion size of a food is eaten within a specified
time period) (15, 19, 20, 29, 30, 37); grams (16–18, 22–28,
31–36, 38–40); percentage of total calories (41); or, as in
this case, pyramid servings. Different studies may accentuate different metrics when assigning foods to specific food
groups. For example, Flagg et al. (38) analyzed the sum of
dairy food intake (milk þ yogurt þ cheese), combining
liquid and solid dairy products, whereas Feskanich et al.
(19) differentiated between the different types of dairy foods
(i.e., milk, yogurt, cheese, cream cheese, etc.).
It appears that investigators in past FFQ validation studies
considered added sugar to be the addition of raw sugar to
prepared foods. In this study, pyramid servings of added
sugar were based on all sources, including sugar added by
food manufacturers to cereals, baked goods, etc. Similarly,
pyramid servings of discretionary fats included fats added to
foods during cooking and at the table, as well as fats exceeding those present in lean cuts of meat and poultry. The
food groups ‘‘discretionary fats’’ and ‘‘added sugars’’ are
unique additions to a foods database, because they express
fat and sugar intake behavior as measures, within the dietary
guidance concept. Individuals have choices in how to
‘‘spend’’ their fat and sugar allowances.
Finally, the search for consistency between studies is
hampered by the presentation of Spearman correlation coefficients or unadjusted Pearson correlation coefficients
(16–18, 22, 23, 26, 27, 29, 31–34, 39, 40), which are not
directly comparable to deattenuated Pearson correlation coefficients, which correct for within-person variation in the
reference instrument.
The results of this validation study may not be completely
generalizable to other populations. Our sample consisted
primarily of White, well-educated persons who were willing
to complete four 24-hour dietary recalls followed by two
FFQs. This is a limitation of many validation studies.
In this study, we assumed that error in the 24-hour dietary
recalls included no misclassification of nonconsumption
days and, for nonzero reported amounts, was unbiased and
contained only within-person random error uncorrelated
with errors in the DHQ. These assumptions may be unwarranted for self-reported reference instruments such as
24-hour dietary recalls and dietary records. The OPEN
Am J Epidemiol 2006;163:279–288
Validation of Pyramid Food Servings
Study showed that there is reporting bias for energy and
protein intake in both the DHQ and 24-hour dietary recalls,
and that people systematically differ in their reporting accuracy (21). Therefore, all dietary reference instruments
could involve systematic error at the individual level, and
this error could be correlated with its counterpart in the
DHQ. Thus, our results may reflect overestimation of the
correlations with true intake and underestimation of true
attenuation (42).
With respect to absolute energy and protein intakes, as the
OPEN biomarker study showed, the DHQ has significant
measurement error, especially in the direction of underreporting (21). This is probably true for many FFQs. However,
after adjustment for total energy intake, protein intake
showed far less measurement error in comparison with unadjusted values. This observation suggests that energy adjustment may minimize the effects of measurement error in
FFQs (42), although the extent to which the results for protein can be extended to individual foods or food groups is
unknown.
In summary, the results of this study demonstrate for the
first time the use of a pyramid servings database for FFQs.
The validity of the US Department of Agriculture Pyramid
Servings Database was comparable or superior to that of
previous FFQ food group validation studies using less precise food grouping methods. The use of a pyramid servings
database in nutritional epidemiologic research provides substantial advantages in measuring consumption in individual
food groups, as well as consumption of sugar and discretional fat. The inclusion of a pyramid servings database in
the DHQ provides an innovative and advanced method of
assessing food intake behavior for use in nutrition surveillance, assessment, or analysis of diet-disease associations.
The DHQ is publicly available online at a National Cancer
Institute website (http://riskfactor.cancer.gov/DHQ/).
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
ACKNOWLEDGMENTS
Conflict of interest: none declared.
18.
19.
REFERENCES
1. Freedman LS, Schatzkin A, Wax Y. The impact of dietary measurement error on planning sample size required in a cohort
study. Am J Epidemiol 1990;132:1185–95.
2. Freudenheim JL, Marshall JR. The problem of profound mismeasurement and the power of epidemiological studies of diet
and cancer. Nutr Cancer 1988;11:243–50.
3. Subar AF, Thompson FE, Kipnis V, et al. Comparative validation of the Block, Willett, and National Cancer Institute food
frequency questionnaires: The Eating at America’s Table
Study. Am J Epidemiol 2001;154:1089–99.
4. Cook A, Friday JE. Pyramid Servings Database for USDA
survey food codes, version 2.0. Washington, DC: Agricultural
Research Service, US Department of Agriculture, 2004.
(http://www.barc.usda.gov/bhnrc/cnrg/section1.htm).
5. Shaw A, Fulton L, Davis C, et al. Using the Food Guide Pyramid: a resource for nutrition educators. Washington, DC:
Am J Epidemiol 2006;163:279–288
20.
21.
22.
23.
287
Center for Nutrition Policy and Promotion, US Department
of Agriculture, 2005. (http://www.nal.usda.gov/fnic/Fpyr/
guide.pdf).
Cleveland LE, Cook DA, Krebs-Smith SM, et al. Method for
assessing food intakes in terms of servings based on food
guidance. Am J Clin Nutr 1997;65(suppl):1254S–63S.
Freedman LS, Carroll RJ, Wax Y. Estimating the relation between dietary intake obtained from a food frequency questionnaire and true average intake. Am J Epidemiol 1991;134:
310–20.
Hu FB. Dietary pattern analysis: a new direction in nutritional
epidemiology. Curr Opin Lipidol 2002;13:3–9.
Block G, Woods M, Potosky A, et al. Validation of a selfadministered diet history questionnaire using multiple diet
records. J Clin Epidemiol 1990;43:1327–35.
Willett WC, Sampson L, Stampfer MJ, et al. Reproducibility
and validity of a semiquantitative food frequency questionnaire. Am J Epidemiol 1985;122:51–65.
Tippett KS, Cypel YS. Design and operation: the Continuing
Survey of Food Intakes by Individuals and the Diet and Health
Knowledge Survey, 1994–96. Washington, DC: Agricultural
Research Service, US Department of Agriculture, 1998.
(Document no. SuDoc A 77.516:96-1).
Eckert RS, Carroll RJ, Wang N. Transformations to additivity
in measurement error models. Biometrics 1997;53:262–72.
Willett W, Stampfer M. Implications of total energy intake for
epidemiologic analyses. In: Kelsey JL, Marmot MG, Stolley
PD, et al, eds. Nutritional epidemiology. New York, NY:
Oxford University Press, 1998:273–301.
Efron B, Tibshirani RJ. An introduction to the bootstrap.
London, United Kingdom: Chapman and Hall Ltd, 1993.
Johansson I, Hallmans G, Wikman A, et al. Validation and
calibration of food-frequency questionnaire measurements in
the Northern Sweden Health and Disease cohort. Public Health
Nutr 2002;5:487–96.
Pisani P, Faggiano F, Krogh V, et al. Relative validity and reproducibility of a food frequency dietary questionnaire for use
in the Italian EPIC centres. Int J Epidemiol 1997;26(suppl 1):
S152–60.
Ocke MC, Bueno-de-Mesquita HB, Goddijn HE, et al. The
Dutch EPIC food frequency questionnaire. I. Description of
the questionnaire, and relative validity and reproducibility for
food groups. Int J Epidemiol 1997;26(suppl 1):S37–48.
EPIC Group of Spain. Relative validity and reproducibility of
a diet history questionnaire in Spain. I. Foods. Int J Epidemiol
1997;26(suppl 1):S91–9.
Feskanich D, Rimm EB, Giovannucci EL, et al. Reproducibility and validity of food intake measurements from a semiquantitative food frequency questionnaire. J Am Diet Assoc
1993;93:790–6.
Salvini S, Hunter DJ, Sampson L, et al. Food-based validation
of a dietary questionnaire: the effects of week-to-week variation in food consumption. Int J Epidemiol 1989;18:858–67.
Subar AF, Kipnis V, Troiano RP, et al. Using intake biomarkers to evaluate the extent of dietary misreporting in a large
sample of adults: The OPEN Study. Am J Epidemiol 2003;
158:1–13.
Bohlscheid-Thomas S, Hoting I, Boeing H, et al. Reproducibility and relative validity of food group intake in a food frequency questionnaire developed for the German part of the
EPIC project. European Prospective Investigation into Cancer
and Nutrition. Int J Epidemiol 1997;26(suppl 1):S59–70.
van Liere MJ, Lucas F, Clavel F, et al. Relative validity and
reproducibility of a French dietary history questionnaire. Int J
Epidemiol 1997;26(suppl 1):S128–36.
288 Millen et al.
24. Bonifacj C, Gerber M, Scali J, et al. Comparison of dietary
assessment methods in a southern French population: use of
weighed records, estimated-diet records and a food-frequency
questionnaire. Eur J Clin Nutr 1997;51:217–31.
25. Mannisto S, Virtanen M, Mikkonen T, et al. Reproducibility
and validity of a food frequency questionnaire in a casecontrol study on breast cancer. J Clin Epidemiol 1996;49:
401–9.
26. Elmstahl S, Riboli E, Lindgarde F, et al. The Malmö Food
Study: the relative validity of a modified diet history method
and an extensive food frequency questionnaire for measuring
food intake. Eur J Clin Nutr 1996;50:143–51.
27. Goldbohm RA, van den Brandt PA, Brants HA, et al. Validation of a dietary questionnaire used in a large-scale prospective
cohort study on diet and cancer. Eur J Clin Nutr 1994;48:
253–65.
28. Pietinen P, Hartman AM, Haapa E, et al. Reproducibility and
validity of dietary assessment instruments. II. A qualitative
food frequency questionnaire. Am J Epidemiol 1988;128:
667–76.
29. Sauvaget C, Allen N, Hayashi M, et al. Validation of a food
frequency questionnaire in the Hiroshima/Nagasaki Life Span
Study. J Epidemiol 2002;12:394–401.
30. Forman MR, Zhang J, Nebeling L, et al. Relative validity of
a food frequency questionnaire among tin miners in China:
1992/93 and 1995/96 diet validation studies. Public Health
Nutr 1999;2:301–15.
31. Ogawa K, Tsubono Y, Nishino Y, et al. Validation of a foodfrequency questionnaire for cohort studies in rural Japan.
Public Health Nutr 2003;6:147–57.
32. Ishihara J, Sobue T, Yamamoto S, et al. Validity and reproducibility of a self-administered food frequency questionnaire
in the JPHC Study Cohort II: study design, participant profile
and results in comparison with Cohort I. J Epidemiol 2003;
13(suppl):S134–47.
33. Sasaki S, Ishihara J, Tsugane S. Reproducibility of a selfadministered food frequency questionnaire used in the 5-year
follow-up survey of the JPHC Study Cohort I to assess food
and nutrient intake. J Epidemiol 2003;13(suppl):S115–24.
34. Lee KY, Uchida K, Shirota T, et al. Validity of a self-administered food frequency questionnaire against 7-day dietary
records in four seasons. J Nutr Sci Vitaminol (Tokyo) 2002;
48:467–76.
35. Tokudome S, Imaeda N, Tokudome Y, et al. Relative validity
of a semi-quantitative food frequency questionnaire versus 28
day weighed diet records in Japanese female dietitians. Eur
J Clin Nutr 2001;55:735–42.
36. Wakai K, Egami I, Kato K, et al. A simple food frequency
questionnaire for Japanese diet—Part I. Development of
the questionnaire, and reproducibility and validity for food
groups. J Epidemiol 1999;9:216–26.
37. Smith-Warner SA, Elmer PJ, Fosdick L, et al. Reliability and
comparability of three dietary assessment methods for estimating fruit and vegetable intakes. Epidemiology 1997;8:196–201.
38. Flagg EW, Coates RJ, Calle EE, et al. Validation of the
American Cancer Society Cancer Prevention Study II Nutrition Survey cohort food frequency questionnaire. Epidemiology 2000;11:462–8.
39. Parr CL, Barikmo I, Torheim LE, et al. Validation of the second version of a quantitative food-frequency questionnaire for
use in western Mali. Public Health Nutr 2002;5:769–81.
40. Torheim LE, Barikmo I, Hatloy A, et al. Validation of a quantitative food-frequency questionnaire for use in western Mali.
Public Health Nutr 2001;4:1267–77.
41. Rodriguez MM, Mendez H, Torun B, et al. Validation of a
semi-quantitative food-frequency questionnaire for use among
adults in Guatemala. Public Health Nutr 2002;5:691–9.
42. Kipnis V, Subar AF, Midthune D, et al. Structure of dietary
measurement error: results of the OPEN biomarker study. Am
J Epidemiol 2003;158:14–21.
Am J Epidemiol 2006;163:279–288