The Consequences of Measurement Error when Estimating the

The Consequences of Measurement Error when Estimating the
Impact of BMI on Labour Market Outcomes *
Donal O’Neill and Olive Sweetman **
National University of Ireland Maynooth
Revised 13/11/2012
Abstract. This paper uses data on both self-reported and true measures of individual
Body Mass Index (BMI) to examine the nature of measurement error in self-reported
BMI and to look at the consequences of using self-reported measures when estimating
the effect of BMI on economic outcomes. In keeping with previous studies we find
that self-reported BMI is subject to significant measurement error and this error is
negatively correlated with the true measure of BMI. In our analysis this non-classical
measurement error causes the traditional approach to overestimate the relationship
between BMI and both income and education. Furthermore we show that popular
alternatives estimators that have been adopted to address problems of measurement
error in BMI, such as the conditional expectation approach and the instrumental
variables approach, also exhibit significant biases.
JEL Classification: L14, C13
Key Words: Obesity, Non-Classical Measurement Error, Auxiliary Data, Instrumental
Variables
*
We would like to thank Aedin Doris and Dave Madden for helpful comments on an earlier draft of
this paper.
**
Corresponding author: e-mail: [email protected], tel:353-1-7083555.
2
1. Introduction
Obesity is a medical condition described as excess body weight in the form of fat. The
International Obesity Task Force (2010) estimates that approximately 1.0 billion
adults are currently overweight and a further 475 million are obese. In the European
Union 27 member states, approximately 60% of adults and over 20% of school-age
children are overweight or obese. Obesity is an important cause of morbidity,
disability and premature death (WHO, 2004) and increases the risk for a wide range
of chronic diseases. As result there are substantial direct and indirect costs associated
with obesity that put a strain on healthcare and social resources.
As well as the studies examining the cost of obesity to the state, in recent years
there have also been a number of studies that examine the impact of obesity on
individual outcomes such as wages (Cawley 2004, Brunello and d’Hombres 2007),
labour force participation and employment (de Sousa 2012) and educational
achievement (Kaestner and Grossman 2009, von Hinke et al. 2012). Many of these
studies find a significant negative association between body weight and individual
economic success. The costs of obesity are therefore borne at the individual as well as
the national level.
The most widely-used method of measuring and identifying obesity is Body
Mass Index (BMI), where BMI = weight in kg/height in m2. 1 In some cases
researchers use field experiments to examine the impact of obesity on labour market
outcomes (Rooth 2009), however, the majority of these studies rely on self-reported
measures of BMI from survey data sets such as the National Longitudinal Study of
Youth, the European Community Household Panel and the National Child
Development Survey. In this case researchers have to address the possibility that selfreported BMI is measured with error. There is a large body of evidence that suggests
that self-reported BMI tends to underestimate true BMI; this occurs both because
people underreport their weight and because people overstate their height. Most
1
Recently Burkhauser and Cawley (2008) compared multiple measures of fatness and found that many
important patterns, such as who is classified as obese, group rates of obesity, and correlations of
obesity with social science outcomes, are all sensitive to the measure of fatness and obesity used (see
also Johansson et al. 2009 and Wada and Tekin 2010). We do not address this issue in our paper.
3
authors recognize this and as result typically adopt a range of approaches to deal with
the problem of mismeasured obesity.
In this paper we use a unique data set that contains both self-reported and
recorded measures of height and weight for a sample of 7,522 females. The
availability of both the self-reported and recorded measures allows us to examine in
detail the nature of measurement error in BMI. In keeping with previous work we
show that measurement error in BMI is non-classical. This means that the approaches
adopted in earlier papers to correct for measurement error, which rely on the
assumption of classical measurement error, are unlikely to provide consistent
estimates. As well as measures of obesity our data also contain detailed information
on individual characteristics, their labour market experiences and their family
circumstances. This allows us to determine the direction of the bias that arises in
simple regressions of outcomes on obesity when measurement error is non-classical
and also to assess the performance of many of the alternative approaches that have
been used to deal with the problem of measurement error.
Section 2 summarises the statistical literature on measurement error. We focus
on the biases that arise when the assumptions of classical measurement error are
relaxed. Section 3, discusses our data and examines the nature of measurement error
in our self-reported measures of BMI. Section 4 considers the implications of this
measurement error when examining the impact of obesity on individual outcomes. We
focus on two outcomes previously examined in the literature, a continuous measure
namely income, for which we use traditional linear regression analysis and a binary
variable measuring educational attainment for which we rely on a probit model.
2. Measurement Error in Economic Analysis
Econometric analysis involves examining the outcomes of random experiments in
order to make inferences about the distribution function underlying the true data
generating process. Measurement error in the observed data, however, may lead
researchers to draw incorrect inferences. The impact of measurement error on the
mean of a random variable has been studied extensively (e.g. Fuller (1987), Carroll et
al (1994) and Bound et al (2001)). For the most part, studies of measurement error
focus on the typical textbook model of classical measurement error. However in their
4
study of measurement error in labour market data, Bound et al. (1994) argued that in
most analyses of labour markets the assumption of classical measurement reflected
convenience rather than conviction. Bound et al (2001, p. 3709) conclude their survey
by calling for researchers to pay greater attention to the possibility of non-classical
measurement error, both in assessing the likely biases in the analyses that take no
account of measurement error and in devising procedures that correct for such error.
Non-classical measurement error can arise in many circumstances. For
example if the variable in question is binary (say a 0/1 indicator variable for example)
then any measurement error must be correlated with the truth (Aigner 1973). The
reason for this is that with a dummy variable, an observation can only be misclassified
on one of two ways; a true 0 can be misclassified as a 1 or a true 1 can be
misclassified as a zero. Clearly the value of the error is determined by the true value
of the binary variable, thereby violating the classical assumption. Hyslop and Imbens
(2001) consider a situation in which the individual is fully aware of his/her ignorance
and actively seeks to provide an optimal response given his/her information set. This
leads to measurement error that is uncorrelated with variables in the information set
and therefore, by necessity, correlated with the true value of interest.
In recent years a number of papers have examined the consequence of nonclassical measurement error in labour economics. Pischke (1995), O’Neill et al.
(2007) and Gottschalk and Huynh (2010) all show that non-classical measurement
error of the type typically found in income data attenuates the role of white noise
measurement error in models of earnings dynamics, while Kim and Solon (2005)
suggest that real-wages may be even more procyclical than recent studies suggest
once one accounts for mean-reverting measurement error.
Several approaches have been suggested for dealing with measurement error,
some of which are summarised in Bound et al. (2001). One popular approach relies on
the availability of auxiliary data which can be used to correct estimates for
measurement error (Lee and Sepanski 1995). While the auxiliary data allow
researchers to examine the nature of measurement error, these data typically do not
contain information on the dependent variable of interest. As a result the information
gained from the auxiliary data must be “transported” into the main survey data. For
instance Cawley (2004) corrects for measurement error in reported BMI by predicting
5
true height and weight in the NLSY using information on the relationship between
true and reported values in the Third National Health and Nutrition Examination
survey (NHANES III). In particular he uses the NHANES III to run separate
regressions by race and gender of actual weight on reported weight. The process is
repeated for height. Then self-reported height and weight in the NLSY are adjusted by
the coefficients on the reported values in the NHANES III. These fitted values are
then treated as, error free measures, and used throughout the remainder of the paper.
This approach is known as the “conditional expectation approach” (Lyles and Kupper
1997).
A second favoured approach uses the instrumental variables method to obtain
consistent estimates. This approach is popular among econometricians (for an
overview see Angrist and Krueger 2001) and requires a valid instrument. When
looking at the effect of obesity on labour market outcomes, this would require finding
a variable that is correlated with the true unobserved measure of obesity but
uncorrelated with either the measurement error or the stochastic component of the
outcome equation. The variation in the outcome variable induced by the instrument
can then be used to identify the true relationship between say BMI and the outcome of
interest. In BMI studies researchers often use the BMI of a sibling or other relative
such as a child as an instrument for reported or measured BMI (Cawley 2000, Cawley
et al. 2004, , Brunello and D’Hombres 2007, Kline and Tobias 2008, Kortt and Leigh
2010, Lindeboom et al. 2010, Cawley and Meyerhoefer 2012). 2
With classical measurement error, both of these approaches provide consistent
estimates but it is easy to show that neither of these standard approaches will work if
measurement error is non-classical. In order to focus attention on the problems of
measurement error, we consider estimating a conditional mean function, E[Y|X*],
where Y is the outcome of interest (assumed measured without error) and X* is the
true value of the variable of interest 3. Assuming E[Y|X*]=βX* we write our regression
equation as :
2
In many studies the instrumental variables approach was used to overcome endogeneity of BMI.
However, our findings will have implications for these studies as well in so far as the measure of BMI
being instrumented is also measured with error. We discuss this in more detail later in the paper.
3
This approach is in keeping with Bound et al. (1994). However, unlike that study we do not require
that the measurement error in BMI be uncorrelated with the stochastic component of the income
generating function.
6
.4
(1)
By assumption E[ X*]=0 which allows us to ignore potential problems associated
with endogeneity of X*, although we will return to this issue later in the paper. In
addition the observed value of X*, which we denote by X, is given by
(2)
where u is the measurement error. Classical measurement error refers to the case
where the error for any individual, ui, is unrelated to the true value Xi*; this in turn
implies E(X*u)=0. Non-classical measurement error can arise in two cases; firstly
there may be a relationship between the reported measurement error and the true
value of the variable so that E(X*u)≠0; secondly there may be a relationship between
the reported measurement error and the residual in equation (1) so that E(єu)≠0. The
latter situation is sometimes referred to as differential measurement error; in this case
X contains information about Y even after we condition on X*. 5
It is easy to show that the simple OLS estimator from a regression of Y on the
observed X is inconsistent in the presence of measurement error of this type. Allowing
for correlation between the measurement error and the true value of X* and for
correlation between є and u we can show that:
4
Although the impact of measurement error on the estimated coefficients in generalized linear models
such as logistic and probit models depends on depends on the derivative of the regression function with
respect to the mismeasured and the curvature of the likelihood function, in many cases the bias is
similar to that in the linear regression model (see for example Stefanski and Carroll (1985) and Buzas
et al. 2005).
5
Measurement error is said to be non-differential when the conditional distribution of y given X and X*
is the same as that of y given X*. In this case X is said to be a surrogate for X*. Black et al (2000) derive
bounds for the parameter of a univariate regression for the case of non-differential, nonclassical
measurement error.
7
If Cov(u,X*)= Cov(u,ε)=0 this simplifies to the textbook attenuation bias
associated with classical measurement error. However, violation of either of these
conditions will alter the probability limit of the OLS estimator such that the direction
of the inconsistency cannot be established a-priori.
The problems posed by non-classical measurement error for both the
conditional expectation approach and the IV approach to measurement error are also
immediate. The conditional expectation approach regresses true earnings on observed
earnings using the auxiliary data and uses these coefficients to predict true BMI in the
survey data. To see how this approach works consider taking conditional expectations
in equation (1):
Clearly if
However, if
then a regression of Y on
then a second regression of Y on
will consistently estimate β.
alone may result in
biased and inconsistent estimates of β.
Likewise the standard IV approach requires an instrument Z such that
E(ZX*)≠0 but E(Zu)=0. However, the correlation between X* and u will mean that
instruments that are strongly correlated with X* are also likely to exhibit a correlation
with the measurement error thus violating the condition for consistency. 6 Formally the
IV estimator converges to
6
Cawley (2004) acknowledges this point and for this reason argues that it is important to correct for
measurement error prior to using IV estimation (to control for endogeneity). However, most papers in
the literature do not adopt this approach and simply instrument the self-reported measure. Furthermore,
as shown above, the approach used by Cawley (2000) to correct for measurement does not overcome
the problem of differential measurement error so that the problems with IV we discuss still apply to his
“corrected” measure.
8
Clearly the second term in the denominator of this expression will lead to
inconsistencies in the IV estimates, and if this term is negative then the IV estimator
may overestimate the true coefficient of interest, β.
3. Data
The Growing up in Ireland (GUI) survey tracks the development of a cohort of Irish
children born between November 1997 and October 1998. The data used for our
analysis are from the first wave of interviews, which were carried out between August
2007 and May 2008. 7. Information was collected from the children, their parents, their
teachers, the school principals, and their childminders (where relevant). Dor et al.
(2010) find that in the U.S the incremental costs of obesity are significantly higher for
obese women than for obese men. Consequently to illustrate our results we focus on the
impact of obesity on economic outcomes for the mothers in our sample. At the time of
the survey the average age of the mothers was 39 years of age.
The mother provides information on variables such as household composition,
income, occupation, parental education, and the child’s health, lifestyle and education.
In addition she was asked to report her measured height in cms and her weight in kgs.
The key feature of the data for our purposes is the fact that in addition to these selfreported measures of height and weight the interviewer also measured the
respondent’s height and weight. We refer to the latter as recorded measures and treat
them as the true height and weight of the respondents. We compare these to the selfreported measures to determine the extent and the nature of measurement error in
BMI for all the mothers in our sample.
Summary statistics for the self-reported and recorded measures of BMI are
given in Table 1. Figures 1-2 show the distributions of the BMI index using both the
self-reported and the true measure. The lines on each graph correspond to a BMI of 25
and 30 respectively. Within this range individuals are deemed overweight, beyond
this individuals are deemed obese. Using these data we find that 42.65% (13.87%) of
the mothers in our sample are overweight (obese) on the basis of self-reported data.
However, the true numbers are 49.85% and 17.28%. The tendency for respondents in
7
Only the first wave is available to date.
9
our sample to underestimate their BMI in self-reported data is consistent with
previous findings. 8
To examine the extent of measurement error in the self-reported measures of
BMI in the GUI survey, we calculate the error in the self-reported data by subtracting
the true BMI measure from self-reported measure. The density of the measurement
error is given in Figure 3 and summary statistics for the measurement error are given
in Table 2. The results in Table 2 show that the mean error is negative. In addition the
magnitude of the measurement error is sizeable, accounting for 34.7% the standard
deviation of the true BMI variable. Figure 3 shows that the error is not normally
distributed. The distribution is more concentrated around the mean than would be the
case with a normal distribution. The remainder of the paper explores the discrepancies
between true and reported BMI in more detail and considers the implications of these
differences for the relationship between BMI and economic outcomes.
4. Analysis and Results
In the previous section we showed that mothers in the Growing up in Ireland survey
are more likely to underreport their BMI. In the notation of section 2, this implies
E[u]<0. However, as noted in section 2, the implications of measurement error for
economic analysis will differ depending on whether the error is classical or nonclassical in nature. To determine this we examine the relationship between the error
and the true measure of BMI. Figure 4 graphs the relationship between the
measurement error and the true measure of BMI. From this we see a negative
relationship between the level of measurement error and the true value of BMI.
People with higher BMI’s are more likely to underreport their BMI. 9 The correlation
is -.34. This negative correlation illustrates the non-classical nature of measurement
error in BMI in our data. As noted in section 2 this has implications for the direction
of the bias when using self-reported BMI data to examine the relationship between
BMI and economic outcomes, as well as for the consistency of proposed alternative
estimators. We consider these issues in the remainder of this section.
8
See for example Morgan et al (2008) and Shiely et al. (2010) for Ireland, Elgar and Steward (2008)
for Canada, Villanueva (2001) for the United States and Spencer et al. (2002) for the U.K.
9
The negative correlation between measurement error and the true value of BMI is consistent with
international findings (see for example Shiely et al. (2010), Elgar et al. (2005), Villanueva (2001),
Spencer et al. (2002)).
10
To examine the consequences of non-classical measurement error in BMI we
consider the relationship between BMI and two outcomes measures, income and
education. Although ideally we would like to look at the relationship between
individual income data and a mother’s BMI, this is not available in our data. Instead
we are restricted to examining the relationship between mother’s BMI and total
household income. While this question is somewhat different to that examined in
previous work, where the focus was individual wages, the use of household income as
the dependent variable nevertheless provides a valid framework for illustrating the
consequences of non-classical measurement error in the linear regression framework.
In addition to the income results we also illustrate the consequence of measurement
error when examining the relationship between a mother’s BMI and her highest level
of educational attainment. The relationship between individual BMI and education
has been studied previously in the literature and so is of independent interest. In
addition this analysis illustrates the consequences of non-classical differential
measurement error in non-linear estimation models.
4a. Mother’s BMI and Household Income
To begin, we estimate the relationship between true BMI and household income.
Apart from sampling error this coefficient gives the “true” relationship between BMI
and income. This establishes the benchmark for the true parameter of interest. The
results are given in the first column of Table 3. The estimated coefficient on true BMI
shows a significant negative relationship between BMI and income, which is
consistent with previous work in the literature. The true parameter estimate is -.85,
which indicates that a 5 point increase in BMI reduces income by €4250.
To examine the impact of measurement error we estimate the same regression
only this time using self-reported BMI. The results are given in the second column of
Table 3. In contrast to what we would expect with classical measurement error, we
see that the observed measurement error in BMI overstates rather than attenuates the
relationship between BMI and income. Using self-reported BMI we estimate that a 5
point increase in BMI would result in a €4728 reduction in income. As a result the use
of self-reported BMI overstates the loss of income by approximately 11%. We now
11
examine whether popular approaches to tackling measurement error can overcome
this bias.
We first consider the use of auxiliary data. Following Cawley (2004) and
others 10 we regress true BMI on self-reported BMI. The coefficients from this first
stage regression are then used to adjust the self-reported data for measurement error.
Similar to Cawley (2004), we use the square of self-reported BMI in addition to the
level of BMI. The predicted measure is then used in place of the self-reported
measure in the regression analysis. The results from the second stage are presented in
Table 4. For comparison the true estimated coefficient is given in the first column and
the corrected estimate in the second column. The results show that in this case the
auxiliary regression approach remains biased with a bias that is almost identical to
that with the original self-reported data. This finding warns against using the
similarity of corrected and uncorrected estimates to infer the absence of measurement
error bias.
As noted earlier the conditional expectation approach fails when the
measurement error is differential. The availability of internal validation in our study
allows us to examine this in more detail. We obtain consistent estimates of
in
equation (1) by regressing income on true recorded BMI, and then generating the
residuals. 11 Under the assumption of non-differential measurement error the residuals
should be uncorrelated with the observed measurement error u. To examine this we
regress the predicted residuals on the measurement error. The coefficient on the
measurement error is statistically significantly negative, with a point estimate of .754. the null-hypothesis of no relationship is rejected with a p-value=.004. The
differential nature of the error in our BMI data is thus clearly evident in these data. It
is this failure of classical measurement error that biases the estimates obtained using
the conditional expectation approach. 12,13
10
Lindeboom et al. (2011) also report using this procedure to correct for measurement error in selfreported BMI in their study of the impact of obesity on labor market outcomes.
11
This requires that we have correctly specified the functional form of the conditional mean function.
However we have repeated the analysis above for the range of polynomials in income up to and
including degree 5. The results we report are robust to these alternative specifications of the conditional
mean.
12
Clearly whether measurement error is differential or not can depend on the variables included as
controls in the regression of interest. To consider this we re-estimated our models using a set of
additional controls typically used when estimating income equations in labour economics. These
12
A second approach typically used to correct for measurement error in these
data is to use an instrumental variable approach. A common instrument in this type of
analysis is to use the BMI of a sibling or other relative to instrument for the
respondents BMI on the assumption that this should pick up genetic and
environmental factors but may not be related to the measurement error. We follow
Cawley (2000) and use the weight of the study child to instrument for their parent’s
BMI. 14 The F-statistic from our first stage regression of mother’s BMI on child’s
weight is 400, well in excess of the value 10 suggested by Bound et al. (1995); as
such our analysis is unlikely to be affected by problems associated with weak
instruments. The estimates from using child’s weight as an instrument for selfreported mother’s BMI are given in Table 5, where again the first column shows the
true OLS estimate for comparison. In this case we that the IV results predict that a 5
point increase in BMI reduces income by €6950, compared to a reduction of €4250
based on the true conditional mean.
In the comparisons so far we have assumed away the problem of endogeneity
by imposing E[ X*]=0, in which case the slope of the conditional mean corresponds
to the causal effect of BMI. However, in the event that this assumption is false the
OLS estimate and the IV estimate in Table 5 are not strictly comparable since they are
likely to be estimating different objects; the first a conditional mean and the second a
causal effect. A more appropriate comparison would compare the OLS estimate with
an IV estimate of the conditional mean. To do this we consider the use of an
instrument aimed at removing only the bias associated with measurement error. In the
econometrics literature repeated measures are often used as instruments for
measurement error. Unfortunately such instruments are not available in our data.
However we can nevertheless illustrate the consequences of using IV in the presence
include parent’s age, education, health status, martial status and a control for English language
proficiency. Our results indicate the error in self-reported BMI is still correlated with the residual from
the income equation even when this extended set of controls is used.
13
Bound et al. (2001) page 3738, discuss an extended correction that uses the internal validation data to
correct for differential measurement error. This approach uses internal validation data to control for the
omitted variable that arises when the standard conditional expectation approach is used with
differential measurement error. We have verified that this approach works in our sample but do not
focus on this adjustment here. If one has all the data required to implement this extended approach in
practice then adjusting for measurement error is a matter of efficiency and not bias. We focus on the
latter case which is more serious and seems to arise more often in practice.
14
Other studies that use child’s weight an instrument include Davey Smith et al 2009 and Cawley and
Meyerhoefer 2012). Kline and Tobias 2008 and Lindeboom et al (2010) use the BMI of a parent to
instrument for that of the child, while Brunello and D’Hombres (2007) and Kortt and Leigh (2010) use
the BMI of other biological family members as instruments.
13
of measurement error by artificially creating appropriate instruments. To do this we
create three new variables from our data set, all of which are mismeasured values of
the true BMI and which we call BMI1-BMI3. When simulating the measurement
error for each of these three additional measures we take samples of errors that have
mean zero and are independent of both the true value and income. This error is then
added to the true BMI to create a new measure. In this way we have three new
mismeasured values of BMI that are all subject to classical measurement error. We
will treat one of these measures (BMI1) as a new reported measure of BMI, while the
other two will be used as instruments. These latter two variables should be “valid”
instruments for the measurement error problem but will not adjust for any endogenity
problems. Because of this we argue that a comparison of the OLS estimate with the
IV estimates using these instruments will provide a better indicator of the bias in IV
estimates of the conditional mean when measurement error is nonclassical.
With these additional simulated variables we run three new regressions.
Firstly, we regress income on BMI1 to illustrate the standard effects of classical
measurement error. We then use our two additional measures (BMI2 and BMI3) as
instruments for BMI1; by construction these repeated measures are valid instruments
for measurement error and should return the conditional mean. We then use these
same additional measures as instruments for our real self-reported BMI measure that
is subject to non-classical measurement error and compare the findings. The results
are given in Table 9. The first panel repeats the results for the true specification for
convenience. The results in the second panel illustrate the textbook attenuation bias
associated with classical measurement error. In our example the bias is of the order of
8%. The third panel shows how the availability of “valid” instruments eliminates the
error bias when the measurement error is classical. In addition a Hansen test of
instrument validity fails to reject the over identifying restrictions that result from
having two instruments. However, the results in the fourth panel show how the use of
these “valid” instruments do not result in consistent estimates when used to
instrument the self-reported measure of BMI found in our data. The IV estimate
overestimates the size of the true effect by 13.7%, which is slightly larger than the
bias in the raw OLS estimate. Furthermore the results of the Hansen test in this case
show that we cannot rely on the over identifying test to detect the problems with the
14
instruments. This reflects the low power of this test when none of the instruments are
valid. 15
4b. Mother’s BMI and Mother’s Education
As part of the Growing up in Ireland Survey mother’s were asked to report their
highest level of education in 5 ranges from at most a primary education at the lowest
end to a postgraduate education at the highest level. Using this we create an indicator
variable equal to 1 if the mother has a third level degree or higher and zero otherwise.
We then estimate the relationship between this measure of education and BMI, using
both self-reported and true BMI. Since the dependent variable is binary in this case
we use a probit model rather than a linear regression model. In addition we consider
both the conditional expectation approach and the instrumental variables approach in
the context of this probit specification of educational attainment.
The results are presented in Table 7. Looking at the first column we find a
statically significant negative relationship between true BMI and education. The point
estimates imply that a 5 point increase in BMI reduces the probability of having a
third level degree by 4.5% percentage points. The second column shows the results
when the self-reported measure is used. As was the case when income was used as the
dependent variable we see that using the self-reported measure of BMI overstates the
relationship between BMI and education. The results based on self-reported BMI
imply that a 5 point increase in BMI would reduce the probability of receiving a
college degree by 5.45 percentage points (a bias of over 20% relative to the true
effect). As before the results from the conditional expectation approach presented in
the third column are very similar to those obtained from using the self-reported
measures. Finally the fourth column reports the IV estimate when our artificial
15
An alternative approach would be to use children’s weight as an instrument and compare the IV
estimate of the causal effect when true recorded BMI is instrumented with the IV estimate obtained
from instrumenting the self-reported BMI. Again a comparison of these estimates (which we might
now think of as causal effects) should also provide an indication of the consequences of measurement
error for IV estimation. When we do this we obtain an IV estimate using the recoded BMI equal to 1.22 compared to an estimate of -1.40 using the self-reported BMI. This produces a bias of 13.9% in
the causal effects which is very similar to the bias of 13.7% reported above for the conditional means.
These findings also suggest that much of the increase in the point estimates that result from the use of
instrumental variables should be attributed to issues associated with endogeneity and not classical
measurement error as has been previously suggested.
15
repeated measure is used to instrument self-reported BMI. 16 As before we find that
the order of the bias from the IV approach is similar to that from the raw probit model
using self-reported BMI.
It is clear from this that the results from our probit analysis of educational
attainment are in line with our least squares analysis of income. Firstly, using selfreported BMI overstates the negative relationship between BMI and the outcome
variable; secondly using the conditional expectation approach results in a bias that is
very similar in magnitude to that obtained with the self-reported measure itself; finally
the instrumental variable approach leads to a greater bias than that obtained using the
self-reported measure alone.
5. Conclusion
Obesity imposes very large costs on both governments and individuals. As a result
there is growing concern over measured levels of obesity throughout the world.
However, studies examining the individual costs of obesity typically rely on selfreported data to measure BMI. The use of self-reported BMI gives rise to potential
problems of measurement error which could bias any estimated relationships. This
paper uses a unique data set that contains both self-reported and true measures of an
individual’s Body Mass Index (BMI) for a large sample of adults to examine the
nature of measurement error in BMI and the consequences of this error for estimating
the relationship between BMI and economic outcomes. We find that self-reported
BMI is subject to significant measurement error and importantly this error deviates
from classical measurement error in two important ways. Firstly the error exhibits a
pronounced negative correlation with the true measure of BMI; secondly self-reported
BMI contains information about outcomes even after conditioning on true BMI. In
our analysis we show that these departures from classical measurement error cause the
traditional estimators to overstate the relationship between BMI and outcomes. We
illustrate this using a linear regression model relating BMI and income and a probit
model relating BMI and education. Furthermore we show that popular alternatives
estimators that have been adopted to address problems of measurement error in BMI,
16
The IV results reported here are estimated using maximum likelihood assuming bivariate normality
between the errors in the BMI equation and the latent education equation. We have also estimated the
model using Newey’s (1987) minimum chi-squared estimator and obtained similar results.
16
such as the conditional expectation approach and the instrumental variables approach,
also exhibit significant biases. The estimated biases are of the order of 10% to 15%
depending on the procedure used.
17
References
Aigner, D. (1973), Regression with a Binary Independent Variable Subject to Errors
of Observation,” Journal of Econometrics, Vol. 1, pp. 49-60.
Angrist, J. and A. Krueger (1999), “ Empirical Strategies in Labor Economics,” in
Handbook of Labor Economics, Vol. 3A, O. Ashenfelter and D.Card (eds),
Amsterdam and New York, Elsevier Science.
Angrist, j. and A. Krueger (2001), “Instrumental Variables and the Search for
Identification: From Supply and Demand to Natural ~Experiments,” The Journal of
Economic Perspectives, Vol. 15, no. 4, pp. 69-86.
Black, D., M. Berger and F.Scott (2000), “Bounding Parameter Estimates with
Nonclassical Measurement Error,” Journal of the American Statistical Association,
Vol. 95, No. 451, pp. 739-748.
Bolin, K. and J. Cawley (2007), The Economics of Obesity, Advances in Health
Economics and Health Services Research, Vol. 17.
Bound, J. and A. Krueger (1991), "The Extent of Measurement Error in Longitudinal
Earnings Data: Do Two Wrongs Make a Right?" Journal of Labor Economics 9, 1-24.
Bound, J., C. Brown, and G. Duncan (1994), “Evidence on the Validity of CrossSectional and Longitudinal Labor Market Data,” Journal of Labor Economics, 12,
345-368.
Bound, J., D.A. Jaeger, and R.M. Baker (1995): “Problems with Instrumental
Variables Estimation when the Correlation between the Instruments and the
Endogenous Explanatory Variable is Weak,” Journal of the American Statistical
Association, 90, 443 – 450.
Bound, J, C. Brown, and N. Mathiowetz (2001), “Measurement Error in Survey
Data,” in: J.J. Heckman and E.E. Leamer, eds. Handbook of Econometrics Vol. 5
(North-Holland, Amsterdam) pp. 3707-3843.
Brunello, G. and B. d’Hombres (2007), “Does Body Weight Affect Wages? Evidence
from Europe,” Economics and Human Biology, Elsevier, vol. 5(1), pages 1-19.
Burkhauser, R. and J. Cawley (2008), “Beyond BMI: The Value of More Accurate
Measures of Fatness and Obesity in Social Science Research,” Journal of Health
Economics, Vol. 27(2), pp. 519-529.
Buzas, J. T. Tosteson and L. Stefanski (2003), “Measurement Error,” Handbook of
Epidemiology, pp 729-765.
Carroll, R.J. , D. Ruppert, D. and L. Stefanski. (1994), Measurement Error in
Nonlinear Models (Chapman and Hall, London).
Cawley J. (2000), “An Instrumental Variables Approach to Measuring the Effect of
Body Weight on Employment Disability,” Health Services Research 35(5): 1159–
1179.
18
Cawley, J., Markowitz, S., Tauras, J., (2004), “Lighting up and Slimming Down: the
Effects of Body Weight and Cigarette Prices on Adolescent Smoking Initiation,”
Journal of Health Economics 23 (2), 293–311.
Cawley, J. (2004): “The Impact of Obesity on Wages,” The Journal of Human
Resources, Vol. 39, no. 2, pp. 451-474.
Cawley, J. and C. Meyerhoefer (2012), “The Medical Care Costs of Obesity: An
instrumental Variables Approach,” Journal of Health Economics, 31, pp. 219-230.
Davey Smith, G. J. Sterne, A. Fraser, P. Tynelius, D. Lawlor, (2009), “The
association between BMI and mortality using offspring BMI as an indicator of own
BMI: large intergenerational mortality study,” British Medical Journal 339, b5043.
De Sousa, S. (2012), “ Does Size Matter? A Propensity Score Approach to the Effect
of BMI on Labour Market Outcomes,” EUI Florence.
Dor, A. C. Ferguson, C. Langwith, E. Tan, (2010) “A Heavy Burden: The Individual
Costs of Being Overweight and Obese in the United States,” The George Washington
University School of Public Health and Health Services Department of Health Policy
Elgar F.J. and J.M. Stewart (2008). “Validity of Self-Report Screening for
Overweight and Obesity. Evidence from the Canadian Community Health Survey,”
Canadian Journal of Public Health, .Sep-Oct;99(5):423-7.
Elgar F.J, C. Roberts, C. Tudor-Smith and L. Moore (2005), “Validity of SelfReported Height and Weight and Predictors of Bias in Adolescents,” Journal of
Adolescent Health, 2005 Nov;37(5):371-5.
Fuller, W (1987), Measurement Error Models (Wiley and sons, New York).
Gottschalk, P. and M. Huynh (2010), "Are Earnings Inequality and Mobility
Overstated? The Impact of Nonclassical Measurement Error," The Review of
Economics and Statistics, MIT Press, vol. 92(2), pages 302-315, May.
Haider, S. and G. Solon (2006), Life-Cycle Variation in the Association between
Current and Lifetime Earnings. American Economic Review, 96(4): 1308–1320
Hyslop, D. and G. Imbens (2001), “Bias from Classical and Other Forms of
Measurement Error,” Journal of Business And Economic Statistics, 19(4), pp. 475481.
Johansson, E., P. Bockerman, U. Kiiskinen and M. Heliovaara (2009), “Obesity and
Labour Market success in Finland: The difference between having a high BMI and
being Fat,” Economics and Human Biology, 7, pp. 36-45.
Kaestner, R. and M. Grossman (2009), “Effects of Weight on Children’s Educational
Achievement,” Economics of Education Review, 28, pp. 651-661.
Kim, B. and G. Solon (2005), “Implications of Mean-Reverting Measurement Error
for Longitudinal Studies of Wages and Employment,” Review of Economics and
Statistics. 87, 193-196.
19
Kline, B. and J. tobias (2008), “The wages of BMI: Bayesian analysis of a Skewed
Treatment Response model with nonparametric Endogeneity,” Journal of Applied
Econometrics, 23, 767-793.
Kortt and A. Leigh (2010), “Does size Matter in Australia?,” The Economic Record,
vol. 86, no. 272, 71–83
Lee, L. and J. Sepanski (1995), “Estimation of Linear and Nonlinear Errors-inVariables Models using Validation Data,” Journal of the American Statistical
Association, 90(429), 130-140.
Lindeboom, M., P. Lundborg and B. van der Klaauw (2010) “Assessing the Impact
of Obesity on Labor Market Outcomes,” Economics and Human Biology 8 (2010)
309–319
Lyles, R. and L. Kupper (1997), “A Detailed Evaluation of Adjustment Methods for
Multiplicative Measurement Error in Linear Regression with Applications in
Occupational Epidemiology,” Biometrics, Vol. 53, pp. 1008-1025.
Madden, D (2012), “A Profile of Obesity in Ireland,” Journal of the Royal Statistical
Society A, 175, 893-914.
Morgan K, H. McGee, D. Watson, I. Perry, M. Barry, E. Shelley, J. Harrington, M.
Molcho, R. Layte, N. Tully, E. van Lente, M. Ward, J. Lutomski, R. Conroy, R.
Brugha (2008), SLAN 2007: Survey of Lifestyle, Attitudes & Nutrition in Ireland:
Main Report. Dublin: Department of Health and Children.
Musingarimi, P. (2008), “Obesity in the UK: A Review and Comparative Analysis of
Policies within the Devolved Regions,” International Longevity Centre.
Newey, W. K. (1987) “Efficient Estimation of Limited Dependent Variable Models
with Endogenous Explanatory Variables,” Journal of Econometrics 36: 231–250.
O’Neill, D., O.Sweetman and D. Van de gaer (2007) “The Effects of Measurement
Error and Omitted Variables when using Transition Matrices to Measure
Intergenerational Mobility,” The Journal of Economic Inequality, Vol. 5, no. 2, pp.
159-178.
Pischke, J (1995), “Measurement Error and Earnings Dynamics: Some Estimates from
the PSID Validation Study,” Journal of Business and Economic Statistics, 13(3), pp.
305Rooth, D. (2009) “Obesity, Attractiveness, and Differential Treatment in Hiring: A
Field Experiment,” Journal of Human Resources, 44 (3), pp. 710–735
Scholtens, S., B. Brunekreef, T. Visscher, H. Smit, M. Kerkhof, J. de Jongste, J.
Gerritsen, A.Wijga (2007) “Reported versus Measured Body Weight and Height of 4year-old Children and the Prevalence of Overweight,” European Journal of Public
Health, Vol. 17, No. 4, 369–374
Serneels, P. (2002), Explaining Non-Negative Duration Dependence Among the
Unemployed, The Centre for the Study of African Economies Working Paper Series.
Working Paper 172.
20
Shiely F., I. Perry, J. Lutomski, J. Harrington, C. Kelleher, H. McGee, K. Hayes
(2010), “Temporal trends in Misclassification Patterns of Measured and Self-Report
Based Body Mass Index Categories - Findings from Three Population Surveys in
Ireland,” BMC Public Health, 10:560
Solon, G (1992), “Intergenerational Income Mobility in the United States,” American
Economic Review, 82(3), 398-408.
Spencer E.A, P.N Appleby, G.K Davey and T.J Key (2002), “Validity of SelfReported Height and Weight in 4808 EPIC-Oxford participants,” Public Health
Nutrition Aug; 5(4):561-5.
Stefanski, L. and R. Carroll (1985), “Covariate Measurement Error in Logistic
Regression,” The Annals of Statistics, 13(4), 1335-1351.
Villanueva, E. (2001), “The Validity of Self-reported Weight in US adults: a
Population Based Cross-Sectional Study,” BMC Public Health, 1:11
von Hinke, S. G. Smith, D. Lawlor, C. Propper and F. Windemeijer (2012), “The
Effect of Fat Mass on Educational Attainment: Examining the sensitivity to different
identification Strategies,” Economics and Human Biology, 10, pp. 415-418.
Wada, R. and E. Tekin (2010) “Body Composition and Wages,” Economics and
Human Biology, 8, 242-254.
21
Table 1.
Summary Statistics on Recorded and Self-Reported BMI
Variable
Observations
Mean
Recorded
BMI
SelfReported
BMI
7522
25.97
Standard
Deviation
4.89
7522
25.30
4.59
Minimum Maximum
15.05
64.15
12.7
69.43
Note the maximum srBMI corresponds to someone who was 177Kg(27.85 stone) and 1.5m
(5.25 feet)
Table 2.
Summary Statistics on Self-Reported Measurement Error in BMI
Variable
Observations
Mean
SelfReported
BMI Error
7522
-.6671
Standard
Deviation
1.697
Minimum Maximum
-12.33
9.5
Table 3
Estimated Coefficients on BMI in OLS regressions (Standard Error in brackets)
Dependent
Variable
Household income
Recorded BMI
OLS
-.844
(.088)
Self-Reported BMI
N
OLS
7021
-.9456
(.094)
7021
Table 4
Estimated Coefficients on BMI obtained using Conditional Expectation Approach
(Standard Error in brackets)
Dependent
Variable
OLS
Conditional
Household income
Expectation
approach
Recorded BMI
-.844
(.088)
Corrected BMI
-.9467
(.094)
22
N
7021
7021
Table 5
Estimated Coefficients on BMI obtained using IV Approach
(Standard Error in brackets)
Dependent
Variable
OLS
IV approach
Household income
(using child’s
weight as
instrument)
Recorded BMI
-.839
(.089)
IV BMI
-1.397
(.391)
N
6963
6963
Table 6
Comparison of IV with classical and non-classical measurement error using simulated
data (Standard Error in brackets)
Dependent
Variable
OLS
OLS with
IV with
IV with nonHousehold
classical
classical
classical
income
Measurement
Measurement Measurement
Error
Error (using
Error (using
BMI2 and
BMI2 and
BMI3 as
BMI3 as
instruments) instruments)
Recorded BMI
-.844
(.088)
BMI1
-.779
(.083)
BMI1 (IV using
-.835
BM2 and BMI3)
(.082)
self-reported
-.96
BMI (IV using
(.094)
BMI2 and
BMI3)
Hansen Overp-value=.204 p-value=.176
identification
test
N
7021
7021
7021
7021
23
Table 7
Impact of Measurement Error when examining relationship between BMI and
Education using Probit Model (Reported results are Point Estimates and Standard
Error in brackets)
Dependent
Variable
Probit
Probit
Probit with
IV Probit
Conditional
(using BMI2
Mother’s
Education
Expectation
and BMI3 as
Approach
instruments)
Recorded BMI
-.0299
(.003)
Self-Reported
-.034
BMI
(.0036)
Corrected BMI
-.034
using CE
(.0036)
Self-reported
-.0337
BMI (IV using
(.0039)
child’s weight)
N
7462
7462
7462
7462
0
.05
Density
.1
.15
24
60
80
20
40
0
Body Mass Index (BMI) of the primary caregiver - derived from self-reported data
Figure 1: BMI derived from self-reported data (Firstline corresponds to BMI=25, second line
0
Density
.05
.1
to BMI=30) .
10
50
20
30
40
Primary Caregiver's BMI - derived from measured data
60
Figure 2: BMI derived from recorded data (Firstline corresponds to BMI=25, second line to
BMI=30)
0
.1
.2
Density
.3
.4
.5
25
-15
-10
0
-5
10
5
BMI_error
-15
-10
BMI_error
-5
0
5
10
Figure 3: Distribution of Measurement error in reported BMI
10
20
30
40
50
Primary Caregiver's BMI - derived from measured data
60
Figure 4: Scatterplot of Measurement Error in BMI against the true measure of BMI