“Rational Fatalism”: Non-Monotonic Choices in Response to Risk

“Rational Fatalism”:
Non-Monotonic Choices in Response to Risk
Jason T. Kerwin1
University of Michigan Department of Economics and Population Studies Center
March 2012
Many health behaviors, from chemical exposure to unprotected sex, involve weighing definite
benefits against uncertain costs. Previous empirical research has assumed monotonic, negative
responses to such risks, where people always decrease the number of risks they choose to take
when the per-act chance of a bad outcome rises. This paper shows that this need not be true if
rational agents have a history of risk-taking with a still-unrealized outcome. In that case there
exists a tipping point above which agents switch from negative (self-protective) to positive
(fatalistic) responses to risks, even if they are unsophisticated in thinking about probabilities. I
also demonstrate that the typical test for violations of monotonic responses can yield misleading
results, and develop an alternative approach. I then apply my framework to decisions about risky
sex in Malawi, finding suggestive evidence of non-monotonic behavior, and present preliminary
findings from an RCT designed to test the model.
1
I am grateful to Yusufcan Masatlioglu, Raj Arunachalam, Joe Golden, Emily Nicklett, and seminar participants in
the IGPI conference and the University of Michigan Informal Development Seminar for their invaluable feedback.
Rebecca Thornton, Ajay Shenoy, Shaun McGirr, Isaac Sorkin, and participants at the University of Michigan
Extremely Informal Development Seminar provided helpful comments on an earlier version of this paper. Research
on this project was supported by a Student Global Health Engagement Grant from Center for Global Health, the
Population Studies Center Initiatives Fund, and the Weinberg International Travel Fund. The Situational Analysis of
Sexual Behaviors and Alternative Safer Sex Strategies was funded by a grant from the Eva L. Mueller New
Directions in Economics and Demography Fund. Mr. Kerwin acknowledges fellowship support from the Population
Studies Center and the Rackham Graduate School.
1. Introduction
Life is full of decisions that involve weighing the definite benefits of an act against
uncertain, but potentially large consequences. When people consider everything from smoking
and drinking to exposure to chemicals to risky sex to merely driving to work, they face a
probabilistic chance of a bad outcome, and decide whether it’s worth it to take a chance.
Typically research has assumed “self-protective” responses to risks – that when the per-act risk
goes up, people take fewer chances.2 In this paper, I show that rational actors may also respond
“fatalistically” – by increasing their risk-taking when per-act risks rise, and describe a general
model of “rational fatalism” in which the choice of the number of risky acts is a non-monotonic
function of the per-act risk, so that behavior has a both a self-protective and a fatalistic region.
This leads to a fundamental non-monotonicity in people’s behavior – their responses switch from
downward-sloping to upward-sloping depending on the value of the per-act risk. This nonmonotonicity is driven by a history of past risks taken for which outcome is still unrealized.
Logically, this might apply to HIV infection, contracting cancer from smoking, or lung damage
from asbestos exposure. The logic is fairly straightforward: people who are convinced that the
bad outcome is sure to happen perceive no benefit from reducing their risk-taking. In contrast
with previous research I show that this result holds for any reasonable benefit function for the
risky acts, and for a wide range of risk aggregation functions including those that might be
employed by people with a limited background in mathematics. I also demonstrate that fatalistic
behavior can exist for interior solutions, rather than solely in situations where agents choose the
maximal number of risks allowed in the model.
2
Economists commonly use “risk” or “riskiness” to refer to the volatility of an outcome or asset that has some
upside or downside risk, and focus on the curvature of the utility function as a summary of aversion to this sort of
risk. In this paper I use “risk” in the more colloquial sense of “taking a risk”, in which doing something has a
known (beneficial) outcome and carries some chance of a bad result.
Although theoretical work has acknowledged that fatalism can be rational, an extensive
empirical literature on risky choices has almost uniformly imposed a linear functional form
assumption, therefore ruling out potentially non-monotonic relationships. The portion of this
literature that has studied the relationship between unprotected sex and HIV in Africa has found
perplexingly small responses. Many explanations have been advanced for this limited response,
but rational fatalism suggests another possibility that has yet to be studied. If a negative, selfprotective response by some is balanced by a positive, fatalistic response by others, the average
magnitude of the response will be lower and potentially near zero unless the researcher explicitly
allows for non-monotonic responses. This is rare – only a few studies have allowed for nonmonotonicity, commonly by including a quadratic term in the per-act risk. A conventional
approach is to looking for non-monotonicity is to examine the statistical significance of this
squared term, but this can be misleading. Using data generated by the rational fatalism model, I
show that a parametric test of the presence of a non-monotone relationship also performs poorly.
I develop a method to identify non-monotonic responses that is effective for this model, and then
apply it to preliminary data from Malawi. While the data has is purely observational, and reverse
causality between risk-taking and per-act risks is likely, I do find suggestive evidence of nonmonotone choices, especially among people in rural areas. This implies that research on sexual
behavior and HIV transmission risks should consider the potential of rational fatalism, and look
for possible non-monotonicity in responses.
I begin in Section 2 by looking at the existing body of research on choices in the face of
risk, with a particular emphasis on HIV as well as on models of fatalistic behavior. Section 3 lays
out a general theoretical framework that captures the possibility of rational fatalism, and
discusses the conditions under which we expect fatalistic (as opposed to self-protective)
behavior. In Section 4, I develop an empirical strategy for identifying potential fatalistic behavior
from data on risk-taking and perceived per-act risks. I emphasize that linear models will find
attenuated responses in the face of underlying non-monotonicity, and that standard approaches to
testing for a non-monotonic relationship have important limitations when applied to the rational
fatalism model. Section 5 applies that strategy to preliminary observational data from Malawi’s
Zomba District, and shows that there is suggestive evidence of rational fatalism in that region,
and Section 6 concludes.
2. Background
There is an extensive empirical literature in economics that considers decisionmaking in a
context where actions have a known benefit but carry some risk of a bad outcome (a failure in
the probability terminology). This section gives an overview of that literature, with an emphasis
on what the present work contributes and how it differs from related research. I begin with a
discussion of research on responses to risks in general (Section 2.1), and then turn to a discussion
of the possibility of fatalism as a rational choice (Section 2.2). I then discuss the performance of
standard empirical approaches when used on data that may be non-monotonic due to rational
fatalism (Section 2.3).
2.1.
Measuring responses to risks
Among the earliest studies to examine choices in the face of some per-act risk is Viscusi
(1990), which shows that people who think the risk of acquiring lung cancer from smoking is
higher are much less likely to smoke. Research on responses to other risks has found similar
responses. Using a fixed-effects approach that exploits repeated home sales to reduce omittedvariable bias, Gayer et al. (2002) show that the release of information that cancer risks are lower
than expected leads to increases in home prices, implying a rise in demand for housing. A
positive response of wages to job-related mortality risks is one of the central predictions of the
theory of compensating differentials. Also working with home prices, Linden and Rockoff
(2008) use data from Megan’s Law to show that the arrival of a sex offender in a neighborhood
decreases home prices by about 4 percent, which they attribute to a decline in demand due to the
increased perceived risk of crime. Viscusi (2004) uses variations in pay and risk across
occupations to estimate the statistical value of a human life. He finds a significant and positive
response of wages to occupational mortality risks, consistent with a ceteris paribus reduction in
labor supply in response to the risk of death.
Unprotected sex in the face of potential HIV infection is arguably the most important
risky choice from a public policy perspective. Accordingly, the response of people’s sexual
behavior to risks has received extensive attention from economists, beginning with Philipson and
Posner (1993), who develop a model of “rational epidemics,” wherein infectious diseases are
spread principally by voluntary behavior. This approach led to later work that attempted to
measure the responsiveness of sexual behavior to the risk of HIV. Empirical studies of HIV and
sexual behavior in the United States consistently find strong negative responses. Geoffard and
Philipson (1996) estimate the parameters of a rational epidemic model of HIV using data on
homosexual men from 1980s San Francisco, finding significant differences from a traditional
epidemiological predictions. Based on the same dataset, Auld (2006) uses a structural model of
sexual behavior to show that the rate of sexual partner change dropped rapidly in response to
higher HIV prevalence in the homosexual population of that city, with a 10% rise in the
prevalence decreasing the rate of partner change by 5%. Ahituv et al. (1996) use the National
Longitudinal Survey of Youth to study behavioral responses across the United States as a whole,
and find significant increases in condom use as HIV prevalence rises.
In contrast, there is decidedly less evidence of people responding to the risk of HIV by
curtailing sexual risk-taking in sub-Saharan Africa, where the epidemic is at its worst. A study
of 14 countries in the region using data on HIV prevalence and sexual risk-taking from the
Demographic and Health Surveys (DHS) finds that higher HIV prevalence does decrease risky
sex at several margins, after instrumenting for HIV prevalence using the distance to the origin of
the virus (Oster 2012). However, the reductions are small in magnitude: a doubling of the HIV
infection rate decreases the probability of unprotected sex by just 2 percentage points.
Other research shows similarly limited responses. Also using data from the DHS and
instrumenting for the distance to the origin of the virus, Juhn et al. (2009) estimate separately
both the direct biological effect of HIV infection on fecundity, and the indirect behavioral effect
of higher HIV prevalence through reduced unprotected sex. They find evidence of significant
biological reductions in fecundity but no meaningful change in the fertility rate. Stoneburner and
Low-Beer (2004) argue that with the exception of Uganda, no African country has exhibited
substantial behavioral changes in response to the HIV epidemic. Consistent with this pattern of
limited behavioral change in response to the HIV epidemic, Padian et al. (2010) conduct a
systematic review of RCTs that attempt to reduce HIV transmission, finding that only one in
seven show any impact, either positive or negative. “In fact, the overwhelming majority of
completed RCTs are ‘flat’ – unable to demonstrate either a positive or adverse effect.”
Some recent research in Africa has found more encouraging results. Delavande and
Kohler (2011) use panel data on probabilistic expectations in rural Malawi to study the impact of
beliefs about the HIV transmission rate. Relying on optional HIV testing as a shock to people's
beliefs, they find a significant negative relationship between the perceived risk of HIV
transmission and the decision to have multiple sex partners. Godlonton et al. (2012) run an
experiment in rural Malawi that demonstrates that when people are told that circumcised men
have a relatively lower risk of HIV transmission, circumcised men do not change their sexual
risk-taking but uncircumcised men have significantly less risky sex. A study of Kenyan teenagers
also explored responses to relative risks: Dupas (2011) finds that a program that provided
information on the relative risks of HIV infection by the age of one’s partner prompted large
changes in sexual behavior in that group. Despite these encouraging results, however, the
estimated response of risky sex to per-act risks in Africa is still strikingly smaller than in the
United States. A variety of explanations have been advanced for this difference. . Oster (2012)
argues that the limited response she finds is due in part to lower life expectancy in Africa; if
people expect to live short lives anyway (due to say, malaria), HIV infection may not be as
salient a threat. An alternate view is that condoms and other forms of safer sex are culturally
unacceptable in an African context, thus impeding uptake; for example, in Malawi condom use
in marriage is seen as an accusation or admission of infidelity and is commonly compared to
eating candy with the wrapper still on (Chimbiri 2007). In this paper I argue that a third factor
may also contribute to a limited average response: fatalism. If some people are responding
fatalistically to risks – by increasing their risk-taking when risks rise – then this will cancel out
some of the self-protective responses in the data.
2.2.
Fatalistic responses to risks
Empirical research on responses to risks, whether in Africa or elsewhere, has almost
uniformly assumed that these responses conform to the “self-protection” model in which choices
are a declining linear function of riskiness. In contrast with this is the possibility of the opposite
pattern, in which response are actually positive: as the per-act risk goes up, people take more
risks. This kind of behavior is commonly thought of as an irrational, and referred to as
“fatalism”. The connotation is that it is unreasonable and lies soundly in the realm of behavioral
theories that break with rationality. In the context of HIV, for example, “fatalism” is usually used
to refer to the seemingly irrational pattern of just giving up on avoiding infection due to
hopelessness or a lack of regard for personal safety. This behavior was observed as long ago as
the late 1980s in Uganda: Barnett and Blaikie (1992) discuss men who were aware of the risk of
contracting HIV and simply appeared not to care, asking “Who is never going to die?”
Economists have also focused on non-rational explanations for fatalistic behavior. Leon and
Miguel (2011), for example, demonstrate that travelers in Sierra Leone reveal a lower
willingness to pay for reductions in mortality risk than Americans, despite the fact that they have
comparable incomes and remaining life expectancies; they argue that this may be explained by
the perceived role of fate in determining life outcomes in West African societies.
However, it is not necessarily the case that fatalism must arise from people behaving
irrationally. Fatalistic behavior can actually be perfectly rational: in their early treatise on the
economics of HIV, Philipson and Posner (1993) point out that selfishly rational actors will tend
to demand more risky sex as their probability of already having HIV goes up. In the limit, the
logic is simple: if I already have HIV, I gain no benefit from using a condom and doing so
carries some cost. More generally, O’Donoghue and Rabin (2001) develop a simple model of the
expected cost of risk-taking, showing that for some parameter values the marginal cost of
another risky act actually decreases when the per-act risk rises. While their analysis focuses on
people being unwilling to go below some minimum level of risky choices, in Section 3 I show
that the same logic will apply when people have already engaged in risk-taking in the past and do
not yet know the outcome of that risk-taking. Using a special stepwise functional form for the
benefit of the risky behavior, they show that fatalistic responses can be rational, and note that
along with other potential applications such as drug use, their model may have relevance for the
risk of HIV infection as well. More recently, Sterck (2011) develops a theoretical framework that
uses the same cost function as O’Donoghue and Rabin (2001), but in a dynamic setting. Using
parameter values derived from data on Burundi, he argues that believing the risk of HIV
transmission is high can lead to rises in risky sex.
The idea of rational fatalism can be seen as a specific form of the Mickey Mantle effect,
in which people invest less in their health when their life expectancy is lower for reasons
unrelated to the investment in question (Fang et al. 2007). Fang et al. find that this decline in
health investments is heterogeneous across health behaviors, with smoking being more
responsive than heavy drinking. In a rationally fatalistic model, the specific shifter of life
expectancy is past choices of the very same risk under consideration.
Is rationally fatalistic behavior relevant to the study of responses to risks? It may wall be,
at least in the context of the HIV epidemic in Southern Africa. Qualitative evidence from highprevalence areas in the region suggests that indeed rationally fatalistic behavior is potentially
common there. In research on how rural Malawian men discuss HIV and risky sex, Kaler (2003)
documents many cases in which single “freelancer” men use fatalistic reasoning when thinking
about the disease. One informal conversation on the topic recorded in the Kaler (2003) study
proceeded as follows:
Friend: I don't fear AIDS because I know that I have it already.
Diston: How do you know that you have got AIDS?
Friend: I have malaria and some coughs so I know that I have it.
Diston: Do you use condoms when [sleeping with] these bargirls?
Friend: What for, since I know that I am already infected? (Kaler, 2003)
Kaler describes many cases in which high perceived risks are leading sexually experienced
individuals to fatalism: even though they have never tested positive for HIV, they give up on
safer sex, having decided that they cannot avoid developing AIDS.
2.3.
Measuring the response of risk-taking to per-act risks
The possibility of fatalism as a rational response to increased risks has commonly been
discounted in the literature on estimating risk responses empirically. For example, Viscusi (1990)
uses the one-tailed version of the t-test for his statistical significance calculations, thereby
assuming that responses can only be either negative or zero. In the case of HIV, researchers
typically follow Philipson and Posner (1993) in assuming that the overall prevalence of the virus
is sufficiently low that few if any people believe they are HIV-positive. This justifies the
assumption of a linear risk-response relationship, since for low values of both the number of
risks taken and the per-act risk, the probability of infection is approximately linear in the number
of risks. Given the low prevalence in most US populations, ignoring potential fatalism may be
justifiable in this context. However the prevalence of HIV in Africa is much higher, making this
assumption harder to justify, and as noted above at least some people in Malawi use rationally
fatalistic reasoning to explain their own behavior.
This implies that any empirical study of the response of sexual behavior in Africa to the
risk of HIV prevalence must allow for responses to be potentially non-monotonic. Almost no
existing research on the effect of risks on behavior allows for non-monotone effects. Some
studies have considered heterogeneity in responses, for example by gender and marital status
(Oster 2012) and by level of risk-taking (Auld 2006).3 In an experiment studying the effect of
providing HIV test results to people in Malawi, Thornton (2008) explicitly considers combating
fatalism as one of the mechanisms through which HIV testing could potentially affect sexual
behavior. Selfishly rational people who believe they are HIV positive, and find out they are not
due to a test result, are likely to reduce the amount of risky sex they have. She therefore
3
The Auld (2006) approach comes close to allowing us to directly examine the rational fatalism model laid out in
Section 3, but he explores heterogeneous responses by current level of risk-taking, whereas rational fatalism is
driven by variation in past risks taken.
explicitly looks for heterogeneous effects by HIV status, but finds that only HIV-positive
individuals respond significantly to the test results, and that the effect is small in magnitude. One
study related to HIV and sexual behavior that does not impose linear responses is de Paula et al.
(2009). However, they are examining the impact of beliefs about one’s own current HIV status
on sexual behavior, and so are not directly comparable with the other literature discussed above.
Since almost none of literature on the response of sexual behavior to risks allows for that
response to be non-monotonic, this suggests an additional explanation for the lower estimated
responses to risks in Africa than in the US. If people in Africa are more likely to be fatalistic,
then their positive responses to risks will cancel out some of the negative response on average.
Since a linear regression of risk-taking on per-act risks measures the average slope across the
population, this would tend to attenuate the estimated response.
How can one tell if such a linear regression is likely to be misleading? One common
approach to testing for non-monotonic relationships is to run some regression specifications that
include a quadratic term, which allows for the common technique of examining the statistical
significance of the second-order coefficient to determine whether a relationship is nonmonotonic. However, even this approach may fail to reveal the non-monotone responses typified
by rational fatalism.4 Recent work by Lind and Mehlum (2010) shows that a significant quadratic
term may arise even if the relationship is monotonically negative, and propose a formal
parametric test for a U-shaped relationship. Their technique also has its drawbacks, because it
requires an approximately quadratic functional form for the data. As I discuss in detail in Section
4, if rational fatalism is potentially at work in generating data on responses to risks, careful
analysis is necessary to determine whether the relationship is monotonic.
4
De Paula et al. (2009) also look at heterogeneous effects by quantile of perceived risk, which would perform far
better than the use of a quadratic term in a regression.
3. Theoretical Framework: The Rational Fatalism Model
How can existing approaches to studying behavioral responses to risks be modified in
order to accommodate the possibility of rational fatalism? In this section I will develop a model
of rational fatalism that extends the logic employed in the previous literature in a simple but
powerful way. This model replaces the linear probability function used in most research on risktaking with a risk aggregation function that represents an agent’s belief about the probability of a
failure given the number of risky acts chosen and the perceived riskiness of each act. Using this
model, I will show that a tipping point into fatalistic behavior emerges naturally from a wide
range of possible risk aggregation functions, including the true total probability derived from the
binomial distribution.
I begin this section by laying out the basic form of the model, and the conditions that
must be satisfied for an agent’s chosen number of risky acts to be optimal (Section 3.1). Using a
minimal set of assumptions about the optimization problem, I show the conditions under which
an interior optimum will and will not exist. I demonstrate that a non-zero fixed price for each
risky act (which could also be thought of as a time cost or an emotional cost) will exclude the
possibility that agents just choose as many risky acts as possible. (Section 3.2) I then explore the
model’s general comparative statics, focusing in particular on how the optimal number of risks
taken changes in response to variation in the perceived per-act risk. To explore these
comparative statics, I prove that any risk-aggregation function that satisfies basic conditions will
have a tipping point, above which additional increases in riskiness decrease the marginal cost of
risk-taking. I use this finding to show that there are tipping points in individuals’ behavior as
well: above a certain point, additional risks lead to more risk-taking rather than less (Section
3.3). After showing these tipping points exist, I then discuss two fairly simple heuristics that
people might use to think about how risks add up, and show that despite not being differentiable
they also exhibit the tipping point that is central to my result, implying that the behavior I model
could hold even for unsophisticated agents (Section 3.4). In Section 3.5, I show that the rational
fatalism model implies that risk responses are qualitatively different depending on the domain
being considered: the standard negative response occurs in when individuals face low per-act
risks or have a small amount of previous risk-taking, while individuals facing a combination of
both high perceived risks as well as substantial past risk-taking will tend toward fatalistic
behavior. Finally, I discuss the ways in which this model differs from previous theoretical work,
in particular the fact that it holds for any valid risk aggregation function and that it shows that
fatalistic risk responses can occur for interior solutions, and not just in situations where agents
take the maximum possible number of risks (Section 3.6).
3.1.
Model Basics
In this model, I assume that people weigh the benefits of choosing a level of risk-taking,
, against both any monetary costs as well as the expected cost of a stochastic bad outcome
occurring due to their choice.5 Each risky act carries some perceived per-act risk r (its
“riskiness”) that it will cause the bad outcome to occur.6 The benefit of
by a continuously differentiable benefit function,
, with
risky acts is described
and
so the
marginal benefit of risk-taking is positive with diminishing returns. For simplicity I normalize
. There is some non-probabilistic cost p>0 for each act that might be thought of as a
monetary cost, the cost of the time devoted to the act, or even an emotional cost or guilt, so n
acts cost
5
. The expected cost of the bad outcome is the perceived probability of it occurring,
For the sake of simplicity I will assume that n is continuous, rather than a discrete number of acts. This follows
O’Donoghue and Rabin (2001). The simulations in Section 4 will demonstrate that the qualitative results derived in
this section for continuous n will also hold for discrete-valued n.
6
This perceived risk does not have to equal the true risk; most people overestimate per-act risks across a wide range
of activities from smoking (cf. Viscusi 1990) to unprotected sex (see Section 5 of this paper).
, times its perceived cost, . Given a number of acts and a per-act risk, the true probability can
be computed using the binomial distribution. I reserve this for later discussion, however, and
allow people’s perceived probability of the bad outcome (a “failure”, to use terminology from
probability theory) to be some general (continuously differentiable) risk-aggregation function
. Here r, the perceived per-act risk, is assumed to lie between 0 and 1.
is the
number of previous risks taken for which the outcome has not year been realized, and is weakly
positive. In other words, this model explicitly considers agents for whom some of their risktaking history is still unresolved; for example, people in areas with no accessible HIV testing
who are still in the window period between risky sex they might have had and the point at which
they would develop AIDS symptoms. In order to ensure that
corresponds to well-
formed probabilities, I impose the following basic assumption:
Assumption 1
A.
, , and
B.
if
C.
iff
.
, or both
or
and
.
.
and
iff
.
Probabilities must never be negative or greater than one. The probability of the bad
outcome is zero if either the activity is risk-free or if the individual does not engage in the
activity at all. Taking any risks will lead to the bad outcome occurring with certainty if
the per-act risk is 100%. Likewise, choosing an unbounded number of risky acts will
eventually lead to the bad outcome happening for sure, as long as the act has some nonzero risk associated with it.
This simply imposes that the probabilities produced by the risk-aggregation function start from
zero and rise to one as we increase either the riskiness of an individual act or total risk-taking. In
addition, I assume that P is never decreasing in any of its arguments, and strictly increasing to
begin with. I also impose that additional acts do not raise the total probability of a failure if the
acts are riskless, and that raising the per-act risk does not increase the total probability if no risks
are taken.
Assumption 2
, with
if
and
if
and
;
, with
.
Increasing the per-act risk will never decrease the overall probability of a failure;
likewise raising the number of risky acts chosen or the previous stock of risky acts
always (weakly) increases the total probability of a failure. An initial increase in riskiness
or risk-taking strictly raises the total probability of a failure (as long as total risk taking or
riskiness, respectively, are non-zero). Conversely, increasing the number of riskless acts
chosen, or the riskiness of a risky act that is not chosen, does not affect the total
probability of the bad outcome occurring.
Taken together, these assumptions simply state that people must have a general understanding of
risks, so that they understand that probabilities never fall outside of a 0-100% range and that
additional risk taking is bad, up to the limit imposed by maximal probability of 1. They
immediately lead to an initial lemma:
Lemma 1
A.
B.
The effect of increasing the per-act risk on the total probability of the bad outcome is zero
if the per-act risk is one. The effect of the number of risks chosen or the existing stock of
risks approaches zero as the sum of those variables approaches infinity.
Lemma 1A holds trivially if n+m = 0, and likewise for Lemma 1B if r = 0. To see why they must
hold in the non-trivial case, assume they do not hold. Then P is unbounded. But by assumption P
is bounded above at 1, so we have a contradiction. Therefore Lemma 1 must hold in general.
Note that because P is continuously differentiable, Lemma 1A also implies that
. Conceptually, Lemma 1 says that increasing the riskiness of the act high
enough, or taking a sufficient number of risks, pushes the likelihood of the bad outcome to
100%. Once it has reached that point, additional risk-taking does not increase the probability any
further.
The agent’s optimization problem is therefore the following:
By the assumption that n is continuous, the maximand U(n; m; p; c; r) is the sum of continuously
differentiable functions and therefore continuously differentiable itself.
3.2.
Conditions for a Non-trivial Optimum
Since the object of interest in this analysis is the response of n to changes in r, one concern is
whether the solutions to the problem are purely trivial, with fatalism representing jumps to some
maximal level of risk taking. In this section I show that a non-stochastic price for each risky act
guarantees that we will find interior solutions unless risk-taking is not beneficial at all at the
agent chooses to take zero risky acts.
The above optimization problem admits many conceivable forms for the benefit function
B(n), including some that make little intuitive sense. To restrict the discussion to reasonable
benefit functions, I assume that at some point taking additional risks yields no utility gains.
Assumption 3
. As the number of risky acts chosen approaches infinity, the
marginal benefit from an additional risky act approaches zero.7
Under Assumptions 1-4 the problem still admits trivial corner solutions where n*=0. In order to
discuss interior solutions, I impose one additional assumption.
Assumption 4
. Risk-taking is desirable: given the stochastic and nonstochastic costs of risky acts, agents will choose a non-zero level of risk-taking.
Empirically, Assumption 4 seems reasonable in many applications: for example, a large
proportion of people have had unprotected sex at some point in their lives. If the converse of
Assumption 4 holds, agents will (weakly) prefer to set n=0, and the problem becomes trivial.
Given Assumption 4, however, the model allows a fairly powerful statement to be made:
Proposition 1
if p>0.
An interior solution to the optimization problem described in (1) is guaranteed whenever
the non-stochastic cost (e.g. the price) of a risky act is not zero.
Proposition 1 follows because
Assumption 3 and Assumption 4, and because
7
by
. This, along with
This assumption is substantively identical to the sixth Inada condition used to guarantee the stability of
neoclassical growth models.
the continuity of U, allows me to use the extreme value theorem to state that U has at least one
optimum where
, as long as p>0.
This eliminates the possibility of trivial corner solutions, in which the optimal response
to an increase in risk is always to either choose
or
(where
is some upper
bound on n that prevents it from reaching infinity). Conversely, if p=0, then given the other
conditions the optimal n* will be arbitrarily large: U is initially upward-sloping and its slope
never becomes negative, so additional risk-taking is always weakly beneficial. The other
analyses of optimal risk-taking that admit fatalistic responses (Sterck 2011; O’Donoghue and
Rabin 2001) have shown fatalism only as a corner case, in which the individual pursues the
maximum feasible level of risk-taking. While corner solutions are a fairly intuitive response –
they align with the reasoning that once one is doomed, one might as well indulge as much as
possible – they are not empirically relevant: there is little evidence that individuals ever truly
seek out the maximal level of available risk-taking. Moreover, the reason for this is exactly that
given above – taking additional risky acts, whether that means smoking more or seeking out sex
partners, carries pecuniary costs so that there are tradeoffs with out goods an individual might
desire.
Proposition 1 guarantees that the optimum will be non-trivial if the price of risk-taking is
positive. It does not rule out interior optima in other cases; O’Donoghue and Rabin do have an
interior optimum in their model’s non-fatalistic case, for example. However, it is a fairly
intuitive economic result: people are constrained by resources from pursuing the high extreme in
risk-taking. The results that follow will hold for the commonly-seen case in which people pursue
some intermediate level of risk-taking irrespective of their perception of the per-act risk r. In the
following section I will show that fatalism can occur even for these interior solutions.
3.3.
Comparative Statics
Given that an interior solution exists, the optimal choice of n,
, must satisfy the first-order
condition:
where
and
are the derivatives with respect to
of
and
.
also needs to satisfy the second-order condition that the utility function be concave at that point.
It is not possible to solve for
and
without additional assumptions about the functional form of
, and there will be no closed-form solution for
for most possible
functional forms of the benefit and risk aggregation functions.8
Despite the intractability, in general, of the precise optimum
the response of
, it is possible to explore
to changes in other variables without solving for the optimum analytically by
employing the implicit function theorem (IFT). In particular, there is a function
, and therefore the IFT allows us to compute the
comparative static for changes in
in response to changes in , as well as the way that response
changes when varies.9
For comparative static I the denominator is
. This is precisely the left-
hand side of the second-order condition (1.4), and is therefore weakly negative. It is not possible
8
In particular, if I impose that P be the true risk-aggregation function derived from the binomial distribution and
that B have a logarithmic form, then no closed-form solutions for n* are possible.
9
In Appendix A I derive additional comparative statics with respect to p and c.
to rule out the possibility that the second-order condition is exactly zero in general, so the
optimum occurs at a flat region of the utility function. This would mean that neither comparative
static would exist, and the model would predict neither self-protective nor fatalistic responses.
Since such flat regions of the utility function seem unlikely, I will assume the second-order
condition holds strictly.10 The denominator of comparative static II is just the square of the same
expression and therefore strictly positive. Its numerator is the product of
;
is negative as long as
. As a result, comparative statics I and II
will have the same sign, which is opposite to the sign of
Before deriving the sign of
and
.
in general, I first consider the specific case
where agents use the true risk-aggregation function
. This function comes from the
binomial distribution, and can be constructed as follows. Given a stock of past risk-taking
,a
choice of the number of additional risks to take , and the per-act risk , the probability of a
failure after one act is ; after two acts,
; and after three,
.
The probability of avoiding a failure after three acts is
Likewise the probability of avoiding a failure after
chance of a failure having occurred by after
.
acts is
. Therefore the
acts is:
The relevant second derivative from Comparative Statics I and II for this functional form is then
Interestingly, the expression for
, is weakly positive, but
10
may be positive or negative: the first factor,
is either greater or less than zero
Note that even in the limit as the denominator approaches zero all the results in this section will go through.
depending on the values of ,
, and . It is possible to solve the value of that determines the
sign of the second term.
This result holds even if we assume the agent is at
. Then the threshold value of becomes
. This threshold depends on the level of past and current risk-taking. Using the true riskaggregation function
, as
approaches zero, the value of the per-act risk
which leads to fatalism approaches . As
becomes sufficiently large, the critical value of
can become arbitrarily close to zero. Imagine a decisionmaker who has a long history of taking
chances. If he believes the per-act risk is low, and he finds out it is quite high, then he will be
nearly-certain he is already doomed. Thus the cost of taking one more chance goes down
substantially.
The above proof of the existence of a tipping point for the true risk-aggregation function
is due to O’Donoghue and Rabin (2000). However, whereas they consider only the
true risk-aggregation function
, it is possible to show that the same qualitative result
holds for any credible risk-aggregation function
. To prove this I first show that the
cross-partial is initially positive:
Lemma 2
and
. The cross-partial derivative of the total probability
of a failure with respect to riskiness and number of risky acts chosen is positive when the
number of risky acts
This follows straightforwardly from Assumption 2:
is zero if n and m are both zero and
positive if at least one of n or m is positive, so the initial cross-partial is positive; a symmetric
analysis holds for
.
Given Lemma 2, we can therefore prove that this cross-partial changes sign in general,
for all functions P that meet the conditions laid out above.
Proposition 2
with
if
s.t.
if
and
. For sufficiently high values of the per-act risk, increasing the per-act risk
actually diminishes the marginal impact of additional risk-taking.
To prove this, I consider two functions
with
and
. By Lemma 2,
Assumption 1 also gives us
and
Then these two continuous functions begin at the same value and converge to the same value, but
the slope of
is initially higher than that of
which the slope of
that of
exceeds that of
. This implies that there must be some point at
. If not then the value of
can never catch up with
.
Formally, consider a point
sufficiently close to zero that
, which
must be possible because the second function’s slope is initially higher. Then the average slopes
of the two functions between
and some higher point
are
and
,
so the ratio of the two slopes is
ratio approaches
. Taking the limit as
approaches infinity, this
, which is greater than one. This implies that there is a point above
which the average slope of
exceeds that of
; therefore, by the extreme value theorem
there must be a point where the instantaneous slope
. Figure 1 illustrates why this
must be the case. The solid blue line gives the known initial shape of
red line for
and likewise the solid
. Above the breakpoint at infinity, the two-colored line shows their common
value of 1. The dashed lines show the implied average slopes in the intermediate region; because
is initially shallower, it must be steeper on average over this range.
This ensures that a tipping point must exist in any valid risk-aggregation function
. It does not rule out multiple tipping points, which could conceivably arise from
sophisticated curvature of the risk-aggregation function, but the number of such tipping points
must be odd. I will ignore the possibility of multiple tipping points, motivated by the fact that for
the true risk-aggregation function
the cross-partial derivative changes sign only once.
Because the impact of riskiness on the marginal cost of risk-taking has a tipping point,
responses to riskiness will have tipping points as well. Formally we have the following two
comparative statics:
Proposition 3
if
, and
if
(Comparative Static I)
There exists a threshold value for the per-act risk, , at which rational behavior switches
from self-protection (negative responses to risks) to fatalism (positive responses).
Proposition 4
if
, and
if
(Comparative Static II)
Increasing the cost of a failure will increase the magnitude of the responsiveness of risktaking to per-act risks, making it more negative when agents are self-protective and more
positive when they are fatalistic.
Comparative Static I is my central result, which is that there is a tipping point not just in the
marginal cost of risk-taking (as O’Donoghue and Rabin (2001) show for the true risk aggregation
function
) but also in the optimal choice of n. Comparative Static II is an extension
of the finding of Oster (2012), who shows develops a model of rational responses to HIV risks to
show that agents will respond more to the per-act risk of HIV infection when the costs are
higher, e.g. when non-HIV mortality in their area is lower. In this model, a similar result holds,
but only for self-protective agents - those who face risks below . Above , higher costs will tend
to encourage more fatalism.11
Thus the model implies that under fairly broad and plausible assumptions, rationally
fatalistic responses will occur for sufficiently high combinations of the per-act risk r and the past
stock of risk-taking
. While somewhat surprising, this result is consistent with the intuitive
notion, expressed by the men quoted in Kaler (2003), that having made enough mistakes in the
past can doom you to HIV infection, no matter what you do to protect yourself now. If HIV is
unavoidable, attempting to mitigate your own risk of contracting it is useless.
This applies to any situation where there is a risk from each act chosen, and the outcome
goes unrealized for an extended period of time. Consider an individual who knows he has
engaged in some risk-taking in the past in the past. If his accumulated stock of risky acts, and his
perceived per-act risk, are sufficiently high, then learning that the per-act risk is lower than he
had thought can actually lead him to take fewer risks.
11
Another result from Oster (2012) is that all else equal higher costs will lead to less risk-taking. This also holds for
my theoretical framework. See Appendix A for a derivation.
3.4.
Other Risk Aggregation Functions
The results above hold for a broad range of possible risk-aggregation functions that
satisfy a minimal set of conditions, including the true function
. However, the central
point – that behavior will swing from self-protection to fatalism for sufficiently high values of
– is driven by a tipping point in impact of riskiness on the marginal cost of riskiness. This kind of
tipping point may exist even for far simpler heuristic risk aggregation functions that agents might
employ, in particular ones that are not differentiable and therefore not amenable to the
techniques employed in Sections 3.1-3.3. I therefore cannot prove that an interior optimum exists
for such functions, or that optimal risk-taking will switch from self-protective to fatalistic.
Instead, I demonstrate that two very simple heuristic risk aggregation functions exhibit this
tipping point phenomenon.
It might seem that this sort of tipping point is an esoteric mathematical feature of how
probabilities add up that people cannot be expected to understand, but in fact such tipping points
arise naturally and in a comprehensible way from some fairly basic heuristic risk aggregation
functions. Consider the simple linear function used in much of the literature, where the
assumption is made that levels of risk-taking and per-act risks are sufficiently low that the
probability never approaches 1. Agents might use a similar rule, but also assume that if the
probability does reach 1 then it stays there forever:
This function might appear to lack a tipping point as defined in Proposition 2, but the same basic
behavior actually obtains. Consider two agents, one who believes r = 0 and one who believes
. If both agents increase their risk belief by
, the marginal cost of
increasing n rises for the first agent and falls for the second. Any shift in r that increases its value
to at least
will induce fatalism, with further increases having no additional effect on
behavior.
An even simpler alternative is the "exposed enough" heuristic discussed in MacGregor et
al. (1999), wherein people think they are totally safe as long as they stay below some level of
activity, and then doomed with certainty if they take too many risks:
In this case only the act that shifts an agent over the threshold,
, has a direct
marginal cost – all other acts carry no cost at all. Increasing r will in general push agents closer
to the margin of being “sufficiently exposed” to suffer harm, thus carrying an indirect marginal
cost. But if r reaches or crosses
, the agent believes he or she is already sure to suffer
the bad outcome and hence this decreases the marginal cost of an additional act to zero.
Despite not being amenable to analysis through standard optimization techniques, these
functions both exhibit the crucial tipping-point phenomenon, implying that the results of Section
3.3 could hold even if agents handle the addition of risks in a very simple and heuristic way.
3.5.
Domains of Self-Protection and Fatalism
Comparative Statics I and II identify a threshold value of (that depends on
+
)
above which agents will become fatalistic, and increased per-act risks will lead to more risktaking rather than less. That is, the rational fatalism model implies that risk responses will not be
monotonic, but will shift from negative to positive when reaches . This differs sharply from
much of the empirical literature on risks, in which the expected cost of infection, is linear in the
per-act risk:
. Under the linear model, risk responses will be
monotonic and negative; there is no tipping point. In other words, the rational fatalism model and
the linear model give qualitatively identical predictions for any behavior that occurs below the
tipping point .
As a result, the rational fatalism model implies that we should expect to see three distinct
patterns depending on the domain in which it is applied. In settings that are either low-risk or
low-activity, it predicts negative responses consistent with the conventional linear model. In
settings that are both high-risk and high-activity, it predicts positive, fatalistic responses. Finally,
in heterogeneous settings, we expect to find a roughly U-shaped response, with some amount of
self-protection as well as some fatalism.
The first domain includes the Kenyan teenagers studied by Dupas (2011) - they are
sexually inexperienced, and therefore even fairly high perceived per-act risks still cause them to
behave in a self-protective fashion. The second domain is exactly that discussed in Kaler (2003) men with extensive sexual experience who perceive high per-act HIV risks end up rationally
fatalistic; they are doomed no matter what, so why bother using condoms. The third domain is
the most interesting, and most comprehensive: most populations include both people who think
they have some past exposure and those who think they have never taken a risk. In the majority
of cases I expect observed patterns of risk responses to include both self-protective and fatalistic
behavior.
3.6.
Comparison with Existing Theoretical Work
As discussed in Section 2.2 above, previous theoretical work has studied fatalism as a
potentially rational response to risks, with both O’Donoghue and Rabin (2001) and Sterck (2011)
developing formal mathematical models that predict fatalistic responses at some margins. The
rational fatalism model outlined in the above sections improves on those models in several
valuable ways.
First, previous research has only been able to demonstrate fatalistic behavior as a corner
solution, where agents either pursue a low, self-protective level of risk-taking or jump to a point
where they fatalistically take as many risks as possible. Sterck (2011) shows that without random
mistake or a condom failure, people will choose either no risk-taking or maximal risk-taking and
stay there forever, while O’Donoghue and Rabin use a contrived utility function to show that
agents will sometimes react to increased per-act risks by taking as many risks as they possibly
can. While O’Donoghue and Rabin do discuss the implications of their results for interior
solutions, they do not demonstrate that interior solutions actually exist when responses are
fatalistic, as I did in Section 3.2. In contrast the results of Section 3.3 are explicitly conditioned
on interior solutions; the predictions hold specifically for values of
that are not at a maximum
or minimum allowed value. This is guaranteed to be possible because individual behavior is
constrained by an implicit budget constraint; every risky act carries a cost p. In Section 4 I will
show that a wide range of interior solutions exists for plausible parameter values.
Second, both Sterck (2011) and O’Donoghue and Rabin (2001) combine the number of
risky acts being chosen and the unavoidable minimum number of acts into a single variable, and
neither separates out potential past mistakes from current decisionmaking (although Sterck
portends this line of reasoning when discussing mistakes). The rational fatalism model draws an
explicit separation between the unavoidable number of risky acts,
, and the current number of
acts being chosen, n. It also focuses on the case where the m unavoidable acts probably occurred
in the past, but that the outcome (malaria infection, lung cancer, HIV transmission) has not yet
been revealed.
Third, both previous papers rely on specific, simple benefit functions for the risky acts
that potentially raise questions about the robustness of the results to other functional forms,
whereas the model laid out in this section is robust to any concave, increasing benefit function
with sufficiently strongly diminishing returns.
Finally, the previous theoretical work has used the true risk aggregation function to show
interesting effects of varying per-act risks on choices of risky behavior. This is a valuable line of
inquiry, but extensive research has demonstrated that in addition to overestimating per-act risks,
individuals often do not understand how those risks compound into the total probability of a
failure. O’Donoghue and Rabin (2001) note that evidence on HIV transmission beliefs indicates
that the risk aggregation functions individuals employ tend to be far more concave than the true
function. The rational fatalism model shows that similar non-monotonic behavior can be found
for any valid risk aggregation function that satisfies very simple conditions. In addition, I
illustrate similar tipping point behavior even for very simple, non-differentiable risk aggregation
functions that people might realistically use even if they know very little math.
4. Empirical Approach
Based Comparative Static I above, any analysis of data on risk-taking must consider the
possibility of a non-monotonic response to the per-act risk. In this section I lay out the regression
specifications commonly employed in the literature on risk-taking (Section 4.1). I then use the
model from Section 3 generate simulated data (Section 4.2) in order to examine the effectiveness
of the basic approach. I show that under plausible parameter values, a squared term in per-act
risks can be statistically insignificant even when the underlying model has a tipping point. I also
find that in addition to missing the fact that the relationship has a U shape, running simple linear
regressions will tend to attenuate the magnitude of the measured risk response (Section 4.3). I
then describe two methods that may be more successful – the U-shape test of Lind and Mehlum
(2010) and a semi-parametric technique that lets the user look for non-monotonicity directly
(Section 4.4). I find that the Lind and Mehlum method is not particularly successful unless the
functional form of the relationship is close to quadratic, but that the semi-parametric approach
has promise. I propose a combined method, which uses the Lind and Mehlum test but checks that
the relationship is close to quadratic semi-parametrically. This substantially out-performs the
conventional approaches from the literature: a first-order relationship between risk-taking and
per-act risks can show a significant relationship even when the underlying behavior is nonmonotone, and a second-order term in risks can be insignificant even when non-monotonicity
exists. Finally, in Section 4.5, I move to a discussion of several threats to any attempt to use
observational data to identify the relationship perceived per-act risks and risk-taking, and
solutions to those problems.
4.1.
Regression specification
The standard approach used in the literature is to run regressions of the form
where
is a vector of controls. This is fine so long at the relationship between and
is
monotonic, but as discussed in Section 3 above there is reason to believe it may not be. While
relatively little research has considered the possibility of a non-linear relationship between and
, a typical strategy for doing so is to add a squared term to the above regression (cf. the De
Paula et al. 2009 study of the relationship between people’s own perceived HIV status and risky
sex). This yields the following specification.
One method for determining whether an estimated relationship is non-monotone is to
examine the statistical significance of the quadratic term, r2.
4.2.
Simulated data
To explore the effectiveness of these regression specifications, I construct simulated data
using the model described in Section. Specifically, I draw a pseudorandom sample of 3000
individual-level observations, comparable to the size of many field surveys on sexual behavior in
Africa, with the following parameter value distributions. I set the past level of risk-taking m to 2
acts for all individuals. I then set the mean of t to the threshold value for declining marginal
, and SD(r) arbitrarily to 0.2.12 I use a logarithmic utility function
costs,
, setting
for all observations, and impose the true risk-aggregation function,
. I set the cost of a failure to 1 for all individuals. I
choose the mean of p to be 0.15 and its standard deviation to be 0.03 arbitrarily. Using these
values, I engage in a grid search over possible integer values of n ranging from 0 to some upper
12
Note that this is only the tipping point if agents choose zero risky acts (n=0). This is tautologically never observed
since if n=0 no relationship is present. Since agents will in general choose some level of risk-taking greater than
zero, the observed tipping point in the data will differ for each individual, and will be smaller than 0.3935.
limit (which was selected randomly to lie from 15 to 21 for each individual) in order to find the
highest total utility value. The values picked above generate almost entirely interior solutions:
none of the simulated data points had n* = 0, and only three were at their individual-specific limit
for n.
4.3.
Parametric regression results
The first two columns of Table 1 show the simple linear model and a model with both a
linear and a quadratic term respectively, both without any regression controls. Both
specifications find a significant positive first-order relationship between risk-taking and per-act
risks. The estimated quadratic term in specification 2 is positive but statistically insignificant.
Together with the first-order term this would imply a convex parabola with a minimum below
zero. The only relevant potential control is the cost per risky act, p. Controlling for the per-act
cost does not substantially affect our coefficient estimates, but it does improve the precision with
which they are estimated. While the estimated sign of the first-order term is uniformly positive,
implying a positive average relationship between risks and costs, this is simply the result of the
specific parameter values chosen. Other simulations (not shown) give a negative first-order term
but results that are qualitatively similar to what follows.
The results of this analysis are revealing – although this data was simulated using a
model with a built-in tipping point, an approach that looks at the significance of the second-order
risk term would conclude that the relationship is probably monotonic.
Another notable aspect of these regressions can be seen by comparing the coefficient on
the first-order term in Columns 1-4 to that in Column 5; as I shall discuss in more detail later,
Column 5 does a better job of matching the shape of the curved part of the function. Irrespective
of whether a squared term is included, these regressions estimate first-order coefficients that are
far less than 20% of the magnitude of the true coefficient, and of the opposite sign. This implies
that standard regression specifications may yield misleading results when applied to data where
rational fatalism is possible.
4.4.
Identifying U-shaped relationships
To address this shortcoming, I first explore the formal parametric test for U-shaped
relationships proposed by Lind and Mehlum (2010). They argue that a significant quadratic term
is necessary, but not sufficient, for a non-monotone relationship. The logic is simple: if a
relationship is monotonic, but concave, the best fit to the data will in general involve a
significant second-order coefficient. But the turning point in the predicted parabola might be far
outside the data range. Thus it is necessary to formally test whether the fitted values involve both
a statistically significant upward-sloping and downward-sloping component. Table 1 includes the
p-values for this test in Columns 2 and 4, which in both cases are extremely close to 1.13 For this
dataset, the Lind-Mehlum test concurs with the simpler method of looking at the coefficient on
the squared term, and indicates that there is almost surely no non-monotonicity in the
relationship between n and r. This is unsurprising – the squared coefficient in one case does not
meet the necessary condition of being statistically significant, and in the other just barely crosses
that threshold.
Since the Lind-Mehlum method still relies on the estimated coefficients from fitting a
second-order polynomial to the data, one potential concern is that the model fit is sufficiently
poor that we are drawing false inferences. In many cases the solution would be to derive a
superior regression model based on what is known about the data generating process. However,
as discussed in Section 3, the utility maximization that generated this data has no closed-form
13
Tests conducted using the utest function in Stata, written by Lind and Mehlum. Technically both rejections are
trivial, since the estimated minimum point of the parabola would have been outside the data range regardless.
solution so we would be left with a fairly complex challenge of numerical optimization, that
might yield a wide variety of solutions depending on the parameter values chosen. A natural
approach to this issue is to estimate the relationship non-parametrically, in order to avoid any
issues of model selection. There are two general methods for attempting this.
The first is to simple do a scatterplot of the values of the two variables, and possible fit a
non-parametric curve estimate to that data. Figure 2 presents a simple scatterplot (with no curve
fit to it) of the number of risky acts against the per-act risk. While a possible U-shape is evident,
the bivariate relationship is fairly noisy.
An alternative is to use a semi-parametric approach that relies on parametric regression to
reduce the noise and omitted variable bias induced by other variables. I do this by using the
partially linear model estimator as developed by Yatchew (1997) and implemented by Lokshin
(2006) to plot the conditional relationship of n and r, holding p constant. The results are shown
in Figure 3. The plot also includes a plot of the LOWESS-estimated non-parametric curve, but
the non-monotone relationship is clear without it.14
These results imply that the Lind and Mehlum approach was indeed hampered by poor
model fit; the relationship, while non-monotonic, is also clearly not a parabola. As an additional
check, I repeat the same regression as in Column 4 of Table 1, but apply it to just the portion of
the data between r = 0 and r = 0.4. The results are shown in Column 5 of the same table. Now
the linear and quadratic terms are large and statistically significant, and the sign of the linear
term is negative. The Lind and Mehlum test rejects monotonicity at beyond the 0.001 level.
Given the limitations of the Lind and Mehlum test for analyzing this model, it is sensible
to look for a semi-parametric equivalent. For bivariate non-parametric analysis, options do exist.
14
The lowest observed value of n is 3, implying a tipping point of 0.1813 or less. This is roughly consistent with the
observed values.
One example is Bowman et al. (1998), who develop a formal test for the monotonicity of a nonparametric locally linear regression function. They rely on the use of a critical bandwidth that
flips the estimated relationship from monotonic to non-monotonic, and use it to construct This
has substantial promise for the analysis of risk-response relationships, but unfortunately there is
currently no equivalent for partially linear regressions. That limits the direct applicability of their
method, since confounding omitted variables are likely to substantially bias the observed
relationship between n and r. Adapting the Bowman et al. approach to partially linear regressions
is beyond the scope of this paper, but will be a focus of future work.
Lacking a formal semi-parametric test for non-monotonicity, I will rely on a strategy that
combines the Lind and Mehlum test and partially linear regressions. My formal test for nonmonotonicity will use the Lind and Mehlum approach, but I will confirm that the relationship is
approximately quadratic using semi-parametric regressions. If it is not, but possible nonmonotonicity is evident, I will use the data truncation method described above, applying the Lind
and Mehlum test to the portion of the data that is potentially non-monotone.
The preceding analysis demonstrates that the econometric methods commonly used in the
literature perform poorly when data is generated in a manner consistent with the rational fatalism
model of Section 3. The standard regression specifications for measuring risk-taking as a
function of per-act risks will tend to identify a slope that is too small in magnitude and
potentially of the wrong sign. Moreover, if a regression includes both linear and quadratic terms
in risks, a statistically significant squared term is neither necessary nor sufficient for nonmonotonicity. Correspondingly, the formal parametric test for non-monotonicity from Lind and
Mehlum (2010) can fail to identify the fact that the data has a tipping point if the model is
sufficiently mis-specified. In contrast, the approach outlined above – using the Lind-Mehlum test
along with a semi-parametric approach to check for the model fit – appears to perform much
better. A semi-parametric approach is also advisable for confirming that the estimated regression
coefficients are sensible.
4.5.
Threats to identification
Even if an appropriate approach to measuring potential non-monotonicity is adopted,
using observational data on the relationship between per-act risks and risk-taking can lead to
false inferences for three reasons: the data is affected by two different kinds of reverse causality,
which may bias the results in opposite directions, and also subject to substantial measurement
error, which will attenuate measured effects toward zero.
The first reverse causality issue has to do with cases where the per-act risk is actually the
result of previous risk-taking, which may be correlated with current levels of risky choices. This
is most notably the case in the literature on HIV and sexual risk-taking: the per-act risk depends
on the prevalence of the virus, which in turn depends on how much risky sex people have. Other
infectious diseases share this property - malaria, for example, is likely to be more common in
places where mosquito net use is lower. For this sort of risk, simple cross-sectional comparisons
would find high risk-taking in high-prevalence places, and draw a false inference. This issue will
tend to bias estimates toward positive infinity.
Another sort of reverse causality occurs when individuals form their beliefs about the
risks of various acts through experience with risk-taking. To remain with the example of HIV,
suppose a decisionmaker believes that she will quickly show symptoms after contracting the
virus. After engaging in a sufficient quantity of risky sex over time, and not developing
symptoms, her perceived per-act risk would decline, generating a mechanical, negative
relationship between risk-taking and per-act risks. This reverse causality problem will have the
opposite effect from the first one, tending to bias estimates toward negative infinity. It is possible
that some belief-formation processes might generate positive biases as well.
Separate from reverse causality is the problem of poorly-measured data on sexual risktaking and risk perceptions. Data on sexual behavior may be even more susceptible to recall
errors and biases than other behaviors studied using survey data, and also carries the risk of
“social acceptability bias”, where respondents say what they think the enumerator wants to hear.
Active research in survey methodology has focused on the development of life-history and diarybased methods, in order to reduce these sorts of measurement errors (Luke et al. 2009). Data on
risk perceptions present even more severe measurement problems. People with limited math
backgrounds may have trouble understanding questions about probabilistic expectations, and this
issue is likely to be exacerbated in developing-country settings (Attanasio 2009). Encouragingly,
recent work conducted in Malawi by Delavande and Kohler (2009) has shown that, through
carefully designed questions, it is possible to elicit meaningful beliefs about probabilities even
when individuals have very limited formal education.
Errors in both reported sexual behaviors and risk beliefs are likely to take many forms,
and will often be effectively random. Randomly mismeasured sexual behavior will tend to
decrease the precision of the estimated relationship between n and X, while random measurement
errors in beliefs will attenuate any estimated relationship, biasing estimates toward zero. There is
one particular kind of measurement error in risks that may not be random and is worth observing,
however. A certain fraction of individuals in all contexts will answer any probabilistic question
with “50%”, indicating not that they think it is a 50-50 chance but that they simply do not know
(Lillard and Willis 2001). This is true even if questions are posed deterministically, as in “what
share of your friends owns an iPod”. One crude way of estimating the measurement error in a
probabilistic belief variable is to look at the extent to which there is an anomalously high “50%”
response rate.
One factor that is likely to increase measurement error in probabilistic beliefs is the use
of true probabilities as proxies for beliefs. This has been employed in research on HIV and risktaking in Africa (Juhn et al. 2009, Oster 2012) as well as in the United States (Ahituv et al. 1996,
Auld 2006). Since many individuals will be misinformed, this issue will tend to exacerbate the
attenuation bias problem described above. This kind of measurement error also raises the
question of what structural relationship is being estimated. Individuals are less likely to be aware
of the actual probabilities than policymakers, who may intervene to promote reductions in risktaking. Thus a significant estimated response may not reflect actual behavioral responses by
individual people.
The first two identification threats can best be resolved by instrumental variables
approaches. An exogenous shock to beliefs will allow the causal effect of changes in those
beliefs on risk-taking to be measured. For example, Oster (2012) and Juhn et al. (2009) use the
distance from the origin of the human immunodeficiency virus as an instrument for its
prevalence. To study individual risk beliefs, the optimal instrument would actually be some kind
of quasi-random information campaign, or an outright experiment that provides information
about risks.
Resolving the measurement error issue is somewhat harder. A simple first step is to rely
on individual probabilistic beliefs rather than the true per-act risks, since the two may differ
substantially. Much recent work in HIV and risk beliefs in Africa has begun to do this.
(Godlonton and Thornton 2011, Delavande and Kohler 2011, de Paula et al. 2008). Beyond that,
careful questionnaire design, and variables that can serve as cross-checks within a survey, are
useful tools. Question design should focus on minimizing the “50%” response rate (within
reasonable bounds), for example by asking a followup question about whether respondents are
just unsure (Lillard and Willis 2001). Another approach is to test the sensitivity of the results to
excluding respondents who answer “50%” to the relevant question.
5. Preliminary Results
In this section I apply the empirical strategy laid out in Section 4 to a preliminary
observational dataset on sexual behavior and perceived HIV risks in Malawi. Section 5.1
describes the data used, detailing the construction of variables to capture the number of risks
taken, n, and the perceived per-act risk, r. In Section 5.2 I conduct a basic regression analysis of
the form described in Section 4.1, and show that the standard approach of examining the sign of
a squared term in risks implies at best fairly fragile evidence for a non-monotonic relationship.
Section 5.3 applies the method for testing for a U-shaped relationship described in Section 4.4,
finding that the evidence for a U shape is actually fairly robust. Based on this I argue that the
evidence is broadly consistent with possible fatalism in rural areas on Malawi’s Southern
District, which is in line with the findings of previous qualitative research. In Section 5.4 I
examine the plausibility of fatalism in this population in two different ways: first, I compute the
number of people who would be expected to tip into fatalism based on the true risk-aggregation
function ; second, I explore just the subset of individuals who think their sex partners are HIVpositive. Both explorations suggest that fatalism is plausible for this population. Section 5.5
discusses some important limitations of this analysis and argues that my results are likely to be
smaller in magnitude than the true relationship, and may actually understate the extent of
fatalism in this population.
5.1.
Data
I use data collected in the Zomba District of Malawi’s Southern Region. Malawi is in the
midst of a severe HIV epidemic, with an infection rate of nearly 12%. Its experience has been
typical of countries in Southern Africa in that its high infection rate has been sustained for a long
period, and has declined only slightly from its peak (National AIDS Commission, 2003). HIV is
the leading cause of non-infant mortality in the country, and is responsible for one in every three
adult deaths (PEPFAR 2008).
The data form the peri-urban and rural survey components of the Situational Analysis of
Sexual Behaviors and Alternative Safer Sex Strategies which I helped run as part of a
collaboration with PIs Professor Rebecca Thornton of the University of Michigan and Dr. Jobiba
Chinkhumba of the University of Malawi College of Medicine, and co-investigators Sallie Foley,
LMSW, of the University of Michigan and Alinafe Chibwana, MA, of Catholic Relief Services
(Kerwin et al. 2011). We conducted the surveys in July of 2011 in areas adjoining a major
trading post in Zomba District. The survey enumerators were hired near the survey sites, and all
surveys were conducted in Chichewa, the national language, with female enumerators
conducting all interviews with female respondents and likewise for males.
We drew a geographic representative sample of 447 sexually active adults (age 18-49) in
the area based on the locations of households, with selection rules for picking participants within
each household. Our sampling strategy oversampled urban areas and unmarried individuals, so
we constructed sample weights to match the population proportions by locale and marital status
to the 1998 Malawi census. All reported statistics use these sample weights unless otherwise
noted.
Summary statistics for the sample’s demographics are given in Table 2, below. The
sample is 5% peri-urban, meaning people that live around the major trading post, and about 60%
literate. The average age is 29 and people have completed five and half years of school on
average. Nearly 90% of the weighted sample is married. The sample is also almost 60% female;
anecdotally, people in the region reported that men were more likely to migrate away for work.
Respondents have 3 children on average and want just over one more. The sample is 80%
Christian and 20% Muslim, and is dominated by the Lomwe ethnic group at 55%. Other large
ethnic groups include the Yao (22%) and Chewa (16%).
Table 3 summarizes the data on the perceived risks and costs of HIV for our respondents,
as well as their self-reported sexual behavior. The data contains both a measure of perceived past
exposure to HIV (the number of a respondent’s past partners that, looking back, they now think
were HIV-positive at the time) and of their potential future exposure (the share of attractive
people in the region who the respondent thinks have HIV). The former measure appears to be
more or less uncorrelated with current sexual behavior, so I rely on the latter. Respondents report
that they think that 49% of attractive people, on average, are HIV-positive. The survey also
asked respondents how many out of 100 people who had sex with an HIV-positive person last
night would contract the virus. Over 50% of respondents answered that 100% would become
infected, and the mean response was 85%. I multiply these two variables to construct a measure
of the perceived per-act risk from unprotected sex with a possible sex partner, or the variable X
from the framework described in Section 3. The average value of r in this sample is 43%.
The survey also asks about people’s beliefs about their life expectancy, including without
HIV, if they contracted HIV today but did not have ARVs, and if they contracted HIV today and
did not have ARVs. In addition, it asks them how many out of 100 people from their area they
would expect to have access to ARVs. I use these to construct a measure of the expected cost of
an HIV infection in terms of years of life lost. On average, respondents believe HIV infection
will cost them 18.9 years of life – they expect to live almost 30 years if HIV-free and 11.4 years
if they contract the virus.15 While probably lower than the expected cost of HIV in a developed
country, this is hard to square with the idea that limited responses to the threat of HIV infection
15
These numbers do not add up properly because some data is missing for each variable.
are the result of low expected costs in terms of foregone years of life. I will use this variable as
the equivalent of C from the Section 3 theoretical framework.
Finally, the dataset contains a detailed sex diary that captures detailed information on all
sex acts in the past week. I use this to construct a measure of the total number of sex acts and the
number in which no condom was used. The average respondent had unprotected sex 1.9 times in
the preceding week. In the analyses that follow, I will rely on this variable as the measure of
sexual risk-taking from – n from the theoretical framework in Section 3.
5.2.
Regression Analysis
The focus of this analysis is to explore the relationship between n (Unprotected sex acts
in past week) and r (Average Prob. of HIV Xmission per act, attractive people) for my
representative sample of individuals from Malawi’s Zomba District. As a first pass, I construct a
simple, unweighted scatterplot of the two variables (Figure 4). No obvious bivariate relationship
is evident. A simple bivariate regression does not find a significant coefficient on n either (not
shown).
However, the oversampling of unmarried individuals and people near the trading post
may mask an underlying relationship between n and r. I therefore run a set of weighted
regressions, using the specifications described in Section 4.1. Table 4 presents the results.
Column 1 is a bivariate regression of n on r, but using the sample weights. I estimate a
statistically significant positive first-order relationship between perceived per-act risks and the
number of risks taken. Taken at face value, this would imply that the population is fatalistic on
average. However, this result is not robust to the inclusion of a second-order risk term (Column
2), which yields a negative estimated slope. The coefficient on the quadratic term in risks is
positive but imprecisely measured.
Columns 3 and 4 repeat the specifications from Columns 1 and 2 but include a broad set
of regression controls (suppressed from the tables for space). Again the bivariate relationship
(Column 3) is positive and significant. Including a quadratic term in risks (Column 4) reverses
the sign of the first-order term and decreases the accuracy with which it is estimated. The
quadratic term is statistically significant at the 5% level.
In Columns 5 and 6 I extend the model to capture Comparative Static IV (from Appendix
A), which states that the number of risk acts is declining in the cost of a failure (captured by the
expected years of life lost if the respondent contracts HIV) and Comparative Static II, which
states that the higher the cost of a failure, the stronger the relationship between risk-taking and
per-act risks is likely to be. My results contrast with Oster (2012) in that I do not see a
particularly strong effect of the cost of HIV on the risk-response relationship. Increasing the
number of life years lost to the disease by 10 will decrease the slope of the n-r relationship by
just 0.1 in the quadratic specification of Column 6, just 2% of its overall magnitude. However it
is evident that cost of an HIV infection was an important omitted variable in the other
regressions. The first-order term becomes more negative in both specifications and in the secondorder specification the squared term becomes more positive, and in both specifications the
coefficients are more precisely estimated.
Comparative Static V also states that there should be a tipping point in the effect of the
cost of a failure on the relationship between n and r, and that the two tipping points should be
equal. I therefore include an interaction between the cost of a failure at the square of the per-act
risk in Column 7. This still does not lead to statistically significant results for Comparative
Static V, but it decreases the precision of the estimates for the linear and quadratic terms in risk.
Because Column 7 captures both of the relevant comparative statics that can be measured in this
dataset, I will rely on it as my preferred specification for further analysis.
5.3.
Testing for a non-monotone relationship
Looking just at the squared coefficients in Table 4 would suggest that there is some
evidence of a non-monotone relationship between per-act risks and risk-taking but that it is not
robust to changes in the specification used. However, as shown in Section 4, this method is likely
to be misleading if the underlying data-generating process is based on the rational fatalism model
of Section 3. Instead, I employ the two-step test described in Section 4.4. First, I run the LindMehlum parametric test for a U-shaped relationship. The p-values for this test are given in the
third-to-last row of Table 4. While the unconditional relationship (Column 2) does not display
significant non-monotonicity, the test rejects monotonicity for the other three specifications at
the 0.1 level. In the preferred specification of Column 7, the p-value is 0.096.
To confirm that the Lind-Mehlum test is not misleading due to poor model fit, I run a
partially linear regression of risk-taking on per-act risks, including all the covariates from
Column 7 (with the exception of the squared term in risks, which will be captured by the
LOWESS curve). This regression is plotted in Figure 5. Although the LOWESS curve has a
flattened U shape, no obvious relationship is visually evident from the plotted datapoints.
However, the partially linear estimator employed does not accept sampling weights, meaning
that these results are potentially biased due to the oversampling of certain groups – unmarried
individuals and especially people from rural areas. Because of this, Figures 6 and 7 show the
results split out by peri-urban and rural residents. Since rural residents are 95% of the weighted
sample, I focus on Figure 7 as a basic approximation of the weighted partially linear regression.
This plot shows a more noticeable U-shaped relationship, implying that the weighted relationship
we applied the Lind-Mehlum test to has a reasonably parabolic shape and the test results are
probably trustworthy.
In summary, there is suggestive evidence for a non-monotone relationship between risktaking and per-act risks in this dataset. Moreover, the first-order relationship is consistently
positive, which is the opposite of the prediction of the simple linear self-protection model.
Strikingly, these results are based on a weighted sample that is 95% rural. This is consistent with
the finding of Kaler (2003) that some men in rural areas employ rationally fatalistic reasoning.
People near the peri-urban trading center may have more access to HIV testing, which would
tend to resolve the uncertainty about their HIV status and eliminate the fatalistic effect.
5.4.
Plausibility of Fatalism
While these results imply that risk responses may be fatalistic for some people in Zomba
District, it is not clear how plausible this is – could people conceivably think their risk is really
high enough that they cross the tipping point value for per-act risks, ? As a first pass at this
question, I compare people’s beliefs to a plausible tipping-point: I assume that people use the
true risk-aggregation function
. Then the tipping point value of r is
. If people
think they have just a single unprotected exposure to a partner who is HIV-positive, their tipping
point is about 63%; higher levels of exposure imply a lower tipping point. Looking at the
distribution of beliefs in this population, (which is highly-skewed toward 100%), I find that
78.9% of people have beliefs that would be above the tipping point by this standard: if they have
any past exposure to someone they are sure is HIV-positive at all, their beliefs about per-act risks
are high enough to lead to fatalism.
A more conservative approach is to use people’s beliefs about their past partner’s HIV
status (at the time they had sex) multiplied by their perceived per-unprotected-act risk of
contracting HIV from an infected partner. By this standard, 9.7% of people have risk beliefs that
are consistent with fatalism even if they only had unprotected sex with their past partners a
single time – far lower than the actual figure, since condom use is very rare in Malawi. Thus
even by a very conservative standard, one would expect a non-trivial number of people from
Zomba District to react fatalistically to HIV transmission risks.
An alternative method of examining the plausibility of fatalism in this sample is to isolate
how people behave when they think a sex partner is infected with HIV. For this I rely on a set of
questions on the SASB survey that asked respondents about their primary sex partner, which was
defined to be the person they had sex with the most times in the past week, or, if they had not
had sex in the past week, then the person with whom they had most recently had sex. Although a
large share of respondents are married, we did not restrict this partner to be the respondent’s
spouse (and intentionally did not ask if they were). Respondents were asked about the likelihood
that this partner was HIV-positive; 52 of the 447 respondents in the dataset either reported a high
likelihood or said that they were sure their partner was infected. Figure 10 presents a scatterplot
for these 52 individuals, with their perceived per-act risk of infection from sex with an infected
partner on the x-axis and the number of times in the past month that they had unprotected sex
with this partner on the y-axis. Immediately notable is the large cluster of beliefs at 100%. Many
of these respondents also report positive levels of unprotected sex with their partner. A best-fit
line confirms a positive relationship between risk-taking and perceived riskiness. Due to the
small sample size, this relationship is not statistically significant. It is nevertheless suggestive of
potential fatalism among people who think they are being exposed to HIV.
5.5.
Limitations
In Section 4.5, I lay out three potential identification issues with using observational data
to analyze risk responses. With the present dataset I am unable to address either of the two
sources of reverse causality that are likely to be bias these results. I expect that the first reverse
causality issue is of fairly limited importance. The HIV epidemic in Malawi is fairly mature and
all individuals face very similar true prevalences. Moreover, people’s perceived risks have little
relationship with the actual risk they face. The actual per-act risk of HIV transmission is about
0.1% (Wawer et al. 2005), far below the average belief of 85%. And the true prevalence of HIV
in Malawi’s Southern Region is 17.6%, also substantially less that the mean belief for this
variable (49%). Since people do not actually understand the true risk, the causal positive effect of
behavior on the true risk is unlikely to cause a large bias in my results.
The second issue, in contrast, is more important for these data than for data that employ
the true HIV prevalence as a proxy for risks. Since people develop their beliefs in part through
experience I expect this to introduce a net negative bias in these results. Without an exogenous
source of variation in risk beliefs it is not possible to determine the exact nature of the bias in
these results, but the negative bias from the second issue is probably more important than the
positive bias due to the first one. That would imply that the true relationship would be initially
flatter than what is implied by the partial linear regressions in Figures 5, 6, and 7, and then
steeper after the tipping point. In other words, this observational analysis is likely to
undermeasure the true extent of fatalism in this population. This would not be the case if the
belief formation process worked in the opposite way, with more experience leading to higher risk
beliefs.
The measurement error issue is more tractable within the current dataset. As discussed in
Section 4.5, one simple metric for the measurement error in my risk belief variable is the excess
frequency of “50%” responses. Figure 8 shows a histogram of the per-act risk beliefs of my
respondents. There is substantial heaping at exactly 50%. Unfortunately, no viable alternative
variables are available.16 The survey questionnaire did not ask a followup question about whether
people are simply unsure, so it is not possible to exclude only those people or including a dummy
for being unsure. As a crude approximation to that method, however, I replicate the regressions
from Section 5.3 and 5.4 using only the subset of people who are not “uncertain”, meaning they
did not answer 50% to the per-act risk variable. The results are presented in Table 5 and Figure
9. I find uniformly stronger responses to HIV risks across all specifications, and lower p-values
for the Lind-Mehlum test. The preferred specification in Column 7 finds large and statistically
significant first- and second-order terms, and the Lind-Mehlum test rejects montonicity at the
0.05 level. The semi-parametric plot shows a reasonably clear-cut U shape. While this approach
is not perfect, it implies that measurement error is substantially attenuating these results, so the
true response will be larger in magnitude and more statistically significant than what is seen in
Table 4 and Figures 5 through 7.
16
There is a question about the total prevalence of HIV in the population, but a printing error meant that it was
collected only for males. As mentioned above, the data does have the prevalence among past partners, but this has
no appreciable relationship with risk-taking, either positive or negative.
6. Preliminary experimental results
In the summer of 2012, I conducted a field randomized controlled trial (RCT) in to test the
implications of this model. The experiment took place in Traditional Authority (TA) Mwambo,
in the Zomba District of Malawi’s Southern Region. I sampled roughly 2100 sexually active
adults aged 18-49 chosen randomly from 70 villages selected at random from the . Each
participant was interviewed twice: once for a baseline survey, and again for a followup
conducted 1-3 months later. All participants were provided with basic information about the
sexual transmission of HIV and the benefits of condoms.17 Participants from half of the villages,
chosen at random, were also read an information script that presented the actual annual risk of
HIV transmission in serodiscordant couples that have unprotected sex, based on the Wawer et al.
(2005) estimates and also figures from the Malawi National AIDS Commission.
Because existing evidence indicates that fatalistic individuals often frequent rural trading
centers in order to drink and seek out sex partners, the sample of villages was stratified based on
their distance to the closest major trading center.18 One third of the sample was villages within 2
km of a trading center, which is generally agreed to be the maximal distance people will walk for
nightlife; one third was villages between 2 and 5 km from a trading center; and one third was
17
Knowledge of the basics of HIV transmission and prevention is already high in this population. In the 2010 DHS,
nearly 100% of individuals said that HIV was sexually transmitted and over four fifths knew that condoms were
effective prevention. The latter figure may be an underestimate: in survey questions about the risks of unprotected
and protected sex in the Situational Analysis of Sexual Behaviors data, virtually all respondents stated that condoms
provided at least some risk benefit.
18
Trading centers were identified based on the 2008 Malawi Population and Housing Census, which codes periurban areas outside the main cities with enumeration area numbers from 800 to 899. I included trading centers both
inside the TA as well as in other nearby parts of the Southern Region. Since TA Mwambo adjoins the city of Zomba,
I also included the main markets in that city as trading center equivalents. In addition, based on conversations with
local public transit workers, I added three more trading centers (Govala, Kachulu, and Mpyupyu) that do not have
enumeration area codes between 800 and 899 but that are nonetheless major centers for trade and nightlife. The
informants I spoke to stated that besides the EA800-899 sites I had already identified, there were no other places
people went for trading or nightlife outside of TA Mwambo.
more than 5 km away from the closest center. This compares with overall proportions of 10%,
40% and 50% of all villages in TA Mwambo.
I rely on self-reported sexual behavior at the followup interview as my outcome measure. I also
use purchases of subsidized condoms at the followup as an objective measure of risk avoidance:
all participants were given six coins worth five Malawi Kwacha apiece (30 kwacha total, or just
over ten cents), and allowed to purchase up to six packets of Chishango (local-brand) condoms
for five kwacha apiece. This empirical strategy only allows me to estimate linear local average
treatment effects, because I only have two experimental arms. In lieu of fitting a semi-parametric
model to the experimental data, I explore heterogeneous responses to the information treatment
based on the distance to the nearest TC, gender, sexual experience, and baseline risk beliefs, as
well as interactions between these factors.
[Flesh this out more as data comes in]
Conclusion
I develop a theoretical model that generalizes the linear risk-response relationship
assumed in the literature to the case where responses may not be linear. I do this by allowing
people to employ a subjective risk aggregation function that satisfies several broad conditions
about its shape, and show there is a tipping point above which increases in the per-act risk lead to
more risk-taking rather than less. This result holds for any valid risk-aggregation function that
satisfies a set of simple conditions. Even very simple heuristic risk aggregation functions that
require no sophisticated understanding of probability theory also exhibit the tipping point that is
central to my results. The rational fatalism model implies that responses to risks will have both a
downward-sloping (self-protective) and upward-sloping (fatalistic) region. It advances the
previous literature by showing this effect not just for specific simple benefit functions and the
true risk aggregation function, but for a wide range of plausible choices. The rational fatalism
model also shows that fatalistic responses can occur for interior solutions and not just in
situations where people choose to take as many risks as possible. Based on this model, and
imposing some assumptions about the benefit from risky acts and the risk aggregation function, I
generate simulated data and use it to test the effectiveness of standard econometric approaches to
data on risk-taking a per-act risks. I find that the typical specifications can generate misleading
inferences: they may fail to identify non-monotonic relationships and will generate estimates of
the average response that are attenuated relative to the true value and may also be of the incorrect
sign. I develop my own approach based on the Lind and Mehlum (2010) parametric test for nonmonotonicity and the Yatchew (1996) partially linear regression technique. I apply these
methods to preliminary observational data from Malawi’s Zomba District, and find suggestive
evidence of non-monotonicity in that region. I also show that depending on how we think about
past exposures, between 9.7% and 84.4% of people in the region have beliefs that are above the
tipping point into fatalistic behavior implied by the true risk aggregation function. Looking just
at the subset of 52 individuals who think their primary sex partner is HIV-positive, I find that
risk-taking increases with perceptions about the per-act risk, although the sample is not large
enough to rule out a slope of zero. These data are limited by potential measurement error as well
as reverse causality. A first pass at the measurement error issue confirms that it is probably
attenuating the measured relationship between risk-taking and per-act risks, and that the true Ushape is even more pronounced than observed in this data. I also present preliminary results from
a randomized field experiment conducted in Southern Malawi that is designed to test the
implications of this model.
The results of this paper are subject to several important limitations. First, while the
rational fatalism model substantially extends the usual linear risk response relationship used in
the literature, it abstracts from the possibility of multiple periods. In particular, agents may be
aware of the effect of their current behavior on their future decisions about risky acts. This model
is appropriate for individuals with short planning horizons, or as an abstract model of risky
behavior among adults where the cost of a failure is simply dying before old age (when no risks
will be taken).
Second, while it demonstrates that data generated by the rational fatalism model can
display non-monotone responses, and that conventional methods of looking for that nonmonotonicity may not perform well, this is done only for a single simulated dataset. A superior
approach would be to generate a large number of such datasets under plausible assumptions
about parameter values, and explore the general effectiveness of various methods. This would in
turn necessitate developing a formal and automated way of testing for non-monotonicity that
works for this model, probably by adapting Bowman et al. (1998). An additional difficulty with
doing this is that many combinations of seemingly-plausible parameter values generate only
corner solutions, which are not amenable to any kind of test and hence uninformative.
Third, I cannot resolve the reverse-causality issue in my preliminary data, since it lacks
an exogenous source of variation in risk beliefs. I argue that correcting it is likely to decrease the
extent of self-protection in measured in the data while increasing the extent of fatalism, so that
this preliminary data is probably underestimating the extent to which Malawians from that
district are fatalistic and not overestimating it. However, it is impossible to know what the true
relationship is without an instrument or an experiment. Future work on this topic should focus on
conducting an experiment which provides information about HIV transmission risks to people in
Malawi and in other places where people substantially overestimate the per-act risk of
contracting HIV.
References
Ahituv, A., Hotz, V. J., & Philipson, T. (1996). The Responsiveness of the Demand for Condoms
to the Local Prevalence of AIDS. The Journal of Human Resources, 31(4), 869–897.
Arunachalam, R., & Shah, M. (2009). Compensated for Life: Sex Work and Disease Risk in
Ecuador (Working Paper). Population Association of America Annual Meeting. Retrieved
from http://paa2009.princeton.edu/abstractViewer.aspx?submissionId=91998
Attanasio, O. (2009). Expectations and Perceptions in Developing Countries: Their Measurement
and Their Use. American Economic Review, 99(2), 87–92.
Auld, M. C. (2006). Estimating behavioral response to the AIDS epidemic. Contributions to
Economic Analysis & Policy, 5(1), 12.
Barnett, T., & Blaikie, P. (1992). AIDS in Africa: Its present and future impact. London: The
Guilford Press.
Bowman, A. W., Jones, M. C., & Gijbels, I. (1998). Testing Monotonicity of Regression.
Journal of Computational and Graphical Statistics, 7(4), 489–500. doi:10.2307/1390678
Chimbiri, A. M. (2007). The condom is an “intruder” in marriage: Evidence from rural Malawi.
Social Science & Medicine, 64(5), 1102–1115.
De Paula, A., Shapira, G., & Todd, P. (2009). How Beliefs about HIV Status Affect Risky
Behaviors: Evidence from Malawi, Second Version.
Delavande, A., & Kohler, H. P. (2009). Subjective expectations in the context of HIV/AIDS in
Malawi. Demographic Research, 20(31), 817–875.
Delavande, A., & Kohler, H. P. (2011). HIV/AIDS-related Expectations and Risky Sexual
Behavior in Malawi (Working Paper).
Dupas, P. (2011). Do Teenagers Respond to HIV Risk Information? Evidence from a Field
Experiment in Kenya. American Economic Journal: Applied Economics, 3(1), 1–34.
Fang, H., Keane, M., Khwaja, A., Salm, M., & Silverman, D. (2007). Testing the mechanisms of
structural models: The case of the Mickey Mantle effect. The American economic review,
97(2), 53–59.
Geoffard, P.-Y., & Philipson, T. (1996). Rational Epidemics and Their Public Control.
International Economic Review, 37(3), 603–624. doi:10.2307/2527443
Godlonton, S., & Thornton, R. (2011). Circumcision, Information, and HIV Prevention.
Godlonton, Susan, Chinkhumba, J., Hayes, B., & Thornton, R. (2010). Circumcision,
Information, and HIV Prevention Study - Urban Sample Dataset. University of Michigan.
Godlonton, Susan, Thornton, R., & Munthali, A. (2009). Circumcision, Information, and HIV
Prevention Study - Rural Sample Dataset. University of Michigan.
Juhn, C., Kalemli-Ozcan, S., & Turan, B. (2009). HIV and fertility in Africa: First evidence from
population based surveys (Working Paper No. Discussion Paper No. 4473). IZA Discussion
Paper Series. Institute for the Study of Labor.
Kaler, A. (2003). “My girlfriends could fill a yanu-yanu bus”: Rural Malawian men’s claims
about their own serostatus.
Leon, G., & Miguel, E. (2011). Transportation Choices, Fatalism and the Value of Life in Africa
(Working Paper). Working Group in African Political Economy. Berkeley: University of
California.
Lillard, L., & Willis, R. (2001). Cognition and wealth: The importance of probabilistic thinking
(Working Paper No. WP 2001-007) (p. 48109–1220). University of Michigan Retirement
Research Center.
Lind, J. T., & Mehlum, H. (2010). With or Without U? The Appropriate Test for a U-Shaped
Relationship. Oxford Bulletin of Economics and Statistics, 72(1), 109–118.
Luke, N., Clark, S., Zulu, E., Westlands, N., & Eliya, K. (2009). The Relationship History
Calendar: Improving the Scope and Quality of Data on Youth Sexual Behavior. Brown
University typescript.
Malawi National Statistical Office, & ORC MACRO. (2010). Malawi Demographic and Health
Survey 2010. Calverton, MD: MEASURE DHS.
NAC. (2009). National HIV/AIDS Prevention Strategy: 2009 to 2013. Lilongwe: Malawi
National AIDS Commission (NAC).
O’Donoghue, T., & Rabin, M. (2001). Risky Behavior among Youths: Some Issues from
Behavioral Economics. NBER Chapters, 29–68.
Oster, E. (2012). HIV and sexual behavior change: Why not Africa? Journal of Health
Economics, 31(1), 35–49. doi:10.1016/j.jhealeco.2011.12.006
Padian, N. S., McCoy, S. I., Balkus, J. E., & Wasserheit, J. N. (2010). Weighing the gold in the
gold standard: challenges in HIV prevention research. AIDS, 24(5), 621–635.
doi:10.1097/QAD.0b013e328337798a
Sterck, O. (2011). Should prevention campaigns disclose the transmission rate of HIV/AIDS?
Theory and evidence from Burundi. Université catholique de Louvain, Institut de
Recherches Economiques et Sociales (IRES). Retrieved from
http://ideas.repec.org/p/ctl/louvir/2010042.html
Stoneburner, R. L., & Low-Beer, D. (2004). Population-level HIV declines and behavioral risk
avoidance in Uganda. Science, 304(5671), 714.
Viscusi, W. K. (1990). Do smokers underestimate risks? Journal of Political Economy, 98(6),
1253–1269.
Wawer, M. J., Gray, R. H., Sewankambo, N. K., Serwadda, D., Li, X., Laeyendecker, O.,
Kiwanuka, N., et al. (2005). Rates of HIV-1 Transmission per Coital Act, by Stage of HIV1 Infection, in Rakai, Uganda. The Journal of Infectious Diseases, 191, 1403–1409.
Yatchew, A. (1997). An elementary estimator of the partial linear model. Economics Letters,
57(2), 135–143. doi:10.1016/S0165-1765(97)00218-8
Table 1: Results of optimal risk-taking on per-act risk, simulated data
Per-Act Risk
(1)
(2)
(3)
(4)
(5)
2.911***
2.573***
2.870***
2.150***
-16.49***
(0.171)
(0.648)
(0.105)
(0.432)
(0.906)
0.425
0.905*
43.84***
(0.785)
(0.488)
(1.954)
-50.07***
-50.10***
-47.72***
(1.012)
(1.012)
(1.561)
(Per-Act Risk)^2
Cost per act
Constant
p-value(Lind-Mehlum U-Shape Test for n and X)
Adjusted R-squared
n
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
4.114***
4.165***
11.63***
11.74***
12.71***
(0.0734)
(0.122)
(0.169)
(0.197)
(0.289)
-
1.000
-
1.000
<0.001
0.0979
0.0978
0.789
0.789
0.797
3000
3000
3000
3000
1547
Table 2: Summary Statistics for Demographics
Mean
Demographics
Peri-Urban
Female
Married
Age
Years of education
Std.
Dev.
Min.
Max.
N
0.05
0.59
0.88
29.46
5.52
0.22
0.49
0.32
8.25
3.03
0
0
0
17
0
1
1
1
49
13
447
447
447
447
444
Literate
0.60
0.49
0
1
447
# of female children
1.47
1.31
0
7
438
# of male children
1.55
1.21
0
6
445
1.16
1.45
0
20
447
3.44
0.90
1
5
442
Household spending in past month, PPP USD
142
232
0
3852
447
Household income in past month, PPP USD**
247
561
0
19369
447
Catholic
0.23
0.42
0
1
447
CCAP
0.13
0.34
0
1
447
Pentecostal
0.13
0.33
0
1
447
Church of Christ
0.11
0.31
0
1
447
Anglican
0.06
0.24
0
1
447
Other Christian†
0.14
0.35
0
1
447
Muslim
0.20
0.40
0
1
447
Lomwe
0.55
0.50
0
1
440
Yao
0.22
0.41
0
1
440
Chewa
0.16
0.37
0
1
440
Other‡
0.07
0.26
0
1
440
Desired # of additional children
Attractiveness [1-5]
*
**
Religion
Ethnic Group
Data taken from Kerwin et al. (2011). Means and standard deviations constructed using sample weights. Summary statistics for
other regression controls available from the author upon request.
* Enumerators (of the same sex as the respondents) rated respondents on how attractive they are, from 1 to 5.
** Constructed by dividing self-reported values in Kwacha by the World Bank ICP PPP exchange rate for 2011, which is 39.46.
† Other Christian includes indegenous Christian churches (2.0%), Baptist (1.6%), 7th-Day Adventist (0.2%), and miscellaneous
(10.3%).
‡ Other includes Mang'angja (2.7%), Nyanja (2.6%), Ngoni (1.3%), Sena (0.4%), and trace numbers of Tumbuka and Shona.
Table 3: Summary Statistics for Subjective Beliefs and Sexual Behaviors
Mean
Subjective Beliefs
Transmission Risks
Share of people resp. finds attractive who are HIV-positive
Prob. of HIV Xmission per unprotected sex act w/infected partner)
Average Prob. of HIV Xmission per act, attractive people
Cost of HIV Infection
Years of life remaining
Years from HIV infection to death, without ARVs
Std.
Dev.
Min.
Max.
N
0.49
0.85
0.43
0.21
0.26
0.24
0
0.01
0
1
1
1
414
423
413
29.96
4.36
17.48
4.09
0
1
82
80
436
423
15.85
12.19
2
100
423
0.62
0.29
0
1
421
11.43
8.50
1.12
100
421
18.86
16.40
0
78
410
Total sex acts in past week
2.11
2.68
0
13
447
Unprotected sex acts in past week
1.93
2.67
0
13
447
Years from HIV infection to death, with ARVs
Share of HIV-positive people who would receive ARVs
Expected years from HIV infection to death
Expected life years lost if resp. gets HIV
Sexual Behaviors
Data taken from Kerwin et al. (2011). Means and standard deviations constructed using sample weights.
Table 4: Regressions of risk-taking on perceived per-act risks, full sample
Average Prob. of HIV Xmission per act, attractive people
(1)
(2)
(3)
(4)
(5)
(6)
(7)
2.833**
-1.046
1.520**
-3.732
1.524
-5.800**
-4.277
(1.304)
(3.508)
(0.759)
(2.588)
(1.133)
(2.494)
(3.274)
(Average Prob. of HIV Xmission per act, attractive people)^2
4.185
5.598**
7.442***
5.818
(4.222)
(2.638)
(2.592)
(3.577)
-0.00804
-0.0156
-0.00419
(0.0221)
(0.0203)
(0.0260)
-0.0267
-0.0129
-0.0841
(0.0415)
(0.0374)
(0.127)
Expected life years lost if resp. gets HIV
(Cost of HIV)*(Prob. HIV per Act)
(Cost of HIV)*(Prob. HIV per Act)^2
0.0793
(0.128)
Constant
0.753
1.401***
-1.850
-0.551
-1.054
0.506
0.349
(0.547)
(0.453)
(3.255)
(3.337)
(3.896)
(3.912)
(3.942)
X
X
X
X
X
†
Controls Used
Std. Errors Clustered by Area
X
X
p-value(Lind-Mehlum U-Shape Test for n and X)
-
0.385
-
0.075
-
0.010
0.096
0.0633
0.0780
0.524
0.535
0.528
0.546
0.545
413
413
380
380
367
367
367
2
Adjusted R
N
Heteroskedasticity-robust standard errors in parentheses. All regressions run using the sampling weights as pweights.
† Controls include gender, age, age squared, education, education squared, number of male and female children, desired future children, marital status, literacy, attractiveness,
attractiveness squared, and peri-urban location, and fixed effects for area, ethnicity, religion, media exposure (TV/radio/newspaper) and enumerator. Coefficients on controls
suppressed due to space considerations but available from author upon request.
*** p<0.01, ** p<0.05, * p<0.1
Table 5: Regressions of risk-taking on perceived per-act risks, excluding “uncertain” respondents
Average Prob. of HIV Xmission per act, attractive people
(1)
(2)
(3)
(4)
(5)
(6)
(7)
2.685*
-3.134
0.910
-6.182**
1.673*
-7.589***
-6.788*
(1.355)
(3.888)
(0.588)
(2.709)
(0.887)
(2.723)
(3.450)
(Average Prob. of HIV Xmission per act, attractive
people)^2
6.126
7.516***
9.511***
8.672**
(4.524)
(2.846)
(2.758)
(3.610)
0.0109
0.00537
0.0107
(0.0157)
(0.0134)
(0.0197)
-0.0175
-0.00118
-0.0369
(0.0336)
(0.0276)
(0.110)
Expected life years lost if resp. gets HIV
(Cost of HIV)*(Prob. HIV per Act)
(Cost of HIV)*(Prob. HIV per Act)^2
0.0385
(0.112)
Constant
0.698
1.596***
-6.952*
-4.511
-7.416*
-4.654
-4.921
(0.544)
(0.456)
(3.560)
(3.341)
(3.881)
(3.459)
(3.568)
X
X
X
X
X
†
Controls Used
Std. Errors Clustered by Area
X
X
p-value(Lind-Mehlum U-Shape Test for n and X)
-
0.217
-
0.0117
-
0.0029
0.0252
0.0783
0.119
0.661
0.681
0.676
0.706
0.705
312
312
293
293
281
281
281
2
Adjusted R
N
Heteroskedasticity-robust standard errors in parentheses. All regressions run using the sampling weights as pweights.
† Controls include gender, age, age squared, education, education squared, number of male and female children, desired future children, marital status, literacy, attractiveness,
attractiveness squared, and peri-urban location, and fixed effects for area, ethnicity, religion, media exposure (TV/radio/newspaper) and enumerator. Coefficients on controls
suppressed due to space considerations but available from author upon request.
*** p<0.01, ** p<0.05, * p<0.1
Figure 1: Shapes of Risk-Aggregation Functions for Low and High Values of Per-Act Risk
Figure 2: Scatterplot of Optimal Risk-Taking by Per-Act Risk, Simulated Data
Figure 3: Partially Linear Regression of Optimal Risk-Taking on Per-Act Risk, Simulated
Data
Figure 4: Scatterplot of Unprotected Sex by Perceived HIV Transmission Risk from
Attractive People, Unweighted
Figure 5: Partially Linear Regression of Unprotected Sex on Perceived HIV Transmission
Risk from Attractive People, All Respondents, Unweighted
Figure 6: Partially Linear Regression of Unprotected Sex on Perceived HIV Transmission
Risk from Attractive People, Peri-Urban Respondents, Unweighted
Figure 7: Partially Linear Regression of Unprotected Sex on Perceived HIV Transmission
Risk from Attractive People, Rural Respondents, Unweighted
Figure 8: Histogram of Perceived HIV Transmission Risk from Attractive People
Figure 9: Partially Linear Regression of Unprotected Sex on Perceived HIV Transmission
Risk from Attractive People, Excluding “Uncertain” Individuals, Unweighted
Figure 10: Risk-Taking with Primary Partner for Respondents who Think Partner is
Probably HIV+
Appendix A: Additional Comparative Statics
This appendix derives two additional comparative statics based on similar methods to those used
in Section 3.3: the response of the optimal value of n to the per-act price p and the cost of the bad
outcome c. These two comparative statics are both negative in sign. The second is directly
analogous to the Oster (2012) result for a linear risk aggregation function P.
Note that the denominator of each expression is simply the second-order condition for an internal
optimum and is therefore strictly negative, and that
is negative by assumption.. This
immediately yields the following intuitive results:
Proposition B1
(Comparative Static I)
The higher the price of a risky act, the less of it people will do. Conversely, the more
negative the price - that is, the more that one is paid to take risks - the more one is willing
to do.
Proposition B2
(Comparative Static II)
Raising the cost of a failure weakly decreases the number of risks taken, and strictly decreases
the number of risks chosen as long as
.
Corruption and the E↵ectiveness of Imported
Antiretroviral Drugs in Averting HIV Deaths
Willa Friedman
Department of Economics, University of California at Berkeley
PRELIMINARY
December 6, 2012
Abstract
This paper looks at the impact of corruption on the e↵ectiveness of antiretroviral drugs
in preventing deaths due to HIV and the potential channels that generate this relationship.
This is based on a unique panel dataset of countries in sub-Saharan Africa, which combines information on all imported antiretroviral drugs from the World Health Organization’s
Global Price Reporting Mechanism with measures of corruption and estimates of the HIV
prevalence and the number of deaths in each year and in each country. Countries with higher
levels of corruption experience a significantly smaller drop in HIV deaths as a result of the
same quantity of ARVs imported. This is followed up with a single case-study from Kenya
to illustrate one potential mechanism for the observed e↵ect, demonstrating that disproportionately more clinics begin distributing ARVs in areas that are predominantly represented
by the new leader’s ethnic group.
1
Introduction
Today antiretroviral drugs are widely available in sub-Saharan Africa, with 8 million peo-
ple receiving treatment in 2011 according to the World Health Organization. Until the last
decade, this level of provision was considered inconceivable as the drugs were prohibitively
expensive, and this enormous expansion in access has been credited with extending the lives
of millions of people across the continent. At the same time, corruption in governments is
associated with inefficient distribution of public goods, and this could limit the e↵ectiveness
1
of imported drugs in saving lives if the drugs do not reach the intended clinics or individual
recipients, or if they are distributed with insufficient guidance.
This paper addresses the role of corruption in determining the e↵ectiveness of antiretroviral drugs in reducing HIV mortality in sub-Saharan Africa. This is first done using a
cross-country analysis comparing the impact of imported drugs on HIV deaths across countries with di↵erent levels of corruption. This is done using an original panel dataset of
countries in sub-Saharan Africa from 2000-2007. This dataset combines standard measures
of corruption used in economics and political science, information about HIV prevalence and
deaths, and records of the quantities of antiretroviral drugs imported into each country. Using year and country fixed e↵ects, this data provides evidence that HIV deaths are reduced
less in corrupt countries given the same quantity of medicine, and the e↵ect is even larger if
the relevant quantity of drugs is measured in dollars spent.
There are many channels through which corruption could influence the e↵ectiveness of
health investments. For example, drugs can be purchased and then diverted either outside
the country or within the country. The supply chain can fail if governments with higher levels
of corruption are generally less capable of delivering public goods. Additionally, corruption
within a government can facilitate targeting of public goods, not based on need, but based
on political or other motivations.
Diversion of drugs could happen if drugs purchased by governments are resold. This could
be particularly lucrative in sub-Saharan Africa for two reasons. First, in nearly all countries
of sub-Saharan Africa, supply is not nearly sufficient to meet demand and so treatment is
rationed. This makes resale valuable because some of those excluded are likely to be willing
to pay for the treatment. Second, because of international agreements with pharmaceutical
companies, ARVs are sold at an enormous discount to governments and NGOs working in
many countries in sub-Saharan Africa. This variation in price between di↵erent countries
creates a substantial opportunity for arbitrage.
Such diversions prevent the drugs from reaching those who need them most, and they
2
may take them out of the country entirely. It should be noted that if these drugs are sold
to others within the same country, then a change in allocation may not reduce the overall
reduction in mortality. However, if they are diverted to those who need them less - perhaps to
those for whom the disease has not progressed as far and their risk of opportunistic infection
is reduced or to those who have another source and want the security of accumulating a
bu↵er stock - then this will prevent the drugs from having the same national impact on
HIV-related mortality.
Studying the impacts of corruption in the context of ARV provision is particularly appropriate for a few di↵erent reasons. First, many important outcomes may be only indirectly
linked to welfare, whereas the relevant outcome in this study of deaths averted is clearly
of direct importance. Second, during this time period there was virtually no domestic production of Antiretroviral Drugs and there is no substitute for these treatments. The next
best alternative (good nutrition and treatment and prevention of opportunistic infections
through antibiotics) does not have nearly the impact on morbidity and mortality that these
drugs do. Therefore, while other studies that look at corruption and goods provision will be
unable to measure the entire supply of those goods, this is possible in this case.
Corruption in government could also limit the e↵ectiveness of local supply chains in a
number of ways. If promotion within the public sector is not based on performance, there is
less incentive for employees to manage transport or work hard at health facilities. Thus the
drugs may remain in the country, but sit unused. Similarly, if health facilities are plagued by
high absenteeism, drugs may either sit idly or be prescribed with insufficient guidance so that
clients are less likely to adhere. Because of its quick rate of mutation HIV is particularly
susceptible to the development of drug resistance due to low adherence to the prescribed
regimen.1 .
An additional channel through which corruption could influence the impact of imported
drugs is through changing allocations within a country. Guaranteeing treatment to those
1
It should be noted that adherence to HIV treatment regimens is generally measured to be quite high in
developing countries (Mills et al, 2006)
3
who have low CD4 counts and therefore have the most compromised immune system is the
most efficient way to immediately avert deaths using ARVs. However, there is also a benefit
to an individual of treatment before the CD4 count is extremely low, and the World Health
Organization recently increased the recommended CD4 count threshold of eligibility from
200 to 350. With the higher threshold, demand for the drugs increases and without sufficient
supply, other systems of allocation besides targeting those with the lowest CD4 count arise.
One notable alternative system of allocation of any health expenditure is political favoritism,
including, for example, targeting core supporters or co-ethnics.
Using data from Kenya about ARV provision before and after an election, I test for
politically motivated targeting of new ARV clinics in one country with high corruption levels
and high HIV rates. This is done using an original dataset containing all health facilities in
Kenya that provide antiretroviral drugs, along with the year in which they began distribution
and their GPS locations. This information is linked with ethnicity records to look for evidence
of targeting of the placement of ARV clinics in the home area of a newly elected political
leader. I find that there are disproportionate clinics opened in areas of the leader’s ethnic
group. This suggests one mechanism through which corruption reduces the impact of health
inputs. Namely, in a country with high corruption, the assignment of ARV clinics follows a
political criterion rather than a public health criterion. Further, this pattern of allocation
appears to reflect additional clinics added to areas that were already served rather than
expanding access to districts that were underserved previously.
This paper is organized as follows. Section 2 discusses the relevant literature and explains
how this paper contributes to it. Section 3 outlines the data to be used for the main
specification and for the analysis of the case study of Kenya. Section 4 presents the empirical
strategy and the results of the cross-country analysis and section 5 discusses the methodology
and results of the case study. Section 6 concludes.
4
2
Literature review
The consequences of corruption are an important area of study in both political science
and economics. Corruption is associated with a number of poor outcomes including weakened
democracy and reduced economic growth (e.g.: Mauro, 1995; Bardhan, 1997). Another crosscountry literature specifically links corruption with public goods and health outcomes (e.g.:
Rajkumar and Swaroop, 2008; Lewis, 2006). There are many mechanisms through which
corruption can hurt health outcomes, and the popular press provides a number of examples
of diversions in public health systems.
A report in Zambia found that an enormous fraction of the money that was provided to
the country by the Global Fund could not be accounted for (“Zambia”, 2011). This money
could have been used to build clinics and distribution networks to facilitate the distribution
of ARVs, but without it, the ARVs would need to be distributed using fewer resources,
possibly causing some drugs to go unused, preventing interventions to increase adherence,
or hurting the ability of the government to target those who most needed treatment.
A Ugandan newspaper reported that in many areas of Uganda in 2011, ARV clinics had
run out of stocks of drugs (Basudde, 2011). If corruption prevents drugs from being restocked
in a timely matter, this can have disastrous impacts on HIV mortality, even if the stocks
return. First, a lack of consistent adherence to ARVs allows the virus within an individual to
develop immunity to the drugs. When treatment is restarted, it is likely to be less e↵ective
at preventing opportunistic infections and keeping the individual alive. HIV is known to
mutate rapidly, facilitating the development of drug resistance. Second, if this individual is
sexually active, this resistance can be transmitted to others. Both factors will reduce the
e↵ectiveness of future ARVs, because the lack of consistent supply allows the virus to develop
drug resistance.
In another article in the same newspaper, alleged corruption prevented a bill to allocate
28.4 billion Ugandan shillings to purchase CD4 count machines. These machines are used to
monitor the progress of HIV in an individual and the e↵ectiveness of treatment (“Corruption
5
feared”, 2011). With the machines, ARVs could be more efficiently administered. These
machines help doctors and clinical officers to determine whether a person has developed
drug resistance and ought to be switched to second line treatment, which would improve the
e↵ectiveness of the treatment and the likelihood of its prolonging life.
A report from the Ministry of Finance in Swaziland illustrated the scale of money lost
due to corruption by showing that it was nearly double the country’s yearly budget for social
services (“Swaziland”, 2011).
One channel through which corruption could reduce the impact of imported drugs is by
preventing targeting based on need in favor of other motives. In order to maximize the
benefit of the drugs, they would need to be distributed in such a way that those who can
use them have sufficient access that they can begin and successfully adhere to treatment. If
corruption allows targeting based on other criteria, this targeting will be weakened, and the
drugs may not lead to those most likely to be helped.
In Zimbabwe, PlusNews reports that HIV positive patients are asked to pay providers
in order to access drugs which are officially distributed free of charge (“Zimbabwe”, 2010).
With this type of targeting, many who could use the drugs may be denied access in favor of
those who either are less likely to be adherent or in less urgent need of the drugs, resulting
in higher rates of mortality due to HIV, even with the same quantity of drugs distributed.
In order to keep prices high, providers may also restrict access, letting some drugs go unused
in order to maximize profits, again resulting in increased mortality.
3
Data
3.1
Cross-country impacts of corruption and ARVs
The data in this paper comes from many sources. For the first section of the paper, all
data is collapsed to a single observation per year in each country in sub-Saharan Africa. The
sample is restricted to one region of the world in order to avoid some - but not all - of the
6
standard concerns with cross-country analysis, and to focus on the region that is the hardest
hit by the HIV epidemic.
The first datasource is used to measure the quantity of drugs entering each country.
This information comes from the WHO Global Price Reporting Mechanism. This is an
online database of all international purchases of drugs associated with HIV/AIDS, malaria,
and tuberculosis going into developing and middle income countries. For each purchase, the
database reports the date of purchase, the country and company of manufacture, the country
of the purchaser, and the price and quantity of each type of drug. This contains records for
approximately 30,000 purchases of antiretroviral drugs across three categories - antiretroviral
drugs, HIV Diagnostics, and HIV prevention. The same drugs with the same dosages are
listed in each category and combining all three reduces threats from misclassification at
the international level of how drugs will be used at the local level.2 The records includes
purchases on the part of governments, NGOs, and researchers.3
The analysis uses two measures of the quantity of drugs entering each country in each
year. The first measure uses standard doses to calculate the quantity of drugs in terms
of person-years. Because some drugs are used in combination with others, this measure
is imperfect and may be higher in countries that use fewer combination pills. The second
measure is the quantity of money spent on all imported ARVs. This is simply the sum of
the costs of all purchases.
HIV statistics come from the UNAIDS/WHO 2008 Report on the Global AIDS Epidemic,
which for each country in each year reports an estimate of the prevalence, the number of
people living with HIV, and the number of deaths due to HIV. While this information may
be flawed, there is no better source of information about the prevalence in all countries.
2
For example, antiretroviral drugs purchased as HIV prevention may be used for Prevention of Mother
to Child Transmission, and as HIV positive babies typically are overcome by the disease quickly, this should
also show up in preventing future deaths.
3
Agreements between drug companies and developing countries set maximum prices that are low if drugs
are purchased by governments or NGOs, but the prices are higher for the private sector. Partly because
of this, the private sector does not import large quantities of ARVs in these countries, but it minimally
participates in distribution of drugs once they are in the country.
7
Governance indicators for Control of Corruption, Government Efficacy, and Rule of Law
are taken from Kaufmann, Kraay, and Mastruzzi (2010). In each year, each country is given
a score for each of these indexes. In order to not rely on small di↵erences, the analysis uses
binary measures of each of these representing an indicator for a score above the mean in
sub-Saharan Africa. GNI per capita is taken from the World Bank’s Human Development
Indicators.
3.2
Case study
The second section of the analysis uses more detailed data obtained from government and
private sources in Kenya, combined with population data from MeasureDHS. Information on
the placement of ARV clinics is from Kenyapharma, a procurement agency, and the National
AIDS and STI Control Program of the Ministry of Health. These reports were provided
directly to the author in the Fall of 2011. The information about ethnic backgrounds of
populations comes from the 2008/2009 Measure DHS data, in which respondents are asked
to report their own ethnicity. The GPS data used to link the two is from the Kenya Open
Data Initiative.
4
4
Empirical Strategy and Results
4.1
Cross-country impacts of corruption and ARVs
This paper will follow previous analysis of cross-country panel-data, including country
and year fixed e↵ects and estimating the coefficient on the interaction of corruption and
quantities of imported drugs. To ensure some reliability of the data, I will first estimate
the following equation to verify that the quantity of drugs is associated with a reduction in
deaths due to HIV:
4
(opendata.go.ke)
8
deathsjt = ↵j +
t
+
1
⇤ ARV sjt +
2
⇤ prevjt +
3
⇤ P LW Hjt + ej
where deathsj t is the number of deaths in year t in country j due to HIV as reported
by the WHO, and ARV sjt is the total quantity of drugs imported in that year according
to the WHO Global Price Reporting Mechanism. In the first set of specifications, this is
included measured in doses (person-years), and in the second set of estimates, the quantity
is reported in dollars spent. Controls are included for the prevalence of HIV (↵j )and the
number of people living with HIV (P LW Hjt ) as well as country and year fixed e↵ects (↵j
and
t ).
The coefficient
1
shows the association between ARVs entering a country and
deaths due to HIV reported in that year.
As reported in columns 1 and 4 of Table 1, this is large and negative and statistically
significant at all standard levels. Column 1 reports the estimates using the number of person
days of drugs as the quantity of ARVs and Column 4 presents the same with the cost of all
imported drugs as the measure of quantity. In both cases, the coefficient is negative and
significant.
To investigate the role of corruption in changing this e↵ect, I focus on the interaction
between the quantity of ARVs and the level of corruption. To do this, I estimate the following
equation:
deathsjt = ↵t +
+
3
j
+
1
⇤ ARV sjt +
2
⇤ corruptionjt
⇤ ARV s ⇤ corruptionjt + ⌃bij ⇤ Xij + ej
where: ARV sjt is the person*years purchased by country j in year t. Corruptionjt is an
indicator for being below the mean (in Africa) on the Kaufmann, Kraay, and Mastruzzi
Control of Corruption Index. To ease interpretation, the quantity of ARVs are demeaned so
9
that the mean is zero. This way the coefficients on the un-interacted terms are meaningful
and can be interpreted as the impact at the mean.
If corruption does limit the reduction in deaths generated by purchased drugs, then
3
should be positive (reflecting a dampened reduction in deaths).
The estimated parameters from this equation are reported in columns 2 and 4 of table
1. In this table, the coefficients on quantities of ARVs are still large and negative and
significant, showing that in less corrupt countries, ARVs reduce deaths due to HIV. The
coefficient on the interaction term in column 2 is positive, but not significant. A positive
coefficient reflects that the impact of ARVs in corrupt countries is lower, but the fact that it
is not significant means that this is inconclusive. In column 4, using spending as the measure
of quantity, the interaction term is positive and significant, and large enough to nearly wipe
out the impact of ARVs on deaths averted. This suggests that corruption does mitigate the
impact of imported ARVs.
Perhaps the variable for corruption is picking up other measures of good governance that
have an e↵ect through di↵erent channels on the impacts of ARVs on deaths. Columns 3
and 6 of table 1 show the same estimates including the country’s GNI and the interaction
between that and the quantity of ARVs. With this included, the coefficient on the interaction
between ARV quantity and corruption is of a similar magnitude, but significant using both
measures of quantity.
Interestingly, the positive coefficients on the interaction between ARV quantity and GNI
imply that wealthier countries may see fewer deaths averted as a result of the same quantity of
ARVs. One possible explanation is that richer countries have met more of the demand within
their countries and the marginal (and average) return is lower as those with less advanced
infection are treated. Although not estimated in this paper, these countries may be treating
those who would not have died immediately otherwise, but the e↵ects on mortality may show
up after a few years.
Table 2 further investigates whether the variable for corruption is a proxy for other types
10
of governance. This table adds a number of measures of government quality alone and
interacted with ARV quantities. These variables are binary measures constructed in the
same way as the measure of corruption coding above average values as 1 and below average
as 0.
The first column uses person days of treatment as the quantity and the second uses
the price. In column 1, the coefficient on the interaction term of ARVs and Corrupt is
still positive and significant, although Good Rule of Law and E↵ective Governance also
have significant relationships when interacted with ARVs. In the second column, only the
interaction with corruption is significant.
Are corrupt countries di↵erent in other ways? Table 3 shows which countries fall in
which categories. Table 4 compares the countries that are more or less corrupt on a variety
of measures. As seen in table 4, more corrupt countries have lower HIV prevalence rates.
They also spend more on ARVs, but for a lower quantity. One possibility is that more
corrupt countries are buying more expensive drugs that may be less likely to be first or
second line treatments, and therefore more valuable since third line and beyond treatments
are rarely widely available.5 Tables 5 and 6 show the breakdown of types of antiretroviral
drugs purchased by more and less corrupt countries. This is measured as the percentage of
all drugs purchased in each category these quantities by the specific type of drug.
Based on these comparisons, the quantity of ARVs imported is clearly not exogenous and
it is possible that the association shown in this cross-country analysis is not causal. The
inclusion of country and year fixed e↵ects deals with many potential threats to endogeneity,
but it cannot handle all of them.
5
ARV treatment becomes ine↵ective for an individual once the HIV in their system develops resistance
to the treatment they are given. Once this happens, a person is given a di↵erent treatment, referred to as
the second line. In developed countries, this process can repeat many times with those who live with HIV
for many years progressing to third, fourth, etc. line treatments.
11
5
Case study in Kenya
If the relationship measured in the previous section is causal, then looking at the mech-
anisms through which corruption changes e↵ectiveness of imported HIV treatment is a pertinent next step. At the same time, identifying specific mechanisms provides additional
support that the relationship is causal.
The fact that controlling for the role of government efficacy does not eliminate the e↵ect
is suggestive that the channel through which corruption has an influence is not in simply
making government programs less efficient generally with poor incentives for performance or
high absenteeism. Instead, this suggests other channels through which drugs are diverted or
allocated inefficiently.
One channel through which corruption could reduce the impact of imported drugs is by
preventing targeting based on need in favor of other motives. In order to maximize the
benefit of the drugs, they would need to be distributed in such a way that those who can
use them have sufficient access that they can begin and successfully adhere to treatment. If
corruption allows targeting based on other criteria, adequate targeting will be weakened, and
the drugs may not reach those most likely to be helped. In this section, I test whether one
type of targeting exists in Kenya, a country consistently listed as in the top half of corrupt
countries in sub-Saharan Africa.
In particular, I look for evidence of selective placement of ARV clinics in Luo areas after
Raila Odinga became Prime Minister in 2008. Previous research has demonstrated that the
match between the ethnicity of leaders and constituents is a strong predictor of the provision
of public goods (Burgess et al, 2009; Kramon and Posner, 2012). In 2008, after a fiercely
contested election for president, followed by allegations of electoral fraud and eventually
by violence, the opposition leader, Raila Odinga, became prime minister. Jablonski (2012)
looks at government spending in areas populated by Odinga’s core supporters after the same
election. This paper uses a similar method, focusing exclusively on ARV clinics.
One concern that has been raised in the literature about targeting of core supporters after
12
a successful election is that in many settings this could be confounded by an inspiration e↵ect.
For example, Marx, Ko, and Friedman (2009) find that African American children in the
US have higher test scores after Obama was elected, and argue that this reflects - not a
channeling of resources - but greater motivation on the part of children who see a role-model
in such an important position. In the case study in this paper, I will test whether political
changes generate changes in investments in health. In particular, I will look at whether there
are more ARV clinics opening in areas with core supporters of Raila Odinga after he became
Prime Minister in Kenya in 2008.
By using a measure of health inputs rather than outcomes, I am able to isolate the impact
through expenditures and channeling of resources, and avoid contamination from inspiration.
This is similar to Burgess et al (2009) who look at road construction in Kenya as a function
of the ethnic match between the constituents and the government.
If there is targeting based on shared ethnicity, then we would expect to see a relative
increase in ARV clinics in Luo areas after the election. To test this, I construct a dataset
in which each observation represents one division in one year.6 For each year between 2004
and 2010 and each of the 225 divisions covered in the 2003 or 2008/2009 DHS survey, this
dataset contains the number of clinics which disburse antiretroviral drugs and an estimate
of the proportion of the population that self-defines as Luo.
This data is used to look for evidence that Luo areas disproportionately received new
clinics after the election, by regressing the number of clinics on the proportion of the population that is Luo, an indicator variable for whether the observation is after the election,
and the interaction of the two. I also include controls for the local HIV prevalence at both
the district and division levels and year and division fixed e↵ects. The coefficient of interest
is the coefficient on the interaction term. Formally, the equation to estimate is:
6
Kenya has provinces subdivided into districts, further subdivided into divisions.
13
N umClinicsjdt = ↵t +
jd
+
1
⇤ P ercentLuojdt +
+ 3 P ostt ⇤ HIV ratejd +
2
⇤ P ercentLuo ⇤ P ostt
4 P ostt
⇤ HIV rated + ✏j
where N umClinicsjdt is the number of health facilities distributing ARVs in division j
of district d in year t. P ostt is a binary variable that is 1 if the observation is from 2008 or
later and 0 if it is earlier. If there exists ethnically-based targeting, one would expect that
2
would be positive and significant.
Columns 1 and 2 of Table 7 shows the estimates of the parameters from the equation
above. The coefficient of interest is the coefficient on the interaction between being a year
after the election and the percent of the population that is Luo. These are reported in
the first row. The first column includes the basic specification without any controls. The
second column HIV rates interacted with P ostt , the indicator for 2008 and later. In each
specification, the coefficient on the interaction term is large, positive and significant. This
provides evidence that Luo areas saw a disproportionate increase in the number of HIV
clinics after the 2007 election.
To better understand the relationship, I replace the outcome with a binary indicator for
whether the division has any ARV clinics, estimating the following equation:
IC linicjdt = ↵t +
+
3
⇤ P ercentLuo ⇤ P ostjdt +
jd
4
+
1
⇤ P ercentLuojdt +
⇤ HIV ratejdt +
5
2
⇤ P ostt
⇤ HIV ratedt + ✏j
where IC linicjdt is an binary variable which takes on a value of 1 in divisions with an
ARV-distributing facility in a given year and 0 otherwise.
Columns 3 and 4 of Table 7 shows the estimates from this equation. Unlike in the
previous table, the coefficients on these interaction terms are consistently insignificant. The
estimates are imprecise enough that it is not possible to conclusively rule out some impact
14
on this margin, but the di↵erence between the two tables is suggestive of an increase in
intensity rather than an expansion to new areas.
The lack of impact on the extensive margin demonstrated in the last columns of 7 is
suggestive of a reduction in welfare as a result of this misallocation. Arbitrary distribution
of scarce resources may not reduce welfare, but this suggests an increase in distribution
without a corresponding increase in access. The degree to which this is true depends on
the degree to which the previously existing ARV-distributing facilities were able to meet the
local demand.
The response of targeting to the ethnic composition is not likely to be linear as specified
above. The analysis from Table 7 is repeated, replacing the dependent variable percent luo
with an indicator for whether the majority of the population is Luo.7 The results, reported
in Tables 8, are qualitatively unchanged.
One explanation for the estimated result is that before the election, Luo areas may have
been disproportionately underserved and the increase was bringing them to where they would
have been otherwise. Limiting the analysis to the years before the election, Table 9 does not
provide evidence that Luo areas were previously underserved.
6
Conclusion
This paper identifies two interesting patterns. First, using a cross-country panel from
sub-Saharan Africa, it shows that corruption is associated with a reduction in efficiency of
imported antiretroviral drugs. The lack of reduction in mortality from the same expenditure
on treatment in relatively more corrupt countries points to a very dangerous consequence of
corruption. Second, using data from within Kenya, I find evidence of political targeting of
HIV treatment, again suggestive of a reduction in efficiency of important health expenditures.
This is extremely preliminary research, and the future direction depends on the credibility
7
The divisions with Luo majorities are in Homa Bay(Kendul Bay, Lake Victoria, Mbita, Ndhiwa, Oyugis,
and Rangwe), Kisumu (Lower Nyakach, Muhoroni, Nyando, Upper Nyakach, and Winam), Migori (Migori
and Nyatike), and Siaya(Bondo, Boro, Rarieda, Ugunja, Ukwala, and Yala)
15
of each piece and whether they ought to be combined or separated. The cross-country
component fits neatly into a literature on the national impacts of corruption on various
outcomes and adds a new outcome with important consequences. The case-study speaks to
a separate literature on political favoritism and targeting of public goods.
There are also a number of ways in which this work could be extended. For the crosscountry analysis, it would be helpful to use additional measures of corruption for robustness.
Further inclusion of the elements that are incorporated into a corruption index could also
help to identify more specific methods. A better understanding of the di↵erences between
di↵erent antiretroviral drugs could illuminate di↵erences in purchases between di↵erent types
of countries to look for di↵erences between types of countries before the drugs are even
received in the country.
The case study could be extended by looking for evidence of di↵erences in outcomes. Using census data, I could obtain better measures of population densities, which combined with
HIV data from MeasureDHS could demonstrate the degree to which - if it all - this targeting
misses out on increasing access to ARVs. Additionally, information about the quantity of
drugs distributed in each facility is available in the data, but still needs substantial cleaning,
could help flesh out the story. Election data from other countries could be combined with
data from MeasureDHS for a coarser measure of targeting - without the detailed information
about ARV-distributing clinics - to get a sense of how generalizable the pattern is.
At this point, this paper presents evidence of large costs of corruption in reducing the
efficiency of health services that are able to save lives. This is combined with suggestive
evidence of one mechanism which may contribute to this relationship. Future work will
expand the analysis of mechanisms through which corruption limits the e↵ectiveness of health
spending in producing outcomes.
16
References
Bardhan, Pranab (1997), “Corruption and Development: A Review of Issues,” Journal of
Economic Literature, 35(3).
Basudde, Elvis (2011), “Where have all the ARVs gone? World Aids Day Supplement” The
New Vision, Kampala, Uganda, December 2, 2011.
Burgess, Robin, Rémi Jedwab, Edward Miguel, and Ameet Morjaria (2009), “Our Turn to
Eat: The Political Economy of Roads in Kenya,” unpublished working paper.
“Corruption feared in sh28b HIV deal” The New Vision, Kampala, Uganda, December 4,
2011.
Jablonski, Ryan (2012), “How Aid Targets Votes: The Impact of Electoral Incentives on
Foreign Aid Distribution,” unpublished working paper.
Kaufmann, Daniel, Aart Kraay, Massimo Mastruzzi, “The Worldwide Governance Indicators:
Methodology and Analytical Issues,” World Bank Policy Research Working Paper No. 5430.
Kramon, Eric and Daniel Posner (2012), “Ethnic Favoritism in Primary Education in
Kenya,” unpublished working paper.
Lewis, Maureen (2006) “Governance and Corruption in Public Health Care Systems,” CGD
Working Paper 78.
Marx, David, Sei Jin Ko, Ray Friedman (2009) “The ‘Obama E↵ect’: How a salient role
model reduces race-based performance di↵erences,” Journal of Experimental Social Psychology 45(4).
Maura, Paolo (1995) “Corruption and Growth,” The Quarterly Journal of Economics,
110(3).
Mills, Edward, Jean Nachega, Iain Buchan, James Orbinski, Amir Attaran, Sonal Singh,
Beth Rachlis, Ping Wu, Curtis Cooper, Lehana Thabane, Kumanan Wilson, Gordon Guyatt,
David Bangsbert (2006), “Adherence to antiretroviral Therapy in Sub-Saharan Africa and
North America: A Meta-analysis,” JAMA, 296(6).
Rajkumar, Andrew Sunil, and Vinaya Swaroop (2008), “Public Spending and Outcomes:
Does Governance Matter?” Journal of Development Economics, 86(1).
“SWAZILAND: Corruption exceeds social services budget,” IRIN, Mbabane, Swaziland, October 12, 2011.
“Zambia: Corruption scandal rocks ARV programme,” PlusNews, Johannesburg, South
Africa, March 14, 2011.
“ZIMBABWE: HIV patients forced to pay up or go without,” PlusNews, Harare, Zimbabwe,
October 5, 2010.
17
7
Figures
Figure 1: ARV clinics in Kenya
8
Tables
18
Table 1: Impact of corruption on e↵ectiveness of ARVs
VARIABLES
ARVs (person years)
(1)
deaths
(2)
deaths
(3)
deaths
-0.0821***
-0.0915**
-0.0832**
(0.0253)
(0.0392)
(0.0409)
0.0356
0.0346*
(0.0254)
(0.0202)
ARVs*Corrupt
ARVs*High GNI
(4)
deaths
(5)
deaths
(6)
deaths
-0.384***
-0.716***
-0.601***
(0.140)
(0.0876)
(0.108)
0.434***
0.396***
(0.120)
(0.107)
0.168***
(0.0440)
Spending on ARVs (1000s)
Spending*Corrupt
Spending*High GNI
1.456*
(0.834)
Corrupt
4,318**
3,538**
4,705***
3,626***
(1,825)
(1,487)
(1,173)
(1,189)
High GNI
HIV prevalence
PLWH
Observations
R2
4,629***
6,249***
(1,414)
(1,574)
1,076
1,113
2,718**
941.1
636.5
2,416**
(1,805)
(1,875)
(1,291)
(1,710)
(1,501)
(946.5)
0.0766**
0.0801***
0.0540***
0.0653**
0.0807***
0.0555***
(0.0298)
(0.0295)
(0.0170)
(0.0259)
(0.0220)
(0.0113)
142
0.998
142
0.998
142
0.998
142
0.998
142
0.998
142
0.999
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
All estimates include both country and year FEs,
and all SEs are clustered at the country level.
19
Table 2: Impact of corruption and other government indicators on e↵ectiveness of ARVs
VARIABLES
(1)
deaths
ARVs (person years)
-0.0621
(2)
deaths
(0.0982)
Spending on ARVs (1000s)
-0.656***
(0.225)
ARVs*Corrupt
0.0148
(0.0918)
Spending*Corrupt
0.410***
(0.128)
ARVs*High GNI
0.177***
(0.0504)
Spending*High GNI
1.427
(0.855)
ARVs*Good Rule of Law
0.0396
(0.168)
Spending*Good Rule of Law
-0.0832
(0.329)
ARVs*E↵ective Governance
-0.0593
(0.112)
Spending*E↵ective Governance
0.136
(0.302)
Corrupt
High GNI
Good Rule of Law
E↵ective Governance
HIV prevalence
PLWH
Observations
R2
3,064
3,452**
(2,389)
(1,472)
4,659***
6,588***
(1,259)
(1,411)
376.8
-664.0
(3,633)
(2,368)
-1,730
-760.8
(2,568)
(1,636)
2,756**
2,172**
(1,341)
(1,032)
0.0526***
0.0571***
(0.0175)
(0.0118)
142
0.998
142
0.999
Robust standard errors in parentheses
*** p<0.01, **20
p<0.05, * p<0.1
All estimates include both country and year FEs,
and all SEs are clustered at the country level
Never Rich
Sometimes Rich
Table 3: Countries by corruption status
Never Corrupt
Sometimes Corrupt
Always Corrupt
Burkina Faso
Benin
Burundi
Eritrea
Comoros
Cameroon
Ghana
Djibouti
Central African Republic
Lesotho
Ethiopia
Chad
Madagascar
Gambia
Cote d’Ivoire
Mali
Guinea
Guinea-Bissau
Mauritania
Liberia
Kenya
Mozambique
Malawi
Niger
Rwanda
Nigeria
Togo
Sierra Leone
Zambia
Sudan
Uganda
Senegal
Sao Tome and Principe
Always Rich
Angola
Zimbabwe
Botswana
Gabon
Republic of the Congo
Cape Verde
Tanzania
Democratic Republic of the Congo
Namibia
Equatorial Guinea
Seychelles
Somalia
South Africa
Swaziland
Countries categorized as rich if GNI is higher than mean and as more corrupt if the control
of corruption score is below the mean.
21
Table 4: Comparison of more and less corrupt countries
Variable
Less Corrupt More Corrupt
HIV prevalence
7.442
4.456
(8.768)
(4.467)
PLWH
607633.33
444457.65
(1320124.3)
(624440.62)
ARVs (person years)
26416.174
24679.665
(60638.485)
(60317.787)
Money for ARVs (1000s)
5481.701
7041.231
(11247.696)
(15658.17)
Table 5: Drugs purchased by country, by dose
Variable
Less Corrupt More Corrupt
Abacavir (ABC)
1.42
1.77
Combination
32.22
23.17
Didanosine (ddI)
4.05
2.96
Efavirenz (EFV)
2.73
21.77
Indinavir (IDV)
.03
.04
Lamivudine (3TC)
2.14
11.33
Nelfinavir (NFV)
.07
.32
Nevirapine (NVP)
21.52
3.99
Ritonavir (RTV)
.1
1.39
Saquinavir (SQV)
.07
.
Stavudine (d4T)
8.04
24.33
Tenofovir (TDF)
0
0
Zidovudine (ZDV)
27.6
8.94
Table 6: Drugs purchased by country, by money spent
Variable
Less Corrupt More Corrupt
Abacavir (ABC)
2.32
5.36
Combination
66.03
60.29
Didanosine (ddI)
5.71
2.2
Efavirenz (EFV)
7.17
16.14
Indinavir (IDV)
.17
1.65
Lamivudine (3TC)
2.78
1.71
Nelfinavir (NFV)
.55
1.11
Nevirapine (NVP)
8.05
2.29
Ritonavir (RTV)
.15
.43
Saquinavir (SQV)
1.05
.
Stavudine (d4T)
.68
4.17
Tenofovir (TDF)
0
1.26
Zidovudine (ZDV)
5.33
3.38
22
Table 7: Targeting of introduction of ARVs in health facilities in Kenya
(1)
(2)
(3)
(4)
VARIABLES
Num. ARV clinics Num. ARV clinics Any ARV clinics Any ARV clinics
Post*PercLuo
2.079***
(0.528)
1.753**
(0.874)
1.553
(2.565)
0.727
(4.841)
Post*HIVdivision
Post*HIVdistrict
0.0442
(0.0760)
-0.0681
(0.161)
-0.132
(0.306)
0.774
(0.685)
Observations
1,568
1,260
1,568
1,260
R-squared
0.721
0.724
0.747
0.748
Clusters
224
180
224
180
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
Standard errors clustered at the division level. All estimates include division and year FEs.
Table 8: Targeting of introduction of ARVs in health facilities in Kenya
(1)
(2)
(3)
(4)
VARIABLES
Num. ARV clinics Num. ARV clinics Any ARV clinics Any ARV clinics
Post*LuoMajority
1.906***
(0.463)
Post*HIVdivision
Post*HIVdistrict
Observations
1,568
R-squared
0.721
Clusters
224
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
Standard errors clustered at the division level.
23
1.631**
(0.714)
1.635
(2.588)
0.792
(4.496)
0.0540
(0.0705)
-0.0388
(0.131)
-0.144
(0.296)
0.684
(0.638)
1,260
0.725
180
1,568
0.747
224
1,260
0.748
180
All estimates include division and year FEs.
VARIABLES
LuoMajority
HIVdivision
HIVdistrict
Table 9: Previously underserved?
(1)
(2)
(3)
Num. ARV Clinics Any ARV clinics Num. ARV clinics
0.237
(0.380)
0.130
(0.858)
2.501
(1.630)
-0.0392
(0.163)
-0.0992
(0.383)
1.148
(0.849)
PercLuo
Observations
720
720
R-squared
0.212
0.266
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
Standard errors clustered at the division level.
All estimates include division and year FEs.
24
(4)
Any ARV clinics
0.0423
(0.829)
1.847
(1.617)
0.448
(0.404)
-0.125
(0.374)
0.914
(0.896)
0.0288
(0.193)
720
0.215
720
0.266
Parental Health, Child Labor and Schooling Outcomes: Evidence from
Malaria and HIV/AIDS Inflicted Region in Tanzania
Shamma Adeeb Alam
University of Washington - Seattle
Abstract
This study examines the impact of parents’ health on child labor and schooling outcomes in
agricultural societies. Despite the effect of numerous fatal diseases on households, there has been
no prior research that focuses on the effect of illness of adult household members on child
education or child labor. Employing longitudinal data from the Kagera region in Tanzania, a
region severely affected by malaria and HIV/AIDS, this study demonstrates that parental
sickness lead to increased child labor among these agricultural households. Consequently, the
increased child labor leads to lower school attendance and enrollment. Subsequently, it
culminates in children failing their grade and in some cases dropping out of school altogether.
This paper explains how the effect on child labor and school enrollment is especially higher
when single-mothers or both parents are sick. Furthermore, the provision of greater household
wealth acts as a buffer against heightened child labor and lower school enrollment when adults
are ill. In other words, households with greater assets are better able to cope with health shocks,
and hence do not need to increase child labor to weather the shock. Additionally, fathers’ illness
lead to greater hours of outside (including farm) employment, while sickness of mothers or both
the parents lead to greater hours in household chores. There is also a gender-difference in hours
worked as girls work more than boys because of parental illness. This gender difference is
mainly present for household chores but not for outside employment. Lastly, older children are
likely to work more in the farm. The paper concludes with policy implications.
1
Introduction
Child labor continues to be a serious problem in developing countries. It is estimated that
there are still over 200 million children laborers worldwide (ILO, 2010). An important
consequence of increased child labor is reduction in children’s schooling. Edmonds (2005)
summarizes many factors that affect child labor and educational outcomes. Although prior
research examines many factors that affect child labor and educational outcomes (Edmonds,
2005), the effect of parental illness have received very little empirical attention. Diseases like
HIV/AIDS and malaria have devastating impacts on adult health with high levels of morbidity
and mortality. Hence, these diseases are likely to affect parent’s decisions on whether to send a
child to school or whether to use them as a child labor to replace a sick adult labor.
Understanding these family decisions can help us better target policies to reducing child labor
and improve education.
This study examines the influence of parental health on children’s labor and schooling
outcomes. Employing longitudinal data from Tanzania, focusing on children aged 7 to 15, this is
the first study to show that parental illness has a direct impact on child labor and educational
outcomes. Illness of fathers and mothers lead to increased hours of child labor, which
consequently leads to reduced school attendance and enrollment. These effects are especially
higher when single-mothers are ill or if both parents are ill. However we find no other consistent
impact on child labor or school enrollment when other adults, such as siblings, grandparents,
uncles and aunts of the child, are ill. This study further finds that greater household wealth acts
as a buffer against greater child labor. In other words, households with greater assets are better
able to cope with adult health shocks and hence do not need to increase child labor to deal with
the shock.
2
Furthermore, this study shows evidence that parental illness leads to greater gender
difference in housework as girls work more than boys when parents are ill. However, there is no
gender gap in the increase in outside employment. Additionally, hours of outside employment of
children increases when only fathers are ill and hours of household chore increases when only
mothers or both the parents are ill. Lastly, I find that older children are more likely work
increased hours and less likely to enroll in school because of adult illness. We use a child-level
fixed effects model to control for household preferences or birth order effects.
This paper makes four contributions to the literature. First, it contributes to the child labor
literature as it shows that parental illness is a significant determinant of child labor. Although
prior studies have shown that numerous factors affect child labor (Edmonds, 2005), there is only
limited evidence on the effect of parental health on child labor. Dillon (2008) finds that only
illness of mothers in Mali increases child labor. However, as his study is based on cross-sectional
data, it does not control for time-invariant child or household level heterogeneity that can bias
results.1
Second, this study contributes to the education literature as it demonstrates that parental
illness lead to lesser school enrollment, and hence fewer children advancing to the next grade.
The only prior study having examined this relationship is Sun and Yao (2010). They find that
illness of parents causes reduced enrollment and completion of middle school. However, their
study is based on retrospective survey that asks households to recall illness and health
expenditure in the prior 15 years, which may lead to reporting errors arising from such long
1
Certain inherent characteristics can cause households are likely to have more sick people and also more child labor.
If we do not control for unobservable household characteristics, it will lead to a biased result.
3
recall periods.2 Sun and Yao (2010) also does not address potential endogeneity issues as it does
not control for household and child’s unobserved characteristics. I employ child fixed effects
model to address the unobserved heterogeneity and endogeneity issues.
Third, my findings contribute to the consumption smoothing literature and permanent
income hypothesis. This literature states that households use savings and borrowing to adjust
their consumption during economic booms and busts. Several studies (Beegle et al, 2006; Duryea
et al, 2007; Guarcello et al, 2010; Janvry et al., 2006) demonstrate that households increase child
labor to smoothen their consumption during income shocks (i.e. sudden loss of income). Child
labor may also be used to smoothen household consumption during adult illness. Adult illness
affects household budgets in two ways: greater health care expenditure and reduced income
through forgone labor hours. Households can cope with these shocks (i.e. illness) using savings,
credits or buffer stock of assets. However, if households lack buffer stock or are credit
constrained, they are likely to resort to increased child labor to cope with adult illness.
Lastly, it contributes to the health policy literature. The health literature documents the
various consequences of illness, including fatal diseases like HIV/AIDS and malaria. This study
demonstrates the effect of these and other illnesses on children’s labor and educational outcomes,
which did not receive prior attention in the literature.
Data
This study uses a panel data survey, named Kagera Health and Development Survey
(KHDS), from the Kagera region in Tanzania. The survey is conducted by the World Bank and
2
Errors may occur as respondents may forget some illnesses in the last 15 years or may not recall the precise time of
the sickness. The study focuses only on major health shocks. However, smaller health shocks can also have a
significant effect on education as shown by my study.
4
the University of Dar es Salaam in four rounds from 1991 through 1994. It surveyed over 800
households, drawn from 51 communities (49 villages) in the six districts of Kagera. The average
interval between each of the survey rounds was between six and seven months. The sample
selection was based on a variable probability sampling procedure (a two-stage, randomized
stratified procedure) based on expected mortality.3 The Kagera region has suffered the
devastation of both HIV/AIDS and malaria. In some parts of Kagera, the HIV/AIDS rate was as
high as 20 percent of adults during the time of the study. Similarly, for malaria, more than 10
percent of children have an adult member of the family who have been diagnosed with malaria.
And an additional 20 percent think that they have malaria as they have malaria type symptoms,
although they did not approach a medical practitioner. As this area has suffered from these fatal
diseases, this region is an ideal area to study of the effects of adult health on children’s
outcomes.
The data contain detailed information on individual and household level demographic
and socioeconomic characteristics, which makes it suitable for this study. It contains data on
child’s age, education and enrollment status. It also provides detailed data on value of household
asset holdings, which include business equipment, durable goods, land, livestock and personal
savings. Additionally, the survey provides detailed time use data in the past seven days of all
household members aged 7 and above, hence allowing us to find the number of hours worked by
children. Furthermore, household members were asked to report any illness that they have
suffered in the past 4 weeks. They were further asked about the number of days of work, if any,
that they have missed because of their illness. As there can be a huge variation in the severity of
illness, I only consider an individual to be ill, if they have missed at least a day’s worth of work.
3
For further details on the sample selection, please refer to World Bank (2004).
5
Using this data on illness of all adult household members aged 18 and above, I find its effect on
child labor and education of children between the age of 7 and 15. Summary statistics of the data
employed are provided in Table-1.
Results
Table 2 demonstrates the impact of parental illness on child labor. I find that parental
illness causes an increase in children’s hours worked by 2 to 4 hours each week, which
corresponds to an approximately 15 to 25 percent increase. The maximum increase of about 25
percent is when single mothers are ill. To further understand the nature of the child labor, I
disaggregate the total hours worked by household chore hours and time spent at external
employment (such as, in the farm) in table 3. The results show that illness of mothers or both the
parents cause an increase in household chore hours, but father’s illness cause an increase in time
spent working outside home.
Although there is an increase in hours worked on average, it is important to study if
parental illness cause children to enter the labor force, who previously were not working.
Therefore, I employ a dichotomous binary variable which indicates if a child is working (1
represents working and 0 represents not working). The results are presented in Table 4. My
estimates indicate that only in dire conditions, which is when both parents are ill or a single
mother is ill, a child is forced to start working, typically in housework rather than farm-work.
Next, we find the impact of parental illness on children’s schooling outcomes. Table 5
shows that parental illness, i.e. if father, mother or both parents being ill, significant reduces
school attendance among children across different age groups. However, in addition to parents, I
also find that illness of grandparents also cause a reduction in children’s school attendance. Next,
6
in table 6, we find the effect of parental illness on children’s school enrollment. The estimates
shows that if the father or mother is ill, children are significantly less likely to be enrolled in
school. Although, both parents being ill has the correct sign, it is not statistically significant,
probably because only a small sample of children have both parents who are ill.
In the last section, we find the parental illness’ differential impact through gender, age
and household wealth. First I find the impact of parental illness on child labor for three age
groups: 7-9, 10-12, and 13-15. I find that older children are more likely to work when their
parents are ill, which is especially true for the middle age group, 10-12. The increase in hours
worked for children aged 10-12 is greater than the age group 13-15 is probably because the
higher age group already works near their full potential and hence are unable to increase their
hours worked as much as the 10-12 age group. Disaggregating the hours worked by farm work
and household chores, I find that older children are more likely to work in the farm when the
father is ill, and more likely to work for household chores when both parents are ill. Lastly,
school attendance also declines for older children because of parental illness.
I also disaggregate the increase in child labor by gender (Table 8), and find that girls are
likely to work more hours compared to boys, when the mother is ill or both her parents are ill.
Girls work more than boys by as much as approximately 4.5 hours, which is about 30 percent
increase in weekly hours worked. The increase in hours worked is predominantly for household
chores. However, somewhat surprisingly, there is no significant difference between boys and
girls in increase in work at the farm when any of the parents are ill.
Lastly, I examine if household wealth mitigate the effect of health shocks (i.e. parental
illness) on child labor. I control for the wealth in the previous period before the health shock has
occurred, i.e. controlling for wealth 6 months prior to the current survey round. I also include an
7
interaction term between household wealth and parental illness, which could capture the
mitigating effect of wealth when there is parental illness in the household. The results are
presented in Table 9. I find that parental illness continue to have a significant effect on the
number of hours worked, not only at the aggregate level, but also at the household and the farm
level. Furthermore, the interaction terms between illness and household assets have a negative
and significant effect when the father or the mother is ill. It indicates that given that a parent is
ill, greater assets would lead to fewer hours of work. This demonstrates that assets act as a buffer
to increased child labor as wealthier households are better able to weather the shocks.
References
Beegle, K., Dehajia, R., & Gatti, R. 2006. “Child labor and agricultural shocks.” Journal of
Development Economics 81: 80-96.
Dillon, A. 2008. Child labor and schooling responses to production and health shocks in
Northern Mali. IFPRI Working Paper.
Dureya, S., Lam, D., & Levison, D. 2007. “Effects of economic shocks on children's
employment and schooling in Brazil.” Journal of Development Economics 84(1): 188-214.
Edmonds, Eric V., 2008. "Child Labor," Handbook of Development Economics, Elsevier.
Guarcello, L., Mealli, F., & Rosati, F.C. 2010. Household vulnerability and child labor: the effect
of shocks, credit rationing, and insurance. Journal of Population Economics 23: 169-198.
Janvry, A., Finan, F., & Vakis, E.S.R. 2006. “Can conditional cash transfer programs serve as
safety nets in keeping children at school and from working when exposed to shocks?” Journal of
Development Economics 79: 349-373.
Sun, A., & Yao, Y., 2010. “Health shocks and children's school attainments in rural China.”
Economics of Education Review 29: 375-382.
8
Table 1: Summary Statistics:
Variable
Mean
Std. Dev.
Hours worked
Hours worked for children with hours>0
18.3
20.1
14.96
14.5
Only father ill
Only mother ill
- single mother ill
- non-single mother ill
Both parents ill
13%
22%
9%
13%
8%
0.34
0.41
0.28
0.33
0.27
Adult sibling ill
Grand parents ill
Uncle/aunts ill
Other household members ill
14%
3%
3%
4%
0.34
0.18
0.16
0.20
Only father ill x Child ill
Only single mother ill x Child ill
Only non-single mother ill x Child ill
Both parents ill x Child ill
Only mother ill x Child ill
5%
4%
5%
3%
8%
0.22
0.19
0.21
0.18
0.27
165583
39%
11.1
1.6
34%
2016109
0.49
2.6
2.0
0.47
7.8
3.7
Per capita asset owned (in Tanzanian shillings)
Number of child illness
Age
Education
Percentage of crop loss
Number of household members
N
5495
Table 2: Effect of parental illness on child labor hours
(1)
Parents ill
(2)
(3)
(4)
3.05**
2.91**
2.71*
(1.52)
(1.50)
(1.47)
2.55**
2.45**
2.29**
(1.16)
(1.17)
(1.15)
4.53***
4.01***
(1.44)
(1.44)
2.32**
2.17*
(1.15)
(1.13)
1.51
3.65
(2.30)
(2.52)
-0.69
-0.71
(1.04)
(1.00)
0.31
0.37
(2.33)
(2.60)
0.70
-0.79
(1.54)
(2.04)
2.96***
(0.73)
Both parents ill
Only father ill
Only mother ill
3.12***
(0.92)
- Single mother ill
- Non-single mother ill
Grand parents ill
Adult sibling ill
Uncle/aunts ill
Other household members ill
Number of household members
-0.52**
(0.24)
Number of observations
5495
5495
5495
5495
Table 3: Effect of parental illness on hours worked in farm and home
Household chore hours
Only father ill
Only mother ill
0.40
0.39
1.84**
1.76**
(0.71)
(0.71)
(0.75)
(0.75)
2.01***
0.81
(0.56)
(0.61)
- single mother ill
- non-single mother ill
Both parents ill
Grand parents ill
Adult sibling ill
Uncle/aunts ill
Other household members ill
Number of HH members
Number of observations
N
Farming hours
2.19**
1.72*
(0.88)
(0.96)
1.85***
0.26
(0.70)
(0.76)
2.66***
2.64***
0.15
0.01
(0.92)
(0.92)
(1.01)
(1.01)
0.87
0.87
2.00
2.02
(1.39)
(1.39)
(1.83)
(1.82)
-0.30
-0.30
-0.37
-0.38
(0.57)
(0.57)
(0.69)
(0.69)
1.06
1.06
-0.67
-0.71
(1.12)
(1.12)
(2.07)
(2.07)
-1.05
(1.10)
-0.37**
(0.14)
-1.05
(1.10)
-0.37***
(0.14)
0.47
(1.41)
-0.14
(0.16)
0.46
(1.40)
-0.14
(0.16)
5495
5495
5495
5495
Table 4: Effect of parental illness on likelihood of child labor (Child previously not working now coming
into child labor)
Only father ill
Only single mother ill
Only non-single mother ill
Both parents ill
Only father ill x Child ill
Only single mother ill x Child ill
Only non-single mother ill x Child ill
Both parents ill x Child ill
Number of observations
Aggregate
Houseehold chores
Employment work
0.012
0.001
0.035
(0.021)
(0.023)
(0.032)
0.061***
0.042*
0.054
(0.022)
(0.026)
(0.039)
0.013
0.014
-0.021
(0.018)
(0.020)
(0.037)
0.083***
0.071**
0.043
(0.027)
(0.032)
(0.051)
-0.004
-0.020
-0.029
(0.026)
(0.030)
(0.044)
-0.028
-0.031
0.013
(0.027)
(0.029)
(0.052)
0.038
0.014
0.081
(0.026)
(0.031)
(0.046)
-0.093***
-0.059*
-0.17***
(0.035)
(0.036)
(0.054)
5495
5495
5495
Table 5: Effect of parental illness on school attendance
Only school going children
Age group 7-15
Both parents ill
Only father ill
Only mother ill
Uncle/aunts ill
Other household members ill
Number of observations
Age group 7-15
Age group 8-15
-0.059*
-0.061*
-0.060*
-0.074*
-0.073*
-0.022
-0.024
(0.04)
(0.036)
(0.037)
(0.037)
(0.040)
(0.040)
(0.032)
(0.034)
-0.120***
-0.12***
-0.116***
-0.115***
-0.116***
-0.115***
-0.067**
-0.076**
(0.04)
(0.038)
(0.038)
(0.038)
(0.039)
(0.039)
(0.030)
(0.033)
-0.057**
-0.063**
-0.068***
(0.02)
(0.025)
(0.025)
Only non-single mother ill
Adult sibling ill
Age group 9-15
-0.059*
Only single mother ill
Grand parents ill
Age group 8-15
All children
-0.067*
-0.081**
-0.081**
-0.038
-0.058
(0.041)
(0.042)
(0.040)
(0.037)
(0.039)
-0.051*
-0.051*
-0.059*
-0.06**
-0.054*
(0.029)
(0.030)
(0.031)
(0.029)
(0.030)
-0.10*
-0.098*
-0.097*
-0.097*
-0.107*
-0.107*
-0.021
-0.035
(0.05)
(0.054)
(0.055)
(0.055)
(0.061)
(0.061)
(0.038)
(0.044)
0.011
0.011
0.022
0.023
0.018
0.019
-0.023
-0.011
(0.03)
(0.028)
(0.028)
(0.028)
(0.029)
(0.029)
(0.028)
(0.029)
0.152
0.152
0.152
0.152
0.163
0.163
0.098
0.137
(0.10)
(0.096)
(0.098)
(0.098)
(0.106)
(0.106)
(0.096)
(0.116)
0.091
0.091
0.084
0.084
0.085
0.086
0.035
0.044
(0.05)
(0.052)
(0.052)
(0.052)
(0.057)
(0.057)
(0.057)
(0.053)
2987
2987
2886
2886
2704
2704
4590
4046
Table 6: Effect of parental illness on school enrollment
Only father ill
Only single mother ill
Only non-single mother ill
Both parents ill
Adult sibling ill
Grand parents ill
Number of observations
Age group 7-15
Age group 8-15
Age group 9-15
-0.039*
-0.048*
-0.058**
(0.025)
(0.028)
(0.027)
-0.015
-0.031
-0.034
(0.029)
(0.031)
(0.029)
-0.044*
-0.042*
-0.032
(0.026)
(0.027)
(0.027)
0.002
0.008
0.004
(0.026)
(0.026)
(0.024)
-0.018
-0.017
-0.014
(0.020)
(0.023)
(0.023)
-0.019
-0.026
-0.084
(0.029)
(0.042)
(0.053)
4590
4046
3533
Table 7: Effect of age group on child labor and school attendance
Aggregate
Both Parents ill x Age 10-12
x Age 13-15
Only father ill x Age 10-12
x Age 13-15
Only mother ill x Age 10-12
x Age 13-15
House hours
School Attendance
0.27
0.64
-0.30
-0.01
0.55
0.60
-0.072
-0.069
(1.92)
(1.92)
(1.19)
(1.19)
(1.20)
(1.19)
(0.05)
(0.052)
2.28
2.78
-0.15
0.09
2.29**
2.51**
-0.094*
-0.086*
(2.00)
(2.00)
(1.36)
(1.37)
(1.20)
(1.20)
(0.05)
(0.054)
4.15***
4.37***
3.64***
3.84***
0.39
0.43
-0.028
-0.026
(1.42)
(1.42)
(0.94)
(0.95)
(0.95)
(0.96)
(0.05)
(0.046)
2.36*
2.65*
2.60**
2.76**
-0.29
-0.15
-0.087*
-0.081*
(1.62)
(1.62)
(1.09)
(1.10)
(0.98)
(0.98)
(0.05)
(0.051)
1.61
0.75
0.92
0.002
(1.35)
(0.92)
(0.87)
(0.05)
-0.36
-0.12
-0.07
-0.056
(1.33)
(0.94)
(0.82)
(0.05)
Only non-single mother ill x Age 10-12
x Age 13-15
Only single mother ill x Age 10-12
x Age 13-15
Number of observations
Farm hours
5495
3.44*
2.68**
0.76
0.013
(1.96)
(1.29)
(1.18)
(0.058)
1.74
1.00
0.85
-0.008
(1.74)
(1.20)
(1.13)
(0.059)
-0.80
-1.92*
1.17
-0.022
(1.55)
(1.07)
(1.21)
(0.089)
-3.22
-1.75
-1.50
-0.135
(2.01)
(1.46)
(1.13)
(0.095)
5495
5495
5495
5495
5495
2886
2886
Table 8: Effect on girl's hours worked
Aggregate
Both parents ill x Daughter
Only father ill x Daughter
Only mother ill x Daughter
2.45
2.67**
2.41**
0.19
-0.11
(1.73)
(1.76)
(1.13)
(1.14)
(1.11)
(1.13)
-0.44
-0.84
-0.01
-0.19
-0.46
-0.68
(1.34)
(1.34)
(0.85)
(0.85)
(0.89)
(0.90)
1.76*
1.57**
0.19
(1.14)
(0.70)
(0.81)
Only non-single mother ill x Daughter
Only father ill
Only mother ill
4.38**
2.80***
1.56
(1.75)
(1.05)
(1.32)
-0.11
0.70
-0.81
(1.48)
(0.91)
(1.05)
-1.56
-0.90
-1.27
-0.90
-0.14
0.16
(2.75)
(2.79)
(1.74)
(1.78)
(1.93)
(1.96)
3.03
3.53
0.41
0.68
2.52
2.76
(2.41)
(2.42)
(1.42)
(1.42)
(1.71)
(1.72)
0.16
-0.43
0.53
(1.94)
(1.16)
(1.38)
Only single mother ill
Only non-single mother ill
Number of observations
Farm hours
3.0*
Only single mother ill x Daughter
Both parents ill
House hours
5495
-2.75
-2.14
-0.65
(3.19)
(1.64)
(2.48)
2.27
0.76
1.45
(2.43)
(1.56)
(1.65)
5495
5495
5495
5495
5495
Table 9: Breakdown of effects by wealth
Only father ill
Only single mother ill
Only non-single mother ill
Both parents ill
Assets x Only father ill
Assets x Only single mother ill
Assets x Only non-single mother ill
Assets x Both parents ill
Number of observations
Total Hours
Household chores
Farm hours
3.99***
(1.41)
6.35***
(2.30)
4.49**
(1.76)
0.68
(2.15)
0.7081
(0.823)
2.28*
(1.370)
2.68**
(1.045)
1.86*
(1.157)
3.17***
(0.97)
3.94**
(1.60)
1.73
(1.18)
-1.22
(1.40)
-0.03***
(0.00)
-4.25
(4.72)
-0.82**
(0.42)
-0.09
(0.35)
-0.035***
(0.003)
0.364
(2.52)
-0.347
(0.318)
0.229
(0.177)
0.003
(0.003)
-4.54*
(2.570)
-0.458
(0.375)
-0.322*
(0.203)
3568
3568
3568