Learning about the Enforcement of Conditional Welfare Programs: Evidence from Brazil∗ Fernanda Brollo Katja Maria Kaufmann Eliana La Ferrara This version: March 2017 Abstract We study the implementation of Bolsa Familia, a program that conditions cash transfers to poor families on children’s school attendance. Using unique administrative data, we analyze how beneficiaries respond to the enforcement of conditionality. Making use of random variation in the day on which punishments are received, we find that school attendance increases after families are punished for past noncompliance. Families also respond to penalties experienced by peers: a child’s attendance increases if her own classmates, but also her siblings’ classmates (in other grades or schools), experience enforcement. As the severity of penalties increases with repeated noncompliance, households’ response is larger when peers receive a penalty that the family has not (yet) received. We thus find evidence of spillover effects and learning about enforcement. ∗ Fernanda Brollo, University of Warwick, CAGE and CEPR, [email protected]; Katja Maria Kaufmann, Mannheim University, CESifo and IZA, [email protected]; Eliana La Ferrara, Bocconi University, IGIER and LEAP, [email protected]. We thank Manuela Angelucci, Josh Angrist, Felipe Barrera-Osorio, Esther Duflo, Caroline Hoxby, Robert Jensen, Leigh Linden, Marco Manacorda, Karthik Muralidharan, Johannes Rincke, Jeff Smith, Michele Tertilt, Eric Weese, conference and seminar participants at Dartmouth, EUDN, Harvard, Michigan, MIT, NBER, Namur, Paris School of Economics, Toulouse School of Economics, Uppsala, Universitat Pompeu Fabra and Yale for helpful comments. Giulia Zane, Simone Lenzu and Emanuele Colonnelli provided excellent research assistance. 1 Introduction Governments around the world increasingly rely on ‘conditional’ welfare programs in many different areas, including unemployment and social assistance benefits, maternity grants, child support and support for asylum seekers. Existing evaluations of conditional programs provide evidence on the combined effect of formal rules and the enforcement of these rules, but not on the role of enforcement itself. Yet understanding such role is indispensable for offering policy advice on program implementation, given that monitoring and enforcement are costly. Furthermore, political or administrative constraints may limit the extent to which governments are able (or willing) to enforce conditionalities. This paper investigates how the enforcement of program conditions affects beneficiaries’ behavior in the context of conditional cash transfer (CCT) programs. CCT’s provide a stipend to poor families as long as they meet certain conditions and have become a widely used tool to fight poverty in low and middle-income countries. We study the implementation of the “Bolsa Familia Program” (BFP) in Brazil, a large-scale CCT that provides monthly subsidies to poor families conditional on school attendance of all school-aged children being at least 85 percent of school days every month. Failure to comply with this condition implies the receipt of up to five warnings and penalties which increase in severity with the number of past violations. The goal of our paper is to analyze whether people respond to the enforcement of BFP conditions by adjusting school attendance and whether they learn about the quality of enforcement from experiences of their peers. From a theoretical point of view, it is not obvious that one should observe an increase in school attendance in response to the receipt of a warning. First, the family may not be able to (or want to) adjust on this margin, e.g., if the fall in attendance was due to a particularly severe shock which persists. Second, even if there is room for adjustment, the timing of it may not necessarily coincide with the receipt of a warning. If families expect that the government will detect and punish noncompliance with probability one, they should react already at the time when they fail to meet the program conditions. If, on the other hand, families believed that this probability were smaller than one, then they should respond after they 1 receive the warning. Apart from responding to their own warnings, families might also respond to their peers’ experiences: learning that someone else was punished for not attending school should lead to updating one’s beliefs about the strictness of enforcement and to increasing school attendance in response. To test these hypotheses we make use of a unique dataset that we compiled from administrative sources. It covers the universe of BFP beneficiaries in the Northeast of Brazil and includes information on child and family characteristics, monthly records on school attendance of each child, warnings received and benefits disbursed for the years 2008-2009. By merging these data with the Brazilian school census, we are also able to identify children of other BFP beneficiaries who are in the same class, grade or school as the family’s own children. This allows us to construct different sets of peers. We present our results in two parts. In both, our outcome of interest is the likelihood that individuals fail to comply with the school attendance condition in a given month (in short, “noncompliance”). In the first part we analyze the extent to which people respond to their own experiences of enforcement. We find that the likelihood of noncompliance decreases after receiving a warning or penalty. The estimated size of the effect is a decrease of 4 percentage points, which represents a 30 percent decrease over the average monthly noncompliance rate. To provide evidence that we identify a causal effect we exploit random variation in the timing of arrival of warnings within each month. The date at which households can withdraw their monthly transfers –and receive the warning message, if applicable– depends on the last digit of the head’s social security number, which is random. We can thus compare households who receive warnings earlier in the month with households who receive them later: the former should display a stronger contemporaneous reaction because they have more time to adjust their children’s school attendance in that month, which is what we find. In the second and main part of the paper we study whether beneficiaries react to warnings received by other households. The goal is to shed light on the importance of information transmission and learning in the context of program enforcement and to quantify potential spillovers that matter for estimating the benefits of stricter enforcement. We start by estimating if the likelihood that a child does not com2 ply responds to the share of “peers” who received warnings in that month or in the month before, controlling for the household’s own warnings. We use different definitions of peers, including BFP beneficiaries who are (i) a child’s own classmates; (ii) classmates of the child’s siblings (typically of different age); and (iii) classmates of the child’s siblings who attend a different school. In all cases we find that warnings received by peers induce a decrease in noncompliance in the order of 1.1 (2.3) percentage points in the same (subsequent) month when all of one’s peers receive a warning. This effect does not seem to be driven by correlated shocks: effects are very similar whether we focus on a child’s own classmates (who are likely to experience similar shocks) or on the classmates of a child’s sibling who attends a different school. Also, as a falsification test we show that there is no evidence of an anticipated response. To investigate the learning process more in depth, we make use of the fact that families progress through different “warning stages” and receive increasingly severe penalties if they repeatedly fail to comply. When we distinguish between peers who get penalized at a lower warning stage than the family’s own, and peers who receive warnings for higher stages, the latter induce a larger decrease in individual noncompliance. This is consistent with our learning interpretation because those warnings convey new information on the likelihood that the government implements higher order punishments. Finally, to further corroborate our learning interpretation, we analyze a different set of peers, that is, BFP beneficiaries who live in the same zip code as the family. People living in the same area are likely to pick up the transfer from the same local service point, so we exploit the quasi-random variation in the day of the month on which beneficiaries can withdraw the money to construct an exogenous set of peers with whom recipients may interact. We then test whether warnings to beneficiaries who live in the same zip code and can cash in the benefit on the same day as family i affect the likelihood of noncompliance of i’s children, after controlling for warnings to BFP families in i’s zip code.1 We find significant effects in the same and in the following month, consistent with the existence of information spillovers among families living in the same area. Also in this case, warnings received by 1 The latter is included to control for possible geographically correlated shocks. 3 others for higher stages than one’s own have stronger effects. The fact that we find evidence of information transmission using two sets of peers that are based on two entirely different identification strategies increases our confidence in the validity of our results. Our paper contributes to several strands of literature. First, there is a relatively small literature on estimating behavioral responses to imperfect program enforcement. Banerjee, Glennerster and Duflo (2008) discuss the implications of imperfect enforcement in the Indian health sector; Rincke and Traxler (2011) and Drago, Mengel and Traxler (2015) analyze spillovers from enforcement of compliance with TV license fees; Lochner (2007) investigates the effect of learning about arrest probabilities on criminal behavior. Earlier work by Black, Smith, Berger and Noel (2003), Van den Berg, van der Klaauw and van Ours (2004) and Lalive, van Ours and Zweimüller (2005) studies the effects of unemployment insurance programs that monitored job search effort. Aside from the difference in context, our paper contributes to the above literature by focusing on the process through which beneficiaries learn about the quality of enforcement, such as the importance of information transmission through different types of signals and via different types of peers.2 Our paper is also related to a large literature on CCT’s. Among others, Schultz (2004), Todd and Wolpin (2006) and Attanasio, Meghir and Santiago (2012) estimate the impact of Progresa/Oportunidades in Mexico; Bourguignon, Ferreira and Leite (2003), De Janvry, Finan and Sadoulet (2011) and Bursztyn and Coffman (2012) study Bolsa Escola, the predecessor of Bolsa Familia in Brazil, while Bastagli (2008), De Brauw et al (2015a,b) and Chioda, de Mello and Soarez (2016) focus on Bolsa Familia. In contrast to the above literature, we focus on the enforcement aspect of CCT’s. There are a few papers which analyze the enforcement of eligibility criteria, i.e. errors of type one and two in terms of families included in or excluded from the program (e.g., Cameron and Shah, 2014), but there is very little evidence on the enforcement aspect of conditionalities. One notable exception is DeBrauw and Hoddinott (2010) who test the direct effect of program conditions in 2 Also, since we do not have direct measures of beliefs about enforcement, we infer them from individuals’ behavior, akin to the approach in Chetty, Friedman and Saez (2013). 4 the context of Progresa by exploiting the fact that some beneficiaries who received transfers did not receive the forms needed to monitor school attendance. They use matching and fixed effect methods to show that the absence of these forms significantly reduced compliance. Within the CCT literature Schady and Araujo (2006), Baird, McIntosh and Özler (2011) and Benhassine et al. (2015) compare the effectiveness of conditional versus unconditional cash transfers (UCT). The first two papers find that conditionality (or the belief of it) improves school attendance, while the latter paper concludes that an unconditional transfer which is “labeled” as education support is equally effective. Our contribution differs from the CCT versus UCT comparison. In a world where conditionality nominally exists but program recipients anticipate that it will not be enforced, one may fail to find a difference in the impact of the two types of programs.3 But this does not imply that conditionality, if enforced, would be uneffective. Our paper is precisely an attempt to understand how people update their beliefs about enforcement after experiencing it (directly or through peers) and change their behavior in response. Finally, several papers analyze peer effects and spillovers in the context of conditional cash transfer programs, e.g., Barrera-Osorio, Bertrand, Linden and PerezCalle (2011) for Colombia; Angelucci et al (2010), Angelucci and De Giorgi (2009) and Bobonis and Finan (2009) for the Mexican program Progresa/Oportunidades. Compared to this literature, our focus is not to identify conventional or “direct” peer effects (such as the effect of peers’ school attendance on own attendance), but on the effect of peer warnings (a signal about the quality of enforcement) on individual attendance. This has the advantage of mitigating some of the identification problems that the conventional peer effect literature has to face.4 The remainder of the paper is organized as follows. In section 2 we provide background information on Bolsa Familia, while in section 3 we present the data and descriptive statistics. In sections 4 and 5 we discuss our empirical strategy and 3 For example, Schady and Araujo (2006) conclude that cash transfers can be effective without monitoring but hypothesize that the effect of unenforced conditions might dissipate once households realize that they will not be punished. 4 Our interest in how people learn about program features from their peers is shared, for example, by Duflo and Saez (2003). 5 results on whether individuals respond to experiencing enforcement and whether they learn from peers, respectively. Section 6 concludes. 2 Background Information on Bolsa Familia The Bolsa Familia Program (BFP) reaches around 14 million Brazilian families, that is 60 million poor people (equivalent to about 30 percent of the Brazilian population) with an annual budget of over 24 billion reais (USD 11 billion, about 0.5 percent of GDP). Thus BFP reaches nearly three times as many people and is about three times as large in terms of budget as the well-known conditional cash transfer program Progresa/Oportunidades. BFP was launched by the Brazilian president Inácio Lula da Silva in 2003 to consolidate four different programs (Federal Bolsa Escola Program, Auxilio Gas, Bolsa Alimentação, Fome Zero) into a single program.5 The implementation of the program has seen a gradual evolution over the years. The election-free year 2005 was used to strengthen the core architecture of the program and to improve the registry of families. In 2006, the Ministry of Social Development (MDS) embarked on initiatives to promote further vertical integration with sub-national CCTs and integrated the conditional transfers paid under the Child Labor Eradication Program (PETI). The agenda for 2007 and beyond was to reinforce the monitoring and verification of conditionalities, to strengthen oversight and control mechanisms and to continue improving the program’s targeting system. Notably, monitoring and enforcement of conditionalities had been relatively weak in the first years of the program, and were strengthened after 2006. The targeting of the program was conducted in two steps. First, there was geographic targeting at the municipal level: the federal government allocated BFP quotas to municipalities according to estimates of poverty. Within municipalities, spatial maps of poverty were used to identify and target geographic concentrations of the poor. The second step was to determine eligibility at the household level. Eligibility was determined centrally by MDS based on household registry data that was collected locally and transmitted into a central database known as the Cadastro 5 For a detailed description of the features of BFP, see Lindert et al. (2007). 6 Unico. Benefits. BFP provides two types of benefits, a “base” and a “variable” transfer that depend on family composition and income. Families with a monthly per capita family income of up to R$60 (US$30) are classified as “extremely poor”, while families with between R$60 and R$120 are classified as “moderately poor”. The base benefit is provided only to families in extreme poverty, regardless of their demographic composition. Both extremely poor and moderately poor families receive a “variable” benefit which depends on the number of children in the family (capped at three to avoid promoting fertility) and on whether the mother is pregnant or breast-feeding. Benefit amounts are as follows. The base benefit amounts to R$ 60 (approximately US$ 30, which is also the per capita income threshold for the extremely poor), and the variable benefit to R$ 20. Benefit amounts and eligibility thresholds are periodically adjusted for inflation. To illustrate the magnitude of the program transfers, think of a family with three children that is right at the threshold to be classified as extremely poor, i.e., they have a monthly per capita income of R$60 and a family income of R$ 300. This family would receive monthly transfers of around R$120, which amounts to 40 percent of their total family income. Conditionality. BFP cash transfers are conditional on all age-relevant family members complying with requirements in terms of school attendance. Each schoolaged child between 6 and 15 has to attend at least 85 percent of school days each month (absence due to health reasons is justified and does not count towards the number of days absent). If a single child fails to meet this requirement in a given month, the family is affected for the whole amount of the transfer, i.e. also for quotas that pertain to other children. This element of “joint responsibility” for children in the same family is a unique feature of BFP, e.g., compared to other well-known CCTs such as Progresa/Oportunidades.6 BFP also has conditions related to health behavior, such as health check-ups for pregnant women and vaccinations for children below age 5. Based on our data, 6 In 2008 a new subprogram, Benefı́cio Variável Jovem (BVJ), was added to BFP to increase school attendance of teenagers aged 16 and 17. BVJ has its own set of conditions and warning system and does not affect the family’s warning stage in the main program (i.e. for the children aged 6 to 15). We only have data for a few months on BVJ, hence we focus on children aged 6 to 15. 7 those conditions are rarely enforced and we do not observe individual responses to enforcement of those conditions (in contrast to having monthly data on children’s school attendance). Therefore we focus on children aged 6 to 15 and analyze responses to enforcement of school attendance conditions. Penalties. The consequences of noncompliance vary depending on the historical record of compliance of each family. In the first case of noncompliance the family receives a warning without any financial repercussion. With the second instance of noncompliance, the family receives a second warning and benefits are blocked for 30 days, after which the family receives the accumulated benefit of the previous and the current month. The third and fourth warnings lead to a loss of benefits for 60 days each time and these benefits are never recovered. Finally, after the fifth warning the benefit is canceled and the family loses eligibility. According to the general rules, the family can re-apply to the program 18 months later. The 18 months rule is also used to “reset” a family warning history, e.g., if a family is in warning stage 2 but then complies with the rules for 18 consecutive months, their warning history goes back to stage zero. Timing of penalties. Each month families can withdraw their transfer money at a local service point with a Bolsa Familia “electronic benefit card”.7 In case of noncompliance, at the time of withdrawal the family receives a warning message which specifies the penalty that the family incurs. To avoid bottlenecks at the local service point, when families pick up their BFP transfers, each family is assigned a pre-specified date each month starting from which the family can withdraw the transfer money (and contextually receive the warning). The exact date in each month is determined by the last digit of the social security number of the legally responsible adult of the family, which is basically random, as we will show. In particular, the ten digits (0,1,...,9) correspond to the ten weekdays in the last two weeks of the month.8 We will exploit this randomness to identify the causal impact 7 Local service points are either Caixa branches or Caixa “correspondents”. Caixa Econômica Federal, a savings and credit union, is the government agency responsible for transferring BFP benefits to the beneficiary families. Caixa has a large network of banking correspondents, which are commercial establishments with a different business focus, used to expand access to remote and particularly poor areas (Kumar et al., 2006). 8 Every family receives a benefit calendar with the information on when they can pick up the 8 of warnings received by a family on school attendance of its children. Quality of information about the BFP program. Families are well informed about transfer amounts, conditionalities and penalties: these aspects are widely and regularly publicized on TV, radio and newspapers in Brazil, and are spelled out in a booklet issued to each beneficiary family (Agenda de Compromissos). In case of noncompliance, at the time of withdrawal of the transfer money, the family receives a message that reports the family’s warning stage, the month of noncompliance to which the warning refers, the names of the child(ren) who failed and which type of warning they might receive in the next instances of noncompliance. An example of a warning message for a family receiving their first warning is: “The family has not complied with the conditionalities of Bolsa Familia for the first time. At this moment the payment will not be blocked. But if you fail to comply again your benefit payment may be blocked, suspended or even cancelled.”9 Note that the text does not say that the payment will be suspended with probability one, hence there is a degree of uncertainty about the actual enforcement of conditionality which is embedded in the system. This uncertainty is further reinforced by the experience of lax enforcement in the early years of the program. The implementation of this conditionality scheme involves different actors. First of all, children’s attendance is recorded by school teachers. The school sends the attendance lists of students to the municipality, reporting the exact fraction of school days attended in case attendance was below 85 percent, otherwise only reporting that the student complied. Each municipality collects the lists and sends them to the Ministry of Education (MEC), which determines whether the family as a whole complied or not in a given month, i.e. whether all children between 6 and 15 attended at least 85 percent of school days. MEC sends a detailed report to the Ministry of Social Development (MDS), which establishes which warning the family should receive in case of noncompliance and whether the family is entitled to the transfer in the different months. 9 In addition to this short paragraph, the warning message repeats the general rules of BFP in a clear and salient manner. It briefly mentions again the program conditions (i.e. of at least 85% of school attendance of all school aged children) and which instance of noncompliance may lead to which type of penalty. 9 transfer for that month (based on the warning stage reached). The MDS sends this information to the Caixa Econômica Federal, a savings and credit union that transfers the benefit amount to the bank account of the family if the family is entitled to receiving the transfer for that month. [Insert Figure 1] These different steps of the process involve time and lead to a delay in terms of the month in which the warning is received compared to the month of noncompliance. Figure 1 illustrates an example where the family failed to comply in the month of February, right after the beginning of the school year, and received a warning in May. We denote the difference between the month of warning and the month of noncompliance as “delay”. There is substantial variation in the extent of delay, which we make use of in our graphical motivation for analyzing the effect of warnings on families’ behavior and in some robustnes checks which we conduct to support our main findings. During our sample period this variable ranges from 2 to 6, with a mean of 3.8 months and a median of 3 months. Appendix Figure A.1 shows the frequency of different delays. As we will show, the variation in delay is orthogonal to household and child characteristics and is entirely driven by time and area effects. 3 Data We make use of a unique dataset that we assembled combining several sources of administrative data. The first source is the household registry (Cadastro Unico) held by the Brazilian Ministry of Social Development (MDS), which contains socioeconomic characteristics of all BFP beneficiary households.10 We also know the eight digit zip code (Código de Endereçamento Postal - CEP) of the neighborhood where the household lives, which we will exploit in our analysis of learning. We have information on the universe of households enrolled in the program during 10 For each household member the Cadastro Unico comprises information on age, gender, race, marital status, education and employment status and for the household as a whole it reports expenditures, house property, garbage collection, connection to running water, electricity etc. 10 2008-2009 in the Northeast of Brazil, one of the poorest and largest regions in the country, comprising 30 percent of the Brazilian population, and with more than half of its inhabitants living in poverty. To conduct our analysis we extracted a 10 percent random sample, yielding a total of 478,511 households in the program.11 A second dataset from MDS contains monthly records of school attendance during 2008-09 for each child monitored for the attendance conditionality, i.e., each child aged 6 to 15. We know whether the child complied with the attendance condition in a given month, and in case of noncompliance we know the exact fraction of days attended below the threshold of 85 percent. This dataset also includes two categories that qualify the absence: one is justifiable absences such as sickness, for which the household does not incur any penalty; the other category is for unjustifiable absences (e.g., child labor, teen pregnancy, etc.) which instead count towards the warnings. A third dataset from MDS contains monthly information on warnings, which allows us to create the complete warning history of each family. In particular, we know in which month they received a warning, the month in which attendance failed to meet the threshold and that gave rise to that particular warning, and the warning stage in which the family is at any given point in time. The fourth administrative dataset we use is payroll data, which contains monthly information on the benefits that the family received and whether the benefit was blocked or suspended in a given month. All four of these datasets can be linked through the social security number (Número de Identificação Social - NIS) of the household member legally responsible for the child or of the child him/herself. We complement the above administrative records on BFP with two datasets on schools. The first is the School Census compiled by the Ministry of Education, which contains information on all children who are enrolled in a given school, grade and class. We merge the School Census with the BFP administrative data using the NIS code of the child when available in the School Census and otherwise based on 11 To improve our precision in estimating the effects of different warning stages, we over-sampled households that were warned at least once during our sample period, and use regression weights to correct for this. Since we include household or individual fixed effects in all our regressions, households that are never warned do not contribute to estimating the effect of the main variable of interest, namely the receipt of a warning. 11 area code, school code, grade, full name and date of birth of the child.12 This allows us to identify the peers who are in the same school, grade and class for each child and who are also recipients of BFP. This is important for our analysis to identify the role of learning from classmates. The final dataset we employ is Prova Brasil, collected by the Instituto Nacional de Estudos e Pesquisas Educacionais Anı́sio Teixeira (INEP) to evaluate the quality of education across Brazilian schools. These data include a questionnaire administered to principals of schools, containing, among other things, information on the policies that the school adopts to inform parents about children’s unexcused absences. We exploit this information in one of the robustness checks for our analysis. [Insert Table 1] Table 1 presents summary statistics of the main variables of interest. A full set of summary statistics for all variables is provided in Appendix Table A.1. The outcome variable we use in our analysis is whether a family fails to comply with BFP rules, i.e., at least one child attends less than 85 percent of school days in a given month. We refer to this variable as “noncompliance”. Panel A of Table 1 shows that on average the likelihood of noncompliance for the household is 8.6 percent in any given month, and the likelihood of receiving a warning is 3.5 percent. In the average month 29 percent of our households are in warning stage 1, which means they have received a message about noncompliance but not lost money; 8.5 percent are in warning stage 2, where the money is blocked and then returned; 3 and 1 percent are in warning stages 3 and 4, respectively, where the transfer is lost for two months, and 0.2 percent reach warning stage 5, when they are expelled from the program. Panel B of Table 1 shows that the average child who is a BFP recipient in our sample has 16 other BFP recipients in her class, 52 in the same grade and 279 in the same school. About 1 percent of one’s classmates who are BFP recipients (‘Peers’) fail to meet the attendance conditionality in the average month, and 1.4 percent receive a warning message (for any of the five warning stages).13 12 13 We are able to match 68% of individuals, 41% based on their NIS. Since warnings are received at the household level, while the fraction of peers not complying 12 4 Own Experience of Enforcement Before analyzing whether individuals learn about enforcement from peers, we investigate how individuals respond to experiencing enforcement themselves. In particular, we test whether households react to a warning by increasing the school attendance of their children. We conduct this analysis at the household level because warnings are applied to the entire family and not to individual children. In Appendix Table A.2 we show that results are very similar if we use the child as the unit of observation. From a theoretical point of view, it is not obvious that one should observe an increase in school attendance in response to the receipt of a warning. The first reason is that the family may not be able to (or not want to) adjust on this margin, e.g., if the fall in attendance was due to a particularly severe and persistent shock. The second reason has to do with the priors held by the household and its updating process. When deciding whether to fulfill program conditions, families compare the costs and benefits of compliance. An essential part of this calculation is the belief they hold about the probability that the government will detect noncompliance and punish them for it. If the family believed this probability to be one, then it should already anticipate the consequences of noncompliance at the time when the child fails to attend school, and should respond accordingly. The increase in attendance –if any– would not necessarily coincide with the receipt of a warning several months later, but plausibly come earlier. If, on the other hand, the household believed that the probability of punishment were smaller than one, then on top of any earlier response it should adjust its behavior upon receipt of a warning, in order to avoid further penalties. Identifying the causal effect of warnings on school attendance is empirically challenging because the receipt of a warning is a function of past school attendance. We therefore resort to a number of alternative strategies to establish a causal link between receipt of a warning and attendance response. are calculated at the individual child level, the fraction of peers in noncompliance is lower than the fraction of peers warned, since not all children in a household fail to comply at the same time. 13 4.1 Graphical analysis We start by visually exploring the pattern of noncompliance in the months before and after the warning. [Insert Figures 2 and 3] Figure 2 pools data from the different types of warnings received by different households and plots the probability that at least one child in the household fails to meet the 85 percent attendance threshold against the number of months since the warning was received. The value 0 on the horizontal axis indicates the month of the warning, −1 is the month before, and 1, 2, ...5 indicate the number of months after the warning was received. As can be seen from the figure, there is a clear discrete downward jump in noncompliance immediately after the receipt of a warning: households respond by increasing attendance of their children, hence the likelihood of noncompliance goes down. The probability of noncompliance remains relatively constant in the months after the warning. Appendix figure A.2 presents the same data disaggregated by warning stage, i.e. separately for households that received their first warning (Warning Stage 1), their second warning (Warning Stage 2), etc. until the fifth and last warning. The pattern displayed in figure 2 is found across all warning stages except for the last one, where the decrease in noncompliance is less discontinuous (note, however, that very few people reach Warning Stage 5). In figure 3 we repeat the exercise conditioning on the time elapsed between noncompliance and the receipt of the warning, i.e. on “delay”. In this figure, we indicate with 0 the month when the family failed to comply with the attendance requirement and with a vertical line the month in which the warning is received. Moving from left to right and from top to bottom, figure 3 shows what happens to rates of noncompliance when the delay is 2, 3, ... up to 6 months.14 In t = 0 the probability of noncompliance is one by construction; from t = 1 onwards this probability decreases to a ‘normal’ level, but once the warning is received, it jumps down and stays at this lower level given the new (higher) warning stage. 14 See Appendix Figure A.1 for the frequency of different delays. 14 In all cases except for the delay of 6 months, we see a discrete jump in the rate of noncompliance immediately after the receipt of a warning: if the delay is 2 months, then immediately after the arrival of the warning in month 2 the likelihood of noncompliance jumps down; if the delay is 3 months, it jumps down in month 3, etc. This is important because it suggests that the adjustment was caused by the receipt of the warning. Other possible reasons (e.g., mean reversion of the shock) would generate a pattern where the adjustment would depend on the distance from the month of noncompliance (month 0) and not necessarily correspond to the month of the warning, i.e., the (shifting) vertical line. In other words, there is no reason why mean reversion should cause a discrete jump downwards which happens with the exact same delay as the warning.15 Also, the fact that we see a response of the household upon arrival of the warning (i.e. a further reduction in the likelihood of noncompliance) suggests that the warning was not fully anticipated. 4.2 Multivariate analysis We next estimate the effects of warnings on noncompliance using multivariate regressions. As a first step, we want to regress noncompliance on the warning stage that a household is in, since the warning stage determines the cost of not complying with conditionality. One challenge in identifying this effect is that some households have a much higher propensity to miss attendance than others, and these are the households who reach higher warning stages. If we simply used cross-sectional variation in the data, this would bias the (negative) coefficient on warnings towards zero or even induce a positive correlation between receiving warnings and failure to comply. We therefore use household fixed effects throughout our analysis to control for time-invariant differences in the propensity to attend school. This implies that we exploit variation within family over time in the warning stage reached, i.e. in the receipt of new warnings. 15 Moreover, as we show in the next section, delay is orthogonal to household and child characteristics (see Appendix Table A.3). 15 Our baseline specification for estimating the effects of own warnings is: Yht = 5 X k αk WS ht + γXht + Dt + Dh + ht (1) k=1 where h denotes the household, t the month, Y is a dummy equal to 1 when at least one child in the household attends less than 85 percent of the school days in a month (“noncompliance”); WS k denotes a dummy equal to one if the household is in warning stage k (with k = 1, ..., 5); X is a vector of household level controls including fraction of male children in the household, number of boys and girls in different age brackets (6-10, 11-15, 16-18); Dt denotes month and year dummies to control for seasonality and time effects; Dh denotes household fixed effects and is the error term. Given the panel nature of our data, we estimate equation (1) and all other regressions in this section using a linear probability model and clustering the standard errors at the household level. [Insert Table 2] Table 2 shows our results. The probability of noncompliance decreases after the household receives a warning and advances to a higher warning stage, confirming the findings of our graphical analysis. The negative coefficients are of increasing magnitude the higher the warning stage, but one should be careful when comparing the effects of different warning stages because those coefficients are estimated by averaging over different subsets of families. For example, the coefficient on warning stage 5 is only estimated for those families who ever get to warning stage 5 in our two-year period of observation, while the coefficient on warning stage 1 is estimated over the set of families who receive at least the first warning during our sample period. These households are likely to differ in unobservable ways, hence we cannot attribute differences in the magnitude of the coefficients solely to the incentive effects of larger penalties. Since our focus here is on whether and how people respond to and learn about enforcement and not on the effects of different warning stages, we leave a more detailed analysis of this issue for future analysis.16 16 To explore the effect of larger penalties, in Appendix Table A.4 we estimate the same regression 16 Appendix Table A.2 shows that the results are qualitatively similar if we conduct the analysis at the individual (child) level, including child level fixed effects and time varying controls.17 Causality To show that we identify a causal effect of warnings on attendance behavior, we exploit random variation in the timing of warning arrival. In particular, we make use of the fact that the exact day of the receipt of warnings depends on the last digit of the social security number of the legally responsible adult in the family, as described in section 2. We conjecture that families who receive warnings earlier should have more time to react to the new information, hence we expect a larger behavioral response for these families compared to those who receive the warning later in the month. In the limit, someone who receives a warning the last day of the month could only adjust if her child’s attendance was just one day below the threshold. To test this conjecture, we split the sample into “early” and “late” warning receivers, where “early” are the households that fall in the first half of the period during which money can be withdrawn within a month and “late” are those that fall in the second half. Appendix Table A.5 shows that observable characteristics are almost perfectly balanced across the two groups (in the few cases where there are significant differences, these are extremely small, at the third decimal digit). We estimate the following modified version of Equation (1): Yht = 5 X k=1 αkE WS htE,k + 5 X k=1 αkL WS htL,k + 5 X E,k βkE WS h,t−1 + k=1 5 X L,k βkL WS h,t−1 +γXht +Dt +Dh + ht k=1 (2) holding the set of households constant by conditioning on the initial and last warning stage that the households reach during our sample period. For example, for households that reach warning stage 3, we can compare the magnitude of the effects of warning stages 1 and 2 (the effect of warning stage 3 cannot be interpreted for this subset of households, since by construction warning stage 3 is the highest warning stage they will reach, which means that their noncompliance rate mechanically decreases thereafter). Appendix Table A.4 shows that higher warning stages have stronger effects in terms of reducing noncompliance for all subgroups. 17 The magnitude of the coefficients is smaller when estimated with individual level data because not all children in a given household fail to comply at the same time. 17 where the superscripts E and L stand for “early” and “late”, respectively. The variable WS htE,k (WS htL,k ) is the interaction between the warning stage and a dummy taking value one if household h is an “early” (“late”) warning receiver. We also include first lags of these variables to estimate delayed household responses. Our prior is that αkE > αkL because during the month when the warning is received households who are warned earlier have more time to adjust the attendance of their children. On the other hand, since “early” and “late” households on average fail to comply at the same rate, we expect “late” households to catch up during the following month, hence we expect βkE < βkL . [Insert Table 3] E Table 3 reports our estimates.18 Coefficients b αkE , b βk for “early” households are L displayed in column 1; coefficients b αkL , b βk for “late” ones in column 2, while they are both estimated in the same equation. Column 3 reports the differences b αkE − E L b αkL and b βk − b βk , with p-values in square brackets. The pattern of the estimated coefficients is clear: households that receive warnings early respond more strongly in the month of warning receipt, while households that receive warnings late catch up in the following month and respond more strongly then. Given the randomness of the “early” vs. “late” classification, it is difficult to rationalize this pattern with explanations other than the fact that the household is responding to the actual receipt of the warning and to its informational content. To provide further support for this interpretation we show in Online Appendix A that the same conclusion holds when using variation in the arrival of warnings across months. In particular, we make use of the fact that the delay of warnings varies over time in a non-monotonic way due to bureaucratic complexities and is orthogonal to household characteristics.19 In Appendix Table A.6 we thus conduct a difference-in-difference analysis comparing households who received warnings 18 The smaller number of observations in Table 3 compared to Table 2 is due to the fact that, by introducing a lag in warning stage, we lose the first month of our sample period. 19 In Appendix Table A.3 we regress delay on household and child level controls, month and year dummies and municipality dummies. We find that time and area fixed effects explain more than 98 percent of the variation and only one out of 23 household and child controls is significant at the 10 percent level. The p-value for the joint test that the coefficients on all household and child characteristics are zero is 0.91. 18 with different lags and show that there is a significant decrease in the probability of noncompliance precisely after the receipt of the warning, in that the receipt of a warning decreases noncompliance by 3.8 percentage points (equivalent to a 30 percent decrease). 4.2.1 Interpretation A possible interpretation of our results so far is that when households receive a warning, they learn not so much that the program is being enforced, but that their children have missed school – something they may not have been aware of. While this is certainly not a concern in our main analysis about learning from peers’ warnings (as discussed in the next section), we test the importance of this concern in Appendix Table A.7 by exploiting variation in school policies regarding communications with families. Specifically, we merge our administrative data with a dataset on schools called PROVA Brasil. PROVA contains some school-level variables that capture whether parents receive information about non-attendance of their children from the school, independently of BFP warnings. We compare schools that inform parents with schools that do not. Under the hypothesis that warnings have an effect because parents learn about non-attendance of their children, the effect of warnings should be smaller in schools where parents are already informed about non-attendance. The results show, however, that there is basically no difference in the effect of warnings between schools which inform parents and schools that do not. This suggests that families’ response to warnings is unlikely to be entirely due to learning about children’s non-attendance.20 Behavioral response vs teachers’ reporting One remaining interpretation issue is the extent to which the behavioral responses we identify correspond to actual 20 Bursztyn and Coffman (2012) present evidence that the conditionality embedded in Brazil’s CCT is a valuable source of information for parents and empowers them in the negotiation with their children. The effect of warnings that we estimate could then be interpreted as resulting from the improved ability of parents to commit to punishing their children when the parents themselves are penalized by the government. While this interpretation would not be inconsistent with the results in Appendix Table A.7, it cannot explain the learning from peers results that we present in the next section. 19 increases in attendance, and not simply to more lenient reporting by teachers, e.g., because parents convince teachers to be less strict in registering non-attendance after they receive a warning.21 A first reaction to this is that even in this case our hypothesis that families learn about the strictness of enforcement would be valid: it would be the action taken by the family that would differ (e.g., persuading teachers instead of sending children to school). Secondly, our results on warnings received by peers –as discussed in the next section– help address this point to some extent. In particular, we find that child i’s noncompliance goes down when the classmates of i’s siblings get warned or when i’s neighbors receive warnings. Since the teacher of child i would not know about those children’s warnings (in particular if the siblings and neighbors’ children attend different schools, for example because they are of a different age), this result is more likely explained in terms of actual child’s attendance as opposed to misreporting. In fact, it may be difficult for families to ask teachers to falsify attendance records simply because someone else in another school has been warned. Third, children have several different teachers who have to register attendance: bribing or convincing all of them may not be easy in the presence of generally high stakes.22 In fact, municipalities face important incentives for proper monitoring and reporting of school attendance: since 2005 the Ministry of Social Development (MDS) has exerted massive efforts to strengthen the monitoring of conditionalities, introducing mechanisms to promote incentives for quality management and rewarding innovations in the decentralized management of BFP (Lindert et al., 2007). Taken all the evidence together, our results suggest that families respond to the enforcement of program conditions and that at least an important part of their behavioral response corresponds to an actual increase in terms of school attendance. To summarize our results so far, we have shown that the receipt of warnings reduces the likelihood that households fail to comply with the 85 percent school attendance requirement. We provided evidence that this effect can be interpreted as 21 Unfortunately, we cannot directly test for misreporting, differently from Linden and Shastri (2012) who rely on external monitors’ verification in a sample of Indian schools. 22 Teachers earn a relatively high salary and it is unclear that parents from poor households could offer a sufficiently large amount of money to induce teachers to falsify records and risk complaints by other students or teachers, and/or risk punishment by the school principal. 20 causal, i.e. that the increase in school attendance is a response to the warning. We now move to the main focus of this paper, namely the question of whether people not only respond to experiencing enforcement themselves, but also learn about the strictness of enforcement from warnings of their peers. 5 Peers’ Experience of Enforcement In this section we analyze whether BFP beneficiaries learn about enforcement from the experiences of peers who receive warnings. In particular, children who are in the same class as one’s own children and their parents are likely to be key sources of information regarding the implementation of the program. To explore these effects we rely on individual children as the unit of observation and construct different sets of peers, comprising the child’s own classmates and the classmates of a child’s siblings. 5.1 Identification Identifying the impact of the warnings received by classmates is a non-trivial issue. To discuss our empirical strategy it is useful to start from a simple specification, which is not the one we estimate, but which helps highlight identification challenges. Consider the following model: Yiht = 5 X k αk WS ht + β PEERWARNiht + γXiht + Dt + Di + iht (3) k=1 where i denotes the child, h the household, t the month; Y is an indicator for whether child i failed to comply in month t, WS k is a set of dummies to denote if the household is in warning stage k; X is a vector of child level controls including age and number of brothers and sisters in different age brackets (6-10, 11-15, 16-18); Dt denotes month and year fixed effects; Di denotes child fixed effects and is the error term. The key regressor of interest is PEERWARNiht , which is the fraction of i’s classmates who receive a warning in month t. Suppose we found –as we do– a negative correlation between a child’s failure to 21 attend in a given month and the fraction of peers who are warned (b β < 0). Before we can interpret this correlation as learning we need to address several identification challenges. Correlated shocks. The first threat to identification are correlated shocks that may directly affect a student and her peers, thus inducing a correlation between PEERWARNiht and iht in equation (3). Consider for example an economic shock leading to an increase in the opportunity cost of schooling in the area where individual i lives. In response to such a shock, both individual i and her peers would be more likely not to comply and thus more likely to receive a warning.23 If the shock was persistent, it would induce individual i and her peers to still not comply several months later, when the warning is received. However, this type of mechanism would generate a positive, not negative, correlation between a child’s noncompliance and her peers’ warnings (b β > 0), while what we find in the data is a negative correlation (b β < 0). If, on the other hand, the shock was mean-reverting, this would lead to a negative correlation between a child’s noncompliance and her peers’ warnings and could thus be confounded with the learning interpretation we offer. To address this concern, we include the lead of peers’ warnings as a falsification test, i.e., we add PEERWARNih,t+1 among the controls in equation (3). Our reasoning exploits the time lag from the moment in which noncompliance occurs and the moment in which the warning arrives (as discussed in section 2, this delay has a median value of 3 months and a mean of 3.8 months). If a child’s attendance increases because of mean-reversion after the initial instance of noncompliance, there is no reason why the attendance would start reverting to the mean with the exact same delay as the warnings of her peers, which vary between 2 and 6 months. In the case of meanreversion, one would typically expect changes in attendance to start occurring before the arrival of the warnings, which implies that we should find a negative and significant coefficient on PEERWARNih,t+1 . Failing to find an effect of the lead variable would be hard to reconcile with the interpretation of mean-reverted shocks: it 23 To address the concern that the arrival of an individual’s own warning might be correlated with the arrival of her peers’ warnings, we always control for own warnings and analyze if peers’ warnings have an additional effect. 22 would mean that the initial shock, which led to noncompliance, starts reverting exactly, say, four months afterwards (when peers happen to receive the warning) but not three months afterwards. An additional strategy we employ to deal with grade or school-specific correlated shocks is to exploit warnings received not only by individual i’s own classmates, but by the classmates of i’s siblings. Since siblings typically have different ages, the effect of warnings received by siblings’ peers should not reflect class or grade-specific shocks. This should be particularly true for a second specification we use, in which we focus on peers of siblings in other schools (being in different schools is mostly due to siblings’ age difference). The warnings received by students of those schools should not be correlated with shocks experienced by child i’s own school, conditional on the warnings of the child’s family and of her own peers. On the other hand, warnings received by siblings’ peers do contain information about the quality of BFP enforcement. We include the maximum fraction of peers who got warned across child i’s siblings, as the strongest signal of enforcement.24 Conventional peer effects. A separate concern relates to direct or ‘conventional’ peer effects. As we have shown in the first part of the paper, once an individual receives a warning, she reacts by reducing the likelihood of noncompliance (i.e. increasing attendance). This means that when a child’s classmates receive warnings, they will attend school more in response to their own warnings. The child may then start attending more because she observes her peers doing so. This response would imply a spillover effect of enforcement on peers’ school attendance, but cannot necessarily be interpreted as learning about enforcement: it might be due to learning about the benefits of schooling or to a preference to attend school with more peers. To identify “learning about enforcement” we pursue two approaches. First, we directly control for the fraction of child i’s classmates who fail to comply with conditionality, to analyze if warnings of i’s peers have an independent effect on i’s likelihood to fail.25 Second, we analyze whether the probability that child i fails 24 The more families in a class receive a warning, the more likely it is that this becomes a topic of discussion among children and parents of beneficiary households. 25 This may lead to an underestimate of the “true” learning effect, if part of the learning about 23 to comply decreases when her siblings’ peers (possibly at a different school) get warned. Since those peers are not child i’s classmates (or not even in child i’s school), the direct effect of their school attendance should not be important. On the other hand, warnings received by siblings’ peers contain relevant information about the strictness of enforcement. To sum up, the two specifications we estimate are: Yiht = 5 X 0 X k αk WS ht + n=−1 k=1 0 X βn PEERWARNih,t+n + ζ n PEERFAILih,t+n +γXiht +Dt +Di + iht n=−1 (4) Yiht = 5 X k αk WS ht k=1 + 0 X βn PEERWARNih,t+n + n=−1 0 X δn MaxPEERWARNih,t+n + γXiht + Dt n=−1 + Di + iht (5) where PEERFAILih,t+n is the fraction of i’s classmates who are BFP recipients and fail to comply in month t; MaxPEERWARNih,t is the maximum fraction of classmates who got warned in month t among i’s siblings. This variable is constructed alternatively from all of i’s siblings or from siblings who attend a different school than i. For both peers’ warnings and peers’ noncompliance we augment the specification with a one-month lag, because it is ex ante difficult to say how persistent the effects may be. We estimate equations (4) and (5) using a linear probability model and clustering the standard errors at the household level (or, as a robustness check, at the school level). 5.2 Main results Table 4 reports our main results on classmates’ warnings. The coefficient on the fraction of classmates warned in column 1 is −0.011 for the contemporaneous variable and −0.023 for the lagged one, both significant at the 1 percent level. This indicates that after more classmates receive a warning, the child is less likely to fail enforcement happens through observing one’s peers’ higher attendance. 24 to comply with the attendance requirement. To assess the magnitude of the effect in relation to the mean: if all of i’s classmates who are BFP recipients received a warning in a given month, child i’s noncompliance would decrease by 13 percent in the same month and 27.5 percent in the following month. 26 [Insert Table 4] In column 2 we include the fraction of classmates warned as lead variable to conduct a falsification test and rule out the importance of correlated shocks and mean reversion. As explained above, if mean reversion were driving our results, we would expect a negative coefficient on the lead of classmates’ warnings. We find instead a precisely estimated zero, lending support to our interpretation. In column 3 we control for the fraction of classmates who fail to meet the attendance threshold and find that both the contemporaneous and the lagged variable are positively correlated with own noncompliance. At the same time, we still find a negative effect of classmates’ warnings that is equally significant and comparable in size to that of the previous specifications. Again, this is consistent with our learning interpretation and suggests that ‘conventional’ peer effects operating through peers’ attendance are not driving our results. In the remaining part of table 4 we focus on warnings received by siblings’ peers. For this analysis we naturally need to restrict the sample to children who have at least one sibling in the age range of BFP conditionality. It turns out that a large fraction of our original sample satisfies this condition, but in column 4 we report our benchmark estimates for the reduced sample to show that the magnitude of the coefficients is virtually the same as in column 1. Column 5 shows that not only an individual’s own peers but also her siblings’ peers matter for her attendance decision: both the contemporaneous and the lagged fraction of siblings’ classmates warned significantly reduces a child’s likelihood not to comply, while controlling for the warnings of the child’s own classmates. The results remain significant and comparable in size for the lag when we exclusively focus on siblings going to a different school (column 6), while the coefficient on 26 Appendix table A.8 presents the same results as table 4, but displays in addition the coefficients on the family’s own warning stage, which we control for in all specifications. 25 the contemporaneous variable is also negative but insignificant. The findings in columns 5 and 6 lend further support to our learning interpretation, as the peers who receive warnings in columns 5 and 6 are attending different grades or schools and are exposed to different shocks. In Appendix table A.9 we present results clustering standard errors at the school level. Standard errors remain virtually unchanged. 5.2.1 Interpretation: informational content of different warnings To gain further insights into the role played by the new information contained in warnings, we analyze how the effect of classmates’ warnings varies depending on the peers’ warning stage relative to the family’s own warning stage. We expect that warnings received by peers who are in a lower warning stage than the family’s own should carry relatively less information, because the family has already experienced first hand that the government enforces the program up to that level. On the other hand, warnings of a level higher than one’s own carry new information, because the family could still hold a prior that the government punishes up to some point, but will not go through with more severe and costly punishments. Peers’ warnings of the same stage as the family’s could still have an effect, for example, if other families receive them earlier in the month or because they improve the precision of the signal. [Insert Table 5] In Table 5 we distinguish between classmates who receive warning 1, 2, ...5 (the variables listed by row) and estimate four different regression equations, conditional on whether a family is currently in warning stage 1, 2, 3 or 4 (the different columns). Clearly, all stages higher than one’s own should be relevant, but the next stage might be particularly important for at least two reasons. First, in our data, the higher the warning stage, the fewer the households that reach it. This implies that more households receive a warning for the next stage, compared to subsequent warnings. Second, if people are present-oriented, the closer in the future is the punishment, the more they care about it. The estimates in Table 5 lend support to our hypothesis that warnings received by peers in higher warning stages have a stronger impact on school attendance 26 than warnings for the same or lower warning stage. For example, in column 1 the coefficient on peers’ warnings for stage 2 is twice as large as that for stage 1. In column 2 the coefficient on peers’ warnings for stage 3 is about one and a half times as large as for stage 2 (similarly for column 3). Only in column 4 we find that while the coefficient of peers’ warnings for stage 4 is significant, the one for stage 5 is not, which is likely due to the fact that extremely few families reach warning stage 5 at all. Overall, these results corroborate our interpretation of learning about enforcement, as it would be difficult to find alternative explanations that produce the asymmetric pattern that we uncover. 5.2.2 Interpretation: different peers While classmates are an important source of information about the strictness of enforcement, another potential source are adults who are BFP beneficiaries and who live in the same area. We hypothesize that families who receive warnings may interact directly, and not only through their children in schools, and focus on a dimension of interaction for which we can find exogenous variation. Consider the interaction among adults living in the same neighborhood. These adults and their families may experience common shocks and hence their children’s attendance behavior may be correlated. However, as we explained in section 2, the day of the month in which households receive warnings is determined by the last digit of the social security number (NIS) of the designated household member. Since this number is as good as random, the set of families living in a given zip code who receive the warning on the same day is also as good as random. Why would this be relevant from an information point of view? Households receive warnings when they pick up the transfer from the local Caixa service point.27 Households that visit the same Caixa point on the same day are more likely to meet and communicate about warnings on the day when any information about enforcement is most salient. We thus test whether compliance of a household is affected by warnings received by other households that live in the same zip code and cash the transfer on the same 27 Kumar et al (2006) report that in the early 2000’s Caixa started expanding their banking correspondent network to reach even remote locations, and that by 2008 (the period studied in this paper) there was roughly one Caixa correspondent per zip code. 27 day, conditional on warnings received by all households in the neighborhood. We estimate the following specification: Yiht = 5 X k=1 k αk WS ht + 0 X βn BANKWARNih,t+n + 0 X ζ n NEIGHBORWARNih,t+n n=−1 n=−1 + δNEIGHBORS iht + γXiht + Dt + Di + iht (6) where all variables are defined as in equation (3), except for the following. We denote the number of BFP beneficiaries in i’s zip code in month t as NEIGHBORS iht , while NEIGHBORWARNiht denotes the fraction of BFP beneficiaries living in i’s zip code who receive a warning in month t. The key regressor of interest is BANKWARNiht , which is the fraction of BFP beneficiaries living in i’s zip code who receive a warning in month t among those whose families can pick up the transfers at the local Caixa point on the same day. By construction, this variable is not based on a peer group which is chosen endogenously. [Insert Table 6] Table 6 reports the results. Column 1 shows that there is indeed a strongly significant negative relation between the fraction of BFP beneficiaries from the same zip code who can pick up their transfers on the same day and individuals’ decision of noncompliance. Based on our estimates, if 10 percent of the BFP recipients who live in the same zip code and can pick up their transfers on the same day received a warning, a child’s probability of noncompliance would decrease by 3.2 percent in the same month and 6.3 percent in the following month. In column 2 we include BANKWARN as lead variable to conduct a falsification test and rule out the importance of correlated shocks and mean reversion. The coefficient on the lead variable is a precisely estimated zero (with a coefficient of −0.0007), lending support to our learning interpretation. In column 3 we control for the fraction of BFP beneficiaries from the same neighborhood who are warned in a given month (NEIGHBORWARN). The fraction of neighbors who are warned on the same day (BANKWARN) is exogenous 28 when controlling for the overall fraction of neighbors warned within that month, as discussed above. We find that NEIGHBORWARN is positively correlated with an individual’s noncompliance, while the lag of this variable is negatively correlated, which might be due to correlated shocks that have some persistence but meanrevert at some point. More importantly, controlling for NEIGHBORWARN does not change our key result, which is that a larger fraction warned on the same day significantly decreases an individual’s likelihood of noncompliance. The point estimates imply that if 10 percent of the BFP recipients who live in the same zip code and can pick up their transfers on the same day received a warning, a child’s probability of noncompliance would decrease by 4.1 percent in the same month and 3.2 percent in the following month.28 We also analyzed how the effect of other households’ warnings varies depending on their warning stage relative to the family’s own warning stage. Analogous to our analysis in section 5.2.1, we find that warnings received by other households who live in the same zip code, can pick up their transfer on the same day and are in higher warning stages have a stronger impact on compliance than warnings for the same or lower warning stage. This is consistent with our learning interpretation: warnings for lower stages than one’s own carry relatively less information, because the family has already experienced them. The results are reported in Appendix Table A.11.29 Overall, the fact that we find consistent effects for two different types of peers that are based on two entirely different identification strategies (partially overlapping networks in the case of classmates and quasi-random variation in the case of 28 In Appendix table A.10 we present results clustering standard errors at the zip-code level. Standard errors are somewhat larger, but all conclusions are unchanged. In particular, the coefficients on the main variable of interest, i.e. on the ”fraction of neighbors warned on the same day”, and on its lag are all significant at the one percent level. 29 In Appendix Table A.11, for households that are in stage 1 (column 1) the coefficients on other households’ warnings for stages 2 and 3 are three and five times larger, respectively, than that for stage 1 (the difference is significant at the 2 and 1 percent level, respectively). In column 2 the coefficient for stage 3 is similar as for stage 2, but the one for stage 4 is more than four times larger (the difference is significant at the 10 percent level). Only in columns 3 and 4 we do not find significant differences, possibly due to the fact that the punishment in the case of the fourth warning is in fact the same as in the case of the third warning, while extremely few families reach warning stage 5 at all. 29 households living in the same zip code), strengthens our confidence in the validity of our approach and interpretation. 6 Conclusions In this paper we study the implementation of the large-scale conditional cash transfer program “Bolsa Familia” in Brazil. This program conditions transfers to poor families on children’s school attendance: when families fail to comply with this requirement, they receive a series of warnings and financial penalties. We analyze how people learn about and respond to the enforcement of program conditions. We find that warnings not only have a direct effect on the families warned, but also important spillover effects on other families, who learn from the experiences of their children’s classmates and from other households living nearby who are warned on the same day. Our finding that people adjust their behavior to the strictness of enforcement implies that not only formal rules but actual enforcement of program conditions is crucial for program effectiveness. This aspect seems particularly important for developing countries, as they might lack administrative capacity or political will to strictly enforce the rules.30 Thus the design of conditional welfare programs should take into account the important dimensions of monitoring and enforcement. References [1] Angelucci, M. and G. D. Giorgi (2009), “Indirect effects of an aid program: How do cash transfers affect ineligibles’ consumption?”, American Economic Review, 99(1), 486–508. [2] Angelucci, M., G. D. Giorgi, M. Rangel, and I. Rasul (2010). “Family networks and school enrollment: Evidence from a randomized social experiment”, Journal of Public Economics, 94(3-4), 197–221. 30 Brollo, Kaufmann and La Ferrara (2015) study the electoral costs of enforcement and the political incentives for manipulating the implementation of BFP conditionality in Brazil. 30 [3] Attanasio, O. P., C. Meghir, and A. Santiago (2012), “Education choices in Mexico: Using a structural model and a randomized experiment to evaluate Progresa”, Review of Economic Studies, 79(1), 37-66. [4] Baird, S., C. McIntosh and B. Özler (2011), “Cash or condition? Evidence from a cash transfer experiment”, The Quarterly Journal of Economics, 126(4), 1709–1753. [5] Banerjee, A. V., R. Glennerster, and E. Duflo (2008), “Putting a band-aid on a corpse: Incentives for nurses in the Indian public health care system”, Journal of the European Economic Association, 6(2-3), 487–500. [6] Barrera-Osorio, F., M. Bertrand, L. L. Linden, and F. Perez-Calle (2011), “Improving the design of conditional transfer programs: Evidence from a randomized education experiment in Colombia”, American Economic Journal: Applied Economics, 3(2), 167–195. [7] Bastagli, F. (2008). “Conditionality in public policy targeted to the poor: Promoting resilience?”, Social Policy & Society, 8 (1). [8] Benhassine, N., F. Devoto, E. Duflo, P. Dupas and V. Pouliquen (2015) “Turning a Shove into a Nudge? A ”Labeled Cash Transfer” for Education”, AEJ: Economic Policy. [9] Black, D., J. Smith, M. Berger and B. Noel (2003) “Is the Threat of Reemployment Services More Effective than the Services Themselves? Evidence from Random Assignment in the UI System.”, The American Economic Review, 93(4), 1313–1327. [10] Bobonis, G. J. and F. Finan (2009). “Neighborhood peer effects in secondary school enrollment decisions”, The Review of Economics and Statistics, 91(4), 695–716. [11] Bourguignon, F., F. Ferreira, and P. G. Leite (2003). “Conditional cash transfers, schooling and child labor: Micro-simulating brazil’s bolsa escola program”, The World Bank Economic Review, 17(2). 31 [12] Brollo, F., K. M. Kaufmann and E. La Ferrara (2015), “The Political Economy of Program Enforcement: Evidence from Brazil”, mimeo, Bocconi University, Mannheim University and University of Warwick. [13] Bursztyn, L. and L. C. Coffman (2012). “The Schooling Decision: Family Preferences, Intergenerational Conflict, and Moral Hazard in the Brazilian Favelas”, Journal of Political Economy, 120(3), 359-397. [14] Cameron, L. and M. Shah (2014). “Mistargeting of Cash Transfers, Social Capital Destruction, and Crime in Indonesia”, Economic Development and Cultural Change, 62(2), 381-415. [15] Chetty, R., J. Friedman and E. Saez (2013), “Using Differences in Knowledge Across Neighborhoods to Uncover the Impacts of the EITC on Earnings”, American Economic Review, 103(7), 2683-2721. [16] Chioda, L., J. M. P. de Mello and R. Soarez (2016) ”Spillovers from Conditional Cash Transfer Programs: Bolsa Familia and Crime in Urban Brazil”, Economics of Education Review. [17] De Janvry, A., F. Finan, and E. Sadoulet (2011). “Local Electoral Incentives and Decentralized Program Performance”, Review of Economics and Statistics, 94(3), 672-685. [18] DeBrauw, A. and J. Hoddinott (2010). “Must conditional cash transfer programs be conditioned to be effective? the impact of conditioning transfers on school enrollment in Mexico”, Journal of Development Economics. [19] DeBrauw, A., D. Gilligan, J. Hoddinott and S. Roy (2015a). “The impact of Bolsa Familia on education”, World Development , 70(6): 303-316. [20] DeBrauw, A., D. Gilligan, J. Hoddinott and S. Roy (2015b). “Bolsa Familia and household labor supply”, Economic Development and Cultural Change , 63(3): 423-457. [21] Drago, F., F. Mengel, and C. Traxler (2015) “Compliance Behavior in Networks: Evidence from a Field Experiment”, IZA DP No. 9443, October 2015. 32 [22] Duflo, E. and E. Saez (2003). “The role of information and social interactions in retirement plan decisions: Evidence from a randomized experiment”, Quarterly Journal of Economics. [23] Kumar, A., A. Nair, A. Parsons and E. Urdapilleta (2006) “Expanding Bank Outreach through Retail Partnerships Correspondent Banking in Brazil”, World Bank Working Paper No. 85 . [24] Lalive, R., J. van Ours and J. Zweimüller (2005) “The Effect of Benefit Sanctions on the Duration of Unemployment”, Journal of the European Economic Association, 2005, 3(6): 1386-1417. [25] Linden, L. and K. Shastri (2012), “Grain Inflation: Identifying Agent Discretion in Response to a Conditional School Nutrition Program.”Journal of Development Economics, 99(1), 128-138. [26] Lindert, K., A. Linder, J. Hobbs, and B. de la Brière (2007). “The nuts and bolts of brazil’s bolsa famı́lia program: Implementing conditional cash transfers in a decentralized context”, The World Bank, Social Protection Working Paper No. 0709. [27] Lochner, L. (2007). “Individual perceptions of the criminal justice system”, American Economic Review, 97(1), 444–460. [28] Rincke, J. and C. Traxler (2011). “Enforcement spillovers”, The Review of Economics and Statistics, 93(4), 1224–1234. [29] Schady, N. and M. C.Araujo (2006). “Cash transfers, conditions, school enrollment, and child work: Evidence from a randomized experiment in Ecuador”, World Bank Policy Research Working Paper 3930, The World Bank. [30] Schultz, P. T. (2004). “School subsidies for the poor: Evaluating the mexican Progresa poverty program”, Journal of Development Economics. 33 [31] Todd, P. E. and K. I. Wolpin (2006). “Assessing the impact of a school subsidy program in Mexico: Using a social experiment to validate a dynamic behavioral model of child schooling and fertility”, American Economic Review. [32] Van den Berg, G. J., B. van der Klaauw, and J. C. van Ours (2004). “Punitive sanctions and the transition rate from welfare to work”, Journal of Labor Economics, 22, 211–241. 34 APPENDIX Figure 1: Timing of Noncompliance and Warning Figure 2: Response to warning .15 Fraction Noncompliant .2 .25 .3 Response to Warnings −1 0 1 2 3 Months since Warning 4 5 ® Warning 35 Figure 3: Response to warning, by delay Response to Warnings with Different Delays .1 .1 .1 Fraction Noncompliant .2 .3 .4 .5 .6 .7 Delay of 4 Months Fraction Noncompliant .2 .3 .4 .5 .6 .7 Delay of 3 Months Fraction Noncompliant .2 .3 .4 .5 .6 .7 Delay of 2 Months Noncompl. Warning 0 1 2 3 4 5 6 Noncompl. 0 1 5 6 7 Noncompl. 0 1 2 Warning 3 4 5 6 7 8 Delay of 6 Months .1 .1 Fraction Noncompliant .2 .3 .4 .5 .6 .7 Fraction Noncompliant .2 .3 .4 .5 .6 .7 Delay of 5 Months Warning 2 3 4 Noncompl. Warning 0 1 2 3 4 5 6 7 8 9 Noncompl. Warning 0 1 2 3 4 5 6 7 8 9 10 ® Table 1: Summary statistics Variable A. Own Warnings Noncompliance Warning Stage 1 Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Warned Delay of Warnings B. Peer Warnings Frac of Classmates Warned Frac of Classmates Noncompl. No of Peers in School No of Peers in Grade No of Peers in Class Observations Mean Std. Dev. 8090769 8090769 8090769 8090769 8090769 8090769 8090769 267626 0.086 0.292 0.085 0.029 0.011 0.002 0.035 3.809 0.280 0.455 0.279 0.169 0.103 0.041 0.184 1.398 5914381 5914381 5914381 5914381 5914381 0.014 0.010 279.407 51.970 16.533 0.059 0.046 222.976 50.544 6.752 36 Table 2: Effect of own warnings on attendance Dependent Variable: Warning Stage 1 Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Controls Household FE Time FE No. Obs. R-squared Noncompliance (in a Given Month) -0.0624*** (0.0000) -0.1303*** (0.001) -0.2050*** (0.002) -0.2656*** (0.003) -0.3477*** (0.008) Yes Yes Yes 8,090,769 0.23 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: fraction of male children, number of boys and girls in the household in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies and household fixed effects. 37 Table 3: Effect of own warnings, random variation in timing Dependent Variable: Timing: Warning Stage 1 * Timing Warning Stage 2 * Timing Warning Stage 3 * Timing Warning Stage 4 * Timing Warning Stage 5 * Timing Lag Warning Stage 1 * Timing Lag Warning Stage 2 * Timing Lag Warning Stage 3 * Timing Lag Warning Stage 4 * Timing Lag Warning Stage 5 * Timing Controls Household FE Time FE No. Obs. R-squared Noncompliance in a Given Month Early Late Diff (Early-Late) Coeff/(S.E.) Coeff/(S.E.) Diff/[p-val] -0.0253*** -0.0112*** -0.0141 (0.001) (0.001) [0.0000] -0.0474*** -0.0052*** -0.0422 (0.002) (0.002) [0.0000] -0.1267*** -0.0633*** -0.0634 (0.004) (0.003) [0.0000] -0.1887*** -0.1168*** -0.0719 (0.007) (0.006) [0.0000] -0.1641*** -0.0955*** -0.0686 (0.016) (0.013) [0.0002] -0.0356*** -0.0484*** (0.001) (0.001) -0.0895*** -0.1230*** (0.002) (0.002) -0.0930*** -0.1438*** (0.004) (0.003) -0.0877*** -0.1513*** (0.007) (0.006) -0.2134*** -0.2715*** (0.018) (0.015) Yes Yes Yes 7,551,546 0.23 0.0128 [0.0000] 0.0335 [0.0000] 0.0508 [0.0000] 0.0636 [0.0000] 0.0581 [0.0086] Notes: ”Early” (”Late”) indicates households that withdraw the transfer -and receive warnings, if applicable- in the first (second) half of the withdrawing period during a month. Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: fraction of male children, number of boys and girls in the household in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. All specifications include household fixed effects. 38 Table 4: Effect of classmates’ warnings Dependent Variable: Sample Benchmark Fraction of Classmates Warned Lag of Frac of Classmates Warned (1) -0.0112*** (0.002) -0.0231*** (0.002) Lead of Frac of Classmates Warned Fraction of Classmates Noncompliant Lag of Frac of Classmates Noncompliant 39 Max Frac of Siblings’ Classmates Warned Lag Max Frac of Siblings’ Classmates Warned Max Frac of Siblings’ Classmates Warned (Only Sibs at Other Schools) Lag Max Frac of Siblings’ Classmates Warned (Only Sibs at Other Schools) Controls for Own Warning Stage Controls Individual FE Time FE No. Obs. R-squared Yes Yes Yes Yes 5,914,114 0.28 Noncompliance in a Given Month All Children Children with Siblings Placebo Control for Benchmark Siblings’ Classmates Siblings’ Classmates Peers’ Noncompl. Other Schools (2) (3) (4) (5) (6) -0.0120*** -0.0100*** -0.0106*** -0.0085*** -0.0099*** (0.002) (0.002) (0.002) (0.003) (0.003) -0.0236*** -0.0199*** -0.0227*** -0.0189*** -0.0191*** (0.002) (0.002) (0.002) (0.002) (0.003) 0.0001 (0.002) 0.2807*** (0.021) 0.0952*** (0.009) -0.0047** (0.002) -0.0076*** (0.002) -0.0013 (0.002) -0.0061** (0.003) Yes Yes Yes Yes 5,628,700 0.27 Yes Yes Yes Yes 5,914,114 0.28 Yes Yes Yes Yes 4,902,847 0.28 Yes Yes Yes Yes 4,902,847 0.28 Yes Yes Yes Yes 4,902,847 0.28 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. All specifications include individual fixed effects. Table 5: Information content of classmates’ warnings Dependent Variable: Own Warning Stage: Frac Classmates Warned (WS 1) Frac Classmates Warned (WS 2) Frac Classmates Warned (WS 3) Frac Classmates Warned (WS 4) Frac Classmates Warned (WS 5) Controls Individual FE Time FE No. Obs. R-squared Noncompliance in Given Month 1 2 3 4 -0.0123*** -0.0139 -0.0118 0.0318 (0.003) (0.009) (0.016) (0.028) -0.0245*** -0.0203** -0.0171 -0.0693 (0.007) (0.009) (0.033) (0.044) -0.0188 -0.0308* -0.0501** -0.0392 (0.012) (0.017) (0.021) (0.051) -0.0197 -0.0386 -0.0787* -0.0590** (0.022) (0.026) (0.043) (0.027) 0.0274 0.0840 0.0392 0.1151 (0.036) (0.070) (0.088) (0.123) Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes 1,993,576 639.362 232.623 88.697 0.46 0.52 0.52 0.54 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. 40 Table 6: Effect of warnings of neighbors in the bank Dependent Variable: Frac in Bank Warned Lag of Frac in Bank Warned Lead of Frac in Bank Warned Frac Neighbors Warned Lag of Frac Neighbors Warned No of BFP Neighbors Controls for Own Warning Stage Controls Individual FE Time FE Observations R-squared Noncompliance in Given Month Benchmark Placebo Controls for Neighbors Warned (1) (2) (3) -0.0275*** -0.0275*** -0.0349*** (0.004) (0.004) (0.006) -0.0548*** -0.0547*** -0.0284*** (0.003) (0.003) (0.004) 0.0007 (0.004) 0.0164* (0.009) -0.0609*** (0.007) -0.0000*** -0.0000*** -0.0000*** (0.000) (0.000) (0.000) Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes 4,813,029 4,813,029 4,813,029 0.21 0.21 0.21 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies and individual fixed effects. 41 Online Appendix – Not For Publication A. Difference-in-Difference Analysis Further insights into the causal nature of the estimates can be gained by making use of variation in the delay of warnings across months. As discussed, the variation in delay is almost entirely explained by time and area fixed effects and there is no correlation with household or child characteristics. We thus estimate a difference-indifferences model that is the regression counterpart of figures 2 and 3. In particular, we compare two groups of households: both groups failed to comply in the same month, but they receive their warnings with different delays. We then compare the behavior of households who have already been warned (the “treated” households) with those who have not yet received their warning (controls). Let “Post” be an indicator taking value 1 in the calendar month(s) after the warning of the households receiving the earlier warnings and 0 otherwise. We estimate: Yht = α · T reatedht + β · Postht + γ(T reatedht · Postht ) + δXht + Dt + Dh + ht . (7) Our coefficient of interest is γ, which we expect to be negative if households react to warnings by decreasing noncompliance (i.e. increasing attendance) of their children. The results are reported in Appendix Table A.6 using different time horizons for pre- and post-warning. [Insert Appendix Table A.6] In the first column, we consider an interval of three months around a warning and let “Post” take value 1 in the month following the warning, and 0 in the month of the warning and the month before. The estimated coefficient suggests that households who receive a warning decrease their noncompliance rate by 3.8 percentage points compared to households that have not yet received their warning. This represents almost a 30 percent decrease over the mean. The estimates are of a similar order of magnitude if we expand the analysis to a period of five months around a 42 warning and define as “Post” the two months following the warning: the estimated coefficient in that case is −0.045, significant at the 1 percent level. To provide further evidence that this is actually identifying a behavioral response to the warning, we show that pre-trends are the same for both groups supportin the common trend assumption underlying our difference-in-difference analysis. In columns 3 and 4 we conduct the following falsification tests. We define as “Post” the month of the warning and use as pre-period the month before (column 3) or the two months before (column 4). If our negative estimate were picking up a differential trend for warned households, this would result in a negative coefficient on “Treated∗Post” in columns 3 and 4. The fact that we instead obtain small and insignificant coefficients on the interaction term lends credibility to the interpretation that our effect is a causal response to the receipt of the warning. 43 B. Appendix Figures and Tables Figures Figure A.1: Frequency of delays 0 .1 Fraction of Warnings .2 .3 .4 Delay of Warning Receipt 2 3 4 Delay (in Months) 5 6 ® i Figure A.2: Response to warning, by warning stage Response to Warnings 5 Fraction Noncompliant .2 .3 Warning −1 0 1 2 3 4 Months since Warning 5 Warning −1 0 1 2 3 4 Months since Warning 5 Fraction Noncompliant .1 .2 .3 Warning Stage 5 Fraction Noncompliant .2 .3 Warning −1 0 1 2 3 4 Months since Warning .1 .1 .1 Warning −1 0 1 2 3 4 Months since Warning Warning Stage 4 .1 Warning Stage 3 Fraction Noncompliant .2 .3 Warning Stage 2 Fraction Noncompliant .2 .3 Warning Stage 1 5 Warning −1 0 1 2 3 4 Months since Warning 5 ® ii Tables Table A.1: Summary statistics Variable Own Warnings Noncompliance Warning Stage 1 Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Warned Delay of Warnings Frac Male N of Sisters 6 to 10 N of Brothers 6 to 10 N of Sisters 11 to 14 N of Brothers 11 to 14 N of Sisters 15 to 17 N of Brothers 15 to 17 Month Year 2009 Parents informed in Writing Parents informed in Meetings Parents informed in Home Visits Peer Warnings Frac of Peers Warned Frac of Peers Noncompl Max Fraction of Siblings’ Peers Warned All Siblings Siblings in Other Schools N of Peers in School N of Peers in Grade N of Peers in Class Predetermined Characteristics Head Female Head Married Head Single Head Age Head White Head Yrs of Educ Head Working Spouse Age Spouse White Spouse Yrs of Educ Spouse Working Dep Ratio Pct Indio HH Expenditures House Property Garbage Collected Running Water Electricity iii Observations Mean Std. Dev. 8090769 8090769 8090769 8090769 8090769 8090769 8090769 267626 8090769 8090769 8090769 8090769 8090769 8090769 8090769 8090769 8090769 5044299 5044299 5044299 0.086 0.292 0.085 0.029 0.011 0.002 0.035 3.809 0.530 0.206 0.222 0.250 0.271 0.154 0.172 6.538 0.492 0.790 0.940 0.703 0.280 0.455 0.279 0.169 0.103 0.041 0.184 1.398 0.387 0.386 0.400 0.412 0.432 0.340 0.360 2.864 0.500 0.408 0.169 0.457 5914381 5914381 0.014 0.010 0.059 0.046 4903020 4903020 5914381 5914381 5914381 0.014 0.013 279.407 51.970 16.533 0.058 0.058 222.976 50.544 6.752 478511 478511 478511 478511 478511 478511 478511 261318 261318 261318 261318 478511 478511 478511 478511 478511 478511 478511 0.933 0.337 0.531 38.063 0.182 4.517 0.510 40.924 0.167 3.519 0.738 1.092 0.002 2254.6 0.695 0.594 0.610 0.793 0.249 0.468 0.495 10.028 0.384 3.227 0.487 10.536 0.371 3.057 0.430 1.033 0.040 15356.5 0.453 0.487 0.482 0.398 Table A.2: Effect of own warnings, individual level Dependent Variable: Warning Stage 1 Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Controls Individual FE Time FE No. Obs. R-squared Noncompliance (in a Given Month) -0.0336*** (0.0000) -0.0657*** (0.001) -0.0970*** (0.001) -0.1147*** (0.002) -0.1366*** (0.005) Yes Yes Yes 16,056,547 0.23 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies and individual fixed effects. iv Table A.3: Correlates of delay Dependent Variable: Delay of Warning (2) Coeff (Std.Err.) -0.0009 (0.001) -0.0005 (0.001) 0.0019 (0.001) -0.0009 (0.001) -0.0019 (0.002) 0.0002 (0.001) 0.0006 (0.002) -0.0033 (0.004) -0.0026 (0.004) 0.0009 (0.002) 0.0001 (0.000) 0.0000 (0.000) 0.0001 (0.001) -0.0003 (0.002) 0.0001 (0.000) 0.0000 (0.000) -0.0009 (0.001) 0.0001 (0.001) 0.0000 (0.000) -0.0023* (0.001) -0.0004 (0.001) -0.0012 (0.001) -0.0009 (0.001) Yes Yes Yes Yes (1) N of Girls 6 to 10 N of Boys 6 to 10 N of Girls 11 to 14 N of Boys 11 to 14 N of Girls 15 to 17 N of Boys 15 to 17 Head Female Head Married Head Single Head White Head Age Head Yrs of Educ Head Work Spouse White Spouse Age Spouse Yrs of Educ Spouse Work Dependency Ratio Expenditures House Property Garbage Collection Water Connection Electricity Time FE Municipality FE P-val of F-test (joint sig of HH charac) Observations R-squared 267,626 0.98 0.910 111,462 0.98 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. v Table A.4: Effect of own warnings, by highest warning stage reached Dependent Variable Highest WS Reached 1 Warning Stage 1 -0.1863*** (0.001) Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Controls Time FE Household FE P-Values of Tests: Coeff of WS 1 vs 2 Coeff of WS 2 vs 3 Coeff of WS 3 vs 4 Coeff of WS 4 vs 5 No. Obs. R-Squared Yes Yes Yes 2,977,415 0.14 Noncompliance in a Given Month 2 3 4 5 -0.1677*** -0.1716*** -0.1385*** -0.1464*** (0.002) (0.004) (0.009) (0.027) -0.5120*** -0.4907*** -0.2874*** -0.2960*** (0.003) (0.005) (0.012) (0.035) -0.9837*** -0.7029*** -0.6010*** (0.006) (0.017) (0.048) -1.1119*** -0.8757*** (0.020) (0.059) -1.2264*** (0.072) Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes 0.000 0.000 0.000 0.000 0.000 0.000 814,758 0.21 221,764 0.27 55,233 0.27 0.000 0.000 0.000 0.000 8,350 0.27 Notes: Initial warning stage is 0 (i.e. warning stage at the beginning of our period of observation, namely January 2008). Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: fraction of male children, number of boys and girls in the household in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. All specifications include household fixed effects. vi Table A.5: Balance test Timing: Time-Varying Characteristics N of Sisters 6 to 10 N of Brothers 6 to 10 N of Sisters 11 to 14 N of Brothers 11 to 14 N of Sisters 15 to 17 N of Brothers 15 to 17 No. Obs. Pre-Program Characteristics Head Married Head Single Head Female Head Age Head White Head Years of Education Head Work Spouse Age Spouse White Spouse Years of Education Spouse Work Dependency Ratio Pct Indio Expenditure House Property Garbage Collected Water Connection Electricity No. Obs. Early Mean/SE Late Mean/SE Diff/[p-val] 0.1643 (0.3432) 0.1738 (0.3529) 0.1894 (0.3643) 0.1949 (0.3708) 0.1157 (0.2984) 0.1186 (0.3017) 3,021,032 0.1644 (0.3421) 0.1738 (0.3521) 0.1902 (0.3654) 0.1948 (0.3687) 0.1142 (0.2953) 0.1191 (0.3037) 4,530,514 -0.0001 [0.588] -0.0001 [0.755] -0.0009 [0.002] 0.0001 [0.617] 0.0014 [0.000] -0.0003 [0.184] 7,551,546 0.3366 (0.4683) 0.5333 (0.4951) 0.9351 (0.2457) 38.0615 (10.0252) 0.1821 (0.3839) 4.5259 (3.2404) 0.5114 (0.4870) 40.9540 (10.5281) 0.1653 (0.3697) 3.5286 (3.0540) 0.7389 (0.4299) 1.0941 (1.0307) 0.0016 (0.0389) 2099.51 (15129.77) 0.6946 (0.4538) 0.5931 (0.4878) 0.6115 (0.4815) 0.7937 (0.3976) 188,970 0.3383 (0.4689) 0.5321 (0.4950) 0.9336 (0.2480) 38.0472 (10.0127) 0.1819 (0.3836) 4.5145 (3.2181) 0.5079 (0.4874) 40.9009 (10.5078) 0.1686 (0.3725) 3.5158 (3.0580) 0.7395 (0.4294) 1.0903 (1.0354) 0.0017 (0.0402) 2227.50 (15070.91) 0.6953 (0.4533) 0.5946 (0.4874) 0.6096 (0.4822) 0.7925 (0.3984) 284,527 -0.0017 [0.221] 0.0012 [0.429] 0.0015 [0.043] 0.0143 [0.630] 0.0003 [0.814] 0.0114 [0.251] 0.0036 [0.014] 0.0531 [0.209] -0.0032 [0.035] 0.0127 [0.311] -0.0006 [0.725] 0.0039 [0.207] -0.0001 [0.452] -127.987 [0.775] -0.0007 [0.620] -0.0016 [0.285] 0.0019 [0.192] 0.0012 [0.330] 472,029 Notes: ”Early” (”Late”) indicates households that withdraw the transfer -and receive warnings, if applicable- in the first (second) half of the withdrawing period during a month. vii Table A.6: Effect of own warnings, difference-in-differences Treat * Post Treat Post No. Obs. R-squared Main result Definition 1 Definition 2 Warning: t=0 Warning: t=0 Before: t=-1,0 Before: t=-2,-1,0 Post: t=1 Post: t=1,2 -0.0379*** -0.0447*** (0.002) (0.003) 0.0014 0.0061*** (0.001) (0.002) -0.1853*** -0.0153*** (0.001) (0.002) 763,847 458,967 0.05 0.03 Placebo Definition 1 Definition 2 Warning: t=0 Warning: t=0 Before: t=-1 Before: t=-2,-1 Post: t=0 Post: t=0 0.002 -0.0037 (0.003) (0.003) 0.0021 0.0078*** (0.002) (0.002) -0.2642*** -0.0269*** (0.002) (0.002) 458,224 309,942 0.06 0.03 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: fraction of male children, number of boys and girls in the household in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. viii Table A.7: Learning about attendance Dependent Variable: Parents Informed in: Warning Stage 1 * Parents Informed Warning Stage 2 * Parents Informed Warning Stage 3 * Parents Informed Warning Stage 4 * Parents Informed Warning Stage 5 * Parents Informed Warning Stage 1 Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Controls Household FE Time FE No. Obs. R-Squared Noncompliance in a Given Month Writing Meeting Home Visit Benchmark -0.0003 -0.0021 0.0002 (0.001) (0.003) (0.001) -0.0008 -0.0039 0.0006 (0.003) (0.007) (0.002) -0.0100** 0.0204 -0.0066 (0.005) (0.013) (0.005) -0.0096 0.0472* -0.0009 (0.009) (0.024) (0.009) -0.0293 0.1017* 0.0159 (0.023) (0.054) (0.021) -0.0619*** -0.0640*** (0.003) (0.0000) -0.1284*** -0.1324*** (0.006) (0.001) -0.2106*** -0.2034*** (0.013) (0.002) -0.3002*** -0.2626*** (0.024) (0.004) -0.4269*** -0.3401*** (0.051) (0.0100) Yes Yes Yes Yes Yes Yes 5,044,299 5,044,299 0.24 0.24 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: fraction of male children, number of boys and girls in the household in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. All specifications include household fixed effects. ix Table A.8: Effect of classmates’ warnings Dependent Variable: Sample Own Warnings (1) Fraction of Classmates Warned Lag of Frac of Classmates Warned Lead of Frac of Classmates Warned Fraction of Classmates Failed Lag of Frac of Classmates Failed Max Frac of Siblings’ Classmates Warned Lag Max Frac of Siblings’ Classmates Warned x Max Frac of Siblings’ Classmates Warned (Only Sibs at Other Schools) Lag Max Frac of Siblings’ Classmates Warned (Only Sibs at Other Schools) Warning Stage 1 Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Controls Individual FE Time FE No. Obs. R-squared -0.0110*** (0.000) -0.0190*** (0.001) -0.0229*** (0.001) -0.0214*** (0.002) -0.0249*** (0.004) Yes Yes Yes 5,914,114 0.28 Noncompliance in a Given Month All Children Children with Siblings Benchmark Placebo Control for Benchmark Siblings’ Peers Siblings’ Peers Peers’ Noncompl. Other Schools (2) (3) (4) (5) (6) (7) -0.0112*** -0.0120*** -0.0100*** -0.0106*** -0.0085*** -0.0099*** (0.002) (0.002) (0.002) (0.002) (0.003) (0.003) -0.0231*** -0.0236*** -0.0199*** -0.0227*** -0.0189*** -0.0191*** (0.002) (0.002) (0.002) (0.002) (0.002) (0.003) 0.0001 (0.002) 0.2807*** (0.021) 0.0952*** (0.009) -0.0047** (0.002) -0.0076*** (0.002) -0.0013 (0.002) -0.0061** (0.003) -0.0107*** -0.0110*** -0.0103*** -0.0092*** -0.0092*** -0.0092*** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) -0.0188*** -0.0192*** -0.0182*** -0.0175*** -0.0175*** -0.0175*** (0.001) (0.001) (0.001) (0.001) (0.001) (0.001) -0.0227*** -0.0232*** -0.0219*** -0.0222*** -0.0222*** -0.0222*** (0.001) (0.001) (0.001) (0.001) (0.001) (0.001) -0.0213*** -0.0213*** -0.0207*** -0.0217*** -0.0217*** -0.0217*** (0.002) (0.002) (0.002) (0.002) (0.002) (0.002) -0.0248*** -0.0270*** -0.0249*** -0.0242*** -0.0242*** -0.0242*** (0.004) (0.004) (0.004) (0.004) (0.004) (0.004) Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes 5,914,114 5,628,700 5,914,114 4,902,847 4,902,847 4,902,847 0.28 0.27 0.28 0.28 0.28 0.28 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. All specifications include individual fixed effects. Table A.9: Effect of classmates’ warnings (clustering of standard errors at the school level) Dependent Variable: Sample Own Warnings (1) Fraction of Classmates Warned Lag of Frac of Classmates Warned Lead of Frac of Classmates Warned Fraction of Classmates Noncompliant Lag of Frac of Classmates Noncompliant Max Frac of Siblings’ Classmates Warned Lag Max Frac of Siblings’ Classmates Warned xi Max Frac of Siblings’ Classmates Warned (Only Sibs at Other Schools) Lag Max Frac of Siblings’ Classmates Warned (Only Sibs at Other Schools) Warning Stage 1 Warning Stage 2 Warning Stage 3 Warning Stage 4 Warning Stage 5 Controls Individual FE Time FE No. Obs. R-squared -0.0111*** (0.000) -0.0193*** (0.001) -0.0231*** (0.001) -0.0218*** (0.002) -0.0248*** (0.004) Yes Yes Yes 5,833,861 0.28 Noncompliance in a Given Month All Children Children with Siblings Benchmark Placebo Control for Benchmark Siblings’ Peers Siblings’ Peers Peers’ Noncompl. Other Schools (2) (3) (4) (5) (6) (7) -0.0115*** -0.0124*** -0.0103*** -0.0109*** -0.0086*** -0.0101*** (0.002) (0.002) (0.002) (0.003) (0.003) (0.003) -0.0239*** -0.0244*** -0.0207*** -0.0233*** -0.0194*** -0.0197*** (0.002) (0.002) (0.002) (0.002) (0.003) (0.003) -0.0003 (0.002) 0.2860*** (0.022) 0.0964*** (0.009) -0.0049** (0.002) -0.0081*** (0.002) -0.0014 (0.003) -0.0063** (0.003) -0.0108*** -0.0112*** -0.0104*** -0.0093*** -0.0093*** -0.0093*** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) -0.0190*** -0.0195*** -0.0184*** -0.0178*** -0.0177*** -0.0177*** (0.001) (0.001) (0.001) (0.001) (0.001) (0.001) -0.0230*** -0.0234*** -0.0221*** -0.0224*** -0.0223*** -0.0223*** (0.001) (0.001) (0.001) (0.001) (0.001) (0.001) -0.0218*** -0.0218*** -0.0211*** -0.0220*** -0.0220*** -0.0220*** (0.002) (0.002) (0.002) (0.002) (0.002) (0.002) -0.0246*** -0.0269*** -0.0248*** -0.0241*** -0.0241*** -0.0241*** (0.004) (0.004) (0.004) (0.004) (0.004) (0.004) Yes Yes Yes 5,554,369 0.28 Yes Yes Yes 5,833,861 0.29 Yes Yes Yes 4,837,760 0.29 Yes Yes Yes 4,837,760 0.29 Yes Yes Yes 4,837,760 0.29 Yes Yes Yes Notes: Robust standard errors in parentheses (clustered at the school level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies. All specifications include individual fixed effects. Table A.10: Effect of warnings of neighbors in the bank (standard errors clustered at zip-code level) Dependent Variable: Fraction in Bank Warned Lag of Frac in Bank Warned Lead of Frac in Bank Warned Frac Neighbors Warned Lag of Frac Neighbors Warned No of BFP Neighbors Controls for Own Warning Stage Controls Individual FE Time FE No. Obs. R-squared Noncompliance in Given Month Benchmark Placebo Controls for Neighbors Warned (1) (2) (3) -0.0276*** -0.0276*** -0.0349*** (0.008) (0.009) (0.006) -0.0549*** -0.0548*** -0.0284*** (0.011) (0.011) (0.004) 0.0004 (0.008) 0.0160 (0.023) -0.0612** (0.030) -0.0000 -0.0000 -0.0000 (0.000) (0.000) (0.000) Yes Yes Yes Yes 4,797,371 0.21 Yes Yes Yes Yes 4,797,371 0.21 Yes Yes Yes Yes 4,797,361 0.21 Notes: Robust standard errors in parentheses (clustered at the zip-code level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies and individual fixed effects. xii Table A.11: Effect of warnings of neighbors in the bank Dependent Variable: Own Warning Stage Frac in Bank Warned (WS 1) Frac in Bank Warned (WS 2) Frac in Bank Warned (WS 3) Frac in Bank Warned (WS 4) Frac in Bank Warned (WS 5) Controls for Frac Neighbors Warned (By WS) Controls Individual and Time FE No. Obs. R-Squared Noncompliance in Given Month 1 2 3 4 -0.0190*** -0.0160 -0.0083 0.0916 (0.004) (0.016) (0.033) (0.071) -0.0640*** -0.0430*** -0.0914** -0.0598 (0.014) (0.008) (0.038) (0.056) -0.1121*** -0.0414 -0.0472*** -0.1801 (0.023) (0.040) (0.014) (0.121) -0.0506 -0.2090** -0.0476 -0.0397* (0.043) (0.098) (0.092) (0.023) -0.0218 -0.0301 0.5936 0.5250* (0.096) (0.192) (0.496) (0.300) Yes Yes Yes 3,842,873 0.27 Yes Yes Yes 1,169,159 0.32 Yes Yes Yes 420,041 0.34 Yes Yes Yes 155,993 0.36 Notes: Robust standard errors in parentheses (clustered at the household level). *** p<0.01, ** p<0.05, * p<0.1. Included controls are: age dummies, birth order dummies, number of brothers and sisters in the age categories 6 to 10, 11 to 15, and 16 to 18, month and year dummies and individual fixed effects. xiii
© Copyright 2026 Paperzz