Hidden Hidden Hunger: Invisible Heterogeneity in Soil Zinc, Crop Zinc, and Zinc Deficiency in Rural Uganda Leah EM Bevis⇤ Cornell University Preliminary Draft Comments Very Welcome Please Do Not Cite or Circulate March 12, 2015 Acknowledgements Special thanks to Clark Gray, PI on the NSF-funded survey that made this data collection possible, co-PIs Ephraim Nkonya, Darrell Shultze, Christopher B. Barrett and Leah VanWay, the Kampala IFPRI and HarvestPlus offices for housing me during the survey, and the entire group of Ugandan surveyors. Thanks to HarvestPlus DC, the Cornell International Institute for Food, Agriculture and Development (CIIFAD), and Cornell’s NSF-sponsored Food Systems and Poverty Reduction IGERT program for the funding that made this research possible. Also thanks to Ross Welch, Christine Hotz, Daniel Gilligan, Raymond Glahn and Anna-Marie Ball for their aid and guidance, to Tembi Williams for her research assistance, to Maia Call for her collaboration, and to Agaba Choice and Sentumbwe George for their aid in data cleaning. Thanks to Mike Rutzke, the Cornell Nutrient Analysis Laboratory and the USDA for their careful analysis of soil, crop and food samples. ⇤ Email: [email protected], Phone: 1.614.288.4008 1 Abstract Over two billion individuals worldwide are a↵ected by micronutrient deficiencies, or “hidden hunger,” with consequences including sustained loss of productivity, irreparably reduced physical and cognitive capacity, blindness, and increased infant and maternal mortality. Population estimates of micronutrient deficiencies, crucial for targeted interventions, commonly rely on individual food recalls or food supply data paired with Food Composition Tables (FCTs) that list “standard” food micronutrient contents. Yet in rural Uganda, FCT zinc standards ignore vast heterogeneity in food zinc content, and misrepresent even the mean zinc content of many crops. Using a unique dataset with food recall data for children under 5 and the mineral contents of household- and market-procured crop samples and household soil samples, as well as a much larger, country-representative panel dataset on agricultural production and food consumption, I examine how unobserved heterogeneity in food zinc content biases individual and population-wide estimates of zinc deficiency. Specifically, because market crops much lower in zinc than home-produced crops, families who rely on the market for staples are more likely to be under-counted as zinc-deficient. Observable household characteristics such as non-farm income, access to transportation and isolation are highly predictive of market reliance for staples. These associations may be useful when considering which families are likely be under-counted by standard zinc intake measures. Keywords: zinc deficiency, micronutrient malnutrition, markets, soils, childhood health 2 1 Introduction Beginning around the 1960s, scientists, nutritionists and policy-makers have gradually come to understand of the importance of vitamins and minerals — collectively, micronutrients — to human health. Micronutrient deficiencies, now known to a↵ect over 2 billion people, are increasingly viewed as a primary constraint to health and development worldwide (Kennedy, Nantel and Shetty, 2003; Black, 2003). While capable of irreversibly eroding human health and productive capacity, micronutrient deficiencies are difficult to diagnose and to treat because they often present with few or no clinical symptoms. For this reason, they are collectively termed “Hidden Hunger.” With a new, global awareness of micronutrient deficiencies came a realization that “hunger” or malnutrition was distributed far more widely and heterogeneously than previously understood. In the context of rural Uganda, I show that an additional layer of invisible heterogeneity, so far ignored in both the nutrition and economics literature, further complicates detection of zinc deficiency. Population estimates of micronutrient deficiencies commonly rely on individual food recalls or food supply data paired with Food Composition Tables (FCTs) that list “standard” food micronutrient contents. Yet in rural Uganda, FCT zinc standards ignore vast heterogeneity in food zinc content, and misrepresent even the mean zinc content of many crops. In particular, market-purchased crops are far lower in zinc than standards would suggest, while household-purchased crops can be up to fifty or a hundred times higher in zinc content than standards suggest. Such variation in food micronutrient content is generally ignored, as it is assumed to be an insignificant driver of intake. Yet in this context, accounting for heterogeneous food zinc content significantly shifts the distribution of zinc intake and the rate of zinc deficiency. Further, heterogeneity of food zinc content is not random — it is correlated with observable household characteristics such as geographic isolation, farm size, non-farm income, and distance to market. These patterns have the potential to bias standard methods of zinc intake calculation, such that zinc deficiency is systematically under- or over-estimated for specific types of children. In this paper, I begin by illustrating the substantial heterogeneity that exists in the zinc content of Ugandan crops. Particularly notable is the di↵erence between home-produced and market-purchased crops: market crops are far lower in zinc, on average, than home-produced crops, suggesting that children who rely on the market for their food intake may be at greater risk for zinc deficiency than other children, even if they consume identical foods. Knowing which children rely on the market, and which children rely on home-produced crops, is therefore potentially important. A nationally representative data-set is used to show that families who are closer to roads and markets, poorer families, and families who are reliant on non-farm income are most likely to source their foods from the market rather than from home-production. In the last section of the paper I construct expected zinc intake according to market crop zinc intake, home-produced crop zinc intake, and estimated reliance on home-sourced foods. I then compare the zinc intake calculated in this way with the standard zinc intake calculated via an FCT. While the sample size is too small to make strong claims the di↵erential between these intake variables, it does appear that 3 children who are particularly reliant on the market are more likely to be counted as “zinc adequate” when in fact they are zinc inadequate. Similarly, children who are particularly reliant on home-produced foods are more likely to be counted as zinc inadequate when they are in fact zinc adequate. This paper proceeds as follows. Section 2 provides background on the importance of childhood nutrition for later health and productivity, on childhood zinc deficiency, and on zinc deficiency in Uganda. Section 3 presents a model that illustrates how farmer cropping choices and marketing practices impact household zinc intake and household health, and highlights the di↵erential in crop zinc content between household-grown crops and market-purchased crops. Section 4 gives an overview of the two datasets used in this paper. Section 5 illustrates the heterogeneity found in crop zinc content, and shows that market-purchased crops are much lower in zinc than household-produced crops. Section 6 constructs child zinc intake using crop samples from households and markets, and predicted ratios of produced to purchased consumption for each relevant crop. (These ratio predictions are estimated using LSMS-ISA panel data.) It additionally explores which children are likely to be under-counted in terms of zinc deficiency. Section 7 concludes. 2 Background Childhood health is a primary determinant of adult productivity. This is especially true in a developing context where income depends on physical as well as cognitive capacity, and a growing literature documents the later-life health and productivity consequences of early childhood malnutrition in poor countries. Maluccio et al. (2009), for instance, examine the impact of childhood protein supplements on wages in Guatemala, and find that improved nutritional status below the age of two increases wages by 46 percent. Chen and Zhou (2007) estimate that early life exposure to famine is associated with decreased longevity and lower income in China. Victora et al. (2008) find that indicators for childhood malnutrition are associated with lower income in Brazil and lower assets in India. Yet, in many of these studies the mechanism connecting childhood diet (or even clinical indicators for childhood malnutrition) to later life outcomes is unclear. For instance, Maccini and Yang (2009) show that early life rainfall shocks in Indonesia lead to poor health and reduced levels of education, a causal association presumably but not obviously acting through reduced calorie intake. In such cases, it might be protein deficiency rather than reduced calorie intake driving later life impacts, or micronutrient deficiencies such as iron and/or zinc deficiency. The irreversibility of micronutrient malnutrition makes it a plausible potential mechanism connecting early childhood diet and later life productivity in many contexts. E↵ectively, early childhood micronutrient malnutrition is capable of degrading an individual’s human capital so much that it changes the production technology available to them as adults (Barrett, 2010; Carter and Barrett, 2006), essentially becoming a poverty trap mechanism (Azariadis and Stachurski, 2005). Severe iodine deficiency, for instance, causes mental retardation, and even mild iodine deficiency reduces cognitive abilities (Hetzel, 1990). Severe selenium deficiency in utero is associated with cretinism 4 (a condition of severely stunted physical and mental growth), and even mild selenium deficiency in pregnant women can have lifelong health impacts for their unborn children through miscarriage, preeclampsia and pre-term labor (Mistry et al., 2012). Vitamin A deficiency is a leading cause of acquired blindness in children (WHO, 2009). Zinc deficiency is one of the most common micronutrient deficiencies,1 with an estimated 2.6-3 billion people at risk (Hotz and Brown, 2004). It is also one of the most dangerous. Zinc deficiency causes abnormal labor and fetal abnormalities in pregnant women, retards physical growth and cognitive capacity in children, is associated with diarrhea and acute lower respiratory infections in children, and delays sexual maturity in adolescents (Prasad, 2003; Hotz and Brown, 2004). In fact, because zinc interacts with a vast number of human proteins (over 900 proteins, 11 times more than does iron), the symptoms of zinc deficiency are many and varied (Graham et al., 2007). Even adults lose muscle mass under zinc deficiency in order to release zinc for maintenance of vital organs, and adult zinc deficiency is associated with a number of diseases/conditions including chronic liver disease, diabetes, and macular degeneration (Prasad, 2003). Zinc deficiency is also associated with a loss of appetite, therefore contributing to deficiency in other nutrients (Hotz and Brown, 2004). Thus, zinc deficiency in children is capable of irreversibly degrading future productivity, while zinc deficiency in adults can decrease current productivity. Moreover, mild or moderate zinc deficiency is almost impossible to diagnose clinically, as it presents with few observable symptoms besides stunting. Zinc is so closely regulated by the body that it is difficult to diagnose even with blood analysis. (Ideally bone tissue is analyzed). For this reason, and because of the diverse functions in the human body that depend on zinc, Graham et al. (2007) call zinc deficiency the “ultimate hidden hunger.” Zinc deficiency is generally accepted to be common in Uganda, and particularly in Ugandan children (Bitarakwate, Mworozi and Kekitiinwa, 2004). Ndeezi et al. (2010) found that, in a group of 247 HIV positive children in Kampala, 54 percent had low serum zinc, defined as zinc less than 10 µmol/liter. Using that same cut-o↵, Bitarakwate, Mworozi and Kekitiinwa (2004) found that half of even healthy children in Kampala displayed low serum zinc status, and that low zinc status was even more prevalent in children with persistent diarrhea. A study by Bachou (1998) found that up to 90 percent of of adolescents in the West Nile region of Uganda had low hair zinc levels. Ndeezi et al. (2010) found that zinc supplementations reduced mortality by two thirds in children with severe pneumonia in Kampala, and Kikafunda et al. (1998) found that zinc supplementation improved child weight gain in preschool children from 3 low-income nursery schools Kampala. While no country-wide estimate of prevalence exists, zinc deficiency in Ugandan children has been found at rates of 50 to 90 percent (Ndeezi et al., 2010; Bachou, 1998), and has been associated with diseases ranging from diarrhea to HIV/AIDs. Tidemann-Andersen et al. (2011) find that dietary zinc intake in Uganda is low, and 1 Its prevalence surpasses that of iodine deficiency, su↵ered by slightly under 2 billion people (WHO, 2004), may be on part with that of vitamin A deficiency, impacting a third of preschool age children and fifteen percent of pregnant women (WHO, 2009), and approaches that of iron deficiency, estimated to impact between 4 and 5 billion people, with 2 billion severely deficient, or anemic (WHO, 2008). 5 work by Ecker, Weinberger and Qaim (2010) suggests that approximately half of all dietary zinc in Uganda intake stems from cereals. Dependence on cereals and other staples for zinc intake is common in semi-subsistence societies that consume primarily plant-based foods (Graham et al., 2007), and leaves families and children particularly at risk of zinc deficiency. In much of rural East Africa, zinc is consumed largely through staples rather than though animal-source foods (ASFs), (Ecker, Weinberger and Qaim, 2010). Work by Ecker, Weinberger and Qaim (2010) suggests that approximately half of all dietary zinc intake in Uganda stems from cereals, and Tidemann-Andersen et al. (2011) find that dietary zinc intake in the Kumi District of Uganda is low in part due to the low zinc content of staples. The authors compare sorghum, millet, maize, groundnut, soy bean and brown bean samples taken from local markets in Kumi to the same crops sampled in Kenya and Mali. They find that crops in Kumi are consistently lower in zinc than crops in Kenya and Mali. Tidemann-Andersen et al. (2011) also find that the zinc content of crops varies across counties, within Kumi District. A number of factors might account for such variation — crop varieties, methods of processing, or variation in soil zinc across counties. It is well known that, because crops uptake minerals from the soils in which they grow, variation in soil minerals is often transmitted to variation in crop minerals. Zinc is strongly transmitted from soils to crops, a fact that has been shown experimentally (Peck et al., 1980; Hipp and Cowley, 1971; Cakmuk and Erdal, 1996; Moraghan, 1994; Shivay, Kumar and Prasad, 2008) and observationally (Singh, 2009; Mayer et al., 2007). Such transmission has been shown to e↵ect the zinc status of rats (Welch, House and Allaway, 1974; House and Welch, 1989), and Mayer et al. (2007) show that the soil zinc concentration of rice paddies in rural Bangladesh e↵ects not only rice zinc content, but the zinc status of farming families who consume the rice. Risk of zinc deficiency is commonly gauged through estimations of national-level or individual-level zinc intake (Joy et al., 2014; Ecker, Weinberger and Qaim, 2010; Gibson and Heath, 2011). While serum zinc status is also measured in many cases, it is far more expense to collect blood samples than to conduct a food recall. Serum zinc is also considered a poor biomarker for individual-level zinc deficiency, as it reflects very recent zinc consumption rather than long-term zinc status (Gibson et al., 2008; Hess et al., 2007). (It does, however, provide a population-level estimate of zinc deficiency.) These estimations of zinc intake rely on FCTs to provide the zinc content of foods consumed. Yet evidence suggests that Uganda’s crops may be low in zinc, and that zinc content varies highly within crops (Tidemann-Andersen et al., 2011). This, combined with a heavy reliance on plant-based foods (Ecker, Weinberger and Qaim, 2010), suggests that zinc intake (and status) in Uganda might be lower and more varied than a it would appear when intake is constructed according to standard FCT zinc content values. 6 3 Model In this model, farmer i seeks to maximize household utility over each day t within agricultural season {0, T }. A household-specific utility function Ui takes the consumption vector Cit as its argument, as in equation 1, where Cit holds the consumption quantities of maize Citmz , sorghum/millet Citsg , sweet potato Citsw , cassava Citcs , beans Citbn , gnuts Citgn , and non-crop foods such as animal-source foods Cita . Farmers maximize utility by choosing optimal quantities of crop sold Mit = {Mitmz , Mitsg , Mitsw , Mitcs , Mitbn , Mitgn }, and purchased Pit = {Pitmz , Pitsg , Pitsw , Pitcs , Pitbn , Pitgn , Pita }.2 max Ui = f f Mit ,Pit t=T X t Ui (Cit ) (1) t=0 For each f 2 {mz, sg, sw, cs, bn, gn}, Citf is the consumption of crop f in time t. (Similarly, Cita is the consumption of non-crop foods in time t). Crop production occurs only once at the start of the season (i.e., we define t = 0 as the moment of harvest), realized as Fi0f for each crop f . No production occurs within the period {1, T }. Conversely, on any day t farmers may choose crop f purchasing quantity Pitf and crop f f selling quantity Mitf . If we allow Si0 to be the initial quantity of crop f in storage, at f t = 0, and if we allow SiT to be the closing quantity of crop f in storage, at t = T , then the maximization problem is constrained according to equation 2. t=T X Citf = t=0 T n X f Si0 + Fi0f + Pitf Mitf f SiT t=1 o 8f 2 {mz, sg, sw, cs, bn, gn} (2) In the Ugandan context however, storage from one season to another is rare. Additionally, storage data is unreliable to work with. In this model I therefore assume that storage from one season to the next is zero, and the farmer’s maximization problem is constrained according to equation 3. Equation 4 additionally requires that selling quantities are no larger than production quantities.3 t=T X Citf = t=0 t=T X t=0 t=T n X Fi0f + Pitf Mitf t=0 o 8f 2 {mz, sg, sw, cs, bn, gn} Mitf Fi0f 8f 2 {mz, sg, sw, cs, bn, gn} (3) (4) Equation 5 displays the household budget constraint for time t, where pit is the vector of crop purchasing prices in time t, and rit is the vector of crop selling prices in time t. Non-farm income gained at time t is given by N Fit . 0 pit Pit s=t n X s=0 0 N Fis + ris Mis 0 pis Pis o 2 (5) Superscripts indicate identical crops across consumption, selling and purchasing variables. I assume that farmers do not sell non-crop foods, i.e., no Mita exists. 3 This constraint implicitly rules out trading. 7 Given equations 1-5, optimal selling and purchasing quantities for each crop f will be defined as in equations 6 and 7, where pi and ri are defined as price vectors across both crops/non-crop food and time, N Fi is defined as the vector of non-farm incomes over time, and Xi includes household characteristics that help to define household preferences over food, i.e. characteristics that impact the form of Ui . Pitf = Pitf (Fi0f , pi , ri , N Fi , Xi ) (6) Mitf = Mitf (Fi0f , pi , ri , N Fi , Xi ) (7) While the farmer does not observe the zinc content of crops produced, sold and purchased, production and marketing decisions do impact the supply of zinc available to the household. Zinc intake depends on both food choice (consumption of maize vs. cassava) and also the zinc density of foods (consumption of low-zinc maize vs. high-zinc f maize), which we know to be heterogeneous. I define zm as the zinc density of food f at f market m, and zi as the zinc density of food f produced by household i. I assume that at any given time t, household i draws its consumption of crop f from either the market or the household — but not from both sources at once.4 The home consumption dummy variable HCitf defines the source of crop f at time t in the following manner: ( 1 if crop f is sourced from the home f HCit = 0 if crop f is sourced from the market Household zinc intake via crop f , Zitf , is therefore defined as in equation 8, relying on f market zinc density zm if HCitf = 1, and relying on household zinc density zif if HCitf = 0. f Zitf = zif ⇤ HCitf ⇤ Citf + zm ⇤ (1 HCitf ) ⇤ Citf (8) If HCitf is not observed at time t, expected Zitf might be defined as in equation 9, where f d HCif is the probability that HCit = 1. d f E[Zitf ] = zif ⇤ HCitf ⇤ Citf + zm ⇤ (1 d HCitf ) ⇤ Citf (9) Expected total zinc intake at time t, E[Zit ], is therefore defined as in equation 10. o X n f df d f a E[Zit ] = zi ⇤ HCit ⇤ Citf + zm ⇤ (1 HCitf ) ⇤ Citf + zm ⇤ Cita (10) f The key components of equation 10 have already been defined; Citf is the consumption of crop f in time t, Citf is the consumption of non-crop food a in time t, zif is the f household-specific zinc density of home-produced crop f , zm is the zinc density of crop f df f at market m. The only variable left to be explicitly defined is HC i = P rob(HCit = 1). 4 This assumption is supported by the LSMS-ISA data, where only 5 percent of consumed crop observations were sourced simultaneously from the home and the market. And even in these cases, the vast majority of the crop was usually consumed from one source. 8 If we assume that families source crop f from home production until they run out of all home-produced crop, then HCitf is defined as in equation 11. Even without this assumption, HCitf must necessarily rely on the quantities Fi0f , Mis , Cisf , and Pisf for all s < t.5,6 By substituting for Misf and Pisf according to equations 6 and 7, and because Cisf is itself a function of Fi0f , Mis , and Pisf for all s < t, a reduced form definition of df HCitf may be written as in equation 12. The probability HC i may be obtained as the prediction from this estimated model. HCitf = 1 i↵ s=t X (Fi0f Misf Cisf ) > 0, 0 otherwise (11) s=1 HCitf = HCitf (Fi0f , pi , ri , N Fi , Xi ) 4 (12) Data This paper utilizes two sets of data. The first set of data covers rural households across much of Uganda, and was collected by the University of North Carolina (UNC) and the International Food Policy Research Institute (IFPRI). The second set of data is a four-wave panel dataset from the Living Standard Measurement Study-Integrated Surveys on Agriculture Initiative (LSMS-ISA). The LSMS-ISA datasets — nationally representative, agriculturally intensive panel data sets — cover a number of countries in sub-Saharan Africa, but this paper utilizes only the panel dataset from Uganda. UNC-IFPRI Dataset The UNC-IFPRI data were collected during the summer of 2013 in nine districts of rural, agrarian Uganda.7 This paper utilizes five types of data from the survey: household survey data, plot-level soil samples, plot-level crop samples, market-level food and crop samples, and child survey and food recall data. While 424 households are represented in the household survey, soil samples are drawn from only 318 of these households, and crop samples from 282. Child data was gathered at 237 households, one child per household. Soil and crop samples were collected on all plots growing maize, sorghum, cassava, sweet potato, beans, or groundnuts. A total of 791 soil samples and 556 crop samples (of those 6 crops only) were gathered. In most cases crops were taken directly from the 5 While it is possible that HCitf indicates the position of some latent variable with respect to a defined threshold, as in equation 11, this is not necessarily true. If, for instance, families may freely choose to consume either home-produced crops or market-produced crops directly after harvest, then during this period HCitf might equal either 1 or 0, regardless of the value of the latent variable. 6 It may also be true that other variables determine HCitf . However, production, selling and purchasing quantities necessarily impact HCitf even if they do not define HCitf , given that HCitf = 1 when Fi0f = 0 or P f P f f Pis = 0. when Fi0 = Mis , and HCitf = 0 when 7 s<t s<t These districts are: Kabale, Iganga, Soroti, Lira, Luuka, Busiki, Amolatar, Dokolo and Serere. Data from these nine districts was collected alongside a larger survey that covered 21 districts. This larger survey was a follow-up round to a previous survey run in 2003. The data utilized in this paper, however, are cross-sectional. Information on the sampling strategy utilized in 2003 can be found in Nkonya et al. (2008). Essentially, rural households were randomly chosen within survey districts, but the survey districts themselves were chosen to represent various agro-ecological zones across Uganda. 9 field; if already harvested, they were sampled from storage.8 Both soil and crop sampling were conducted according to standard protocols for in-field, representative sampling. (See Appendix 1 for details.) For soils, 12-20 subsamples were taken from each plot, evenly distributed. Crop samples were collected similarly. Thus, soil and crop nutrient content can be viewed as plot-representative, and variation in nutrient content across plots is largely related to systematic factors (such as soil texture or pH) rather than random variation within plots. Certain crops and foods, not commonly grown at the household but important to zinc intake, were purchased at local, sub-county markets for later micronutrient analysis. One sample of each food item was taken at each of 32 markets, subject to availability, for a total of 357 samples of 13 food items.9 Soils, crops and foods were analyzed for total micronutrients, and soils were additionally analyzed for “available” micronutrients. Total nutrient quantity, obtained via a Vulcan 84 Digestion, reflects the total mineral quantity (measured in parts per million) in each soil, crop, or food sample. “Available” nutrient quantity, obtained via a Modified Morgan’s extraction, reflects the quantity of soil minerals (measured in parts per million or billion) actually available for plants to uptake. Minerals that are bound tightly in complexes or rock formations may not, for instance, be available to plants. Soil pH is one of the primary factors driving the availability of soil minerals. A Food Frequency Recall (FFQ) was conducted for a maximum of one child per household, and covered all food intake in the preceding week.10 Qualifying children were under 5 years of age, present at the time of the survey, and not exclusively breastfeeding.11,12 During the FFQ child care-takers stated how many times their child had consumed each of 53 selected, zinc-rich foods over the course of the last seven days, and the average portion size for each food consumed. More information on the FFQ can be found in Appendix 3. While the child health module was conducted at only 237 households, in 99 cases a second food recall was conducted.13 This extends the food intake dataset to 336 observations. Table 1 displays descriptive statistics for the children in the UNC-IFPRI sample. Of the 237 children in our sample, all but 4 were under 60 months (5 years) of age, all were 8 Analysis of nutrient content in field-crops vs. storage-crops shows no di↵erence between the two groups. I control for this factor in all relevant analysis, but it has no appreciable impact. 9 Sample food items were matooke (a variety of banana), fish (primarily dried Nile Perch and Tilapia), mukene (small dried fish), cowpeas, rice, cassava flour, maize flour, millet flour, eggs, avocado, milk, cooking oil, and chicken. 10 Food Frequency Recalls are a standard tool for capturing the dietary intake of one or two select micronutrients. 11 If a household had multiple children present who fit these criteria (e.g. a son, a niece, and a cousin), surveyors chose the biological child of the household head. If the household head had multiple biological children present, all fitting the necessary criteria, the surveyors chose the oldest child. 12 Selection on having a young child means that these households are not representative of their survey district. Appendix 2 explores this selection. 13 These 99 children were revisited partway through the survey, as their heights had been improperly recorded. (In theses cases, the digital survey tool was failing to record decimals. This technical problem was not endogenous to enumerator skill.) When enumerators returned to these households to re-measure the children, they also conducted a second food frequency survey. Some of these children were re-measured within a few days of the initial measurement, others over a month or so later. 10 under 66 months of age, and exactly half were males. Calorie intake averaged around 1,000 calories per day (with a median intake of 870 calories/day), generally adequate for children of this age. The median child ate 15 distinct food items or ingredients over the course of the week-long recall period, and during that same period ate meat or fish 2 times, ate milk or eggs 2 times, ate some form of cereal 14 times, ate some type of legume 11 times, and ate some sort of tuber 12 times. Because the UNC-IFPRI survey was conducted during one of Uganda’s two yearly harvest periods, the dietary diversity displayed in this sample likely represents the upper bound of dietary diversity for children in these areas of Uganda. LSMS-ISA Data Uganda’s LSMS-ISA panel includes four rounds of data, collected in 2005/6, 2009/10, 2010/11, and 2011/12. Because Uganda has two agricultural seasons, each family in the Ugandan panel is visited twice during each round, and the agricultural survey, including extensive measurement of agricultural production and sales, is conducted during each visit. The household survey, including a detailed section on household-level food consumption, is conducted only once per round — during the first visit for half of the families in each enumeration area, and during the second visit for the other half. Thus, over the course of 4 rounds, each taking about a year to complete, and each o↵set from the last round by about a month, consumption data has been collected during every week of the year.14 The food consumption module consists of a 7-day recall. For each of 56 common food items listed, families report whether that food item was consumed in the household during the last 7 days, if so how much of it was consumed in total, how much was consumed from purchases, and how much was consumed out of home production. The dummy variable HCitf is assigned a 1 if consumption of crop f was from home production, and a 0 if consumption of crop f was from market purchases.15 A total of 1,756 rural families are included in the Ugandan panel dataset, each observed 4 times.16 (Data from urban areas is discarded.) Because I use LSMS data to model HCitf for each of six crops, and because HCitf is specific to household i and crop f , the the panel dataset constructed is also unique by household, round, and crop consumed. This provides a total of 42,144 possible observations, or 7,024 possible observations per crop. Not every crop is visible in every round, however — if a family did not happen to consume that crop during the 7-day recall window of a particular round, HCitf will not be visible for that round.17 14 Round one was conducted from May 2004 - November 2006, round 2 from September 2009 - October 2010, round 3 from October 2010 - Sept 2011, and round 4 from November 2011 - October 2012. 15 A very small proportion of food is drawn from transfers rather than home production or purchases. f For these observations, HCit = 0. 16 Currently I keep only household observed in all four rounds. I could expand the data-set to include families viewed in only 2 or 3 rounds, and may do so in the future. 17 This selection into the data-set is largely driven by seasonal timing. If this is the only variable driving f f visibility of HCit , then any estimation of HCit is consistent, given that survey timing was chosen randomly by the LSMS team, and is therefore uncorrelated with other independent variables that f might drive HCit . If other variables drive selection into the data-set, such as wealth or tribe, and f these variables are correlated with the independent variables that drive HCit , then estimation is not consistent without adjustment for selection. 11 It is important to note that HCitf refers to the consumption of crop f for household i during week t. The subscript t is defined by the day of interview, and di↵ers across but is not defined by the panel round. For some households, t varies a great deal across rounds, while for others t is similar for all rounds. For a very small number of observations (less than 5 percent), the consumption of crop f is drawn both from market and household production simultaneously. In a few cases, households do truly source a single food item, e.g. maize flour, from both the market and home production within the given, 7-day recall period. However, because multiple forms of certain crops are observed (e.g. cassava tubers and cassava flour), these quantities are first aggregated into total kilograms per crop before HCitf is defined. Most of the apparent double-sourced crops are actually due to this aggregation — cassava tubers may have been drawn from home production, for instance, while cassava flour was purchased. In these cases, HCitf is assigned a 1 if the majority of consumption, by kilograms, was drawn from home production. Because this occurs for so few observations it is unlikely to change results, and is preferably to throwing away potentially non-random observations. df Table 2 displays variables used to predict HC i , from both the LSMS-ISA and the UNC-IFPRI datasets. While the probability of producing any particular crop is similar across the two data-sets, production quantity conditional on production di↵ers in a statistically significant manner. Given the di↵erences in both sampling area and year, this is to be expected. In fact, significant di↵erences in production exist within the LSMS-ISA panel, if production quantities are compared across rounds (results available upon request). It is also clear that the LSMS-ISA datasets are, on average, more isolated than the IFPRI-UNC households. They are both further from the nearest cities and further from the nearest all-weather roads.18 They are also less likely to own a bicycle. Households are larger in the IFPRI-UNC data, in part due to selection on having a child under 5 years old. Non-farm income is substantially higher in the LSMS-ISA data, which is curious given the more isolated nature of these households. However, the di↵erence is largely due to higher wage earnings in the LSMS-ISA data, and these earnings are particularly large in the central region, where no IFPRI-UNC households exist. [[I worry that this di↵erence is due to the survey instrument itself, and may in future exclude this variable from analysis.]] 5 Heterogeneity in Crop Zinc Content The nutrient density of food is important only if zinc content is significantly heterogeneous within particular foods. In our dataset, 76 percent of zinc is consumed through plant-based foods, primarily cereals and legumes. It is therefore important to examine the heterogeneity of zinc content in plant-based foods. Figures 1-5 do so, di↵erentiating between crops gathered at households and crops gathered at market, and marking zinc content values from the HarvestPlus FCT for the sake of comparison. These figures illustrate three points. First, the zinc content of crops is indeed highly 18 Distances to nearest city is measured by euclidean distance, created in both datasets using household and city GPS coordinates. Distance to nearest all-weather road is reported by IFPRI-UNC farmers, and created using GPS coordinates in the LSMS-ISA data. 12 heterogeneous. The highest-zinc sorghum samples contained 164 and 185 mg of zinc per gram of crop, over 100 times the HarvestPlus FCT estimate of 1.6. The highest-zinc maize samples contained 96 and 128 mg of zinc per gram of crop, 50-75 times the FCT estimate for unrefined maize, and 130-180 times the FCT estimates for refined maize. Second, market zinc content is far lower and less variable than household zinc content, for every crop. Median cassava zinc content drops by 40 percent at market (Figure 1), and median maize content drops by 83 percent (Figure 2). Figure 3 illustrates the zinc content of millet purchased at market and sorghum sampled at households, two cereals generally considered to have almost identical zinc content.19 The median zinc content of millet purchased from market is only 56 percent of the median zinc content of sorghum sampled from homes. Figure 4 illustrates the zinc content distribution of cowpeas purchased at market and beans sampled at households. While not strictly comparable, these crops are again considered to have similar zinc content, and the observed pattern is the same: market-purchased cowpeas are lower and less variable in zinc than home-produced beans. What drives the observed heterogeneity in household and market crops? As shown in Appendix 4, variation in the zinc content of household crop samples can be largely explained by soil zinc concentration and other soil conditions. While the precise model may vary across soil types, this type of soil-to-crop mineral transmission is commonly observed (Singh, 2009; Mayer et al., 2007; Chilimba et al., 2011). In these data interactions between total zinc concentration, pH, and extractable cadmium and manganese are the primary soil-level predictors of crop zinc. The di↵erence between household and market samples is more puzzling, and a novel finding. While investigation of this phenomenon is outside the scope of this paper, Appendix 5 holds some details on potential mechanisms. It is unlikely that processing, nutrient degradation over time, or plot-level selection explains the di↵erential. Selection on variety and farmer selection into marketing are the most plausible explanations. Third, these figures illustrate that HarvestPlus FCT standards are often poor proxies for median crop zinc content. For instance, the FCT zinc content standard is close to the median value for cassava grown at home. It dramatically over-estimates the zinc content of cassava purchased at market, however. The FCT zinc content standard over-states the median content of sweet potato grown at home by two thirds — and market purchased sweet potato may be even lower in zinc than home-produced sweet potato, given the trends observed in Figures 1-4. The median zinc content of home-produced maize is far higher than the FCT value for refined or unrefined maize flour, where-as the median zinc content of market-produced maize is significantly lower. The first point might lead us to expect that zinc intake estimates for any particular individual may often be wrong. The second two points suggest, further, that estimates of population-wide zinc deficiency rates might also be biased, either for entire countries or for entire subsets of individuals within a given country. 19 The HarvestPlus FCT lists millet and sorghum as having 1.7 and 1.6 mg of zinc per 100 grams of crop, respectively. 13 6 Constructing Zinc Intake Calculating expected zinc intake E[Zit ], as given in Equation 10, requires knowledge of crops consumed, Cit , and non-crop foods consumed Cita . These variables can be constructed directly from the UNC-IFPRI child food recall data. It is also necessary to a calculate or estimate the zinc density of non-crop foods consumed zm , the f market-specific zinc density of purchased crops zm , the household-specific zinc density df of produced crops z̄if , and the probability of home consumption for each crop f , HC i . The zinc density variables can be calculated or estimated using the UNC-IFPRI dataset df only, while predicting HC i requires the LSMS-ISA data. Zinc Density Variables a Zinc density of non-crop foods, zm , is calculated as the district-median zinc content of of market-sampled foods. These foods were chosen for sampling because they were (i) potentially important to zinc intake, but (ii) rarely produced at the home, e.g., fish, oil, milk, or chicken. For foods that are not important to zinc intake (most fruit and vegetables), market samples were not taken for zinc analysis. For these non-crop foods, a zm is given by the HarvestPlus FCT zinc content. For two other items (beef and goat), market samples were not legal to import and analyze, even though these items may a contribute substantially to zinc intake. For these non-crop foods also, zm is given by the HarvestPlus FCT zinc content. f Market-specific zinc density of purchased crops, zm , should ideally be calculated as the district-median zinc content of market-based crop samples. For maize, cassava, and sorghum/millet this is done, but market samples were not taken for the other three crops. Thus, HarvestPlus FCT values are used for the crops where market samples are not available. If anything, this likely over-estimates the zinc contribution of these market crops. Household-specific crop zinc density, z̄if , is predicted by a soils-based, household-specific model of crop zinc content. The model is parameterized by estimating — by crop, for hundreds of crop samples — the impact of soil zinc concentration and other soil characteristics on crop zinc content. This model is presented in panel 3 of Appendix 4. Predictions, as oppose to true crop sample values, were utilized because it was rare for all six crops — maize, sorghum, sweet potato, cassava, beans and groundnuts — to be sampled at the household level. However, if the relevant crop was sampled at a given household, then this sample zinc value is substituted for the predicted value. Estimating Home Consumption (HCiit ) Because data on food source is not gathered in the IFPRI-UNC data, LSMS-ISA data are used to parameterize a logit model for HCitf , defined by Equation 18. This model is df used to predict HC it for each household in the IFPRI-UNC data, using a technique similar to small area estimation. However, I estimate HCitf in the larger data-set and df predict HC it into the smaller data-set, rather than estimating in the smaller data-set to predict into the larger one as described by Elbers, Lanjouw and Lanjouw (2003) Table 3 estimates the impact of farm and household characteristics on HCitf , by crop, using a random e↵ects logit model. The production of crop f significantly, positively 14 influences HCitf , non-farm income negatively influences HCitf , bike ownership positively influences HCitf , and distance to road increases seasonal fluctuations in HCitf . Other estimation methods are under consideration. For instance, Appendix 6 additionally displays a conditional (household fixed e↵ects) logit model,20 and a probit model that adjusts for non-random selection. For the probit estimation, a binary production variable is used to identify selection into the data-set. While an admittedly less than perfect instrument, it is true that production dummies are insignificant in the intensity (home production ratio) equation, conditional on the log production variables already controlled for. Outcome predictions from these estimations strategies are poor, however, as explained in Appendix 6. The random e↵ects logit model is chosen to f df predict HC it within the IFPRI-UNC data, because it mostly closely predicts HCit in the LSMS-ISA data. df Thus, using the model displayed in Table 3, HC it is predicted for each crop f and each household i in the IFPRI-UNC dataset, according to interview date t. Expected child zinc intake E[Zit ] can therefore be constructed according to equation 10. Examining Zinc Intake Figure 6 illustrates the importance of correctly estimating P rob(HCitf ) by illustrating the distribution of expected zinc intake under three assumptions: (1) P rob(HCitf ) = 0 for all crops (making households totally dependent on market-purchased crops), (2) P rob(HCitf ) = 1 for all crops (making households totally dependent on home-produced df crops) and (3) P rob(HCitf )=HC it for all crops, as estimated via the LSMS-ISA model. Zinc intake stemming from the first assumption is henceforth referred to as E[Zit |HCitf = 0], and from the second assumption as E[Zit |HCitf = 1]. The third assumption leads to E[Zit ], as defined in equation 10. These various assumptions shift the zinc intake distribution dramatically. When zinc intake is defined by E[Zit |HCitf = 0], 56 percent of children are zinc inadequate, and mean zinc intake is 3.9 mg/day.21 When zinc intake is defined by E[Zit |HCitf = 1], only 35 percent of children are zinc deficient, and mean zinc intake rises to 6.1 mg/day. df When estimated HC it is used to define E[Zit ], 41 percent of children are zinc inadequate, and mean zinc intake is 5.4 mg/day. Generally, however, zinc intake is calculated under the assumption that crop zinc density is defined by a single value, procured from a Food Composition Table. In this case, di↵erentiating consumption from market and consumption from home is unnecessary. Zinc intake calculated in this manner is henceforth referred to as E[Zit |F CT ]. Figure 7 contrasts the probability distribution of E[Zit |F CT ] with the probability distribution of E[Zit ]. The distributions are similar, though not identical. According to 20 The conditional logit model maximizes likelihood by group, in this case household, conditional on the number of positive outcomes in that group. In this way it conditions on fixed household means without including dummy variables in the model, which are well known to bias coefficients in non-linear models. 21 “Zinc adequacy” is calculated by dividing zinc intake by the child-specific Recommended Daily Allowance (RDA) for zinc — i.e., the intake necessary to meet the needs of 95 percent of the relevant population. Children with adequacy ratios of less than 1 are intaking inadequate levels of zinc. The RDA for zinc ranges from 2 mg/day (children under 6 months) to 5 mg/day (children 4-8 years). 15 E[Zit |F CT ], 46 percent of children are zinc inadequate (rather than 41 percent), and mean zinc intake is 4.6 mg/day (rather than 5.4 mg/day). Figure 8 shows that, while the cumulative distributions of these two intake variables are again similar, E[Zit ] first order stochastically dominates E[Zit |F CT ]. It seems, therefore, that the standard method of calculating zinc intake leads to slight over-estimates of zinc deficiency in Uganda as a whole. These distributional similarities hide ordinal di↵erences between E[Zit ] and E[Zit |F CT ]. While it is true that for most children E[Zit |F CT ] < E[Zit ], for some children the opposite is true. The di↵erence between E[Zit ] and E[Zit |F CT ] increases df with probability of home consumption HC it , as seen in Figure 9. This is logical, given that average zif is generally higher than the FCT crop f zinc density (Figures 1-5). The di↵erence is also increasing in cereal consumption, again logical given that zif is particularly high, as compared to the FCT measures, for maize and sorghum/millet (Figures 2-3). Thirty-four children (9 percent of the sample) are zinc adequate according to E[Zit ], but zinc inadequate according to E[Zit |F CT ]. For these children, the standard FCT method of zinc intake calculation appears to under-estimate intake and over-estimate risk of deficiency. The majority of these children are in the Eastern region of Uganda.22 Why is zinc intake in Eastern Uganda most likely to be under-estimated by standard calculations? Household-produced crops tend to have heterogeneous zinc density in the Eastern region, largely due to heterogeneous concentrations of soil zinc (Figure 11) and heterogenous pH levels. This means that zinc intake from home produced crops is also particularly heterogenous in the Eastern region. Figures 12 illustrates this phenomenon by graphing the probability distribution of household-specific zinc intake less market-specific zinc intake (i.e., graphing the probability distribution of E[Zit |HCitf = 1] E[Zit |HCitf = 0]) for each region. The di↵erence between these two intake calculations is most variable in Eastern Uganda, where sourcing from home production rather than market might either increase or decrease zinc intake by up to 10 mg per day. Because auto-consumption is most prevalent in the Eastern region (Figure 13),23 E[Zit ] ends up particularly variable in the Eastern region. And while this leads to some over-estimation of zinc intake, it leads primarily to under-estimation of zinc intake for those children who are depending on extremely high-zinc, home-produced crops. Eighteen children (5 percent of the sample) are zinc adequate according to E[Zit |F CT ], but zinc inadequate according to E[Zit ]. For these children, the standard FCT method of zinc intake calculation appears to over-estimate zinc intake, and under-estimate risk of deficiency. The majority of these children are in the Northern region of Uganda; none of them are in the South-West district of Kabale.24 22 Twelve percent of children in the Eastern region are adequate according to E[Zit ], and inadequate according to E[Zit |F CT ]. This figure is 7 percent in the Northern region and 5.5 percent in the South-Western province of Kabale. df 23 While this figure illustrates average HCit across cereals and legumes, the same pattern is true for tubers, and for all individual crops except beans. 24 Eight percent of children in the Northern region are adequate according to E[Zit |F CT ], and inadequate according to E[Zit ]. This figure is 4.5 percent in the Eastern region and 0 percent in the South-Western district of Kabale. 16 Why is E[Zit |F CT ] particularly likely to under-estimate zinc intake in the Northern region, and why does it estimate zinc deficiency rates so well in Kabale? Figures 13 and 14 illustrates that auto-consumption is particularly low in the Northern region, making f Northern children highly dependent on market crops. Because average zm is lower than the FCT zinc density for most crops f , E[Zit |F CT ] is likely to under-estimate child zinc intake in areas like the North where consumption is highly dependent on market crops. Conversely, E[Zit |F CT ] will be unlikely to under-estimate zinc intake in areas where zinc intake is largely drawn from home production. When zinc deficiency is calculated according to E[Zit |F CT ], 23, 63, and 66 percent of children from Kabale, the North, and the East respectively are at risk of zinc deficiency. In Kabale, zinc intake is particularly dependent on beans,25 and Figure 14 shows that bean consumption is more likely to be sourced from home-production in Kabale than in any other region. In summary, E[Zit |F CT ] is likely to under-estimate zinc intake for children who are particularly dependent on market crops for zinc intake, and likely to over-estimate zinc intake for children who are particularly dependent on home-produced crops for zinc intake. Variation in soil zinc may marginally impact the accuracy of E[Zit |F CT ], but if so the e↵ect is hidden by the much larger di↵erential between children who rely on the market and children who rely on auto-consumption. 7 Conclusion Significant heterogeneity in food and crop zinc content exists in Uganda. Crop zinc content varies, first, with the characteristics of the soils in which it is grown. Even more significant, however, is the zinc content di↵erence between market crops and home-produced crops. While such heterogeneity is commonly ignored during the calculation of zinc intake, I show that zinc intake distributions and zinc deficiency estimates shift when heterogeneity is accounted for. Zinc intake in Uganda appears to be slightly higher would be apparent if zinc intake was calculated in the standard manner, via a Food Composition Table. The di↵erence is not huge, however — in this sample, zinc deficiency decreases by five percent when heterogeneity of crop zinc and food sourcing patterns are accounted for. More significant is how children switch from appearing zinc deficient to zinc adequate, or visa versa, between the two calculation methods. Children highly reliant on home-produced crops for their zinc intake, and particularly children who are reliant on home-produced crops and also live in an area with zinc-rich soils and zinc-rich crops, are likely to consume greater quantities of zinc than suggested by the standard calculation method. This appears to be the case with children in the Eastern region of Uganda. Conversely, children highly reliant on market-produced crops for their zinc intake are likely to consume less zinc than suggested by the standard calculation method. This appears to be the case with children in the Northern region of Uganda. These findings hinge largely on the di↵erential between home-produced crop zinc content and market-purchased crop zinc content. While this di↵erential appears across 25 The average child in Kable consumed beans X times during the recall period, where-as the average child from other regions consumed beans only Y times. 17 regions and crops in Uganda, such a comparison has never to my knowledge been made in other sub-Saharan African countries.26 Calculating zinc intake as I do in this paper requires detailed data on crop zinc content at least, at the household level and at the market level, and if possible soil zinc content and other soil characteristics. This type of data collection is surely prohibitively expensive for most organizations that wish to gauge zinc deficiency in poor, agricultural areas. This paper points, however, to large-scale patterns that might help to inform interpretation of standard zinc intake, at the very least. That is, standard zinc intake measures may consistently overestimate zinc deficiency in areas where zinc intake stems largely from home-produced crops, particularly if soils in that area are fertile (rich in nutrients) and neutral in pH (which increases zinc availability). On the other hand, standard zinc intake measures may consistently underestimate zinc deficiency in areas where zinc intake stems largely from market-produced crops. While reliance on home-produced crops vs. market-purchased crops varies by area, it also varies by observable household characteristics: in Uganda, families with greater levels of non-farm income and families close to markets and roads are likely to depend greatly on market-produced crops. Even in areas with generally high zinc intake, it may be these families who are “invisibly” zinc deficient. 26 However, initial investigation into the mechanism behind that di↵erential suggests that farmer selection into the market plays a role. If this is true, the di↵erential is likely to exist across sub-Saharan Africa, as a similar selection process exists in most countries. That is, in most countries it is the largest, wealthiest farmers who supply the majority of crops to market, and in low-input settings these farmers are also likely to have some of the lowest-nutrient soils. 18 References Azariadis, Costas, and John Stachurski. 2005. “Poverty traps.” Handbook of economic growth, 1: 295–384. Bachou, H. 1998. “Zinc status in rural West Nile adolescents.” Barrett, Christopher B. 2010. “Food systems and the escape from poverty and ill-health traps in Sub-Saharan Africa.” Ithaca, NY:Cornell University Press. Bitarakwate, Edward, Edison Mworozi, and Addy Kekitiinwa. 2004. “Serum zinc status of children with persistent diarrhoea admitted to the diarrhoea management unit of Mulago Hospital, Uganda.” African health sciences, 3(2): 54–60. Black, Maureen M. 2003. “Micronutrient Deficiencies and Cognitive Functioning.” Journal of Nutrition, 3927S–3931S. Cakmuk, I, and I Erdal. 1996. “Phytic acid-zinc molar ratios in wheat grains grown in Turkey.” Micronutr Agric, 2: 7–18. Carter, Michael R, and Christopher B Barrett. 2006. “The economics of poverty traps and persistent poverty: An asset-based approach.” The Journal of Development Studies, 42(2): 178–199. Chen, Yuyu, and Li-An Zhou. 2007. “The long-term health and economic consequences of the 1959–1961 famine in China.” Journal of health economics, 26(4): 659–681. Chilimba, Allan DC, Scott D Young, Colin R Black, Katie B Rogerson, E Louise Ander, Michael J Watts, Joachim Lammel, and Martin R Broadley. 2011. “Maize grain and soil surveys reveal suboptimal dietary selenium intake is widespread in Malawi.” Scientific reports, 1. Ecker, Olivier, Katinka Weinberger, and Matin Qaim. 2010. “Patterns and determinants of dietary micronutrient deficiencies in rural areas of East Africa.” African Journal of Agricultural and Resource Economics, 4(2): 175–194. Elbers, Chris, Jean O Lanjouw, and Peter Lanjouw. 2003. “Micro–Level Estimation of Poverty and Inequality.” Econometrica, 71(1): 355–364. Gibson, Rosalind, and Anne-Louise Heath. 2011. “Population groups at risk of zinc deficiency in Australia and New Zealand.” Nutrition and Dietetics, 68. Gibson, Rosalind S, Sonja Y Hess, Christine Hotz, and Kenneth H Brown. 2008. “Indicators of zinc status at the population level: a review of the evidence.” British Journal of Nutrition, 99(Supplement 3): S14–S23. Graham, Robin D, Ross M Welch, David A Saunders, Ivan Ortiz-Monasterio, Howarth E Bouis, Merideth Bonierbale, Stef De Haan, Gabriella Burgos, Graham Thiele, Reyna Liria, et al. 2007. “Nutritious subsistence food systems.” Advances in Agronomy, 92: 1–74. 19 Gre↵euille, Valérie, AP Polycarpe Kayodé, Christèle Icard-Vernière, Muriel Gnimadi, Isabelle Rochette, and Claire Mouquet-Rivier. 2011. “Changes in iron, zinc and chelating agents during traditional African processing of maize: e↵ect of iron contamination on bioaccessibility.” Food chemistry, 126(4): 1800–1807. Havlin, John, James D Beaton, Samuel L Tisdale, Werner L Nelson, et al. 2005. Soil fertility and fertilizers: An introduction to nutrient management. Vol. 515, Pearson Prentice Hall Upper Saddle River, NJ. Hess, Sonja Y, Janet M Peerson, Jaent C King, and Kenneth H Brown. 2007. “Use of serum Use of serum zinc concentration as an indicator of population zinc status.” Food and Nutrition Bulletin, 28(Supplement 3): 403S–429S. Hetzel, Basil S. 1990. “Iodine deficiency: an international public health problem.” 308–313. Washington, D.C.:International Life Sciences Institute, Nutrition Foundation. Hipp, BW, and WR Cowley. 1971. “Importance of the P-Zn interaction in okra production.” HortScience, 6: 211—212. Ho↵man, Vivian, Samuel Mutiga, Jagger Harvey, Michael Milgroom, and Rebecca Nelson. 2015. “A Market for lemons: Maize in Kenya.” College Park, MD. University of Maryland. Hotz, Christine, and Kenneth H Brown. 2004. Assessment of the risk of zinc deficiency in populations and options for its control. International nutrition foundation: for UNU. House, William A, and Ross M Welch. 1989. “Bioavailability of and interactions between zinc and selenium in rats fed wheat germ intrinsically labeled with 65Zn and 75Se.” Journal of Nutrition, 199: 16–921. Joy, JM, EL Ander, SD Young, CR Black, MJ Watts, ADC Chilimba, B Chilimba, EWP Siyame, AA Kalimbira, R Hurst, SJ Fairweather-Tait, AJ Stein, RS Gibson, PJ White, and MR Broadley. 2014. “Dietary mineral supplies in Africa.” Physiologia Plantarum. Kennedy, Gina, Guy Nantel, and Prakash Shetty. 2003. “The scourge of ”hidden hunger”: global dimentions of micronutrient deficiencies.” FNA ANA, 32: 8–16. Kikafunda, Joyce K, Ann F Walker, Eleanor F Allan, and James K Tumwine. 1998. “E↵ect of zinc supplementation on growth and body composition of Ugandan preschool children: a randomized, controlled, intervention trial.” The American journal of clinical nutrition, 68(6): 1261–1266. Maccini, Sharon, and Dean Yang. 2009. “Under the weather: Health, schooling, and economic consequences of early-life rainfall.” The American Economic Review, 1006–1026. Maluccio, John A, John Hoddinott, Jere R Behrman, Reynaldo Martorell, Agnes R Quisumbing, and Aryeh D Stein. 2009. “The impact of improving nutrition during early childhood on education among Guatemalan adults*.” The Economic Journal, 119(537): 734–763. 20 Mayer, Anne-Marie B, Michael C Latham, John M Duxbury, Nazmul Hassan, and Edward A Frongillo. 2007. “A food-based approach to improving zinc nutrition through increasing the zinc content of rice in Bangladesh.” Journal of Hunger & Environmental Nutrition, 2(1): 19–39. Mistry, Hiten D, Fiona Broughton Pipkin, Christopher WG Redman, and Lucilla Poston. 2012. “Selenium in reproductive health.” American journal of obstetrics and gynecology, 206(1): 21–30. Moraghan, JT. 1994. “Accumulation of zinc, phosphorus, and magnesium by navy bean seed.” Journal of plant nutrition, 17(7): 1111–1125. Ndeezi, Grace, James K Tumwine, Bjørn J Bolann, Christopher M Ndugwa, and Thorkild Tylleskär. 2010. “Zinc status in HIV infected Ugandan children aged 1-5 years: a cross sectional baseline survey.” BMC pediatrics, 10(1): 68. Nkonya, Ephraim, John Pender, Kayuki C Kaizzi, Edward Kato, Samuel Mugarura, Henry Ssali, and James Muwonge. 2008. Linkages between land management, land degradation, and poverty in Sub-Saharan Africa: The case of Uganda. Vol. 159, Intl Food Policy Res Inst. Peck, N. H., D. K. Grunes, R. M. Welch, and G. E. MacDonald. 1980. “Nutritional Quality of Vegetables Crops as A↵ected by Phosphorus and Zinc Fertilizers.” Agronomy Journal, 72: 528–534. Prasad, Ananda S. 2003. “Zinc deficiency.” Bmj, 326(7386): 409–410. Shivay, Yashbir Singh, Dinesh Kumar, and Rajendra Prasad. 2008. “E↵ect of zinc-enriched urea on productivity, zinc uptake and efficiency of an aromatic rice–wheat cropping system.” Nutrient Cycling in Agroecosystems, 81(3): 229–243. Sillanpää, Mikko. 1972. Trace elements in soils and agriculture. Food & Agriculture Org. Singh, MV. 2009. “Micronutrient Nutritional Problems in Soils of India and Improvement for Human and Animal Health.” Indian Journal of Fertilizer, 5(4): 11–16, 19–26, 56. Tidemann-Andersen, Ida, Hedwig Acham, Amund Maage, and Marian K Malde. 2011. “Iron and zinc content of selected foods in the diet of school children in Kumi district, East of Uganda: a cross-sectional study.” Nutr J, 10: 81. Victora, Cesar G, Linda Adair, Caroline Fall, Pedro C Hallal, Reynaldo Martorell, Linda Richter, and Harshpal Singh Sachdev. 2008. “Maternal and child undernutrition: consequences for adult health and human capital.” The Lancet, 371(9609): 340–357. Welch, Ross M, William A House, and William H Allaway. 1974. “Availability of zinc from pea seeds to rats.” Journal of Nutrition, 104(733): 999–1010. WHO. 2004. “Iodine status worldwide: WHO global database on iodine deficiency.” 21 WHO. 2008. “Worldwide prevalence of anaemia 1993-2005: WHO global database on anaemia.” WHO. 2009. “Global prevalence of vitamin A deficiency in populations at risk 1995-2005: WHO global database on vitamin A deficiency.” 22 Figures Figure 1: Heterogeneity in Cassava Figure 2: Heterogeneity in Maize Zinc Concentration Zinc Concentration Figure 3: Heterogeneity in Millet and Figure 4: Heterogeneity in Bean and Sorghum Zinc Concentration Cowpea Zinc Concentration Figure 5: Heterogeneity in Sweet Figure 6: Child Zinc Intake PDFs: Potato Zinc Concentration Three Estimations 23 Figure 7: Child Zinc Intake PDFs: Figure 8: Child Zinc Intake CDFs: FCT vs Estimated HC FCT vs Estimated HC Figure 9: Calculation Di↵erentials Figure 10: Calculation Di↵erentials by Estimated HCR by Cereal Consumption Figure 11: Soil Zn Concentration Figure 12: Household vs Market by Region Zinc Intake 24 f f Figure 13: Prob(HCi =1) by Region, Figure 14: Prob(HCi =1) by Region, Average over Grains & Legumes Only for Beans 25 Tables Table 1: Child and Household Descriptive Statistics (IFPRI-UNC) Median St Dev By Child Male (%) Age (months) Breastfeeding (%) 51 37 17 15 - By Intake Obs Dietary Diversity (distinct items/week) Calorie Intake (kcal/day) Consumption Meat & Fish (items/week) Consumption Eggs & Milk (items/week) Consumption Cereals (items/week) Consumption Legumes (items/week) Consumption Tubers (items/week) 15 870 2 2 14 11 12 7 697 4.1 5.5 8.9 9.8 8.6 Table 2: Household Descriptive Statistics (LSMS-ISA, IFPRI-UNC) Produced Maize (%) Maize Production (kg) Produced Sorghum/Millet (%) Sorghum/Millet Production (kg) Produced Sweet Potato (%) Sweet Potato Production (kg) Produced Cassava (%) Cassava Production (kg) Produced Beans (%) Beans Production (kg) Produced Groundnuts (%) Groundnuts Production (kg) Distance to Nearest All-Weather Road (km) Distance to Nearest City (km) Non-Farm Income (dollars) Household size (people) Bike (%) LSMS-ISA Mean* IFPRI-UNC Mean* t-stat (of equal means**) p-value (di↵=0) 0.61 226 0.33 120 0.49 400 0.48 400 0.58 130 0.28 90 0.63 275 0.54 188 0.46 390 0.45 500 0.65 90 0.26 120 0.65 2.86 4.34 20 0.76 6.84 0.92 10.49 1.92 2.78 .0.51 22.55 0.52 0.00 0.00 0.00 0.45 0.00 0.36 0.00 0.06 0.01 0.61 0.00 8.62 16.10 725.27 5.82 0.47 6.59 11.74 253.06 7.92 0.63 3.29 10.86 10.31 10.80 5.04 0.00 0.00 0.00 0.00 0.00 *Because production quantity is distributed log normally, medians are reported. **T-test of equal means, unequal variance — for production quantities log (kg) is tested 26 Table 3: Explaining Household Consumption Ratio by Random E↵ects Logit Household Production Maize (log kg) Sorghum/Millet (log kg) Sweet Potato (log kg) Cassava (log kg) Beans (log kg) Groundnuts (log kg) House Characteristics Household Size Non-farm income (log) Bike (dummy) Geography/Time Distance to City (km) (Distance to City)2 Distance to Road (km) (Distance to Road)2 (Dist to City)*Weeks (Dist to City)*Weeks2 (Dist to Road)*Weeks (Dist to Road)*Weeks2 Observations Number of hhid (1) Maize (2) Sorghum/Millet (3) Sweet Potato (4) Cassava (5) Beans (6) Groundnuts 0.260*** (0.0184) 0.0185 (0.0209) 0.0106 (0.0171) 0.0737*** (0.0151) -0.0390* (0.0209) 0.0621*** (0.0210) 0.0569*** (0.0217) 0.253*** (0.0233) 0.0647*** (0.0218) 0.0224 (0.0191) 0.0487* (0.0276) 0.108*** (0.0255) 0.0448** (0.0220) 0.0199 (0.0261) 0.173*** (0.0218) 0.0383** (0.0194) 0.116*** (0.0265) 0.0338 (0.0282) 0.0449** (0.0184) 0.0435** (0.0216) 0.0802*** (0.0180) 0.173*** (0.0156) 0.0904*** (0.0223) 0.0823*** (0.0222) 0.0466*** (0.0160) -0.0177 (0.0185) 0.0111 (0.0157) 0.0169 (0.0139) 0.314*** (0.0191) 0.0283 (0.0197) 0.0321 (0.0215) 0.0681*** (0.0249) 0.00999 (0.0206) 0.0484*** (0.0182) 0.00730 (0.0258) 0.373*** (0.0249) 0.0225 (0.0169) -0.0557*** (0.0144) 0.560*** (0.0944) -0.0255 (0.0213) -0.0572*** (0.0197) 0.342*** (0.120) 0.0672*** (0.0219) -0.0574*** (0.0194) 0.625*** (0.121) -0.0238 (0.0177) -0.0273* (0.0159) 0.485*** (0.0995) 0.0126 (0.0162) -0.0468*** (0.0136) 0.219** (0.0893) -0.0144 (0.0198) -0.0344** (0.0175) 0.288** (0.113) 0.0313 (0.0283) -0.000307 (0.000516) 0.0625** (0.0285) -0.00280*** (0.000723) -0.00371** (0.00166) 7.68e-05** (2.99e-05) 0.00297 (0.00184) -5.02e-05 (3.37e-05) 0.0491 (0.0339) -5.60e-05 (0.000629) 0.0841** (0.0375) -0.00317*** (0.000914) -0.00341 (0.00222) 5.42e-05 (4.20e-05) 0.00103 (0.00244) -4.80e-06 (4.54e-05) -0.0126 (0.0314) -0.000544 (0.000571) 0.0764** (0.0328) -0.00253*** (0.000789) 0.00287 (0.00205) -4.84e-05 (3.75e-05) -0.000750 (0.00221) 1.80e-05 (4.08e-05) -0.0261 (0.0279) 0.000412 (0.000484) 0.104*** (0.0301) -0.00322*** (0.000704) 0.000718 (0.00166) -1.55e-05 (3.05e-05) -0.000984 (0.00193) 1.84e-05 (3.52e-05) 0.0526** (0.0246) -0.00103** (0.000453) 0.0333 (0.0271) -0.00190*** (0.000682) -0.00119 (0.00141) 1.65e-05 (2.59e-05) 0.00212 (0.00174) -3.33e-05 (3.16e-05) 0.0681** (0.0309) -0.00113* (0.000581) 0.00823 (0.0333) -0.00116 (0.000835) -0.00250 (0.00191) 4.49e-05 (3.65e-05) 0.00352 (0.00220) -6.88e-05* (4.14e-05) 3,687 1,548 2,090 1,039 2,862 1,392 3,736 1,486 4,829 1,636 2,436 1,228 Standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 All estimations control for the producer price of all 6 crops (averaged by enumeration area), for round, and for weeks and weeks squared by region Appendix 1 Soil sampling was conducted according to standard protocols for in-field, representative soil sampling. Twelve to twenty sub-samples were taken from each plot, with a thin soil probe that reached down to 20 cm. In plots with very hard soil, occasionally an auger or a hoe was used to collect soil samples, rather than a soil probe. In such cases e↵ort was still made to gather soil down to 20 cm. Sub-samples were taken from randomly distributed locations around the plot, roughly following zig-zag patterns, but avoiding any “odd” patches of ground such as termite mounds or compost piles. After mixing all sub-samples together in a bucket, a representative quantity of 500 grams of soil was gathered for subsequent drying, grinding and micronutrient analysis. Crop samples were conducted in a similar manner. For grains and legumes, enumerators sub-sampled crops (e.g. bunches of beans, a kernel of maize) from 10 locations around the plot. As it was feared that farmers might object to 10 cassava or sweet potato plants being dug up, enumerators were told to sub-sample these crops from 5 or 6 plants across the plot. (It is possible that sub-samples were sometimes taken from an even smaller number of plants.) The total weight of each sample, once compiled from all sub-samples across the plot, was usually 1 kilogram or more. This large quantity was chosen in order to obtain a representative sample of the crop micronutrient status within each plot. This was important, because plant-to-plant variation in micronutrients can be large (Blair et al. 2013), and we hoped to capture plot-average micronutrient density for each crop sampled. A kilogram of crop matter, gathered across each plot, should suffice for this purpose (Glahn 2013 personal communication). 28 Appendix 2 Households with a child under 5 years of age were on average younger, more educated, and larger, with household heads more often male and more often married. They also appear to be slightly better o↵ in terms of assets. In Table A1, means are given according to the variable as measured. For variables distributed normally, the t-test was also conducted using the measured variable. For those variables distributed log normally, the t-test was instead conducted using the log of that variable. Log normal variables include cattle ownership, sheep/goat ownership, land hectares and land value, and all income variables. Table A1: Household Selection on Having a Child less than 5 Years of Age Head Married (%) Head Male (%) Head Age (years) Head Education (years) Spouse Education (years) Household Size Cattle (#) Sheep & Goats (#) Household owns Motorcycle (%) Household owns Bicycle (%) Agricultural Land (hectares) Agricultural Land Value ($) Household Income ($) Income from Livestock Products ($) Income from Selling Livestock ($) Crop Income Has Child Mean SD No Child Mean SD 0.84 0.82 45.83 6.08 4.35 7.92 1.88 1.86 0.09 0.64 3.21 2549 2131 268 303 1714 0.64 0.68 53.39 5.36 4.16 4.95 1.81 1.48 0.07 0.48 2.96 5900 2450 367 312 3805 29 0.37 0.39 12.72 3.61 3.52 2.29 3.98 2.74 0.29 0.48 3.49 3290 5178 532 728 1066 0.48 0.47 15.54 4.34 3.57 2.54 6.15 3.5 0.26 0.5 3.94 23432 5564 693 645 22267 T-stat -4.42*** -3.48*** 5.56*** -1.87** -0.48 -11.07*** 0.40 1.74** -0.77 -3.32*** -1.08 -0.21 -0.78 2.08** 1.20 -0.54 Appendix 3 Food Frequency Questionnaires (FFQs) are a common tool for procuring the dietary intake of a select group of vitamins or minerals, rather than attempting a to gather comprehensive data on all dietary intake. In an FFQ, an individual (or caretaker) gives the number of times that each item on a list of foods was eaten during a particular recall period — in our case one week. In some cases, and in our survey, an average portion size is also chosen for each food item. We illustrated portion sizes (small, medium and large) with a book of pictures, as shown in figures A2 and A3. Caretakers were also shown the physical plate on which the pictures were taken. Foods listen on an FFQ are chosen to represent the major dietary sources of the nutrients of interest, according to prior and more comprehensive food recall data. Our nutrients of interest were zinc and selenium, and the foods listened on our FFQ capture 98 percent of all zinc consumption in Ugandan children under five, according to detailed food recall data gathered in central and eastern Uganda by the Reaching End Users (REU) project, a HarvestPlus initiative introducing orange-fleshed sweet potato to farmers in rural Uganda (Hotz et al. 2012).27 It was not possible to easily gauge the proportion of selenium intake captured by our FFQ, because the food consumption table put together by HarvestPlus for the REU project did not list selenium density of foods. However, the major sources of selenium are quite similar to the major sources of zinc (Figures A4 and A5), and additional food items (dishes with mushrooms, dairy and fish) were added to the FFQ in order to capture intake of foods particularly rich in selenium. Thus, it seems likely that over 90 percent of selenium intake, at least, was also captured by our FFQ. Most food items listed in the FFQ were dishes (e.g., cooked cassava and beans), rather than food items (e.g., mango or groundnuts). Recipes for each dish were chosen from the HarvestPlus recipe list, a comprehensive list of all common recipes used in the REU project areas in central & eastern Uganda.28 Portion sizes were for each food item and dish were weighed out in grams, with weights having been chosen as quintiles in the FEU food recall data. 27 In using the REU food recall data to construct a list of foods for the FFQ, I followed protocols laid out by a technical document written by Christine Hotz, the primary architect of the REU Food Frequency Recall. That food recall was focused on vitamin A consumption, and so the recall itself was not useful for our purposes, but the methodology — and data to conduct it — was highly useful. 28 If multiple recipes were listed then the recipe most common in the REU food recall data was chosen. If a recipe was not listed (e.g. sorghum porridge, common in our survey district of Kabale but not in central or eastern Uganda), then I modified the closest recipe to it (e.g. maize porridge). 30 Figure A1: Portion Size Picture of Figure A2: Portion Size Picture of Mukene (Small Fish) Katogo (Cassava & Beans) Figure A3: Sources of Dietary Zinc Figure A4: Sources of Dietary Selenium Intake, by 2003 District Intake, by 2003 District 31 Appendix 4 Equation 10 models zijf , the zinc density of crop f grown on plot j by household i. It shows zijf to be a function of soil zinc, zij , and other soil characteristics that e↵ect the availability of soil zinc to the plant, availij . A reduced form is adopted for the estimation of equation 10, including interactions between soil zinc, crop variety, and the key soil characteristics that impact soil zinc availability. These characteristics are pH and pH2 , manganese (Mn) and cadmium (Cd). Soil zinc is also interacted with county fixed e↵ects, to capture environmental or unobserved soil characteristics impacting zinc availability. Included soil characteristics were chosen according to soil science literature on zinc availability. Soil pH is a primary determinant of soil zinc availability to plants. Sillanpää (1972) writes that the range of lowest zinc availability is pH 6-7, because in this range soil zinc tends to form insoluble calcium zincates which are less available to plants. Manganese also impacts zinc availability, as zinc and manganese form unavailable complexes within certain pH ranges (Havlin et al., 2005). Zinc often competes with cadmium for plant uptake, likely because the two cations share the same carrier site (Duxbury personal communication, Oct 2 2014 ). Other minerals and soil characteristics are also known to impact soil zinc availability (e.g., iron, sand content, calcium), and a number of extended model specifications were examined. However, it was found that once zinc interactions with pH, manganese and cadmium were included, additional soil characteristics add very little to the model R2 . Given a fairly small sample for most crops, these extraneous variables were dropped from the model. Table A2 summarizes this core model (panel 2), along with a model including only crop variety fixed e↵ects (panel 1), and a model including only soil characteristics (panel 3). In each case, column 1 pools all crops, while columns 2-7 give crop-by-crop estimations. Panels 2 and 3 give the elasticity of crop zinc with respect to soil zinc, ✏z , along with the confidence interval around this marginal e↵ect. All three panels give the associated R2 and observation number N. Panel 2 of Table A2 is the preferred specification, as it has the highest R2 values. However, if crop variety is not known, the model in panel 3 predicts crop zinc quite accurately using only soil characteristics and zinc-county interactions, the latter of which captures some regional variation in common crop varieties. Figure A5 illustrates crop zinc estimates derived from panel 3 model. 32 Table A2: Marginal E↵ect of (Log) Soil Zinc Concentration on (Log) Crop Zinc Content (1) All Crops (2) Maize (3) Sorghum (4) Sw. Potato (5) Cassava (6) Beans (7) ) Gnuts 1. Varieties R2 =0.587 N=586 R2 =0.058 N=235 R2 =0.025 N=228 R2 =0.222 N=119 R2 =0.126 N=235 R2 =0.086 N=228 R2 =0.049 N=119 2. Varieties & Soil Traits ✏z =0.221 [.061 .38] R2 =0.635 N=586 ✏z =0.138 [-.501 .778] R2 =0.505 N=235 ✏z =-0.107 [-.533 .318] R2 =0.354 N=228 ✏z =0.105 [-.432 .641] R2 =0.492 N=119 ✏z =0.283 [-.252 .819] R2 =0.526 N=235 ✏z =0.071 [-.239 .098] R2 =0.426 N=228 NA NA NA N=119 3. Soil Traits ✏z =0.285 [.1 .469] R2 =0.165 N=586 ✏z =0.127 [-.341 .595] R2 =0.421 N=235 ✏z =0.065 [-.195 .325] R2 =0.228 N=228 ✏z =0.146 [-.496 .788] R2 =0.280 N=119 ✏z =0.375 [-.219 .969] R2 =0.453 N=235 ✏z =-0.092 [-.22 .035] R2 =0.350 N=228 ✏z =0.371 [-.958 1.7] 0.973 N=119 Panel 1 Regressors: variety Panel 2 Regressors: Zn, variety, variety*Zn, pH, pH*Zn, pH2 , pH2 *Zn, county*Zn, Mn, Mn*Zn, Mn*Zn*pH, Cd, Cd*Zn, Cd*Zn*pH Panel 3 Regressors: Zn, pH, pH*Zn, pH2 , pH2 *Zn, county*Zn, Mn, Mn*Zn, Mn*Zn*pH, Cd, Cd*Zn, Cd*Zn*pH Figure A5: Crop Zinc Content Predictions) Appendix 5 It is unlikely that processing accounts for the di↵erence between household and market zinc content, as there was no di↵erence between the market vs. home processing of cassava, millet and sorghum, or beans and cowpeas. There was a slight di↵erence for maize; household-sampled maize was ground into unrefined maize flour, while maize flour purchased at market had been ground at local mills, and often refined or partially refined. But while refinement might lead to 10-50 percent decrease in maize zinc content, it is unlikely to account for an 83 percent decrease. HarvestPlus FCT values indicate that maize processing decreases zinc content by 60 percent, substantially less than the 83 percent di↵erential observed in our data. And even this value may be higher than appropriate, given that the HarvestPlus value for refined flour is derived from the USDA FCT value for refined, de-germed cornmeal. (This is NDB No 20022 in the most recent USDA table, Nutrient Database Release 26.) Locally-owned Ugandan mills de-husk, but do not de-germ, cereals, therefore preserving a greater proportion of zinc and other nutrients. In a study of such locally-owned mills in Benin, Gre↵euille et al. (2011) finds that processing dry maize grain (as is done in Uganda) decreases zinc content by only 11 percent, and processing wet, washed maize grain decreases zinc content by 54 percent. Degradation of nutrients is also unlikely to account for the di↵erential, given that metals do not degrade significantly over time. It is possible that farmers are more likely to sell larger crops to market, and keep smaller crops at home. Because cereal size is often negatively associated with cereal nutrient content, such selection might lower the mean nutrient content of market crops. Crop variety is similarly correlated with crop zinc (as well as crop size), and it is also possible that farmers sell particular varieties of crop to market, and keep other varieties at home. Given the soil-to-crop zinc transmission examined in Appendix 4, it seems plausible that farmers might sell crops di↵erentially according to their beliefs about plot-level soil nutrients. That is, they might be more likely to sell crops from less fertile (and lower zinc) plots, and to consume crops from fertile (and higher zinc) plots. This would mirror the findings of Ho↵man et al. (2015), who observed that farmers in Kenya were more likely to sell high-aflatoxin maize, and to keep low-aflatoxin maize for home consumption. However, farmers in Uganda generally mix crops from all fields during the drying process, which occurs before selling, and so it seems unlikely that they are di↵erentially selling by plot. It also seems unlikely that they can pick out low zinc or high zinc crops by eye after drying, except insofar as zinc is correlated with crop size. There is no other observable characteristic correlated with zinc, unless crops are highly zinc deficient, which they are not in this context. Soil macro- and micronutrients are lower on large plots and on large firms. Because land size is highly correlated with crop quantity sold to market, this suggests that crops at market are largely sourced from farms on the lower end of the fertility spectrum. Along with crop size and crop variety, this farm-level selection may help to explain the low zinc content of market crops. 34 Appendix 6 While Table 3 display random e↵ects logit estimation of home consumption ratio, other estimation strategies were considered. For instance, table A3 displays a conditional (fixed e↵ect) logit estimation of home consumption ratio, for each crop. Because household characteristics and location do not change substantially over time for the majority of households in our dataset, the coefficient estimates in these sections are imprecisely estimated. It is clear, however, that the production of a crop (a factor that does shift over time) is significantly, positively associated with the home consumption ratio of that crop. The o↵-diagonal coefficients in the top panel of Table A3 appear smaller and less significant than they do in Table 3 — suggesting that cross-crop associations in this table is biased by fixed household characteristics such as land quality or location. Table A4 displays a Heckman-adjusted probit with estimation of home consumption ratio, adjusting for selection into the dataset. The selection equation estimates the visibility of home consumption ratio via probit, with a dummy for production of the relevant crop being used to identify selection. This identifying variation may not be perfect, but it does hold that, conditional on log production being included in the intensity (home consumption ratio) equation, binary production is an insignificant predictor for most crops. Ultimately the random e↵ects logit was chosen over either of these models, as it predicted outcomes most accurately. Predictions from a conditional logit require strict assumptions about outcomes within groups — generally assuming either an identical mean for every group, or one and only one positive outcome per group. Thus, while the the conditional logit model is useful for regression analysis, it is very poor at prediction. The Heckman-adjusted probit model predicts probabilities similar to those of the random e↵ects logit model, but with slightly less accuracy. 35 Table A3: Explaining Household Consumption Ratio by Heckman-Adjusted Probit with District FE Household Production Maize (log kg) Sorghum/Millet (log kg) Sweet Potato (log kg) Cassava (log kg) Beans (log kg) Groundnuts (log kg) House Characteristics Household Size Non-farm income (log) Bike (dummy) Geography/Time (Dist to City)*Weeks (Dist to City)*Weeks2 (Dist to Road)*Weeks (Dist to Road)*Weeks2 Observations Number of hhid (1) Maize (2) Sorghum/Millet (3) Sweet Potato (4) Cassava (5) Beans (6) Groundnuts 0.119*** (0.0267) 0.0505 (0.0331) 0.0248 (0.0248) 0.0399* (0.0210) -0.0261 (0.0315) 0.0284 (0.0316) 0.00735 (0.0384) 0.168*** (0.0363) 0.0990*** (0.0354) 0.0230 (0.0317) -0.0189 (0.0563) 0.0395 (0.0498) -0.0116 (0.0448) 0.0228 (0.0461) 0.134*** (0.0395) 0.0447 (0.0341) 0.0779 (0.0531) -0.00513 (0.0576) -0.0170 (0.0283) 0.0291 (0.0332) 0.0740*** (0.0264) 0.121*** (0.0210) -0.0120 (0.0368) 0.0207 (0.0334) -0.00114 (0.0229) 0.0474* (0.0263) -0.000437 (0.0214) -0.0254 (0.0187) 0.125*** (0.0256) 0.000747 (0.0291) -0.0251 (0.0399) 0.0334 (0.0440) -0.0571* (0.0347) 0.0232 (0.0278) 0.0198 (0.0424) 0.180*** (0.0373) 0.0272 (0.0329) -0.0156 (0.0209) 0.410*** (0.154) 0.00841 (0.0603) -0.0497 (0.0329) 0.135 (0.265) 0.0728 (0.0501) -0.00736 (0.0338) 0.601** (0.242) -0.0165 (0.0372) 0.0313 (0.0227) 0.212 (0.163) 0.0169 (0.0303) -0.00151 (0.0179) 0.251* (0.137) 0.0224 (0.0428) -0.0288 (0.0266) -0.0862 (0.222) -0.00279 (0.00245) 5.71e-05 (4.37e-05) 0.000594 (0.00260) -1.45e-05 (4.81e-05) -0.00166 (0.00381) 3.11e-05 (6.73e-05) 0.000904 (0.00415) 1.62e-05 (7.74e-05) 0.00640* (0.00364) -0.000111* (6.31e-05) 0.000567 (0.00402) -3.13e-05 (7.51e-05) 0.00265 (0.00250) -5.23e-05 (4.42e-05) 0.00614* (0.00315) -0.000115** (5.78e-05) -0.00153 (0.00190) 1.87e-05 (3.45e-05) 0.000256 (0.00212) -3.67e-06 (3.92e-05) -0.000753 (0.00310) 2.73e-05 (5.74e-05) 0.00667* (0.00379) -0.000139* (7.27e-05) 1,664 554 831 291 701 270 1,528 492 2,244 674 932 350 Standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 All estimations control for the producer price of all 6 crops (averaged by enumeration area), for round, and for weeks and weeks squared by region Table A4: Explaining Household Consumption Ratio by Heckman-Adjusted Probit Household Production Maize (log kg) Sorghum/Millet (log kg) Sweet Potato (log kg) Cassava (log kg) Beans (log kg) Groundnuts (log kg) House Characteristics Household Size Non-farm income (dollars) Bike (dummy) Geography/Time Distance to City (km) (Distance to City)2 Distance to Road (km) (Distance to Road)2 (Dist to City)*Weeks (Dist to City)*Weeks2 (Dist to Road)*Weeks (Dist to Road)*Weeks2 Observations (1) Maize (2) Sorghum/Millet (3) Sweet Potato (4) Cassava (5) Beans (6) Groundnuts 0.0891*** (0.0174) 0.0257*** (0.00941) 0.00136 (0.00756) 0.0481*** (0.00671) -0.0341*** (0.00921) 0.0346*** (0.00937) 0.0402*** (0.00954) 0.0187 (0.0161) 0.0266*** (0.00970) 0.0115 (0.00847) 0.0363*** (0.0119) 0.0281** (0.0121) 0.0302** (0.0144) 0.00920 (0.0144) 0.119** (0.0594) 0.0187* (0.0110) 0.0597** (0.0298) 0.0174 (0.0143) 0.0426*** (0.00751) 0.00503 (0.00955) 0.0322*** (0.00824) 0.115*** (0.00666) 0.0129 (0.0127) 0.0789*** (0.00936) 0.0214*** (0.00676) -0.00736 (0.00773) 0.000600 (0.00673) 0.00469 (0.00594) 0.112*** (0.00831) 0.00670 (0.00847) 0.0126 (0.0157) 0.0397*** (0.0119) 0.00409 (0.0106) 0.0214* (0.0116) -0.00249 (0.0138) 0.144 (0.0893) 0.000480 (0.00732) -3.56e-05*** (9.10e-06) 0.218*** (0.0464) -0.0237*** (0.00920) -4.39e-05*** (1.18e-05) 0.0945* (0.0539) 0.0378*** (0.0114) -2.58e-05** (1.30e-05) 0.364*** (0.0650) -0.00296 (0.00747) -1.50e-05 (9.42e-06) 0.305*** (0.0406) -0.0108 (0.00668) -3.39e-05*** (8.01e-06) -0.0135 (0.0377) -0.00795 (0.0101) -8.47e-06 (1.07e-05) 0.0693 (0.112) 0.0143 (0.0119) -4.46e-07 (0.000211) 0.0363*** (0.0121) -0.00159*** (0.000297) -0.00181** (0.000727) 3.41e-05** (1.33e-05) 0.00123 (0.000821) -1.97e-05 (1.52e-05) 0.00998 (0.0150) 0.000140 (0.000265) 0.0185 (0.0165) -0.000859** (0.000406) -0.00105 (0.000970) 1.73e-05 (1.81e-05) 0.00135 (0.00106) -2.07e-05 (1.96e-05) -0.0102 (0.0169) -0.000323 (0.000342) 0.0408** (0.0205) -0.00127* (0.000683) 0.00189 (0.00124) -3.26e-05 (2.16e-05) -0.000577 (0.00119) 1.42e-05 (2.32e-05) -0.0187* (0.0111) 0.000495*** (0.000190) 0.0523*** (0.0125) -0.00126*** (0.000309) 0.000214 (0.000696) -5.31e-06 (1.28e-05) -0.00120 (0.000790) 2.02e-05 (1.45e-05) 0.0286*** (0.00997) -0.000556*** (0.000174) 0.0144 (0.0105) -0.000403* (0.000241) -0.000473 (0.000613) 7.00e-06 (1.14e-05) 6.96e-05 (0.000716) 1.31e-06 (1.31e-05) 0.0414*** (0.0141) -0.000648** (0.000256) 0.000843 (0.0159) -0.000785** (0.000381) -0.00134 (0.000931) 2.30e-05 (1.76e-05) 0.00254** (0.00106) -4.88e-05** (1.99e-05) 6,135 6,135 6,135 6,135 6,135 6,135 Standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 All estimations control for the producer price of all 6 crops (averaged by enumeration area), for round, for weeks and weeks squared by region, and for district dummies
© Copyright 2026 Paperzz