Distributional Effect of International Trade and Comparative Advantage in Labor Markets∗ Rodrigo Adão MIT September 20, 2015 Abstract This paper investigates the distributional consequences of international trade shocks in the context of the Brazilian labor market. First, I document a new set of facts about how differential exposure to commodity price shocks across educational groups and regions leads to differential outcomes in terms of sectoral employment and wages. Second, I show that such facts are qualitatively consistent with a two-sector Roy model where worker heterogeneity regarding comparative and absolute advantage in sector-specific tasks determines the structure of employment and wages. Third, I establish that the schedules of comparative and absolute advantage are nonparametrically identified from cross-regional variation in sectoral responses of employment and wages induced by sector demand shocks. Lastly, I build on this result to structurally estimate the model in the sample of Brazilian local labor markets. My structural estimates indicate that a 10% decrease in commodity prices causes counterfactual increases of 1.2% in the skill wage premium and of 5% in wage dispersion. ∗ Author contact information: [email protected]. I am extremely grateful to Daron Acemoglu, Arnaud Costinot and Dave Donaldson for invaluable guidance and support. I also thank Pol Antras, David Autor, Arthur Braganca, Ariel Burstein, Dejanir Silva as well as seminar participant at the MIT Labor Lunch and the MIT Macro-International Lunch. All errors are my own. 1 1 Introduction In an integrated world economy, shocks in a particular country have the potential to exert different effects across different workers within another country located on the other side of the globe. For example, if China implements a large-scale plan of infra-structure expansion, what is the effect on workers employed in the mining industry relative to those in the auto industry within the U.S.? Alternatively, if Saudi Arabia aggressively reduces oil production, what is the effect on workers residing in oil-rich regions relative to those in oil-poor regions within Brazil? More generally, if such shocks affect the relative world price of various products, which workers stand to gain or lose within a country? And how large are these distributional effects? Such distributional concerns would not be relevant if homogeneous workers were able to perfectly switch their sector of employment, their region of residence, and their level of skill. However, this is unlikely to be true. Consider how the negative shock to the world oil supply caused by Saudi Arabia’s policy affects different workers in Brazil. In order to expand production in response to higher oil prices, firms compete for the residents of oil-rich coastal regions with the necessary skills to perform the daily tasks in the offshore drilling industry, pushing their wages upwards. In contrast, workers without these skills and those unable to move to the oil-rich areas are not directly affected by the higher labor demand in the Brazilian oil industry. This simple example suggests that heterogeneous workers are differentially exposed to international trade shocks due to their limited ability to respond to changes in sectoral labor demand. In this article, I propose a unified empirical and theoretical framework to investigate how differential exposure to sectoral shocks across educational groups and regional markets leads to differential outcomes in terms of employment and wages in the context of the Brazilian labor market. The first contribution of the paper is to provide a new set of empirical facts that connect changes in the international price of agriculture and mining commodities to changes in the Brazilian structure of wages and employment between 1980 and 2010.1 First, I establish that aggregate movements in Brazilian wage dispersion are negatively related to the evolution of international commodity prices with the correlation being mainly driven by changes in wage differentials associated with workers’ level of education, sector of employment and region of residence. Second, I explore the variation in the importance of different commodity-oriented industries in employment across educational groups and regional markets to establish a causal relation between commodity prices and labor market outcomes. In each region, I measure exposure to international commodity price shocks for two worker groups: High-School Graduates (HSG) and High-School Dropouts (HSD). The exposure of a groupregion pair is defined as the interaction between the change in commodities’ world prices and the initial participation of corresponding commodities in the group’s labor payroll in the region. Armed with this measure, I document that regional economies more exposed to positive price shocks experienced stronger employment expansions in the commodity sector among both HSG and HSD. This employment expansion is associated with mixed responses in the commodity sector wage differential: it increases for HSG, but remains stable for HSD. Lastly, higher exposure to positive shocks is 1 Production of basic products constitutes an important part of the Brazilian economy. In 2010, agriculture and mining industries represented 58.5% of total exports and 19.9% of total employment. 2 related to stronger growth in the group’s average wage which, in turn, translates into a reduction in the HSG-HSD wage premium because of the relatively higher employment share in the commodity sector among Brazilian HSD. In order to explain this rich response pattern in labor market outcomes across groups and regions, I propose a Two-Sector Roy Economy as in Heckman and Honore (1990). Within each group and region, individuals are heterogeneous with respect to a skill bundle that, ultimately, determines their efficiency in the sector-specific tasks demanded by the commodity and the non-commodity sectors. Conditional on the price of these sector-specific tasks, the heterogeneity in sector efficiency translates into heterogeneity in potential sector wages, generating endogenous selection of an individual to the sector yielding her the highest labor income. To be more precise, the sector employment decision is intrinsically related to the worker’s comparative advantage defined as the worker’s efficiency in the commodity sector task relative to her efficiency in the non-commodity sector task. Whenever the worker’s comparative advantage is higher than the relative task price offered in the non-commodity sector, the worker decides to be employed in the commodity sector. Alternatively, the worker’s wage, given her employment decision, depends also on the worker’s absolute advantage defined as her efficiency in the task specific to the non-commodity sector. In this environment, an individual’s exposure to sectoral demand shocks is centrally dependent on her level of comparative advantage. To see this, consider a partial equilibrium comparative statics exercise in which a demand shock increases the task price in the commodity sector while not affecting the task price in the non-commodity sector. The shock increases the wage of all commodity sector employees, but it only affects the wage of those non-commodity sector employees that decide to switch into the commodity sector. These sector-switchers are the non-commodity sector employees whose level of comparative advantage is similar to the pre-shock ratio of sector-specific task price. Accordingly, the magnitude of the change in sector employment is determined by the amount of such workers in the economy as controlled by the comparative advantage distribution. To the extent that sector-switchers are distinct from sector-stayers in terms of sector-specific efficiency, the change in employment composition affects the average efficiency in the sector with the potential to attenuate or reinforce the direct effect of the task price change. The magnitude of this compositional effect is directly related to the difference between the average absolute advantage of sector-switchers and sector-stayers. In a group and region, the average wage response combines the gains of workers employed in the two sectors, depending on both the sector employment composition and the comparative advantage distribution. In line with this discussion, I show that two functions are sufficient to evaluate the impact of sectoral shocks on the average and variance of the log-wage distribution in a group and region. Specifically, (i) the distribution of comparative advantage; and (ii) the average absolute advantage conditional on the level of comparative advantage. The second contribution of this paper is to provide a novel result that establishes the nonparametric identification of such functions using cross-regional variation in sector labor demand. For each worker group, I assume that regions are segmented labor markets with parallel schedules of comparative and absolute advantage; yet, I allow regions to have different skill shifters to accommodate regional variation in the level of sector-specific task supply. Under this assumption, the schedules of comparative and absolute advantage are nonparametrically identified, respectively, from responses of sector employment and average sector wage to changes in 3 sector-specific task prices induced by exogenous shocks to sector labor demand in the cross-section of regions. The identification result is valid for an arbitrary number of observable worker groups and it imposes no parametric restrictions on the skill distribution. This nonparametric identification result constitutes a mapping between population moments and the unknown schedules of comparative and absolute advantage. Its main importance relies on informing the source of variation in the data that uncovers the economic relation of interest without additional restrictions than those implied by the theory.2 In practice, limited availability of exogenous variation in the data imposes the need of auxiliary functional form assumptions. With this in mind, the empirical application of the model builds on a parsimonious log-linear system with two structural parameters that capture separately dispersion in comparative and absolute advantage. The proposed system represents a strict generalization of the system implied by a skill distribution of the Frechet family that entails a single parameter to control both comparative and absolute advantage.3 The combination of the nonparametric identification result and the auxiliary functional form assumption leads to an empirical strategy to estimate the structural parameters of comparative and absolute advantage using Generalized Method of Moments (GMM). As in the reduced-form analysis, I consider HSG and HSD that are differentially exposed to international commodity price shocks across Brazilian regional labor markets. The implementation of such strategy requires measurement of sector-specific task prices for all groups and regions.4 An additional contribution of this paper is to provide a new methodology to estimate task prices based on the Roy model’s predicted relation between wage growth and initial sector employment across quantiles of the wage distribution for each triple of group-region-period. This methodology can be easily implemented as a first-step regression using repeated cross-section data on wage and employment at the individual level. The structural estimation delivers two main insights. First, between-sector worker mobility is low with the elasticity of relative sector employment to relative sector task price estimated to be around one for both HSG and HSD. Second, the pattern of selection is qualitatively different in the two groups. For HSD, an expansion in sector employment reduces average efficiency in both sectors, generating the stability in sector wage premium documented in the reduced-form analysis. However, a sector employment expansion among HSG is associated with a decrease in average efficiency in the commodity sector and an increase in the non-commodity, creating the strong responses in the sector wage premium obtained in the reduced-form regressions. I conclude the paper by applying the framework to answer one counterfactual question: “If commodity prices fall by 10%, how would Brazilian wage inequality be affected?" To answer this question, I combine the estimated schedules of comparative and absolute advantage with the sectoral allocation of workers across regions and groups in 2010. Since employment in the commodity sector is higher among HSD residing in poor regions, the model predicts an increase in wage inequality following the 2 As noted by Matzkin (2007), such identification results represent an important first step in the empirical analysis of almost any model as the credibility of empirical results would be significantly hindered if identification could only be achieved under restrictive parametric assumptions. 3 The Frechet distribution has been the basis of numerous recent applications of the Roy Model — see e.g., Hsieh, Hurst, Jones, and Klenow (2013); Burstein, Morales, and Vogel (2015); and Galle, Rodriguez-Clare, and Yi (2015). 4 Sector-specific task prices are not immediately available in survey datasets that only contain information on individual labor income. In the model, observable labor income corresponds to the product of the unit task price and the individual’s endowment of efficiency units. 4 shock. Precisely, the fall in commodity prices triggers a 1.2% increase in the HSG-HSD wage premium that is accompanied by a 5% increase in the between-component of log-wage variance in the country. The rest of the paper is organized as follows. Section 2 reviews the related literature. Section 3 presents the empirical facts about the adjustment of the Brazilian labor market following shocks to the international price of agriculture and mining commodities. Section 4 presents the Two-Sector Roy Economy and its implications for the equilibrium structure of employment and wages. Section 5 establishes the nonparametric identification of comparative and absolute advantage in a sample of segmented labor markets. Section 6 implements the identification strategy for High-School Graduates and High-School Dropouts differentially exposed to commodity price shocks across Brazilian regions. Section 7 presents the counterfactual exercise that quantifies the effect of changes in commodity prices on changes in Brazilian wage inequality. Section 8 offers some concluding remarks. 2 Related Literature Research on the distributive effects of international trade has traditionally built upon the stark predictions regarding the changes in relative wages across worker groups provided by neoclassical models of trade in Stolper and Samuelson (1941) and Jones (1975). Yet, empirical studies have failed to provide support for the predictions of these canonical models. For instance, a number of authors have documented (i) movements in wage inequality correlated in both developed and developing countries (Goldberg and Pavcnik, 2007), (ii) movements in the skill wage premium uncorrelated with changes in the relative price of skill intensive products (Lawrence and Slaughter, 1993) while correlated with changes in the skill intensity of production within industries (Bekman, Bound, and Machin, 1998), (iii) and diminished between-sector responses in employment and wages following trade shocks (Wacziarg and Wallack, 2004; and Goldberg and Pavcnik, 2007). Such evidence gave rise to a consensus that international trade shocks were, at best, secondary drivers of the changes in wage inequality between the early-1980s and the mid-1990s. However, the last decade has witnessed a renewed interest in the distributive effects of trade shocks motivated by, on the one hand, recent changes in the wage distribution within several countries and, on the other hand, the integration into the world economy of China and other developing countries.5 Due to the disconnection between empirical evidence and predictions of neoclassical models, this emerging literature has expanded the scope of analysis by examining broader dimensions of workers’ exposure to international trade shocks. The literature has evolved on several directions. More closely related to this paper is the recent reduced-form literature that establishes the impact on labor market outcomes of heterogeneous exposure to import competition in terms of sector of employment (Autor, Dorn, Hanson, and Song, 2014), level of education (Dix-Carneiro and Kovak, 2015b), and region of residence (Topalova, 2010; Kovak, 2013, Autor, Dorn, and Hanson, 2013; and Costa, Garred, and Pessoa, 2014). In this paper, I build on their reduced-form strategy to provide new evi5 Acemoglu and Autor (2011) document a rich transformation pattern in the wage distribution of the U.S. and other OECD countries. As noted by Hanson (2012), low- and middle-income countries experienced not only an increase in international trade volumes but also pronounced changes in their profile of traded goods. In fact, emerging countries increased their participation in world trade from 13.5% in 1990 to 29.6% in 2012. 5 dence regarding the effect of differential exposure to commodity price shocks on sectoral employment and wages across educational groups and regional labor markets. This reduced-form evidence is then combined with the structure of the Two-Sector Roy Economy to provide a methodology to quantitatively investigate the impact of trade shocks on the wage distribution. Following the explosion of firm-level work in the field of international trade, another strand of the literature has investigated the effect of international trade shocks on workers employed in different firms within-industries. These papers focus on how market integrations affects the within-industry matching of workers to firms and the subsequent implications for wage inequality (see e.g. Verhoogen, 2008; Helpman, Itskhoki, and Redding, 2010; Frias, Kaplan, and Verhoogen, 2012; Helpman, Itskhoki, Muendler, and Redding, 2015; and Burstein and Vogel, 2015). Another approach has been to investigate dynamic effects of international trade shocks, paying particular attention to the welfare costs implied by the transitional dynamics in the reallocation of workers across sectors (Kambourov, 2009; Artuç, Chaudhuri, and McLaren, 2010; Dix-Carneiro, 2014; and Dix-Carneiro and Kovak, 2015a) and, more recently, regions (Caliendo, Dvorkin, and Parro, 2015). This paper is closely related to the growing body of literature connecting changes in the economic environment to self-selection of heterogeneous workers. On the theory side, recent papers demonstrate the potential of the Roy model to generate a rich response pattern in wage inequality — see Ohnsorge and Trefler (2007), Costinot and Vogel (2010), Acemoglu and Autor (2011), and, for a comprehensive review, see Costinot and Vogel (2014). On the empirical side, the Roy model has been applied to investigate the consequences of selection to aggregate productivity and output (Lagakos and Waugh, 2013; Hsieh, Hurst, Jones, and Klenow, 2013; and Young, 2014). Closer in spirit, two papers impose a Frechet distribution of worker skills to quantify the portion of changes in between-group wage inequality associated with technological changes in the USA (Burstein, Morales, and Vogel, 2015) and import competition in Germany (Galle, Rodriguez-Clare, and Yi, 2015). I complement these recent quantitative applications by providing a characterization of wage inequality responses in a generic Two-Sector Roy Model where the skill distribution is completely unrestricted. The generality of the model sheds light on the different roles played by comparative and absolute advantage in determining the evolution of labor market outcomes both within- and between-groups. This increases the ability of the model to capture features of the data that are meaningful in the study of distributional effects of trade shocks.6 Lastly, this paper is related to the large literature on selection models in labor markets. Inspired by the seminal paper of Roy (1951), this literature investigates the consequences of self-selection based on unobservable characteristics to observable components of labor income. Evidence in favor of selection driven by multivariate skills is provided by Gibbons and Katz (1992), Neal (1995), Parent (2000), and Gibbons, Katz, Lemieux, and Parent (2005). In order to establish the Roy model’s identification in a single market, the most traditional approach was to impose parametric restrictions on the skill distribution as in Heckman and Sedlacek (1985). Yet, this approach relies heavily on the functional form imposed on the skill distribution since Heckman and Honore (1990) show that the Roy model cannot 6 Although remaining tractable in the multiple sector setting, the Frechet distribution has features that are unappealing in labor market analysis. Not only it requires a high degree of between-sector mobility but also it is unable to generate responses in between-sector wage differentials. In contrast, these restrictive properties are not implied by the log-linear system used in the empirical application of this paper. 6 be nonparametrically identified in a single cross-section of individuals. Following this negative result, they propose two main alternatives. The first assumes observable shifters in the location of the skill distribution for individuals in the same market; the so called identification at infinity reviewed in French and Taber (2011). The second relies on the variation of skill prices across markets with an identical skill distribution. In its majority, the selection literature builds on the first approach to identify hedonic wage regressions (e.g., Mulligan and Rubinstein, 2008); alternatively, this paper builds upon the second approach to investigate distributional consequences of sectoral labor demand shocks. To this end, I provide a nonparametric identification result that complements the elegant result of Heckman and Honore (1990). By allowing for regional skill shifters, I introduce task supply shocks that affect sector employment and sector wages across markets while endogenously correlated with sectorspecific task prices in equilibrium. For this reason, identification requires an instrumental variable that generates exogenous variation in sector-specific task prices. Accordingly, the assumption in this paper is weaker than the existence of a single skill distributions in all market assumed by Heckman and Honore (1990) to establish their nonparametric identification result. In this sense, I provide a generalization of Heckman and Honore’s (1990) result when regions are subject to parallel shocks in the skill distribution. 3 3.1 International Commodity Prices and the Brazilian Labor Market Data and Background Brazil, as primarily an exporter of basic commodities, is particularly exposed to shocks in the world price of agricultural and mining goods. Such exposure pattern is captured by the relative price of six commodity categories: Grains, Livestock, Soft Agriculture, Metals, Precious Metals, and Energy. Together, these commodity groups account for a large share of Brazilian exports, representing 53.8% of total exports in 1980 and 60.4% in 2012. As described in Appendix A, I construct price indices for each category using data on commodity transactions in the main exchange markets of the United States between 1980 and 2012. To replicate relative prices faced by producers in Brazil, international commodity prices are converted to Brazilian currency and deflated by the Brazilian consumer price index. The analysis of labor market outcomes rely on wage and employment data from the Brazilian Census performed by the Brazilian Institute of Geography and Statistics (IBGE) for 1980, 1991, 2000 and 2010. To focus on individuals with strong labor force attachment mainly affected by marketoriented forces, I consider a benchmark sample of full-time white male employed individuals aged 16-64.7 Individuals are divided into two observable groups: High-School Dropouts (HSD) and HighSchool Graduates (HSG). These two groups reflect the relevant margin of education in Brazil; among male white workers, the high-school graduation rate was 16.0% in 1980 and 51.4% in 2010. 7 I restrict the benchmark sample to include only white and male individuals because of the strong declines in gender and race wage differential between 1995 and 2010; see e.g. Ferreira, Firpo, and Messina (2014). The model presented below is not intended to speak directly to the behavior of these components of the wage structure and, therefore, I exclude their behavior from the analysis in this section. Appendix B provides robustness exercises including female and non-white individuals, indicating that all results are essentially the same when these additional worker groups are considered. 7 Figure 1: International Commodity Prices and Wage Dispersion in Brazil, 1980-2010 1.2 2.5 1.0 2.0 0.8 Variance of log wages Log of deflated international commodity prices (BRL) International Relative Commodity Prices, 1980-2012 3.0 1.5 1.0 0.5 Wage Dispersion in Brazil, 1980-2010 0.6 0.4 0.2 0.0 0.0 1980 1985 1990 1995 2000 Commodity Price Index Grains (corn, soybeans, wheat) Soft (coffee, cacoa, sugar, and others) Livestock (cattle, hogs, and others) 2005 2010 1980 1991 Residual Energy (crude oil) Metals (copper, lead, steel, tin, zinc) Precious Metals (gold and silver) 2000 2010 Between (education, sector, region, experience) Note. Left panel: six-month moving average of log international product price converted to Brazilian currency and deflated by the Brazilian consumer price index (Sept-1994 = 1). Right panel: variance of log wages among male white full-time workers in the Brazilian Census of 1980, 1991, 2000 and 2010. Wage decomposition implied by the regression of log real wage on a full set of dummies for high school graduation, employment in the commodity sector, and microregion of residence, and years of experience (0-39yrs). Details in Appendix A. Throughout the analysis, I classify individuals into a sector and a regional labor market. According to their self-reported industry of employment, I map workers either to the commodity sector or to the non-commodity sector. Industries specialized in the production of agricultural and mining products are included in the commodity sector, covering 13.9% of the national labor income in 1980 and 12.9% in 2010. All manufacturing and service industries are included in the non-commodity sector. In addition, I allocate individuals to a regional labor market based on their microregion of residence. Between 1991 and 2010, the Census contains a balanced panel of 558 microregions where each microregion corresponds to a set of economically integrated municipalities. In the 1980 Census, only a subset of these microregions can be constructed due to changes in administrative boundaries of municipalities and states between 1980 and 1991. Appendix A discusses details on the construction and measurement of labor market outcomes.8 Let us now turn to the aggregate trends in international commodity prices and Brazilian wage dispersion presented respectively on the left and right panels of Figure 1. Between 1980 and 2012, the evolution of relative commodity prices is U-shaped: the strong price decline of the 1980s was followed by a recovery of equivalent magnitude from the late 1990s until 2012. Notice that, while the price drop was homogeneous across commodity categories, the price recovery exhibited great betweencategory heterogeneity. In opposite direction, wage dispersion, after increasing in the 1980s, began a compression process in the 1990s that accelerated in the 2000s. This movement in wage dispersion can be decomposed into observable and residual components by regressing log wages on a full set of dummies for years of experience, level of education, sector of employment and microregion of residence. The decomposition indicates that observable worker attributes account for a sizeable part of the wage dispersion movement — i.e., 44.5% of the change in 1980-1991 and 78.1% in 1991-2010. 8 Table A1 presents the allocation of industries to sectors in each year of the Census. Also, Appendix A describes both the construction of microregions in the 1980 Census and the consequences of the changing municipality boundaries to the panel of microregions across years. 8 Figure 2: Components of Wage Dispersion in Brazil, 1980-2010 Return to Observable Worker Characteristics 1.2 0.4 1.0 Wage Differentials Variance of log wages Decomposition of Between-Component of Wage Dispersion 0.5 0.3 0.2 0.8 0.6 0.4 0.1 0.2 0.0 1980 HSG Wage Premium 1991 Region Wage Premium 2000 Sector Wage Premium 1980 2010 HSG Wage Premium Covariance 1991 2000 P90/P10 Region Wage Premium 2010 Sector Wage Premium Note. Left panel: decomposition of the between-component of the log-wage variance associated with the set of dummies for high school graduation, employment in the commodity sector, and microregion of residence. Right panel: estimated return to observable worker attributes implied by the log-wage regression. The dispersion in the 558 regional wage differentials is represented by the difference between 90th and 10th percentiles of the estimated regional fixed effects. To investigate further the negative aggregate correlation between commodity prices and the betweencomponent of wage dispersion, Figure 2 presents the full decomposition of the portion of log-wage variance associated with worker’s affiliation in terms of education-sector-region. The left panel indicates that the substantial contribution of this component to the 1991-2010 decline in wage dispersion is distributed across all its terms: 34.0% is related to the education dummy, 10.7% to the sector dummy, 23.9% to regional dummies, and 31.4% to the covariance of these terms. Importantly, the right panel shows that this fall in log-wage variance was driven by strong reductions in the differential return to observable worker attributes. This discussion can be summarized as follows. Fact 1. Between 1980 and 2010, international commodity prices and Brazilian wage dispersion are negatively correlated. A large fraction of the change in wage dispersion in the period is related to observable worker attributes with substantial movements in wage differentials associated with the worker’s level of education, sector of employment and region of residence. 3.2 Exposure to Commodity Price Shocks and Labor Market Outcomes The similarity between the aggregate trends in wage dispersion and commodity prices is suggestive of their interconnection. Yet, this correlation is potentially driven by confounding shocks afflicting the Brazilian economy in the period. This section addresses the causal effect of commodity prices on labor outcomes separately for High-School Dropouts and High-School Graduates. To this extent, I treat microregions as local subeconomies subject to differential price shocks according to their initial profiles of industry specialization. By comparing worker groups in regions differentially exposed to commodity price shocks, the reduced-form exercise sheds light on the adjustment pattern of wages and employment following sectoral labor demand shifts. In the exercise, I consider a balanced panel 9 of 518 microregions in the years of 1991, 2000 and 2010.9 3.2.1 Heterogeneity in Exposure to Commodity Price Shocks To capture heterogeneous exposure to fluctuations in international commodity prices, I construct a Bartik-Instrument for each microregion and group. Specifically, the shock exposure corresponds to the interaction between the commodity participation in the group-microregion wage bill on the initial year and the change in the international commodity price on the period. Precisely, the exposure of group g in microregion r to the commodity price shock is given by ∆Zg,r,t = ∑ φg,r ( j) · ∆ ln pt ( j) (1) j∈J C where ∆ ln pt ( j) is the log-change in the international price of product j between years t − 1 and t; and φg,r ( j) is the share of industry j in total labor payments of the commodity sector to individuals of group g in microregion r on the initial year of 1991. Intuitively, expression (1) entails a stronger response in the regional demand for worker groups specialized in the production of commodities experiencing stronger international price gains.10 Consequently, ∆Zg,r,t embeds two dimensions of exposure heterogeneity. First, cross-regional variation in initial industry composition creates exposure heterogeneity for individuals of the same group that reside in different microregions. Second, cross-industry variation in skill intensity generates exposure heterogeneity between groups in the same region. As shown in Table A4, these two variation sources are important in Brazil, giving rise to great exposure heterogeneity across both regions and groups. Figure 3 exhibits the pattern of spatial exposure to the commodity price shock among microregions for HSG (left panel) and HSD (right panel). Notice that shock exposure differs significantly for the two groups with a cross-region correlation of only .493. Moreover, Figure 3 indicates that HSG tend to have lower shock exposure than HSD — specifically, the median exposure is 4.7 log-points for HSG and 6.3 log-points for HSD. 9I restrict the sample to only include those microregions with positive employment in the commodity sector for all years and groups. As a result, the final sample contains 518 microregions that represented 98.4% of the country’s population on 1991. As discussed in Appendix A, there are two reasons to exclude the 1980-1991 period from the baseline sample. First, changing municipality borders only allows the construction of a subset of the microregions included in the 1991-2010 Censuses. Second, the 1980-1991 period was marked by severe economic turbulences in Brazil that have the potential to hinder the connection between international and domestic relative commodity prices — these events include hyperinflationary sprouts, suspension of foreign currency convertibility, and the adoption of restrictive internal controls on prices and wages. Nevertheless, Appendix B attests that similar results are obtained with a restricted sample of microregions spanning the entire 1980-2010 period. 10 The initial industry composition is indicative, as in Costinot (2009), of the region’s comparative advantage in production, reflecting local availability of natural resources like soil fertility and oil reserves. Accordingly, the intuition behind the labor demand response in expression (1) follows directly from the comparative static exercises in Costinot and Vogel (2010, 2014). Alternatively, Kovak (2013) connects production specialization to comparative advantage implied by the local endowment of industry-specific factors, deriving an expression similar to (1) for the response of regional labor demand to small price shocks. Recently, reduced-form strategies based on related measures of local exposure to trade shocks have become popular in the literature; see e.g. Topalova (2010), Kovak (2013), and Autor, Dorn, and Hanson (2013). 10 Figure 3: Heterogeneity in Exposure to Commodity Price Shock, 1991-2010 3.2.2 Reduced-Form Evidence In order to investigate the effect of exposure to the commodity price shock on labor market outcomes, I consider the following reduced-form specification: ∆Yg,r,t = β g · ∆Zg,r,t + Xg,r,t γg + v g,r,t (2) where ∆Yg,r,t is the change in a labor market outcome for individuals of group g in microregion r between years t − 1 and t; and Xg,r,t is a control vector of group-microregion characteristics potentially correlated with the exposure measure. In the baseline specification, the control vector includes macroregion-period dummies, the initial share of group labor income in the commodity sector, and the initial share of workers earnings at most the federal minimum wage. Also, microregions are weighted by their share in the national population of 1991 and standard errors are clustered by microregion to account for serially correlated errors. Conditional on the initial share of group labor income in the commodity sector, specification (2) relies exclusively on the variation in relative product prices within the commodity sector. Accordingly, the causal interpretation of model (2) requires that shocks in Brazilian regions are not large enough to affect the world price of basic commodities. A requirement especially plausible given the strong growth in Chinese imports of agriculture and mining products during the period; arguably, an exoge- 11 nous demand shock to relative prices of raw materials.11 Sectoral Wages and Employment. Columns (1)-(3) of Table 1 estimate equation (2) with the dependent variable being the commodity sector employment share of HSG in Panel A and HSD in Panel B. The positive significant coefficients in column (1) indicate that, for both HSG and HSD, higher commodity prices induce workers to reallocate from the non-commodity to the commodity sector. However, estimates suggest very limited between-sector mobility: a 10% increase in commodity prices cause the commodity sector employment share to increase by .31 p.p. for HSG and .77 p.p. for HSD. Compared to a region in the 10th percentile of shock exposure, these estimates imply that a region in the 90th percentile had a differential commodity sector expansion of 0.6 p.p. for HSG and 1.0 p.p. for HSD. To test robustness and potentially eliminate confounding effects, I augment the model with period dummies interacted with a set of initial labor market conditions in column (2) and a quadratic polynomial of the initial commodity sector size in column (3). These controls represent period-specific effects projected on initial region characteristics, capturing for example effects related to the introduction of cash transfer programs and secular differences in sector productivity growth. Although these additional controls absorb a large part of the cross-section variation in sector employment change, they only strengthen estimated coefficients. Turning to the impact of commodity prices on sectoral wages, columns (4)-(6) estimate model (2) with the commodity sector average wage premium as dependent variable. In column (4), the two groups exhibit different qualitative responses. The price shock triggers a significant positive response Table 1: Exposure to Commodity Price Shocks and Sector Employment and Wages Dependent Variable: Change in Commodity Sector Employment Share Change in Commodity Sector Average Log Wage Premium (1) (2) (3) (4) (5) (6) 0.031** (0.009) 0.038** (0.010) 0.038** (0.010) 0.298** (0.083) 0.437** (0.095) 0.416** (0.098) 0.283 0.343 0.405 0.143 0.198 0.220 Commodity price shock 0.077* (0.031) 0.101** (0.029) 0.099** (0.029) -0.110 (0.144) 0.061 (0.161) 0.059 (0.161) R2 0.508 1036 0.557 1036 0.559 1036 0.160 1036 0.218 1036 0.218 1036 Initial labor market conditions x period dummies No Yes Yes No Yes Yes Initial commodity sector size controls x period dummies No No Yes No No Yes A. High-School Graduates Commodity price shock R2 B. High-School Dropouts Baseline Controls Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. All regressions are weighted by the microregion share in national population on 1991. All models include ten macroregion-period dummies and the following baseline controls on 1991: share of group labor income in the commodity sector and share of workers earnings at most the federal minimum wage. Labor market conditions: cubic of per-capita income, quadratic of share of white individuals, share of employed individuals, share of individuals in urban areas, share of workers earnings at most the federal minimum wage, share of social security dependents (only HSD). Commodity sector size controls: quadratic polynomial of commodity sector share in group labor income. Standard Errors clustered by microregion. ** p<0.01, * p<0.05 11 Between 1992 and 2010, the average annual growth rate of Chinese imports was 17.2% for all products, 16.2% for Agriculture, and 28.3% for Mining. Over the period, Hanson (2012) provides a careful discussion of the transformation in the profile of international trade of emerging economies and, in special, China. To the extent that this transformation was mainly driven by internal changes in the production structure of China, this large demand shock represented an exogenous impulse to world commodity prices in the period. In Table B4 of Appendix B, I investigate the sensitivity of results when the exposure to commodity price shocks is instrumented with the exposure to Chinese commodity imports growth. 12 of the commodity sector wage differential for HSG; in contrast, there is a nonsignificant response for HSD. These conclusions are robust to the inclusion of additional controls in columns (5) and (6). In light of the results in Table 1, I state the following conclusion. Fact 2. Exposure to commodity price shocks is positively related to the commodity sector employment of both HSG and HSD. In addition, shock exposure is ambiguously related to the commodity sector wage premium, exhibiting a positive relation for HSG and a nonsignificant relation for HSD. Group Wages. Table 2 investigates the consequences of the commodity price shock to the average log wage of HSG in Panel A and HSD in Panel B. Columns (1)-(3) indicate that exposure to higher commodity prices is associated with higher average wages for the two worker groups. In columns (4)-(6), I access the robustness of this relation to the use of an alternative exposure measure based on the interaction of ∆Zg,r,t with the initial commodity sector size. As in Autor, Dorn, and Hanson (2013), this alternative measure explicitly incorporates differences in the overall sectoral allocation of workers across microregions and groups.12 Again, estimates indicate a positive response of average wages to higher commodity prices. This result suggests that, despite the mixed wage response at the sector level, the commodity price shock has a positive impact on the microregion’s labor demand, putting upward pressure on wages of HSG and HSD. Results in Table 2 imply a sizeable effect of the commodity price shock on the behavior of region Table 2: Exposure to Commodity Price Shocks and Group Wages Change in Average Log Wage Dependent Variable: (1) (2) (3) 0.296** (0.069) 0.283** (0.066) 0.288** (0.064) (4) (5) (6) 0.694* (0.342) 0.510* (0.260) 0.636* (0.292) 0.374 0.415 0.422 0.628** (0.204) 0.557** (0.208) 0.558* (0.221) A. High-School Graduates Commodity price shock Commodity sector size x Commodity price shock R2 0.399 0.433 0.440 0.271** (0.103) 0.280** (0.097) 0.268** (0.099) B. High-School Dropouts Commodity price shock Commodity sector size x Commodity price shock R2 0.555 0.624 0.636 0.552 1036 0.621 1036 0.634 1036 Initial labor market conditions x period dummies No Yes Yes No Yes Yes Initial commodity sector size controls x period dummies No No Yes No No Yes Baseline Controls Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. All regressions are weighted by the microregion share in national population on 1991 and include ten macroregion-period dummies. Baseline controls as in Table 1. Standard Errors clustered by microregion. ** p<0.01, * p<0.05 12 In this case, exposure to the price shock of each commodity is proportional to its overall participation in the regional wage bill of the worker group. In the model presented below, the response of group average wage is proportional to the overall sector allocation of group workers, being more closely related to the alternative exposure measure. In contrast, the model predicts that sectoral wages and employment within each group respond to sectoral demand shocks, being more closely related to ∆Zg,r,t . In Table B6, I attest that qualitative conclusions of the sector-level analysis remain valid if the alternative exposure measure is considered. 13 and education wage gaps in Brazil. To see this, consider the differential wage growth of a microregion in the 90th percentile of shock exposure relative to another in the 10th percentile. Estimates in column (4) predict that this differential growth is 1.9 p.p. for HSG and 4.2 p.p. for HSD. Higher relative wage gains for HSD follow from their higher exposure to commodity price shocks and, in this case, translate into a 2.2 p.p. reduction in the high-school wage premium. Fact 3. Exposure to commodity price shocks is positively related to the average wage of both HSG and HSD. Estimated coefficients suggest a sizeable impact of commodity prices on educational and regional wage differentials. Sensitivity Analysis. Having established Facts 2 and 3, Appendix B turns to an empirical investigation of the robustness of these results. In particular, I obtain similar conclusions if the baseline specification is extended to include additional sector composition controls, microregion-specific time trends, additional periods and additional worker groups (Tables B1-B3). To address concerns regarding the effect of supply conditions in Brazil on world commodity prices, I also attest that similar results are obtained when the exposure to commodity price shocks is instrumented with the exposure to Chinese commodity imports growth in 1991-2010 (Table B4). Lastly, I show that exposure to commodity price shocks is not related to the change in the labor supply of both native and migrant workers (Table B5). Following an increase in the world price of commodities, Facts 2 and 3 indicate that the adjustment pattern of wages and employment depends on the shock exposure across regions and groups. To be more precise, regional economies more exposed to the price shock experience a stronger employment expansion in the commodity sector for both HSG and HSD. Although shock exposure is related an increase in the average wage at the group level, there are mixed within-group responses in sector average wages. In the next section, I propose a Two-Sector Roy Economy that, following commodity price shocks, generates such adjustment pattern through the self-selection of heterogeneous individuals to sectors. This structural framework delivers a methodology to evaluate the impact of commodity price shocks on the wage distribution, allowing a quantitative investigation of the aggregate relation between the Brazilian wage structure and the international commodity prices emphasized in Fact 1. 4 4.1 Two-Sector Roy Economy Environment Consider a segmented labor market populated by workers of multiple groups, g, that self-select into either the commodity sector (k = C) or the non-commodity sector (k = N). In sector k, production depends on the sum of sector-specific tasks performed by all sector employees of a particular worker group. Following a long tradition in Roy-like models, assume that workers can only be employed in a single sector. Employees in each sector are paid a salary in exchange for each completed unit of the sector-specific task assigned to their group. Denote wkg as the unit price in sector k of the task performed by workers of group g. This notation allows, but does not require, sector-specific task prices to differ 14 across groups.13 Within each group, there is a continuum of heterogeneous individuals indexed by i ∈ I g . Individual i is endowed with a bivariate skill vector, ( TgC (i ), TgN (i )), that determines her effective supply of sector-specific tasks. Specifically, individual i if employed in sector k performs Tgk (i ) units of the task specific to sector k and earns a wage of wkg Tgk (i ). The analysis is simplified by working with a log-linear transformation of earnings so that the potential log-wage of individual i of group g in sector k is given by ω gk + ln Tgk (i ) where ω gk ≡ ln wkg . Assume that individuals choose to be employed in the sector that gives them the highest wage.14 Accordingly, individual i self-selects into sector K g (i ) where n K g (i ) = arg max ω gC + ln TgC (i ); o ω gN + ln TgN (i ) . (3) To characterize the structure of employment and wages in the Two-Sector Roy Economy, let us define individual i’s comparative advantage as s g (i ) ≡ ln( TgC (i )/TgN (i )), and absolute advantage as a g (i ) ≡ ln TgN (i ). In a given group, suppose individuals independently draw their skill vector from a common bivariate distribution such that s g (i ) ∼ Fg (s) and { a g (i )|s g (i ) = s} ∼ Hg ( a|s) (4) where α g (q) ≡ ( Fg )−1 (q) is the quantile function of comparative advantage. The skill distribution in (4) implies that, in a particular quantile q ∈ [0, 1], there is a set of individuals in group g whose level of comparative advantage is α g (q). Among these individuals, there is a conditional distribution of absolute advantage, Hg a|α g (q) , with average and variance respectively denoted by A g (q) ≡ E a g (i )|s g (i ) = α g (q) and Vg (q) ≡ Var a g (i )|s g (i ) = α g (q) . 4.2 Sectoral Wages and Employment: Comparative and Absolute Advantage The most direct implication of the preference structure in (3) is the endogenous sorting of workers to sectors. Due to the heterogeneity in sector-specific task efficiency of workers, this endogenous sorting decision has strong implications for the pattern of employment and wages in the two sectors of the economy. To analyze such pattern, I consider a graphical representation of the economy in terms of quantiles of the comparative advantage distribution. Figure 4 exhibits the average potential log wage in each sector for individuals distributed across different comparative advantage quantiles. If employed in the non-commodity sector, workers in quantile q perform, on average, A g (q) units of the sector-specific task, receiving in return an average potential 13 This particular formulation follows closely the environment in the extensive literature inspired by the seminal work of Roy (1951); see e.g. Heckman and Sedlacek (1985), Heckman and Honore (1990), and, more recently, Ohnsorge and Trefler (2007), Lagakos and Waugh (2013) and Young (2014). In Appendix C, I provide a detailed description of the production and market structures that generate such sector-specific task prices in general equilibrium. 14 Equivalently, one could assume that individuals derive utility solely from the optimal consumption bundle acquired with their labor income. By introducing within-group heterogeneity entirely on sector-specific labor efficiency, the distributive impact of a trade shock is completely captured by the behavior of observable labor income. In this environment, identical homothetic preferences imply that changes in the welfare distribution are directly related to changes in the wage distribution. Notice that this preference structure abstracts from private employment benefits and mobility costs. In Appendix D.3, I explore an extension that incorporates such features into the model. 15 log wage of ȲgN (q) = ω gN + A g (q). Alternatively, if employed in the commodity sector, they perform an average of α g (q) + A g (q) units of the sector-specific task and earn an average potential log wage of ȲgC (q) = ω gC + α g (q) + A g (q). In a particular quantile, the unique source of dispersion in potential sector wages is the dispersion of absolute advantage, Vg (q) — illustrated by the hump-shaped curves in quantile q1 .15 In Figure 4, A g (q) is strictly decreasing in q, implying that individuals in high quantiles of comparative advantage have, on average, lower levels of absolute advantage. However, the skill distribution in (4) does not impose any restriction on the relation between comparative and absolute advantage. Consequently, the behavior of A g (q) is free to change in an arbitrary way across comparative advantage quantiles. Figure 5 illustrates the other extreme case in which A g (q) is strictly increasing in q. As discussed below, these two cases have very distinct implications for the adjustment of sector average wages following sectoral demand shocks. q1 q q 1 1 Figure 4: Case I - A g (q) decreasing. Figure 5: Case II - A g (q) increasing. Sector Employment and Comparative Advantage. The preference structure in (3) implies that individuals self-select into the sector yielding the highest wage. For all individuals in a given quantile of comparative advantage, the sectoral employment choice can be determined from the comparison between the two sectoral curves of average potential wage in Figures 4-5.16 In high quantiles of comparative advantage, the relatively higher efficiency in the commodity sector task yields a relatively higher wage in that sector, implying self-selection into the commodity sector. In contrast, individuals 15 The two potential wage curves exhibit the single-crossing property because, by construction, α g (q) is increasing in q. Hence, they cross at most once. To simplify the analysis, I assume that task prices are such that these curves cross at least once. This is the case if Fg (.) has full support on R so that limq→0 α g (q) = −∞ and limq→1 α g (q) = ∞. In general equilibrium, this is also implied by restrictions on the production technology guaranteeing that, in both sectors, the marginal product of a task unit goes to infinity as the total task quantity goes to zero. 16 To formalize this claim, consider individual i with comparative advantage s (i ) = α ( q ). This worker has a potential g g log wage of ω gN + a g (i ) in the non-commodity sector and ω gC + α g (q) + a g,m (i ) in the commodity sector. For this individual, potential sector wages correspond to vertical shifts of those of a worker with the same level of comparative advantage but a different level of absolute advantage. Consequently, the sectoral choice of individual i with s g (i ) = α g (q) is identical to that of a hypothetical individual i0 with s g (i0 ) = α g (q) and a g (i0 ) = A g (q). 16 in low quantiles of comparative advantage obtain a relatively lower wage in the commodity sector, finding it optimal to self-select into the non-commodity sector. In equilibrium, the sector employment composition is determined by marginal individuals with comparative advantage equal to the relative task price, ω gN − ω gC . These marginal workers have exactly the same potential wage in the two sectors, being indifferent between them. Formally, individuals in quantiles of comparative advantage Q gN ≡ {q : q < l gN } are employed in the non-commodity sector and those in quantiles QCg ≡ {q : q ≥ l gN } are employed in the commodity sector where the employment share in the non-commodity sector, l gN , is determined by the unique intersection of the sectoral schedules of average potential wage, ω gN − ω gC = α g l gN . (5) This discussion highlights the central feature of the Two-Sector Roy Economy: workers are not readily substitutable between sectors. Because of the efficiency differential in sector-specific tasks, workers do not switch sectors to arbitrage the differential in sector-specific task prices in equilibrium. The sector employment composition in (5) reflects this heterogeneity in sector efficiency, depending on the shape of the schedule of comparative advantage, α g (.). Sector Average Wage and Average Sector-Specific Efficiency. The endogenous selection of individuals to sectors implies that the sector average wage embodies only the average labor income of individuals employed in the sector: Ȳgk = E[Ȳgk (q)|q ∈ Qkg ]. As a result, the average log wage in the non-commodity sector is given by ȲgN = ω gN + Ā gN (l gN ) s.t. Ā gN (l ) ≡ 1 l ˆ l A g (q) dq ; (6) 0 and the average log wage in the commodity sector is ȲgC = ω gC + ĀCg (l gN ) s.t. ĀCg (l ) 1 ≡ 1−l ˆ 1 [α g (q) + A g (q)] dq. (7) l In expressions (6)-(7), there are two determinants of the average wage in each sector. First, the sector-specific task price, ω gk , that directly affects the wage of sector employees. Second, the sector employment composition, l gN , that affects the average efficiency in the sector-specific task through the term Ākg l gN . Because of the heterogeneity in average sector-specific efficiency illustrated in Figures 4-5, the range of quantiles allocated to each sector has a direct effect over that sector’s average wage. In other words, the pattern of variation in average sector-specific task efficiency across comparative advantage quantiles —A g (.) in the non-commodity sector and α g (.) + A g (.) in the commodity sector— determines the average efficiency of sector employees summarized in the shape of Ākg (.). 4.3 Average and Variance of Group Log Wages In the Two-Sector Roy Economy, average and variance of the log-wage distribution of workers in group g are determined by their sectoral allocation decision, Qkg , as well as their schedules of comparative and absolute advantage, α g (q) and A g (q). By construction, the average wage in group g is Ȳg = 17 l gN · ȲgN + l gC · ȲgC and, therefore, expressions (6)-(7) imply ˆ Ȳg = where e g ≡ ´1 0 ω gC · l gC + ω gN · l gN 1 + l gN α g (q)dq + e g (8) A g (q)dq. Since individuals are not readily substitutable between sectors, they are solely exposed to the task price in their sector of employment. As a result, the average wage in (8) depends on the sector employment composition as captured by ω gC · l gC + ω gN · l gN . The other two terms reflect the average efficiency of workers in the two sector-specific tasks. While the term e g captures the average absolute advantage ´1 that affects efficiency in both sectors, the term l N α g (q)dq captures the comparative advantage that g affects only the efficiency of commodity sector employees. Among individuals of group g, there are two sources of wage dispersion: the between-sector average wage differential and within-sector wage dispersion. To be more precise, the law of total variance implies that 2 Vg = l gN l gC · ȲgC − ȲgN + l gN · VgN + l gC · VgC (9) where Vgk corresponds to the log-wage variance of individuals employed in sector k. As indicated in Figures 4-5, this within-sector wage variance combines the variation in average wage of individuals distributed across comparative advantage quantiles employed in the sector, Yg (q), and the variation in absolute advantage in any particular comparative advantage quantile, Vg (q). Consequently, the logwage variance in sector k is Vgk = E[Vg (q)|q ∈ Qkg ] + Var [Ȳgk (q)|q ∈ Qkg ]. By applying this expression into (9), the log-wage variance in group g can be written as i i h 2 h Vg = l gN l gC · ȲgC − ȲgN + l gN · Var A g (q)q < l gN + l gC · Var α g (q) + A g (q)q ≥ l gN + νg (10) where the variance is taken in the conditional uniform distribution of quantiles allocated to each sec´1 tor. In equation (10), the term νg ≡ 0 Vg (q) dq captures the portion of wage dispersion related to the absolute advantage dispersion within comparative advantage quantiles, being independent of the endogenous worker sorting for its symmetric effects on potential wages in the two sectors. 4.4 Comparative Statics: Sector Demand Shocks To better understand the mechanics of the Two-Sector Roy Economy, let us analyze the adjustment of labor market outcomes following a positive shock to task demand in the commodity sector. To simplify the analysis, I consider a demand-driven increase in ω gC while holding constant both ω gN and the skill distribution. With this exercise, I show how the model is capable of rationalizing the response of the Brazilian labor market to the commodity price shock (i.e., Facts 1-3 in Section 3). As illustrated in Figures 6-7, the increase in the commodity sector task price implies an upward shift in the commodity sector curve of potential average wage. Following the shock, individuals initially indifferent between the two sectors find it optimal to be employed in the commodity sector, reducing the employment share in the non-commodity sector by ∆l gN < 0. The magnitude of the between18 sector employment flow is directly related to the slope of the comparative advantage schedule: for the same variation in the relative task price, the sector employment response is smaller if the slope of the comparative advantage schedule, α g (.), is higher. This is immediately implied by equation (5): ∆ h ω gN − ω gC i ˆ l gN +∆l gN = l gN ∂α g (u) du. ∂q (11) q q 1 1 Figure 6: Positive selection into non-commodity Figure 7: Negative selection into non-commodity sector. sector. Although the non-commodity sector task price is constant, the implied outflow of non-commodity sector employees affects the sector’s employment composition and, consequently, the sector’s average wage. To the extent that the absolute advantage of sector-switchers differs from that of sector-stayers, the change in sector employment affects the sector average efficiency. If A g (q) is decreasing as illustrated in Figure 6, then sector-stayers have a higher level of absolute advantage than sector-switchers (i.e., blue area). In this case, the outflow of workers leaves the non-commodity sector with employees whose average absolute advantage is relatively higher, raising the average wage in the sector. However, this is not the only possibility. In Figure 7, A g (q) is increasing so that the outflow of workers lowers the average wage in the non-commodity sector. To formalize this discussion, notice that expression (6) yields ˆ ∆ȲgN − ∆ω gN = l gN +∆l gN l gN ∂ Ā gN (u) ∂q ˆ l gN +∆l gN du = l gN 1 A g (u) − Ā gN (u) du. u (12) By expression (12), the implied response in the non-commodity sector average efficiency, ∆ȲgN − ∆ω gN , depends on the slope of Ā gN (.) that is directly related to the difference between the absolute advantage of sector-switchers, A g (l gN ), and sector-stayers, Ā gN (l gN ). Following the reduction in noncommodity sector employment, this compositional effect has an ambiguous sign; for example, it is positive if A g (.) is decreasing (i.e., Figure 6) or negative if A g (.) is increasing (i.e., Figure 7). 19 In the commodity sector, the shock has two effects on the average wage. There is an increase in the log-wage of commodity sector employees implied by the growth of the the commodity sector task price. The magnitude of this increase is represented by the vertical shift of the curve of potential average wage in the commodity sector. In addition, there is a compositional effect driven by the inflow of new employees whose sector-specific efficiency differs from that of original employees in the commodity sector. As discussed above for the non-commodity sector, the sign of this effect is ambiguous, being controlled by the slope of ĀCg (.).17 As a consequence of the shock, the upper-envelope of the two curves of potential sector wage shifts upwards and, therefore, the average group wage increases. Formally, expression (8) yields " ∆Ȳg = ∆ω gC · l gC + ∆ω gN · l gN + αg l gN + ∆l gN ˆ · ∆l gN − l gN +∆l gN l gN # α g (q) dq . (13) The first two terms correspond to the positive wage gain implied by the shock if workers were to remain in their initial sector of employment. Yet, the change in sector composition associated with the shock introduces an additional source of average wage variation: the compositional effect captured by the term in brackets. For the positive shock considered, the compositional effect is also positive because individuals only move into the commodity sector to take advantage of its higher wage growth.18 At last, notice that this compositional effect is second-order: for small price shocks, sector-switchers are the marginal individuals with the same potential wage in the two sectors so their reallocation does not affect the log-wage distribution of the group. Proposition 1 summarizes the predictions of the model following an exogenous increase in the commodity sector task price. Proposition 1. Suppose an increase in the commodity sector task demand such that ∆ω gC > ∆ω gN ≥ 0. Then, i. There is an increase in the employment share of the commodity sector whose magnitude is controlled by the slope of the schedule of comparative advantage, α g (.). ii. There is an ambiguous change in the average wage of both sectors due to the existence of compositional effects controlled by the slope of the schedule of sector average efficiency, Ākg (.). iii. There is an increase in the average group wage whose magnitude depends on the initial sector allocation of workers in the group. 17 In the commodity sector, the average efficiency in the sector-specific task combines the schedules of comparative and absolute advantage. From expression (7), ˆ ∆ȲgC − ∆ω gC = l gN +∆l gN l gN ∂ ĀCg (u) ∂q ˆ du = − l gN +∆l gN l gN 1 α g (u) + A g (u) − ĀCg (u) du. 1−u According to this expression, changes in the employment composition have ambiguous consequences to the commodity sector average efficiency. The magnitude of this compositional effect is related to the difference between the sector-specific efficiency of sector-switchers, α g (l gN ) + A g (l gN ), and sector-stayers, ĀCg (l gN ). In Figures 6 and 7, the average efficiency in the commodity sector is increasing in q. Thus, an employment expansion triggers the absorption of individuals with lower sector-specific efficiency, reducing the average wage in the commodity sector. 18 This follows immediately from ∆l N < 0 and the fact that α (.) is increasing. g g 20 Taking the commodity price increase as a demand-driven shock to sector-specific task prices such that ∆ω gC > ∆ω gN ≥ 0, Proposition 1 can be readily used to interpret Facts 2 and 3 in Section 3.19 Consistent with Fact 2, the model predicts a positive response in the commodity sector employment share and an ambiguous response of the commodity sector wage premium. Through the lens of the TwoSector Roy Model, this pattern of adjustment is related to the existence of compositional effects implied by the between-sector worker reallocation. This compositional effect may reinforce or diminish the direct effect of the task price change. If there is positive selection into the non-commodity sector (Figure 6), then the compositional effect increases the average wage in the non-commodity sector — as, for example, in the weak response of the commodity sector wage premium for HSD. However, to the extent that workers are negatively selected into the non-commodity sector (Figure 7), the compositional effect reinforces the direct price effect — as, for example, in the strong response of the commodity sector wage premium for HSG. Despite these mixed wage responses at the sector level, the model also predicts, as in Fact 3, a gain in the average wage of all worker groups. At the group level, the change in the average wage combines the changes in sector-specific task prices using the sectoral allocation of workers. This creates a link between the magnitude of the average wage response and the sector employment composition. In order to move from this qualitative analysis to a quantitative examination of the impact of the commodity price shock on the Brazilian wage structure, it is necessary to recover the schedules of comparative advantage, α g (.), and absolute advantage, A g (.). To be more precise, the knowledge of these schedules allows the counterfactual decomposition of the log-wage variance based on equation (10), permitting the quantification of the commodity price shock contribution to the wage dispersion behavior in Brazil (Fact 1). 5 Identification of Comparative and Absolute Advantage This section investigates the nonparametric identification of the schedules of comparative and absolute advantage in the Two-Sector Roy Economy. This result establishes a mapping between population data and the unknown functions α g (.) and A g (.) without additional restrictions than those implied by the theory. In this sense, such result represents an important first step in the empirical analysis of the model. It indicates the source of variation in the data that uncovers the main economic relation of interest, focusing on robust predictions of the model. Alternatively, as noted by Matzkin (2007), the credibility of empirical results would be significantly hindered if identification could only be achieved under restrictive parametric assumptions. 5.1 Assumptions In order to establish identification of comparative and absolute advantage in the Two-Sector Roy Economy of Section 4, it is necessary to make additional assumption regarding observable labor market 19 For a particular worker group g in microregion r, assume that changes in sector-specific task prices are proportional to k the exposure to the commodity price shock: ∆ω g,r,t = βkg · ∆Zg,r,t such that βCg > β N g ≥ 0. If the commodity price shock affects sector task demand, then this assumption can be interpreted as the linear projection of the change in sector-specific k , on the shock exposure, ∆Z task prices, ∆ω g,r,t g,r,t . 21 outcomes as well as their relation to unobservable variables. Panel of Segmented Labor Markets. Consider a country constituted of regions, r, in different years, t. Each region-year pair is a segmented labor market equivalent to the Two-Sector Roy Economy of C , ω N ).20 Section 4. Assume that sector-specific task prices are observable variables denoted by (ω g,r,t g,r,t In equilibrium, these prices are jointly determined by supply and demand for sector-specific tasks. Let D g,r,t denote a vector of task demand conditions in the market, including, among other variables, final product prices, and shifters of production technology and trade costs. In addition, assume that task supply is determined by a skill distribution satisfying the following regularity conditions. Assumption 1. [Skill Distribution] Suppose individual i of group g in region r at year t, i ∈ I g,r,t , indepen dently draws s g (i ), a g (i ) from a bivariate distribution such that the following decomposition holds. i. Comparative Advantage: s g (i ) = s̃ g (i ) + ũ g,r,t and {s̃ g (i )} ∼ Fg (s) where ũ g,r,t, is a group-region-time shifter of comparative advantage. ii. Absolute Advantage: e { a g (i )|s̃ g (i ) = s} ∼ Hg,r,t ( a|s) = µHga ( a|s) + (1 − µ) Hg,r,t ( a) e ( a ) ≡ H e ( a | ũ where Hg,r,t g,r,t , θg,r,t ) is a group-region-time mixing distribution of absolute advantage ´ e ( a ). such that ṽ g,r,t ≡ (1 − µ) a dHg,r,t The regularity conditions in Assumption 1 guarantee that the pattern of sector selection is identical in every market. Intuitively, they impose parallel curves of potential sector wage across both regions and years; leaving nevertheless the shape of these curves completely unrestricted. In terms of the notation introduced in Section 4, Assumption 1 implies that the schedules of comparative and absolute advantage are respectively given by α g,r,t (q) = α g (q) + ũ g,r,t (14) A g,r,t (q) = A g (q) + ṽ g,r,t (15) where α g (q) ≡ ( Fg )−1 (q) and A g (q) ≡ µ ´ a dHga a|α g (q) . Equations (14)-(15) attest the importance of Assumption 1 for its dimensionality reduction implication: cross-market heterogeneity in labor market outcomes is generated by shocks to sector task supply in ũ g,r,t , ṽ g,r,t and sector task demand in Z g,r,t . In this context, ũ g,r,t is a shifter of comparative advantage, representing in the model a supply-driven shock to sector employment across markets. e ( a ). In addition, ṽ g,r,t is a shifter of the overall task efficiency implied by the mixing distribution Hg,r,t 20 In this Section, I treat sector-specific task prices as, for all purposes, observable variables determined in the equilibrium of the Two-Sector Roy Economy. Section 6.1 provides a methodology to estimate task prices based on the model’s predicted relation between wage growth and initial sector employment across quantiles of the wage distribution. 22 Notice that this market-specific component of the absolute advantage distribution is independent of comparative advantage, capturing supply shocks to labor efficiency not directly related to sector employment.21 To move towards an empirical application of the model in Section 4, I make an additional assump tion regarding the structure of the supply shocks ũ g,r,t , ṽ g,r,t . In particular, I assume that they combine observable and unobservable components as follows. Assumption 2. [Error Structure] Assume that the shifters of comparative and absolute advantage, ũ g,r,t , ṽ g,r,t , can be written as ũ g,r,t = X g,r,t γug + u g,r,t , ṽ g,r,t = X g,r,t γvg + v g,r,t where X g,r,t is an observable vector of group-region-year variables; and u g,r,t , v g,r,t is an unobservable vector of and group-region-year supply shocks. Also, normalize the shifters so that E[ũ g,r,t ] = E[ṽ g,r,t ] = 0 Instrument: Sector Demand Shifter. As discussed above, labor market outcomes in equilibrium are determined by both supply and demand conditions in each region-year pair. Accordingly, in order to recover the task supply structure represented by the schedules of comparative and absolute advantage in α g (.) and A g (.), it is necessary an observable shifter of sector task demand that varies across regions and years. Hence, I assume the existence of an observable subvector Z g,r,t in the vector of sector demand shifters, D g,r,t , that satisfies the following exogeneity restriction. Assumption 3. [Exogeneity] E u g,r,t |Z g,r,t , X g,r,t = E v g,r,t |Z g,r,t , X g,r,t = 0. Assumption 3 is the usual exogeneity restriction of an instrument: conditional on X g,r,t , Z g,r,t is mean independent from unobservable shocks to sector task supply, (u g,r,t , v g,r,t ). In this case, the instrument must be a regional shock to sector demand that is uncorrelated with regional shifters of comparative and absolute advantage. Following the intuition of the comparative statics exercise in Section 4, Z g,r,t generates exogenous variation in sector-specific task prices whose quantitative implications for sector employment and wages are directly connected to the schedules of comparative and absolute advantage. As standard in the nonparametric identification literature, I impose an additional completeness condition on the instrument vector, Z g,r,t . h i N ,X Assumption 4. [Completeness] For any f (.) with finite expectation, E f (l g,r,t ) Z , X g,r,t g,r,t g,r,t = 0 imN ,X plies that f (l g,r,t g,r,t ) = 0 almost surely. Assumption 4 is the equivalent of a rank requirement in the context of nonparametric models. Newey and Powell (2003) show that it is necessary and sufficient for identification of nonparametric 21 For group g, the heterogeneity in the skill distribution across markets is controlled by ũ g,r,t and θg,r,t . The term ũ g,r,t affects market-specific components of both comparative and absolute advantage, allowing for correlated supply-driven movements in sector employment and overall labor efficiency. The vector θg,r,t allows for a rich pattern of variation in labor efficiency across markets, capturing differences in the mass of workers at various ranges of absolute advantage through e ( a ). Hence, at a particular part of the income distribution, variation in θ the distribution Hg,r,t g,r,t introduces supply-driven shocks to income growth across markets. 23 instrumental variable models of the class considered in this paper. Intuitively, this condition guaranN to tees that the instrument Z g,r,t induces enough variation in the endogenous sector composition l g,r,t uniquely discriminate the function consistent with the underlying data generating process. 5.2 Identification of Comparative and Absolute Advantage Under Assumptions 1 and 2, the conditions determining the employment share and the average wage in the non-commodity sector — i.e., equations (5)-(6) — immediately imply that h i N C N ω g,r,t − ω g,r,t = α g l g,r,t + X g,r,t γug + u g,r,t h i N N N Ȳg,r,t − ω g,r,t = Ā gN l g,r,t + X g,r,t γvg + v g,r,t where, by construction, A g (q) = ∂ ∂q h (16) (17) i q · Ā gN (q) . The variation in the instrument vector Z g,r,t creates variation in the sector employment composition that is orthogonal to variation in the shifters of the skill distribution across markets. As in the labor market adjustment to demand shocks described by Proposition 1 of Section 4, this exogenous demand variation gives rise to responses in sector employment and sector average wage that are quantitatively connected to the economy’s skill distribution. To be more precise, the schedule of comparative advantage, α g (.) in equation (16), controls the magnitude of the change in the relative task price associated with the change in sector employment implied by the exogenous demand shock. The subsequent change in the composition of workers employed in the non-commodity sector triggers a change in the N − ω N ]. In equation (17), this compositional effect is sector’s average efficiency as measured by [Ȳg,r,t g,r,t controlled by the function Ā gN (.) which is directly related to the schedule of absolute advantage, A g (.). In line with this discussion, an instrument satisfying Assumptions 3-4 identifies α g (.) and A g (.) respectively from equations (16) and (17). Proposition 2. Consider a panel of segmented labor markets such that Assumptions 1-2 hold. If there is an instrument vector Z g,r,t satisfying Assumptions 3-4, then the schedules of comparative advantage, α g (.), and absolute advantage, A g (.), are nonparametrically identified respectively from equations (16) and (17). Proof. As a direct application of Lemma 1 in Appendix D.1, the functions α g (.) and Ā gN (.) are identified from equations (16) and (17) with the instrument Z g,r,t . Using the definition of Ā gN (.), we immediately h i ∂ q · Ā gN (q) . identify A g (.) such that A g (q) = ∂q In the Two-Sector Roy Economy, comparative and absolute advantage immediately determine efficiency in the task specific to the commodity sector and, consequently, the average efficiency in that sector. As a result, there is one overidentification restriction in the model. To see this, notice that, under Assumptions 1 and 2, equation (7) yields h i C C N Ȳg,r,t − ω g,r,t = ĀCg l g,r,t + X g,r,t (γug + γvg ) + (v g,r,t + u g,r,t ) 24 (18) i h ∂ where, by definition, α g (q) + A g (q) = − ∂q (1 − q) · ĀCg (q) . From equation (18), the instrument Zg,r,t satisfying 3-4 identifies ĀCg (.) which is directly related to the sum of the schedules of comparative and absolute advantage, α g (.) + A g (.). Proposition 3. Consider a panel of segmented labor markets such that Assumptions 1-2 hold. If there is an instrument vector Z g,r,t satisfying Assumptions 3-4, then the sum of the schedules of comparative and absolute advantage, α g (.) + A g (.), is nonparametrically identified from equation (18). Proof. As a direct application of Lemma 1 in Appendix D.1, the function ĀCg (.) is identified from equations (18) with the instrument Z g,r,t . Using the definition of ĀCg (.), we immediately identify i h ∂ A g (.) + α g (.) such that α g (q) + A g (q) = − ∂q (1 − q) · ĀCg (q) . 6 Empirical Application The above results establish identification of the schedules of comparative and absolute advantage using cross-market variation in sectoral labor demand. Armed with these theoretical results, I now turn to the estimation of comparative and absolute advantage from the adjustment of labor market outcomes in Brazilian microregions differentially exposed to international commodity price shocks between 1991 and 2010. The empirical application builds directly upon the reduced-form results of Section 3: precisely, the impact of the commodity price shock on between-sector responses of employment and wages. A detailed description of the data used in this section is given in Appendix A. 6.1 Sector-Specific Task Prices In order to apply the results of Section 5, it is first necessary to measure sector-specific task prices in each region and year. In this section, I propose a methodology to estimate task prices using promptly available information on labor income and employment at the individual level. In the Two-Sector Roy Economy, individuals are not readily substitutable between sectors, implying that their wages are only exposed to changes in the task price of their own sector of employment. Following task price shocks, this observation implies that, across different parts of the wage distribution, variation in the pre-shock sector employment composition translates into variation in the growth of wages. Intuitively, if all individuals at the bottom of the distribution are employed in the commodity sector, then the wage gain at the bottom is entirely attributed to the change in the task price of the commodity sector. In such case, an increase in the task price of the non-commodity sector has no impact on the wage of individuals at the bottom of the wage distribution. To formalize this intuition, let Yg,r,t (π ) denote the π-quantile of the log-wage distribution of group g in region r at year t. For small shocks, I show in Appendix D.2 that the wage growth between periods t0 and t in quantile π of the log-wage distribution is given by h i C N C N ∆Yg,r,t (π ) = ∆ω g,r,t + ∆ω g,r,t − ∆ω g,r,t · l g,r,t (π ) + µ g,r,t · X̃ g,r,t (π ) + ∆v g,r,t (π ) 0 25 (19) N ( π ) is the initial employment share of the nonwhere, at quantile π of the log-wage distribution, l g,r,t 0 commodity sector, X̃ g,r,t (π ) is a set of observable controls, and ∆v g,r,t (π ) is a productivity shock.22 For each triple of group-region-period, equation (19) implies that changes in task prices can be consistently estimated from the relation across wage distribution quantiles between the initial sector N ( π ), and the wage growth, ∆Y composition, l g,r,t g,r,t ( π ). In this context, an estimator based on equa0 tion (19) relies on the assumption that, conditional on the set of controls X̃ g,r,t (π ), pre-shock variation in sector employment composition is uncorrelated with variation in labor efficiency shocks among individuals with different levels of labor income in a particular group-region-period. Additionally, such estimator hinges on a central feature of the Roy Model embedded in equation (19): the indifference of marginal individuals between the two sectors. For small price shocks, sector-switchers are the marginal individuals with the same potential wage in the two sectors so their reallocation has no first-order impact on the group’s log-wage distribution.23 Armed with the model’s prediction in equation (19), I proceed to estimate task price changes by regressing wage growth between two years on the initial year’s sector employment composition in a set of wage distribution quantiles for each group-region-period. Towards this end, I use Census data N ( π ) and ∆Y to construct l g,r,t g,r,t ( π ) in various quantiles of the log-wage distribution. In the baseline 0 specification, I divide workers into bins of 1 p.p. width computed from the log-wage distribution quantiles for a group-region-period.24 The implementation of expression (19) allows for a vector of observables variables that vary with the position in the wage distribution. Accordingly, the baseline specification includes the following dummy variables as nonparametric controls: (i) indicator that wage percentile is at the bottom, middle or top of the log-wage distribution; (ii) indicator that wage percentile is below the federal minimum wage (pre-year and post-year). These dummies capture, for example, differential efficiency gains for workers in distant parts of the wage distribution, and income gains generated by bunching around the minimum wage. In this specification, sector-specific task prices are identified from the variation in pre-shock sector employment in small neighbourhoods of the log-wage distribution of workers in the same group-region-period. As in Section 3, I focus on the sample of male white full-time worker 22 In Appendix D.2, I show that equation (19) is generated by a first-order expansion of the implicit equation defining Yg,r,t (π ). In this context, ∆v g,r,t (π ) is a shock to the absolute advantage of individuals spread across quantiles of the logwage distribution. It is introduced by shocks to (ũ g,r,t , θg,r,t ) that affect the market-specific mixing distribution of absolute e ( a ) ≡ H e ( a | ũ advantage, Hg,r,t g,r,t , θg,r,t ) in Assumption 1. The change in the mixing distribution of absolute advantage has consequences for the labor efficiency of individuals at different income levels. As a result, the process generating innovations in (ũ g,r,t , θg,r,t ) creates, through ∆v g,r,t (π ), idiosyncratic shocks to wage growth across quantiles of the wage distribution. 23 Expression (19) is modified whenever there exists a wedge in sector potential wage of sector-switchers. This is the case in the presence of non-monetary benefits of employment. To the extent that sector-switchers are spread over the wage distribution, the wedge affects wage gains across quantiles. Consistent with this intuition, there is a new term in equation (19) that is proportional to the fraction of sector-switchers among individuals at the π-quantile of the log-wage distribution. For a detailed analysis of this case, see Appendix D.3. 24 In principle, equation (19) can be implemented with any division of individuals into quantiles; in practice, however, this choice entails a tradeoff. On the one hand, a coarse discretization yields a low number of quantiles with potentially little variation in initial sector employment to precisely estimate task prices. On the other hand, a refined discretization exacerbates measurement error of sector employment in each quantile because of the low number of sampled individuals in each sector. Also, extreme values of income could be generated by measurement error, affecting wage growth in the tails of the wage distribution. With these considerations in mind, the estimation is implemented with 88 percentile bins of 1 p.p. width between the 6th and the 94th percentiles. Below, I show that similar results are obtained if bins of 2 p.p. width are used. See Appendix A for more details on data construction. 26 divided in two educational groups, High-School Graduates (HSG) and High-School Dropouts (HSD). Table 3: Summary Statistics: Estimated Sector-Specific Task Prices, 1991-2010 Log change in commodity sector task price Log change in non-commodity sector relative task price R2 Mean SD Mean SD Mean (1) (2) (3) (4) (5) 1991 - 2000 0.320 0.370 -0.151 0.347 55.5% 2000 - 2010 0.150 0.645 -0.306 0.609 75.8% 1991 - 2000 0.524 0.579 -0.364 0.619 71.5% 2000 - 2010 0.440 0.579 -0.360 0.634 83.0% A. High-School Graduates B. High-School Dropouts Note. Sample of 518 microregions in 1991-2000 and 2000-2010. Statistics are weighted by the microregion share in national population on 1991. Worker sample of male white individuals distributed in 1 p.p. percentile bins of hourly wage (N = 88). Baseline specification includes the following dummies: (i) indicator that income percentile is below the federal minimum wage (pre and post years); (ii) indicator that income percentile belongs to range P6-P30 and P30-P75. Table 3 presents the summary statistics of estimated task prices implied by the baseline specification in 2,072 group-region-period triples. Columns (1)-(2) display statistics of the estimated task price C , and columns (3)-(4) of the estimated relative task price of the nonin the commodity sector, ∆ω g,r,t N − ∆ω C . The commodity sector task price presented robust growth in both commodity sector, ∆ω g,r,t g,r,t periods. Between 1991 and 2010, the average increase was 47.0 log-points for HSG and 96.4 log-points for HSD. Simultaneously, the relative task price in the non-commodity sector decreased sharply. Lastly, column (5) reports the average R2 of the estimation in the sample of microregions. A large fraction of the variation in wage growth across quantiles of the earnings distribution is captured by equation (19); in the two periods, the average R2 is above 55% for HSG and 71% for HSD. To address robustness to implementation choices, Table 4 presents the correlation between task price estimates implied by different specifications of equation (19) and those implied by the baseline specification. Columns (1)-(3) and (5)-(7) indicate a high correlation between estimates obtained with different control sets. Notice that, when minimum wage controls are omitted, estimated task prices are extremely similar to those of the baseline specification. This suggests that quantile range controls absorb much of the variation captured in the minimum wage dummies. Columns (4) and (8) attest that the particular choice of bin width has little impact on estimates: the correlation is above .88 between baseline estimates and those obtained with a coarser discretization of 2 p.p. bins. 6.2 Parametric Restrictions: Log-Linear System The nonparametric identification in Section 5 is an asymptotic result. In practice, limitations in the availability of exogenous variation in the data may prevent the implementation of a fully flexible estimator capable of nonparametrically recovering the functions of interest. In such cases, auxiliary 27 Table 4: Estimated Sector-Specific Task Prices, Correlation with Benchmark Specification Commodity sector task price Non-commodity sector relative task price (1) (2) (3) (4) (5) (6) (7) (8) 0.776 0.855 0.973 0.926 0.807 0.874 0.969 0.916 0.802 0.914 0.960 0.886 0.812 0.912 0.960 0.893 No No Yes No No Yes Yes Yes No No Yes No No Yes Yes Yes Yes No Yes No Yes No No Yes Yes No Yes No Yes No No Yes A. High-School Graduates Correlation with baseline estimates B. High-School Dropouts Correlation with baseline estimates Baseline Controls Percentile below federal minimum wage Percentile in bottom, middle or top of wage distribution Discretization of wage distribution Bins of 1 p.p. (N = 88) Bins of 2 p.p. (N = 44) Note. Sample of 518 microregions in 1991-2000 and 2000-2010. Statistics are weighted by the microregion share in national population on 1991. Baseline estimates based on the discretization of the wage distribution in 88 bins of 1 p.p. width, including indicator dummies of percentile bins below the federal minimum wage (pre and post years) and percentile bins in bottom, middle or top of the wage distribution (P6-P30 and P30-P75). functional form assumptions on α g (.) and A g (.) are particularly useful to increase estimation precision. It is important however that these parametric assumptions do not impose artificial restrictions on the model. In the Two-Sector Roy Model, it is particularly relevant that functional forms allow for separate roles for comparative and absolute advantage since they are related to distinct predictions of the model. Accordingly, the benchmark specification in the empirical application below is based on the following log-linear version of system (16)-(17). Assumption 5. [Log-Linear System] Suppose schedules of comparative and absolute advantage in system (16)-(17) are given by α g (q) = α g · [ln (q) − ln (1 − q)] and A g (q) = Ã g + A g · ln (q) where α g ≥ 0 and A g ∈ R. Assumption 5 commands constant-elasticity schedules of comparative and absolute advantage. Following the discussion in Section 4, the positive parameter α g controls the dispersion of comparative advantage; alternatively, the parameter A g controls the pattern of variation in average absolute advantage of individuals distributed across quantiles of comparative advantage. In the empirical application, these parametric restrictions are useful for two main reasons. First, they simplify the instrument’s rank requirement since, as discussed below, they allow the use of standard instrumental variable estimators. Second, parameters α g and A g can be interpreted as the average elasticity of the comparative and absolute advantage schedules. In other words, these parameters correspond to the local treatment effect induced by the labor demand instrument in the cross-section of markets.25 25 In the spirit of the series estimator proposed by Newey and Powell (2003), the log-linear system could be augmented to include higher-order polynomials. In the limit, such expansion would recover nonparametrically functions α g (.) and A g (.). Yet, the implementation of this estimator is not feasible in most applications. As pointed out by Newey (2013), the estimation of nonlinearities tend to be accompanied by sharp increases in standard errors, requiring multiple strong instruments. The application in this paper is no exception and, for this reason, the constant-elasticity specification in Assumption 5 is particularly attractive. 28 It is important to notice that the system in Assumption 5 is a strict generalization of the system obtained under a Frechet distribution of skills that form the basis of numerous recent empirical applications of the Roy Model — see e.g., Hsieh, Hurst, Jones, and Klenow (2013); Burstein, Morales, and Vogel (2015); and Galle, Rodriguez-Clare, and Yi (2015). As discussed in Appendix D.4, the Frechet distribution leads to a similar log-linear system, but it entails a single parameter to control both comparative and absolute advantage. In terms of the system above, the Frechet distribution requires that α g = − A g where α g < 1. These restrictions have strong consequences to the model’s predictions regarding the responses of employment and wages following sector demand shocks. Namely, they impose constraints not only on the magnitude of the between-sector reallocation but also on the pattern of selection into both sectors. In fact, α g = − A g rules out responses in sector wage differentials, being unable to replicate the positive impact of commodity prices on the the commodity sector wage premium documented in Section 3. In contrast, the more general log-linear system in Assumption 5 contains parameters that separately control comparative and absolute advantage. Specifically, α g is positive to reflect a positively sloped comparative advantage schedule and A g is free to capture both positive and negative selection in the absolute advantage schedule.26 6.3 Estimation Procedure Now we are ready to propose an estimator for the schedules of comparative and absolute advantage directly related to the identification result in Proposition 2. To this extent, I take advantage of the parametric restrictions in Assumption 5 to construct a consistent GMM procedure with moment conditions that use the differential exposure of Brazilian microregions to the variation in international commodity prices in the two period windows of 1991-2000 and 2000-2010. To this end, let us combine equations (16)-(17) with the functional forms in Assumption 5 to write the following first-difference system: . C N C N l g,r,t ∆ω g,r,t − ∆ω g,r,t = α g · ∆ ln l g,r,t +∆X g,r,t γug + ∆u g,r,t (20) N N N ∆Ȳg,r,t − ∆ω g,r,t = A g · ∆ ln l g,r,t +∆X g,r,t γvg + ∆v g,r,t (21) where X g,r,t is a control vector of group-microregion-period variables that include group-microregion k fixed effects; and ∆ω g,r,t is the changes in the task price of sector k estimated with the procedure de- scribed in Section 6.1. Also, the overidentification restriction in Proposition 3 delivers an additional equation to the sys- 26 Appendix D.4 provides a detailed discussion on the pattern of sector selection implied by the Frechet distribution. While the restriction of α g = − A g is a direct implication of assuming a Frechet distribution, the restriction of α g < 1 is necessary to guarantee a finite supply of tasks in each sector. In Appendix D.4, I propose an alternative skill distribution that delivers the log-linear model in Assumption 5 with the sole restriction of a positively sloped comparative advantage schedule (i.e., α g positive). Appendix D.4 also discusses the system implied by normally distributed skills — as in Roy (1951), Heckman and Sedlacek (1985), and Ohnsorge and Trefler (2007). Although the normal distribution leads to distinct functional forms, the implied system also entails two parameters that parametrize the slopes of the schedules of comparative and absolute advantage, allowing for a flexible selection pattern. 29 tem. Specifically, equation (18) under Assumption 5 implies that " C C ∆Ȳg,r,t − ∆ω g,r,t = − αg + Ag · ∆ N l g,r,t C l g,r,t # N C ln l g,r,t − α g · ∆ ln l g,r,t + ∆X g,r,t γeg + ∆e g,r,t (22) where X g,r,t is the same control vector as above. Conditional on the parameter vector Θ g ≡ (α g , A g , γug , γvg , γeg ), equations (20)-(22) immediately al low the computation of the vector of structural errors: e g Θ g ≡ ∆u g,r,t , ∆v g,r,t , ∆e g,r,t r,t . Combined with a matrix of instruments W g ≡ ∆Z g,r,t , ∆X g,r,t r,t satisfying Assumption 3, the error vector e g (Θ g ) provides moment conditions for the consistent estimation of Θ g with the following GMM estimator: Θ̂ g = arg min e g (Θ g )0 W g ΦW0g e g (Θ g ) Θg (23) where Φ is a matrix of moment weights.27 As in Section 3, microregions are weighted by their share in the national population of 1991 and standard errors are clustered by microregion to account for serially correlated errors. To build instruments for the estimation of the parameter vector, I rely on a generalization of the regional exposure to commodity price shocks of Section 3. Precisely, I consider the following set of instruments: ∆Z g,r,t = ∆F φg,r ( j) · ln pt ( j) j∈J C (24) where F(.) is a quadratic polynomial function. Similarly to Section 3, ln pt ( j) is the international logprice of commodity category j at year t; and φg,r ( j) is the share of industry j in total labor payments of the commodity sector to individuals of group g in microregion r on the initial year of 1991. In the empirical application, the instrument vector contains five major commodity groups: Grains, Soft Agriculture, Livestock, Mining (Metals and Precious Metals), and Energy.28 As in Section 3, the control vector ∆X g,r,t contains ten macroregion-period dummies, initial commodity sector size controls, and initial labor market conditions interacted with period dummies.29 6.4 Results Table 5 presents the structural estimates obtained with the procedure described in Section 6.3. Column (1) reports the structural parameters implied by the estimation of equations (20)-(21) under the 27 In the main specification, I use the two-step optimal GMM weights. The sensitivity analysis in Appendix E shows that similar results are obtained with other matrices of moment weights. 28 In principle, one could use any non-linear function of the six commodity categories. Yet, the inclusion of too many instruments may lead to the well known weak instrument problem. To maintain a parsimonious number of instruments, I adopt two simplifications. First, I aggregate the two categories with lowest employment participation, Metals and Precious Metals, into a single category, Mining. Second, I consider a quadratic polynomial function, yielding two instruments per commodity category. In Appendix E, I show that similar results are obtained with alternative specifications for the instrument vector. 29 Labor market conditions: quadratic polynomial of per-capita income, share of white individuals, share of employed individuals, share of formal sector employees, share of individuals earnings less than the federal minimum wage, and share of social security earners (only HSD). Commodity sector size controls: quadratic polynomial of commodity sector share in group labor income and dummy for commodity sector share in group labor income in the bottom and top deciles of national distribution. 30 parameter restriction imposed by a skill distribution of the Frechet family (i.e., α g = − A g ). For both groups, estimated parameters indicate a steep schedule of comparative advantage that is associated with a low degree of between-sector employment reallocation.30 Evaluated at the national sector employment composition of 2010, estimates imply that, following a 10% increase in the relative task price in the commodity sector, the share of workers employed in the commodity sector increases from 5.14% to 5.63% for HSG and from 19.94% to 21.65% for HSD. Table 5: Structural Parameters: Log-Linear System Frechet model 𝛼𝑔 = −𝐴𝑔 Log-linear model (1) (2) Log-linear model with overidentification (3) 𝛼𝐻𝑆𝐺 1.040** (0.195) 1.051** (0.192) 0.835** (0.212) 𝐴𝐻𝑆𝐺 -1.040** (0.195) 2.483* (0.976) 1.966* (0.935) - 0.000 0.005 𝛼𝐻𝑆𝐷 0.963** (0.151) 1.873* (0.773) 0.916* (0.399) 𝐴𝐻𝑆𝐷 -0.963** (0.151) -0.888** (0.146) -0.727** (0.142) - 0.208 0.644 A. High-School Graduates Test of Frechet restriction (p-value) B. High-School Dropouts Test of Frechet restriction (p-value) Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. Two-Step GMM estimator with microregions weighted by their share in the 1991 national population. Excluded instruments: quadratic polynomial of local exposure to international product prices. Control vector includes macroregion-period dummies, initial commodity sector size controls, and initial labor market conditions interacted with period dummies (full description in Section 6.3). Standard Errors clustered by microregion. ** p<0.01, * p<0.05 As discussed above, the Frechet distribution entails a very particular pattern of selection that rules out responses in sector wage differentials. Such feature is inconsistent with the reduced-form evidence for HSG presented in Section 3. To allow for a more flexible behavior of sector average wages, column (2) presents the estimation of the log-linear model in equations (20)-(21). Indeed, this additional degree of freedom is important for HSG as the estimated parameter has the inverse sign and the Frechet restriction is rejected at usual significance levels. Among HSG, the estimated parameters indicate negative selection into the non-commodity sector with curves of potential sector wages similar to those in Figure 5. In contrast, the estimated parameters for HSD imply similar qualitative predictions of those of the Frechet model, yielding positive selection into both sectors with curves of potential sector wages similar to those in Figure 4. In this case, the Frechet restrictions cannot be rejected at usual significance levels. Lastly, column (3) presents the implementation of the full estimation procedure that includes the overidentifying equation (22). The overidentification equation yields more precise estimates for HSD 30 In fact, it is not possible to reject the hypothesis that α = 1 at the usual significance levels. Thus, estimated parameters g suggest higher dispersion of comparative advantage than that allowed by the Roy model with a Frechet distribution of skills. 31 and has little effect on estimates for HSG. In this case, comparative advantage parameters indicate between-sector employment reallocation whose magnitude is similar to that of the Frechet model in column (1). Nevertheless, the slope of the absolute advantage schedule is similar to that reported in column (2), indicating a response in the sector wage premium that is positive for HSG and almost zero for HSD. 7 Counterfactual Exercise: Effect of International Commodtiy Prices on the Wage Distribution To conclude, I use the estimated schedules of comparative and absolute advantage to investigate the consequences to the Brazilian wage distribution of shocks in international commodity prices. Precisely, I ask: “In 2010, how the wage structure would change if commodity prices fall by 10%?" The counterfactual exercise takes into account the variation in shock exposure across microregions and groups to compute the implied change in the national dispersion of wages. 7.1 First-Step: From Changes in Sector-Specific Task Prices to Changes in the Log-Wage Distribution As a first step to answer this question, I consider responses in the average and the variance of the log-wage distribution among workers of group g in region r following a change in sector-specific task C , ∆ω N ), triggered by a sector demand shock — in this case, the change in international prices, (∆ω g,r g,r commodity prices. To this extent, I consider the estimated schedules of comparative and absolute advantage, obtained in Section 6, while holding constant the structure of task supply (i.e. ∆ṽ g,r = ∆ũ g,r = 0). In this environment, equations (20)-(22) immediately deliver the changes in sector employment, k , and sector average wages, ∆Ȳ k , implied by the counterfactual change in sector-specific task ∆l g,r g,r prices. Together with equation (8) in Section 4, we obtain the counterfactual change in the average log-wage: ˆ N N C N ∆Ȳg,r = ∆ω gC + ∆ω g,r · l g,r )∆l g,r − αg + α g log(l g,r /l g,r | {z } | Direct Effect N + ∆l N l g,r g,r N l g,r N [log(q) − log(1 − q)] dq + ∆ω g,r ∆l g,r {z Compositional Effect (25) } N − ∆ω C is the counterfactual change in relative task prices; and l N is the observed where ∆ω g,r ≡ ∆ω g,r g,r g,r employment share of the non-commodity sector among workers of group g in region r. In equation (25), the first term is the direct effect on the wage of sector employees if they are unable to reallocate between sectors. Notice that this direct effect depends solely on the pre-shock employment composition, being independent from estimated parameters. The second term is the compositional effect generated by the reallocation of workers in response to the shock which, intuitively, depends on the slope of the comparative advantage schedule. As noted in Section 4, this compositional effect is negligible for small price shock since, in such cases, only workers with very similar potential wages in the two sectors choose to switch their sector of employment. 32 To compute the change in log-wage variance among workers of group g in region r, we rely on the model’s predicted response obtained from equation (10) in Section 4: 2 2 N N C C C N C N N C C N ∆Vg,r = l g,r + ∆l g,r l g,r + ∆l g,r · Ȳg,r − Ȳg,r + ∆Ȳg,r − ∆Ȳg,r − l g,r l g,r · Ȳg,r − Ȳg,r ˆ lg,r N h i2 h i2 2 2 k k N N k N + (l g,r + ∆l g,r ) Ākg (l g,r + ∆l g,r ) − l g,r Ākg (l g,r ) (α g + A g ) log(q) − α g log(1 − q) − A g log(q) dq − ∑ N + ∆l N l g,r g,r k =C,N (26) C − Ȳ N is the commodity sector wage premium that is observed among workers of group g where Ȳg,r g,r in region r; and Ākg (.) is the schedules of average efficiency in sector k estimated in Section 6 (i.e., the right hand side of equations (21)-(22)). The log-wage variance has a between-sector component that responds to the change in sector wage premium as captured by the term in the first row of equation (26). The term in the second row captures the variance in log-wages within the two sectors of the economy. This is directly related to the estimated schedules of comparative and absolute advantage that govern the dispersion of average sector-specific efficiency across the comparative advantage quantiles allocated to each sector. 7.2 Second-Step: Pass-Through from Changes in Commodity Prices to Changes in SectorSpecific Task Prices In order to implement equations (25)-(26), it is necessary to evaluate how the change in commodity prices translates into changes in sector-specific task prices for each group and microregion. In this section, I focus on the reduced-form pass-through of the shock exposure to sector-specific task prices as captured by the following equation: k ∆ω g,r,t = βkg · ∆Zg,r,t + ∆X g,r,t γkg + ∆e g,r,t (27) where X g,r,t is the control vector of group-microregion-period variables described in Section 6.3. As above, microregions are weighted by their share in the national population of 1991 and standard errors are clustered by microregion. Table 6 reports the estimation of equation (27) in the sample of Brazilian microregions in 1991-2000 and 2000-2010. Estimates in columns (1)-(4) indicate that exposure to the commodity price shock is associated with a higher task price in the commodity sector. With the full control vector used in the structural estimation, the estimated pass-through is 0.9-1.2 for HSD and 1.2-1.3 for HSG. In addition, columns (5)-(8) investigate the relation between shock exposure and non-commodity sector task prices, implying a pass-through of .25-.35 for HSG and 0-.10 for HSD. In response to higher commodity prices, these estimates suggest stronger gains in the commodity sector task price which, in turn, imply an increase in the relative task price of the commodity sector. Combined with the commodity sector employment expansion, this change in sector-specific task prices yield the structural estimation above. In the counterfactual analysis, I calibrate the pass-through from commodity prices to sector-specific task prices using the estimates in Table 6. To focus on supply-driven movements in inequality, I assume that the pass-though is identical for HSG and HSD. Taking an intermediate value of the estimates for the two groups, the pass-though from international commodity prices to task prices is set to be 1.10 in 33 the commodity sector and 0.25 in the non-commodity sector. Table 6: Exposure to Commodity Price Shocks and Sector-Specific Task Prices Change in task price of commodity sector Dependent Variable: Change in task price of non-commodity sector (1) (2) (3) (4) (5) (6) (7) (8) 1.407** (0.394) 1.120** (0.390) 1.096** (0.392) 0.962** (0.359) 0.354** (0.071) 0.323** (0.069) 0.324** (0.069) 0.282** (0.062) 0.524 0.570 0.585 0.598 0.558 0.585 0.586 0.592 Commodity price shock 1.599* (0.730) 1.292* (0.571) 1.268* (0.574) 1.274* (0.571) -0.056 (0.112) 0.028 (0.092) 0.008 (0.091) -0.014 (0.089) R2 0.644 0.689 0.690 0.690 0.461 0.555 0.576 0.576 Initial commodity sector size controls Yes Yes Yes Yes Yes Yes Yes Yes Initial labor market conditions x period dummies No Yes Yes Yes No Yes Yes Yes Initial commodity sector size controls x period dummies No No Yes Yes No No Yes Yes Initial size of manufacturing sector x period dummies No No No Yes No No No Yes A. High-School Graduates Commodity price shock R2 B. High-School Dropouts Structural Estimation Controls Additional Controls Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. All regressions are weighted by the microregion share in national population on 1991 and include ten macroregion-period dummies. Structural estimation controls described in Section 6.3. Standard Errors clustered by microregion ** p<0.01, * p<0.05 7.3 Counterfactual Predictions In any microregion r on 2010, we are interested in the counterfactual change of the wage distribution implied by a 10% decrease in international commodity prices. Using the pass-through obtained in Section 7.2, I compute the change in sector-specific task prices triggered by the price shock: k = βkg · 0.10 ∆ω g,r where βCg = 1.10 and β N g = 0.25. For every group g and region r, equations (25)-(26) yield the counterfactual change in the average and the variance of log-wages conditional on estimated parameter of comparative and absolute adC , ∆ω N ). Table 7 vantage, (α g , A g ), and the counterfactual changes in sector-specific task prices, (∆ω g,r g,r report these counterfactual responses at the national level where the aggregate log-wage variance is computed with the total variance formula and the microregion’s employment share in 2010. Table 7 also reports the bootstrapped 90% confidence intervals associated with such changes.31 As shown in column (1), the fall in commodity prices causes average wage losses for the two worker groups. Yet, the wage loss is more pronounced among HSD due to their higher employment share in the commodity sector — the 2010 national employment share of the commodity sector was 5.1% for HSG and 19.9% for HSD. Consequently, the 10% decrease in commodity prices leads to an increase in 31 To compute confidence intervals, I take R draws from the asymptotic normal distribution of the GMM estimator implied by the estimates in Table 5. With each parameter draw, I compute counterfactual changes in the average and the variance of the log-wage distribution using equations (25)-(26). The confidence interval bounds correspond to the 5th and the 95th percentiles of the counterfactual changes obtained from the R parameter draws. In Table 7, I set R = 300. 34 Table 7: Effect of 10% decrease in commodity prices, benchmark estimates Change in group average wage Percentual change in the between component of log-wage variance national average between-region within-group between sector within region-group Total (1) (2) (3) (4) 1.09 [1.09; 1.09] 11.09 [-2.35; 48.31] 12.18 [-1.26; 49.40] -4.13 [-4.15; -4.02] 3.26 [3.10; 3.34] -3.72 [-12.88; 13.65] -0.47 [-9.54; 16.45] -3.52 [-3.52; -3.51] 2.76 [2.57; 2.80] 1.71 [-2.14; 12.22] 4.48 [0.62; 14.91] Panel A. High-School Graduates -2.94 [-2.94; -2.92] Panel B. High-School Dropouts Panel C. All Workers Note. Initial labor market outcomes in 518 microregions on 2010. Estimated parameters of the Log-linear model in column (3) of Table 5. For each worker group, columns (2)-(4) report the percentual change in the between-component of log-wage variance implied by the decomposition presented in Figure 1. Pass-through of commodity prices to task prices equal to 1.1 in the commodity sector and 0.25 in non-commodity sector. Each response is accompanained by its 90% confidence interval computed with the bootstrap procedure described in the main text. the HSG wage premium of approximately 1.2%. Notice that confidence intervals are extremely tight, reflecting the quantitative dominance of the parameter-free direct effect in equation (25). Columns (2)-(4) present the percentage change in the between component of the log-wage variance implied by the shock.32 Column (2) shows the increase in wage dispersion associated with the change in average regional wages. Because areas specialized in commodity production had lower wages on 2010, the negative shock triggers an increase in the log-wage dispersion within both groups. At the national level, this effect is reinforced by the higher shock exposure of low-wage workers in the group of HSD. Column (3) shows the shock effect on the log-wage variance within groups and regions computed with equation (26). For HSG, the negative impact of the shock on the commodity sector wage differential leads to an increase in the log-wage variance. In contrast, the weaker response for HSD leads to a decrease in the log-wage variance. Lastly, column (5) combines these components to obtain the counterfactual change in total wage dispersion: a 10% decrease in commodity prices implies a 4.5% increase in the Brazilian log-wage variance. 8 Conclusion This paper contributes to our understanding of the role of comparative advantage in labor markets in the response of within- and between-wage inequality following international trade shocks. This paper establishes a reduced-form relation between exposure to international price shocks and labor market outcomes that asks for differential effects depending on the worker’s affiliation to groups, regions and sectors. Such structure is provided by a Two-Sector Roy Model where the distribution of worker 32 The between component of the log-wage variance is obtained form the log-wage decomposition presented in Figure 2 of Section 3. 35 heterogeneity in terms of sector-specific efficiency governs the sectoral allocation in various groups and regions. Using robust predictions of the model, I show that this distribution is nonparametrically identified from cross-regional variation in employment and wages induced by exogenous variation in sector labor demand. The application of this novel strategy to Brazilian regions suggest a rich estimated pattern of worker selection that cannot be captured by traditional models. In the counterfactual analysis, the estimates of comparative and absolute advantage attest the importance of selection forces in the behavior of wage differentials associated with educational groups, regions and sectors. References D. Acemoglu and D. H. Autor. Skills, tasks and technologies: Implications for employment and earnings. Handbook of Labor Economics, 4:1043–1171, 2011. E. Artuç, S. Chaudhuri, and J. McLaren. Trade Shocks and Labor Adjustment: A Structural Empirical Approach. The American Economic Review, pages 1008–1045, 2010. D. H. Autor, D. Dorn, and G. H. Hanson. The China Syndrome: Local Labor Market Effects of Import Competition in the United States. The American Economic Review, 103(6):2121–2168, 2013. D. H. Autor, D. Dorn, G. H. Hanson, and J. Song. Trade Adjustment: Worker-Level Evidence. The Quarterly Journal of Economics, 129(4):1799–1860, 2014. E. Bekman, J. Bound, and S. Machin. Implications of Skill-Biased Technological Change: International Evidence. The Quarterly Journal of Economics, 113(4):1245–1279, 1998. G. J. Borjas. Self-Selection and the Earnings of Immigrants. The American Economic Review, 77(4):531–553, 1987. A. Burstein and J. Vogel. International trade, technology, and the skill premium. Manuscript, Columbia University and UCLA, 2015. A. Burstein, E. Morales, and J. Vogel. Accounting for Changes in Between-Group Inequality. Technical report, Working Paper, 2015. L. Caliendo, M. Dvorkin, and F. Parro. The Impact of Trade on Labor Market Dynamics. Technical report, National Bureau of Economic Research, 2015. F. Costa, J. Garred, and J. P. Pessoa. Winners and Losers from a Commodities-for-Manufactures Trade Boom. Technical report, Centre for Economic Performance, LSE, 2014. A. Costinot. An elementary theory of comparative advantage. Econometrica, 77(4):1165–1192, 2009. A. Costinot and J. Vogel. Beyond Ricardo: Assignment Models in International Trade. Technical report, National Bureau of Economic Research, 2014. A. Costinot and J. E. Vogel. Matching and Inequality in the World Economy. Journal of Political Economy, 118(4):747–786, 2010. R. Dix-Carneiro. Trade liberalization and labor market dynamics. Econometrica, 82(3):825–885, 2014. R. Dix-Carneiro and B. K. Kovak. Trade reform and regional dynamics: evidence from 25 years of brazilian matched employer-employee data. Technical report, National Bureau of Economic Research, 2015a. R. Dix-Carneiro and B. K. Kovak. Trade liberalization and the skill premium: A local labor markets approach. Technical report, National Bureau of Economic Research, 2015b. 36 F. H. Ferreira, S. Firpo, and J. Messina. A more level playing field? Explaining the decline in earnings inequality in Brazil, 1995-2012. 2014. E. French and C. Taber. Identification of Models of the Labor Market. Handbook of Labor Economics, 4:537–617, 2011. J. A. Frias, D. S. Kaplan, and E. Verhoogen. Exports and within-plant wage distributions: Evidence from mexico. American Economic Review, 102(3):435–40, 2012. S. Galle, A. Rodriguez-Clare, and M. Yi. Slicing the Pie: Quantifying the Aggregate and Distributional Effects of Trade. UC Berkeley, mimeo, 2015. R. Gibbons and L. Katz. Does unmeasured ability explain inter-industry wage differentials? The Review of Economic Studies, 59(3):515–535, 1992. R. Gibbons, L. F. Katz, T. Lemieux, and D. Parent. Comparative Advantage, Learning, and Sectoral Wage Determination. Journal of Labor Economics, 23(4):681–724, 2005. P. K. Goldberg and N. Pavcnik. Distributional Effects of Globalization in Developing Countries. Journal of Economic Literature, 45(1):39–82, 2007. G. H. Hanson. The Rise of Middle Kingdoms: Emerging Economies in Global Trade. The Journal of Economic Perspectives, 26 (2):41–63, 2012. J. J. Heckman and B. E. Honore. The Empirical Content of the Roy Model. Econometrica, pages 1121–1149, 1990. J. J. Heckman and G. Sedlacek. Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of SelfSelection in the Labor Market. The Journal of Political Economy, 93(6):1077, 1985. E. Helpman, O. Itskhoki, and S. Redding. Inequality and unemployment in a global economy. Econometrica, 78(4):1239–1283, 2010. E. Helpman, O. Itskhoki, M.-A. Muendler, and S. J. Redding. Trade and inequality: From theory to estimation. Technical report, National Bureau of Economic Research, 2015. C.-T. Hsieh, E. Hurst, C. I. Jones, and P. J. Klenow. The Allocation of Talent and US Economic Growth. Technical report, National Bureau of Economic Research, 2013. R. W. Jones. Income distribution and effective protection in a multicommodity trade model. Journal of Economic Theory, 11(1): 1–15, 1975. G. Kambourov. Labour market regulations and the sectoral reallocation of workers: The case of trade reforms. The Review of Economic Studies, 76(4):1321–1358, 2009. B. K. Kovak. Regional Effects of Trade Reform: What is the Correct Measure of Liberalization? The American Economic Review, 103(5):1960–1976, 2013. D. Lagakos and M. E. Waugh. Selection, Agriculture, and Cross-Country Productivity Differences. The American Economic Review, 103(2):948–980, 2013. R. Z. Lawrence and M. J. Slaughter. International trade and American wages in the 1980s: giant sucking sound or small hiccup? Brookings papers on economic activity. Microeconomics, pages 161–226, 1993. R. L. Matzkin. Nonparametric Identification. Handbook of Econometrics, 6:5307–5368, 2007. C. B. Mulligan and Y. Rubinstein. Selection, investment, and women’s relative wages over time. The Quarterly Journal of Economics, 123(3):1061–1110, 2008. 37 D. Neal. Industry-specific human capital: Evidence from displaced workers. Journal of labor Economics, pages 653–677, 1995. W. K. Newey. Nonparametric Instrumental Variables Estimation. The American Economic Review, 103(3):550–556, 2013. W. K. Newey and J. L. Powell. Instrumental Variable Estimation of Nonparametric Models. Econometrica, 71(5):1565–1578, 2003. F. Ohnsorge and D. Trefler. Sorting It Out: International Trade with Heterogeneous Workers. Journal of Political Economy, 115 (5):868–892, 2007. D. Parent. Industry-specific capital and the wage profile: Evidence from the national longitudinal survey of youth and the panel study of income dynamics. Journal of Labor Economics, 18(2):306–323, 2000. A. D. Roy. Some Thoughts on the Distribution of Earnings. Oxford Economic Papers, 3(2):pp. 135–146, 1951. W. Stolper and P. A. Samuelson. Protection and Real Wages. Review of Economic Studies, 9(1):58–73, 1941. P. Topalova. Factor Immobility and Regional Impacts of Trade Liberalization: Evidence on Poverty from India. American Economic Journal: Applied Economics, 2(4):1–41, 2010. E. A. Verhoogen. Trade, quality upgrading, and wage inequality in the Mexican manufacturing sector. The Quarterly Journal of Economics, 123(2):489–530, 2008. R. Wacziarg and J. S. Wallack. Trade liberalization and intersectoral labor movements. Journal of International Economics, 64 (2):411–439, 2004. A. Young. Structural Transformation, the Mismeasurement of Productivity Growth, and the Cost Disease of Services. The American Economic Review, 104(11):3635–3667, 2014. 38 A Data Construction and Measurement Commodity Price Data. To capture Brazil’s exposure to world prices of basic products, I build price indices for six commodity categories. The first source of international commodity prices is the Commodity Research Bureau that publishes price indices by commodity group based on product spot prices in the main exchange markets in the United States. In the paper, I use those groups with sizeable employment participation in Brazil: Grains (corn, soybeans, and wheat), Soft Agriculture (cocoa, coffee, sugar, orange juice and others), Livestock (hides, hogs, lard, steers, tallow), Metals (copper scrap, lead scrap, steel scrap, tin, zinc). In addition, I build price indices for two commodity groups using future prices in the New York Mercantile Exchange: Precious Metals (gold and silver) and Energy (crude oil). These series of international nominal prices were converted into local currency using the nominal exchange rate and deflated by the Brazilian consumer price index (IPCA).33 To avoid short-term price volatility, the reduced-form analysis relies on the average price in the six months preceding the process of data collection of the Census. Specifically, I use the average price between March and August of each year. Labor Market Data. Data on labor market outcomes come from publicly available long versions of the Brazilian Census performed by the Brazilian Institute of Geography and Statistics (IBGE) for the years of 1980, 1991, 2000, and 2010. From the Census, I extract a sample of full-time workers aged between 16 and 64. Full-time workers are defined as those reporting more than 35 weekly worked hours. I restrict the sample to workers with calculated experience between 0 and 39 years. Experience is defined as the individual’s age minus a predicted initial working age that equals 23 for college graduates, 18 for high-school graduates and 15 for those with only primary education. The benchmark sample is further restricted to include only white male workers. This restriction allows us to focus on individuals with strong labor force attachment that are not directly subject to changes in employer prejudice attitudes towards female and non-white individuals. In fact, Ferreira, Firpo, and Messina (2014) show that gender and race wage differentials were important drivers of recent changes in the Brazilian wage inequality. In robustness exercises, I extend the benchmark sample to also include female and nonwhite workers. Individuals in the sample are allocated to sectors according to their self-reported industry of employment. Table A1 shows the industry classification used in this paper together with corresponding industry codes in each year of the Census. I use crosswalk tables publicly provided by the IBGE to link the different activity codes in the Censuses of 1980-1991, 2000 and 2010. The division of industries in the commodity sector accommodates available information on international prices as described above. Regional Labor Markets. As a regional labor market unit, I use the microregion concept created by the IBGE in the 1991 Census. Each of the 558 microregions corresponds to a set of economically integrated municipalities with interconnected labor markets. This definition of regional labor markets in Brazil is similar to Commuting Zones in the United States, being recently used in a series of papers analyzing the response of local labor markets to aggregate trade shocks (e.g. Kovak, 2013, and Dix-Carneiro and 33 All commodity price series were downloaded from the Global Financial Database. In the end of 2008, the soft and grains indices were unified under the foodstuff index. Thus, I build these series for 2009-2010 using each index description. Series of nominal exchange rate and IPCA were downloaded from the IPEADATA. 39 Table A1: Industry Classification and Census Activity Codes Industry 1980-1991 Census (Atividades) 2000 Census (CNEA-Dom) 2010 Census (CNEA-Dom 2.0) Commodity Sector Grains (corn, soybeans, wheat) Soft (coffee, cocoa, sugar, and others) 20; 21; 22 11; 12; 14-17; 23; 24 1102; 1103; 1107 1104; 1105; 1110-1116; 2001; 2002; 15022; 15042 1201-1205; 1208; 1209; 1300; 1402; 5001; 5002; 15010; 15030 1102; 1103; 1107 1104; 1105; 1110-1116; 10022; 10093 Livestock (cattle, hogs, and others) 26; 27; 41; 42 Metals (copper, lead, steel, tin, and zinc) Precious Metals (gold and silver) Energy (crude oil) Other agriculture and mining 58 55 51 13; 18-19; 25; 28; 29; 31-37; 50; 52-54; 56; 57; 59; 581 13002 13001 11000 1101; 1106; 1108; 1109; 1117; 1118; 1206; 1207; 1401; 10000; 12000; 14001-14004 7002 7001 6000 1101; 1106; 1108; 1109; 1117-1119; 1206; 1207; 1401; 5000; 8001-9000 Manufacturing 100-300 15021; 15041; 15043; 15050; 16000-37000 10021; 10091; 10092; 10099-32999; 38000 Non-Tradable Goods and Services 340-901 1500; 40010-99000 1201-1205; 1208; 1209; 1402; 1999; 3001; 3002; 10010; 10030 Non-Commodity Sector 1500; 33001-37000; 39000-99000 Kovak, 2015a,b). Despite the sharp increase in the number of municipalities between 1991 and 2010, the IBGE maintained the same microregion definition in the Censuses of 1991, 2000 and 2010.34 In the 1980 Census, the microregion variable does not exist so I create it from existing municipalities in 1980. Because of the change in municipality borders between 1980 and 1991, it is only possible to replicate a subset of the microregions using historical administrative borders. To be more precise, I recover 540 microregions in the 1980 Census compared to the 558 microregions in the 1991 Census.35 Wage Distribution. As a measure of individual wage, I use the self-reported monthly labor income in the main job. Table A2 presents the trends in log wage inequality among male white full-time workers. As discussed in the main text, the variance of log wage has an inverted U-shape in the period: it increased in the 1980s and decreased in the 1990s and 2000s. To attest that the behavior of log wage variance is not entirely related to individuals with extreme wage values, I consider the change in the differential between percentiles of the wage distribution. Specifically, the P90-P10 wage gap increased by 40 log-points in 1980-1991 and decreased by 66.3 log-points in 1991-2010. In terms of observable worker characteristics, the wage premium associated with two attributes were particularly important in the wage compression process of 1991-2010: educational and sectoral wage differentials fell respectively by 32.1 and 19.0 log-points in the period. To further investigate the movement in wage inequality, I perform a wage variance decomposition with a regression of log wages on a full set of dummies for years of experience, level of education, sector of employment and microregion of residence. Table A3 presents the result of this decomposition 34 There were 4,491 municipalities in 1991 and 5,565 in 2010. Out of the 1074 municipalities created in the period, 998 municipalities had parent municipalities in a single microregion and, therefore, they were allocated to this microregion. The other 76 municipalities had parent municipalities in more than one microregion. These municipalities, that represented .33% of employment in 2000, were allocated to the microregion of the parent municipality that ceded the highest population share to the new municipality. This procedure adopted by the IBGE minimizes any measurement error implied by the border change. In fact, all results in the paper are robust to using a sample of 491 microregions built by aggregating microregions such as to keep borders unchanged in the period. 35 Out of the 3,991 municipalities in the 1980 Census, I am able to link 3,938 municipalities to at least one of the 4,491 municipalities in the 1991 Census. With these linked municipalities, I construct microregions in 1980 using the microregion assigned to corresponding municipalities in 1991. The main problem of this method is the existence of new municipalities in 1991 that belonged to a different microregion than their parent municipalities in 1980. This is the case for 85 of the 500 municipalities created between 1980 and 1991, accounting for .67% of total employment in 2000. 40 Table A2: Trends in Brazilian Log Wage Inequality, 1980-2010 1980 1991 2000 2010 Variance 0.893 1.095 1.017 0.781 P90th - P50th P50th - P10th P90th - P10th 1.378 0.944 2.322 1.476 1.247 2.722 1.492 1.092 2.584 1.386 0.673 2.060 High-school wage differential 1.161 1.046 0.965 0.725 Non-commodity sector wage differential 0.388 0.422 0.358 0.231 Note. Sample of male white full-time workers extracted from the Brazilian Census of 1980, 1991, 2000 and 2010. Education and sector wage differentials implied by a regression of log real wage on a full set of dummies for high-school degree, employment in the commodity sector, microregion of residence, and experience (0-39yrs). between 1980 and 2010. The education-sector-region component is the main driver of wage dispersion, accounting for 49.8% of the 1980-1991 growth and 69.1% of the 1991-2010 decline. Notice that this component’s contribution is distributed across all its three terms. Conclusions in Table A3 are related to results reported elsewhere in the literature. In particular, Ferreira, Firpo, and Messina (2014) highlight the importance of falling educational and state wage gaps for the 1995-2012 decrease in wage inequality observed among workers in the PNAD survey. Using administrative data for formal sector employees, Helpman, Itskhoki, Muendler, and Redding (2015) conclude that observable worker attributes account for roughly half of the increase in log wage variance between 1986 and 1995. Table A3: Decomposition of the Variance of Log Wages, 1980-2010 1980 1991 2000 2010 Overall Residual Between Education-Region-Sector Education Region Sector Covariance (education, region, sector) Experience Covariance (experience, education-region-sector) 0.893 1.095 1.017 0.781 0.481 0.412 0.593 0.502 0.581 0.436 0.524 0.257 0.356 0.457 0.387 0.240 0.181 0.059 0.027 0.089 0.205 0.107 0.029 0.117 0.209 0.073 0.017 0.088 0.131 0.055 0.006 0.049 0.095 -0.040 0.077 -0.033 0.099 -0.050 0.059 -0.042 Note. Sample of male white full-time workers extracted from the Brazilian Census of 1980, 1991, 2000 and 2010. Wage decomposition implied by a regression of log real wage on a full set of dummies for high-school degree, employment in the commodity sector, microregion of residence, and experience (0-39yrs). Sample of Microregions in the Reduced-Form Exercise. In the reduced-form exercise, I select a baseline sample of 518 microregions with positive employment in the commodity sector for all groups in the 1991-2010 period, covering 98.4% of the country’s population on 1991. The 1980-1991 period is excluded from the baseline sample mainly because of the turbulent economic environment in Brazil during the 1980s. The decade was marked by hyperinflationary episodes, suspension of foreign currency convertibility, and the adoption of restrictive internal controls on prices and wages. In this environment, it is not clear that relative international prices were very informative about relative prices faced by domestic producers when deciding resource allocation. More normal economic conditions 41 returned after the series of structural reforms implemented in 1993-1994 that brought monetary stabilization, eliminated price controls, and restored full currency convertibility. In any case, robustness exercises attest that similar results hold in the extended sample spanning the entire 1980-2010 period. Bartik-Instrument Weights. To build the group-region exposure to international commodity prices, I compute total labor income by industry as the weighted sum of monthly wages of individuals reportk ( j ) as the total ing to have their main job in the industry using Census sampling weights. Denote Yg,r,t labor payments of industry j for workers of group g in microregion r on year t. The initial industry participation in sector labor payments to group g in microregion r is k φg,r ( j) ≡ Yg,r,1991 ( j)/ ∑ k Yg,r,1991 ( j) j∈J k where J k is the set of industries in sector k. Table A4 reports summary statistics of industry composition in the sample of 518 microregions on 1991. Columns (1) and (3) indicate that regions, on average, have a large fraction of their work force allocated to the commodity sector with agriculture accounting for the bulk of the sector’s labor expenditure. Importantly, columns (2) and (4) document great heterogeneity in industry composition across microregions. Comparing columns (1) and (3), it is possible to identify different exposure patterns for the two groups. While HSD are more likely to be employed in the production of grains and soft agricultural items, HSG are more likely to be employed in the production of livestock and crude oil. Table A4: Summary Statistics: Labor Income Share by Industry in Brazil, 1991 High-School Graduates Industry High-School Dropouts Mean (1) SD (2) Mean (3) SD (4) 9.0% 9.6% 21.6% 19.7% 4.7% 13.0% 35.5% 3.0% 1.0% 8.4% 34.3% 11.6% 16.1% 21.1% 7.2% 4.0% 17.1% 20.7% 10.1% 19.5% 26.8% 1.6% 1.8% 2.3% 37.9% 16.3% 18.0% 15.6% 4.2% 4.7% 6.2% 19.8% 2. Manufacturing 16.1% 10.7% 18.1% 11.3% 3. Non-Tradable Goods and Services 74.9% 10.8% 60.4% 14.8% 1. Commodity Sector Grains (corn, soybeans, wheat) Soft (coffee, cocoa, sugar and other) Livestock (cattle, hogs, and others) Metals (copper, lead, steel, tin, and zinc) Precious Metals (gold and silver) Energy (crude oil) Other agriculture and mining Note. Sample of male white full-time workers extracted from the Brazilian Census of 1991. Statistics are weighted by the microregion share in national population on 1991. Regional Labor Market Outcomes. For each triple of group-region-year, I compute labor market outcomes used in the reduced-form exercise: sector employment, sector average log wage and average log wage. To calculate wage outcomes, I estimate wage regressions separately for each year using the entire sample of workers in the country. Specifically, I regress the log monthly wage on a full set of experi42 ence dummies (0-39yrs) interacted with dummies for female and white workers. The residual of this regression corresponds to a wage measure adjusted for variation in these demographic characteristics across groups and microregions. For each triple of group-region-year, the sector average log wage is the weighted average of the adjusted wage among individuals reporting to have their main job in the sector. Lastly, the average log wage is the weighted average of the adjusted wage among all individuals in the triple of group-region-year. In both cases, the computation uses Census sampling weights. Sector employment corresponds to the sum of efficiency-adjusted hours supplied by sector employees in a group-region-year. First, the efficiency adjustment is performed by multiplying individual weekly hours by a time-invariant measure of relative wage for each cell of sex-race-educationk , is the weighted sum of efficiency-adjusted experience.36 Second, the total sector employment, Hg,r,t hours of individuals reporting to have their main job in the sector computed with Census sampling k k / ( H C + H N ). weights. Thus, sector employment share is defined as l g,r,t ≡ Hg,r,t g,r,t g,r,t To obtain total labor supply, I use the aggregate amount of efficiency-adjusted hours in a groupC + H N . Lastly, the labor supply of immigrants is computed exactly as above region-year: H̄g,r,t ≡ Hg,r,t g,r,t in a restricted sample of individuals identified as non-native residents of each microregion.37 Task Price Estimation Data. In the structural exercise, the estimation of sector-specific task prices requires initial sector employment and wage growth across percentiles of the wage distribution for each triple of group-region-period. To this extent, I compute percentiles of log hourly wage from the sample of individuals in a group-region-period using Census sampling weights. Individuals are distributed across percentile bins according to their log hourly wage — in the benchmark specification, I consider bins with width of 1pp. The sector employment share in each percentile bin corresponds to the fraction of efficiency-adjusted hours reported by sector employees in that percentile bin. Additionally, the wage growth in each percentile corresponds to the difference in the average hourly wage of individuals in that percentile bin between two consecutive years. Since extreme wage values are more likely to be generated by measurement error, I ignore the wage distribution tails by restricting the estimation to bins between the 6th and the 94th percentiles. Thus, the task price estimation is based on 88 percentile bins for each of the 2,072 group-region-period triples. Chinese Imports Data. The series of Chinese imports by industry used in the robustness exercise below is constructed from imports information at the 2-digit SITC (rev3) industry level in the Comtrade database. I use China imports from the rest of the world in Food & live animals, Crude material, Mineral fuel, and Animal/Veg Oil. I map the 26 industries in these groups to the six commodity categories on Table A1 (crosswalk available upon request). 36 I consider 48 cells based on two sex groups, two race groups (white and non-white), three educational groups (highschool dropouts, high-school graduates and college graduates) and four experience groups (0-9, 10-19, 20-29 and 30-39 years). For each cell, the relative wage is the average hourly wage divided by the average wage of female non-white high-school dropouts with 0-9 years of experience. The cell weight is the average relative wage across regions and years (1991, 2000, 2010). In the 1980 Census, weekly hours are only reported in ranges. To compute efficiency-adjusted hours, I assign 45 and 54 weekly hours to individuals reporting respectively 40-48 and 49+ hours. 37 Given the information in the Census, I can only identify as microregion natives those individuals satisfying one out of two conditions. First, they were born in the same municipality that they current live in. Second, if they were born in a different municipality, then I also consider microregion natives those that moved into the current municipality from another municipality in the same microregion during the previous ten years. 43 B Reduced-Form Evidence: Sensitivity Analysis Having established the main results in Section 3, I turn to an empirical investigation of the robustness of these results taking as starting point the detailed specification with period dummies interacted with macroregion dummies, initial labor market conditions and initial commodity sector size controls. Additional Controls. Table B1 extends the baseline model with additional demographic controls in the initial period. In columns (2), (6) and (10), I investigate the effect of the increase in employment formalization in Brazil in 1991-2010 — for the share of formalized employees increased from 72.3% in 1991 to 82.3% in 2010. To the extent that formalization was lower in rural areas, this increase is potentially correlated with regional exposure to commodity price shocks. I address this concern with the inclusion of the initial share of formal sector employees in the microregion.38 Yet, these controls have almost no impact on estimated coefficients. The other two variables refine the set of initial sector composition controls, capturing more flexibly confounding effects related to heterogeneity in regional exposure to sectoral shocks. I introduce the initial share of group labor income in manufacturing industries (columns 3, 7 and 11) and in agriculture and mining products without international prices (columns 4, 8 and 12). By interacting these variables with period dummies, I allow for an arbitrary change in the price of these products relative to the price of internationally traded commodities. In both cases, coefficients are essentially the same for HSG, but they are slightly lower for the wage and sector employment responses of HSD remaining nevertheless statistically significant at 5%. Additional Period. Table B2 estimates the model in the extended sample spanning the entire period of 1980-2010. As argued above, the peculiar economic conditions in Brazil could potentially weaken the connection between domestic and international commodity prices during the 1980s. Yet, columns (2) and (5) indicate very similar responses in terms of group wages and commodity sector employment. Differences arise for the response of the commodity sector wage differential in column (8). In this case, the coefficient for HSG falls by 40%, moving towards the lower bound of the baseline confidence interval. For HSD, we obtain a higher and more precise coefficient compared to the nonsignificant coefficient implied by the baseline specification. Compared to the period of 1991-2010, the 1980s exhibit another important difference: commodity prices experienced strong losses in the decade. Taking advantage of this qualitatively different price behavior, columns (3), (6) and (9) estimate the model using only within-microregion variation across time — that is, I include microregion-specific time trends in the model. This specification addresses concerns that shock exposure is picking up secular trends in microregions specialized in the commodities with larger price gains in 1991-2010. Although the microregion fixed effects absorb much of the cross-section variation in labor market outcomes, they have little effect on estimated coefficients.39 38 Restricting the sample to formal sector employees creates a selection problem because low-wage individuals are more likely to drop out of the sample which attenuates wage gains in the period. To avoid this issue, I control for the pre-sample formal sector size which captures any correlation between production specialization and employment formalization. 39 In omitted exercises, I have estimated the model with micreoregion fixed effects in the baseline sample of 1991-2010. In this case, standard errors become five to ten times higher. This increase is related to the high correlation in shock exposure between 1991-2000 and 2000-2010 — the autocorrelation of shock exposure is 0.734. Consequently, there is little within- 44 Additional Worker Groups. In Table B3, I extend the sample to include female and non-white individuals. With this exercise, I evaluate whether these additional worker groups exhibit similar behaviors in the labor market. This possibility is specially relevant given the large changes in gender and race wage gaps in the period (Ferreira, Firpo, and Messina, 2014). In columns (2), (6) and (10), I include female white individuals without significant changes in estimated coefficients. The inclusion of non-white male individuals entails a more intricate change in estimated coefficients as shown in columns (3), (7) and (11). Responses of sector employment and wages became weaker for HSG in Panel A, but the opposite is true for HSD in Panel B. These different estimated responses are likely related to differences between white and non-white individuals in terms of unobservable characteristics driving their sectoral allocation. This is particularly important among HSG because of the extremely low high-school graduation rate among non-white individuals in Brazil. IV: Exposure to Chinese Imports Growth. Now I address concerns regarding endogeneity of relative commodity prices introduced if Brazilian regions are large enough to affect the world price of basic commodities. For example, a negative shock to a region’s commodity sector labor supply would simultaneously imply a decrease in sector employment and an increase in world prices, introducing a negative bias in the OLS estimates. Motivated by the strategy in Autor, Dorn, and Hanson (2013), I propose an instrument based on the regional exposure to Chinese imports growth. That is, I instrument ∆Zg,r,t with the following variable: ∆Mg,r,t = ∑ φg,r ( j) · ∆ ln Mt ( j) j∈J C where Mt ( j) is the total value of Chinese imports of product j in year t. Table B4 presents the estimation of this model by 2SLS. Estimated coefficients for HSG are essentially the same albeit more imprecise. For HSD, there is an increase in point estimates, but this change is not significant given the even stronger increase in standard errors. Shock Exposure and Labor Supply. The labor demand shock captured in the exposure variable has the potential to affect the total quantity of hours supplied by workers in a microregion. Such response can be related to changes in the labor supply of native workers and/or changes in the flow of immigrant workers. To investigate the labor supply response to commodity price shocks, Table B5 estimates model (2) for the log change in total hours supplied by all workers in columns (1)-(2) and immigrant workers in columns (3)-(4). Among HSD, shock exposure is not related to the labor supply of both native and non-native workers. For HSG, columns (1) and (3) indicate a weak negative relation between their labor supply and the commodity price shock. Yet, this negative relation disappears once the initial educational composition of the labor force is included in columns (2) and (4). These controls capture the educational achievement convergence across Brazilian microregions in the period, eliminating the relation between labor supply growth and shock exposure.40 Overall, Table B5 suggests limited responses in regional labor supply following commodity price shocks. microregion exposure variation to precisely estimate the coefficient of interest. When the 1980-1991 period is included, there is a significant increase in exposure variation within microregions, leading to the more precise results in Table B2. 40 In omitted exercises, I attest that educational compositional controls do not affect the results in Tables 1 and 2. 45 Alternative Measure of Shock Exposure. Table B6 investigates the robustness of sector-level responses to the use of the alternative exposure measure computed with the overall industry participation in the group-region wage bill. Columns (1)-(3) indicate that employment in the commodity sector is positively related to this alternative measure of price shock exposure. Also, predicted elasticities are similar to those in the baseline specification: using the average commodity sector size for each group, estimated coefficients in column (1) imply that a 1% increase in commodity prices cause the commodity sector employment share to increase by .029 p.p. for HSG and .069 p.p. for HSD. Column (4)-(6) perform the same exercise for the change in the commodity sector wage premium. Similar to the baseline specification, there is a significant response in sector wage differential for HSG. However, the alternative measure also predicts a significant positive response in the sector wage premium for HSD — notwithstanding the coefficient remains lower than its counterpart for HSG. 46 47 0.636 R2 Yes Initial commodity sector size controls x period dummies No No No Initial size of manufacturing sector x period dummies Initial size of other agriculture and mining in the commodity sector x period dummies No Yes No Yes Yes 0.645 0.201* (0.086) 0.443 0.277** (0.060) (3) Yes No No Yes Yes 0.639 0.217* (0.101) 0.440 0.283** (0.067) (4) No No No Yes Yes 0.559 1036 0.099** (0.029) 0.405 0.038** (0.010) (5) No No Yes Yes Yes 0.565 1036 0.097** (0.029) 0.406 0.039** (0.010) (6) No Yes No Yes Yes 0.560 1036 0.089** (0.029) 0.407 0.039** (0.010) (7) Yes No No Yes Yes 0.566 0.067* (0.030) 0.406 0.036** (0.010) (8) Change in Commodity Sector Employment Share No No No Yes Yes 0.218 1036 0.059 (0.161) 0.220 0.416** (0.098) (9) No No Yes Yes Yes 0.221 1036 0.062 (0.162) 0.223 0.420** (0.098) (10) No Yes No Yes Yes 0.221 1036 0.080 (0.163) 0.220 0.410** (0.100) (11) Yes No No Yes Yes 0.221 0.000 (0.171) 0.221 0.414** (0.100) (12) Change in Commodity Sector Average Log Wage Premium Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. All regressions are weighted by the microregion share in national population on 1991 and include ten macroregion-period dummies. Baseline controls as in Table 1. Standard Errors clustered by microregion. ** p<0.01, * p<0.05 No Yes No Yes Yes 0.637 Initial size of formal sector x period dummies Additional Controls Yes Initial labor market conditions x period dummies Baseline Controls 0.268** (0.098) 0.441 0.440 0.268** (0.099) 0.286** (0.064) (2) 0.288** (0.064) (1) Change in Average Log Wage Commodity price shock B. High-School Dropouts R2 Commodity price shock A. High-School Graduates Dependent Variable: Table B1: Exposure to Commodity Price Shocks and Labor Market Outcomes, Additional Controls 48 No Extended sample: including 1980-1991 Yes Yes No Yes Yes 0.624 0.235** (0.054) 0.500 0.214** (0.049) (2) Yes Yes Yes Yes Yes 0.706 0.285** (0.079) 0.597 0.231** (0.069) (3) No Yes No Yes Yes 0.559 1036 0.099** (0.029) 0.405 0.038** (0.010) (4) Yes Yes No Yes Yes 0.490 1036 0.071** (0.021) 0.319 0.051** (0.013) (5) Yes Yes Yes Yes Yes 0.603 1036 0.068* (0.028) 0.417 0.062** (0.018) (6) Change in Commodity Sector Employment Share No Yes No Yes Yes 0.218 1036 0.059 (0.161) 0.220 0.416** (0.098) (7) Yes Yes No Yes Yes 0.259 1036 0.339** (0.095) 0.223 0.234* (0.099) (8) Yes Yes Yes Yes Yes 0.426 1036 0.417** (0.125) 0.322 0.218+ (0.132) (9) Change in Commodity Sector Average Log Wage Premium Note. Stacked sample of 518 microregions in baseline sample and 503 microregions in extended sample. All regressions are weighted by the microregion share in national population on 1991 and include macroregion-period dummies and initial share of group labor income in the commodity sector. Labor market conditions as in Table 1. Industry composition measured in the initial period of 1991 for baseline sample and of 1980 for extended sample. Commodity sector size controls in extended sample: share in group labor income of other agriculture and mining industries in the commodity sector, and quadratic polynomial of the share in group labor income of the commodity sector (columns 2-3). Standard Errors clustered by microregion. ** p<0.01, * p<0.05, + p<0.1 Yes Baseline sample: 1991-2000 and 2000-2010 Sample Period Microregion-specific linear time trend No Yes Additional Controls Yes Initial commodity sector size controls x period dummies 0.636 Initial labor market conditions x period dummies Baseline Controls R2 Commodity price shock 0.268** (0.099) 0.440 R2 B. High-School Dropouts 0.288** (0.064) (1) Change in Average Log Wage Commodity price shock A. High-School Graduates Dependent Variable: Table B2: Exposure to Commodity Price Shocks and Labor Market Outcomes, Additional Period, 1980-2010 49 Yes No No Including female Including non-white No Yes Yes Yes Yes 0.650 0.295** (0.099) 0.539 0.240** (0.064) (2) Yes No Yes Yes Yes 0.653 0.317** (0.110) 0.561 0.303** (0.062) (3) Yes Yes Yes Yes Yes 0.664 0.338** (0.108) 0.617 0.260** (0.063) (4) No No Yes Yes Yes 0.559 1036 0.099** (0.029) 0.405 0.038** (0.010) (5) No Yes Yes Yes Yes 0.591 1036 0.113** (0.031) 0.420 0.032** (0.008) (6) Yes No Yes Yes Yes 0.620 0.113** (0.031) 0.410 0.028** (0.009) (7) Yes Yes Yes Yes Yes 0.652 1036 0.121** (0.034) 0.441 0.022** (0.008) (8) Change in Commodity Sector Employment Share No No Yes Yes Yes 0.218 1036 0.059 (0.161) 0.220 0.416** (0.098) (9) No Yes Yes Yes Yes 0.254 1036 0.091 (0.149) 0.280 0.469** (0.100) (10) Yes No Yes Yes Yes 0.258 0.234* (0.099) 0.276 0.359** (0.099) (11) Yes Yes Yes Yes Yes 0.285 1036 0.227* (0.090) 0.359 0.431** (0.101) (12) Change in Commodity Sector Average Log Wage Premium Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. All regression are weighted by the microregion share in national population on 1991 and include ten macroregion-period dummies. Baseline controls as in Table 1. Standard Errors clustered by microregion. ** p<0.01, * p<0.05 Yes Baseline sample: male / white Additional worker groups Yes Initial commodity sector size controls x period dummies 0.636 Initial labor market conditions x period dummies Baseline Controls R2 Commodity price shock 0.268** (0.099) 0.440 R2 B. High-School Dropouts 0.288** (0.064) (1) Change in Average Log Wage Commodity price shock A. High-School Graduates Dependent Variable: Table B3: Exposure to Commodity Price Shocks and Labor Market Outcomes, Additional Worker Groups Table B4: Exposure to Commodity Price Shocks and Labor Market Outcomes, 2SLS using Exposure to Chinese Imports Growth Change in Average Log Wage Dependent Variable: Change in Commodity Sector Employment Share Change in Commodity Sector Average Log Wage Premium (1) (2) (3) (4) (5) (6) 0.286** (0.064) 0.262** (0.095) 0.039** (0.010) 0.035+ (0.020) 0.420** (0.098) 0.398+ (0.205) A. High-School Graduates Commodity price shock F-stat of excluded variable 36.1 36.1 36.1 B. High-School Dropouts Commodity price shock 0.268** (0.098) 0.746** (0.271) F-stat of excluded variable 0.097** (0.029) 0.261* (0.109) 28.6 0.062 (0.162) 0.575 (0.364) 28.6 28.6 Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. 2SLS estimator where the instrument is the microregion's exposure to growth in Chinese imports. All regressions are weighted by the microregion share in national population on 1991. Regressions include period dummies interacted with macroregion dummies, initial formal sector size and baseline controls described on Table 1. Standard Errors clustered by microregion. ** p<0.01, * p<0.05, + p<0.1 Table B5: Exposure to Commodity Price Shocks and Labor Supply Change in Log of Total Labor Supply Dependent Variable: Change in Log of Immigrants' Labor Supply (1) (2) (3) (4) -0.529+ (0.271) 0.051 (0.114) -0.535+ (0.279) 0.055 (0.131) 0.686 0.811 0.649 0.753 Commodity price shock 0.006 (0.173) 0.209 (0.156) 0.005 (0.221) 0.226 (0.202) R2 0.847 1036 0.877 1036 0.845 1036 0.869 1036 Initial labor market conditions x period dummies Yes Yes Yes Yes Initial commodity sector size controls x period dummies Yes Yes Yes Yes Initial educational composition of labor force x period dummies No Yes No Yes A. High-School Graduates Commodity price shock R2 B. High-School Dropouts Baseline Controls Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. All regressions are weighted by the microregion share in national population on 1991 and include ten macroregion-period dummies. Baseline controls as in Table 1. Educational composition of labor force: share of colloge graduates and share of high-school graduates and some college. Standard Errors clustered by microregion. + p<0.1 50 Table B6: Exposure to Commodity Price Shocks and Sector Labor Outcomes, Alternative Measure Dependent Variable: Change in Commodity Sector Employment Share Change in Commodity Sector Average Log Wage Premium (1) (2) (3) (4) (5) (6) Commodity sector size x Commodity price shock 0.193* (0.094) 0.196* (0.089) 0.158+ (0.087) 1.971* (0.858) 2.032* (0.870) 1.365+ (0.703) R2 0.282 0.341 0.402 0.143 0.195 0.215 Commodity sector size x Commodity price shock 0.183* (0.088) 0.175+ (0.092) 0.161+ (0.092) 0.683** (0.227) 1.075** (0.244) 1.068** (0.241) R2 0.507 1036 0.555 1036 0.557 1036 0.164 1036 0.227 1036 0.228 1036 Initial labor market conditions x period dummies No Yes Yes No Yes Yes Initial commodity sector size controls x period dummies No No Yes No No Yes A. High-School Graduates B. High-School Dropouts Baseline Controls Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. All regressions are weighted by the microregion share in national population on 1991 and include ten macroregion-period dummies. Baseline controls as in Table 1. Standard Errors clustered by microregion. ** p<0.01, * p<0.05, + p<0.1 51 C Two-Sector Roy Economy: Competitive Equilibrium In this section, I describe the production and market structures that generate generate the demand for sector-specific tasks with log-prices given by (ω gk , ω gN ). To this extent, I assume that each market is a small open economy with a segmented competitive labor market. Specifically, goods are freely traded in the world market but factors of production are not. Therefore, product prices are exogenously determined internationally and factor prices are endogenously determined locally. C.1 Demand of Sector-Specific Tasks Suppose sector k is constituted of multiple competitive industries, j ∈ J k . In each industry j of sector k, the production technology requires workers of group g to perform a sector-specific task. The production function combines the sum of the sector-specific task produced by all industry employees, k,j { Tg } g , and other industry-specific inputs, Xk,j . Formally, the production function is q k,j =Q k,j k,j {T }g , X k,j k,j |Bm ˆ Tgk s.t. k,j = i ∈I g Tgk (i ) · L g (i ) di (28) k,j where L g (i ) is an indicator function that equals one if individual i is employed in industry j of sector k; and Xk,j is the vector of industry-specific inputs. In equation (28), the central restriction is that production in every industry j of sector k requires the same sector-specific task. Thus, individuals are perfect substitutes across industries within the sector. Notice that the production function allows, but does not require, tasks of different groups to be imperfect substitutes. Firms take as given the unit log-price of the sector-specific task, ω gk . Hence, the demand for the k,j k,j sector-specific task of group g in industry j of sector k is Tg ({wkg } g |Zk,j ) = Tg such that ω gk = ln pk,j + ln k,j ∂Qk { Tg } g , X̄k,j Bk,j ∂Tgk for all g where Dk,j ≡ ln pk,j , Bk,j , X̄k,j is the vector of industry-sector demand shifters with X̄k,j denoting the endowment of the industry-specific factor. Lastly, the sector-specific task demand combines the task demand of all industries in the sector: Tgk {ω gk } g Dk = ∑ k,j Tg {ω gk } g Dk,j (29) j∈J k where Dk ≡ Dk,j j∈J k is the vector of sector demand shifters.41 41 The simplicity of this production structure sheds light on the distributive channel implied by worker heterogeneity in the Two-Sector Roy Economy. Accordingly, one could consider alternative environments with a generic demand for sectorspecific tasks as in the case of non-competitive product markets and additional mobile factors of production. 52 C.2 Supply of Sector-Specific Tasks In the analysis of Section 4, I have focused on the structure of employment and wages in the TwoSector Roy Economy. Here, I describe the consequences of this structure to the supply of sector-specific tasks. Recall that preferences in (3) imply individuals with comparative advantage below α g (l gN ) are employed in the non-commodity sector. As a result, the total supply of tasks in the non-commodity sector is T̄gN l gN ˆ ≡ Īg · l gN E exp( a g (i ))|s g (i ) = α g (q) dq (30) 0 where Īg is the mass of individuals of group g; and E exp( a g (i ))|s g (i ) = α g (q) is determined by the conditional distribution of absolute advantage in (4). Analogously, individuals with comparative advantage above α g (l gN ) are employed in the commodity sector. Thus, T̄gC l gN ˆ ≡ Īg · 1 l gN exp(α g (q)) · E exp( a g (i ))|s g (i ) = α g (q) dq. (31) The supply of sector-specific tasks depends on both the skill distribution and the endogenous sector employment composition. As discussed in Section 4, the sector employment composition, l gN , is determined by the free mobility condition (5) which I repeat here to facilitate exposition: ω gN − ω gC = α g l gN . C.3 (32) Competitive Equilibrium Market Clearing. The competitive equilibrium requires the equalization of supply and demand for tasks in each sector. For group g in sector k, this is equivalent to Tgk {ω gk } g Dk = T̄gk l gN (33) where Tgk {ω gk } g Zk is the task demand in (29), and T̄gk l gN is the task supply in (30)-(31). The following definition outlines the competitive equilibrium of the Two-Sector Roy Economy. Definition 1. [Competitive Equilibrium] The competitive equilibrium in the Two-Sector Roy Economy is determined by the vector of sector-specific task prices, {(ω gC , ω gN )} g , such that, for all g and k, (i) sector-specific task demand, Tgk {ω gk } g Dk , is given by (29); (ii) sector-specific task supply, T̄gk l gN , is given by (30)-(31); (iii) sector employment composition, l gN , is given by (32); and (iv) market clearing is given by (33). 53 D Theory: Auxiliary Results and Model Extensions D.1 Lemma 1 Lemma 1. Consider a model of the form: em = ym − Φ (lm ) − Xm γ. Assume there is an instrumental variable Zm satisfying Assumption 4. If E[em | Zm , Xm ] = 0, then the function Φ(.) is identified up to a constant. With the normalization E[Xm γ] = 0, the constant in Φ(.) is also identified. Proof. First, E[em | Zm , Xm ] = 0 implies that E [Φ (lm ) + Xm γ| Zr,t , Xr ] = E [ym | Zm , Xm ]. Now let us proceed by contradiction. Suppose there exist Φ̃ (lm ) and γ̃ such that E Φ̃ (lm ) + Xm γ̃| Zm , Xm = E [ym | Zm , Xm ]. By Assumption 4, Φ (lm ) − Φ̃ (lm ) = Xm (γ − γ̃) almost surely. Take m and m0 such that Xm = Xm0 . The condition above implies that, for all lm and lm0 , we must have Φ (lm ) − Φ̃ (lm ) = Φ (lm0 ) − Φ̃ (lm0 ). Thus, Φ(.) is identified up to a constant. To determine this constant, we can use the normalizations E[em ] = E[Xm γ] = 0 which imply that E[Φ̃ (lm )] = E[ym ]. D.2 Derivation of Equation (19) The analysis in this section considers a shock to endogenous and exogenous variables in an arbitrary region and year; accordingly, I drop indices r and t to simplify the notation. Define individual i’s log-wage as y g (i ) such that y g (i ) = max{ω gC + s g (i ) + a g (i ); ω gN + a g (i )}. Under Assumption 1, this implies that the log-wage distribution of group g is given by Pr [y g (i ) ≤ y] = Pr [y g (i ) ≤ y; s g (i ) ≤ ω gN − ω gC ] + Pr [y g (i ) ≤ y; s g (i ) > ω gN − ω gC ] ˆ lgN ˆ 1 h i h i N Pr a g (i ) ≤ y − ω gC − α g (q) − ũ g α g (q) dq = Pr a g (i ) ≤ y − ω g α g (q) dq + (34) l gN 0 where Pr [ a g (i ) ≤ a|s̃ g (i ) = s] ≡ µHga ( a|s) + (1 − µ) H e ( a|u g , θg ). Recall that, by construction, Yg (π ) is implicitly defined as the solution of Pr y g (i ) ≤ Yg (π ) = π. Taking a first-order expansion of this equation, f gN Yg (π ) + where f gN (y) ≡ f gC Yg (π ) ∂Pr [y g (i )≤y; s g (i )≤ω g ] , ∂y ∆Yg (π ) = f gN Yg (π ) · ∆ω gN + f gC Yg (π ) · ∆ω gC − ∆v0g (π ) f gC (y) ≡ ∂Pr [y g (i )≤y; s g (i )>ω g ] , ∂y and ω g ≡ ω gN − ω gC . In this expression, the first-order impact of the endogenous change in l gN is eliminated by the employment condition (5), reflecting the fact that marginal workers are indifferent between the two sec54 tors. The term ṽ˜ g (π ) incorporates changes to other exogenous parameters of the skill distribution: "ˆ ∆v0g (π ) ≡ (1 − µ ) l gN 0 ∇θ H ˆ " + f gC Yg (π ) + (1 − µ) 0 l gN e Yg (π ) − ω gN ũ g , θg ∇u H e ˆ dq + 1 l gN ∇θ H ˆ N Yg (π ) − ω g ũ g , θg dq + e 1 l gN Yg (π ) − ω gC ∇u H e # − α g (q) − ũ g ũ g , θg dq · ∆θg Yg (π ) − ω gC − α g (q) − ũ g ũ g , θg dq !# i h To obtain equation (19), notice that, by definition, l gN (π ) ≡ P s g (i ) ≤ ω gN − ω gC y g (i ) = Yg (π ) which is equivalent to i h P y g (i ) = Yg (π ); s g (i ) ≤ ω gN − ω gC f gN Yg (π ) N l g (π ) = = N P y g (i ) = Yg (π ) f g Yg (π ) + f gC Yg (π ) Thus, ∆Yg (π ) = ∆ω gC · l gC (π ) + ∆ω gN · l gN (π ) + ∆v00g (π ) i h where ∆v00g (π ) ≡ −∆v0g (π ) f gN Yg (π ) + f gC Yg (π ) . Finally, equation (19) is obtained by assuming a particular structure of observable and unboservable components of ṽ g (π ): ∆v00g (π ) ≡ µ g · X̃ g (π ) + ∆v g (π ). D.3 Model Extensions: Non-monetary Employment Benefits This section extends the Two-Sector Roy Economy of Section 4 by incorporating non-monetary employment benefits — a reduced-form for work conditions and switching cost. The environment of Section 4 remains the same except that now I allow workers to derive utility from both consumption and employment. In particular, assume that, if employed in sector k, a consumption bundle c yields utility τgk · u(c) where u(.) is homogeneous of degree one. Thus, individual i’s payoff of employment in sector k is given by Ugk (i ) = τgk · wkg Tgk (i ) P (35) where τgk is the private benefit of being employed in sector k for workers of group g; and P is the price index (i.e., the cost of achieving one utility unit). Without loss of generality, normalize τgC ≡ 1 and denote τg ≡ ln τgN . Assuming that workers maximize their total utility, individual i chooses to be employed in sector K g (i ) such that n K g (i ) = arg max ω gC + ln TgC (i ); o τg + ω gN + ln TgN (i ) . Following the same steps of Section 4, the employment share in the non-commodity sector is determined by the following condition: ω gN − ω gC = α g l gN − τg . 55 (36) ∆ũ g . Intuitively, the choice of employment balances potential earnings against amenities of the sector. These amenities are captured by the wedge τg . Those individuals with strong comparative advantage in the commodity sector (i.e., high s g (i )) obtain higher income in the commodity sector compared to the non-commodity sector. To the extent that the higher income compensates for any non-monetary benefit, these individuals choose to be employed in the commodity sector. Hence, every individual with s g (i ) above the threshold ω gN − ω gC + τg,m self-selects into the commodity sector. The private benefit only affects the allocation of individuals to sectors. Conditional on the sector allocation, workers have the same sector-specific task efficiency. Hence, the remaining equilibrium conditions are not modified with the average sector wage is still given by (6)-(7). The main modification implied by private employment benefits is the introduction of compositional effects in the aggregate wage distribution. To see this, notice that expression (34) still determines Yg (π ), but its first-order approximation contains one extra term: f gN Yg (π ) + f gC Yg (π ) ∆Yg (π ) = f gN Yg (π ) · ∆ω gN + f gC Yg (π ) · ∆ω gC − ṽ˜ g (π )+ i h + Hga Yg (π ) − ω gN − τg α g (l gN ) − Hga Yg (π ) − ω gN α g (l gN ) ∆l gN . The extra term in this equation is directly implied by equation (36). It is zero if τg = 0; however it cannot be ignored if τg 6= 0. To obtain a clear interpretation for this term, consider the fraction of marginal workers with a wage of Yg (π ): ψg (π ) ≡ P[s̃ g (i ) = α g (l gN )y g (i ) = Yg (π )]. These marginal workers are the sector-switchers for small shocks. Using Bayes’ rule, P y g (i ) = Yg (π )s̃ g (i ) = α g (l g ) P y g (i ) = Yg (π )s̃ g (i ) = α g (l g ) ψg ( π ) = ´ 1 = f gN Yg (π ) + f gC Yg (π ) 0 P y g (i ) = Yg ( π ) α g ( q ) dq where the numerator is the wage density conditional on a comparative advantage equal to α g (l gN ). h i Notice that P y g (i ) = Yg (π )s̃ g (i ) = α g (l gN ) = h g Yg (π ) − ω gN α g (l gN ) because the wage distri bution among marginal workers is Hg y − ω gN α g l gN .42 Thus, h g Yg (π )|α g (l gN ) . ψg ( π ) = N f g Yg (π ) + f gC Yg (π ) Using this expression, we have that Hg Yg (π ) − ω gN − τg α g (l gN ) − Hg Yg (π ) − ω gN α g (l gN ) ≈ −τg · ψg (π ). f gN Yg (π ) + f gC Yg (π ) 42 This expressions relies on the assumption that marginal workers are allocated to the non-commodity sector since they are indifferent between the two sectors. If instead they are allocated to the commodity sector, we would have Hg y − ω gN − τg α g l gN and the same steps lead to an identical final expression. 56 Hence, the wage gain at quantile π is, up to a first-order approximation, given by: ∆Yg (π ) = ∆ω gC · l gC (π ) + ∆ω gN · l gN (π ) − ∆l g · [τg ψg (π )] + ṽ g (π ) (37) where ψg (π ) is the share of sector-switchers among workers at the π-percentile of the earnings distribution. The additional term in (37) captures the compositional effect introduced by the difference in sector potential wage of marginal workers in the presence of non-monetary employment benefits. Accordingly, the compositional effect is proportional to the fraction of sector-switchers among individuals at wage quantile π, ψg (π ), and their between-sector wage wedge, τg . D.4 Parametric Restrictions on the Distribution of Comparative and Absolute Advantage This section discusses prominent distributional assumptions that determine the form of α g (.) and Ākg (.). To simplify notation, we omit subscripts for groups, regions and years. D.4.1 Normal Distribution Particularly important in the selection literature is the case of log-normally distributed sector-specific skill (Roy, 1951; Heckman and Sedlacek, 1985; Borjas, 1987; Ohnsorge and Trefler, 2007; and Mulligan and Rubinstein, 2008). That is, assume that sector-specific skill is independently drawn from a bivariate normal distribution: C N " ln T (i ), ln T (i ) ∼ N µC µN # " ; σC2 σCN σCN 2 σN #! . Because the comparative advantage of individual i is defined as s(i ) = ln T C (i ) − ln T N (i ), it is 2 − 2σ straight forward to conclude that s(i ) ∼ N (µ, σ2 ) where µ ≡ µC − µ N and σ2 ≡ σC2 + σN CN . As a 2 . Thus, the distribution result (s(i ), a(i )) is jointly normal with covariance of Cov(s(i ), a(i )) = σCN − σN of a(i ) conditional on s(i ) = s is normal with conditional mean given by E [ a(i )|s(i ) = s] = µ̃ + ρ · s s.t. µ̃ ≡ (1 + ρ)µ N − ρµC , and the variance is V [ a(i )|s(i ) = s] = By definition, F (s) ≡ Φ s−µ σ σ22 ≡ 2 σN ρ≡ 2 σCN − σN ; 2 − 2σ σC2 + σN CN 2 2 σCN − σN − 2 . 2 − 2σ σC + σN CN where Φ(.) is the CDF of the standard normal distribution. Thus, α ( q ) ≡ F −1 ( q ) = µ + σ · Φ −1 ( q ). 57 (38) Also, notice that Ā N (l N ) ≡ Because 1 lN ´ lN 0 ˆ 0 l ρ E a(i )|s(i ) = α g (q) dq = µ̄ + N l ˆ lN α(q)dq = (µ̃ + ρµ) + 0 ρσ lN ˆ lN Φ−1 (q)dq. 0 Φ−1 (q)dq = φ Φ−1 (l N ) , φ Φ −1 ( l N ) Ā (l ) = µ̄ − (ρσ) · lN N (39) where µ̄ ≡ (µ̃ + ρµ) = µ N . For completeness, consider the average task efficiency in the commodity sector: 1 ĀC (l N ) ≡ 1−l ˆ 1 l φ Φ −1 ( l N ) . α(q) + E [ a(i )|s(i ) = α(q)] dq = (µ + µ̄) + σ(1 + ρ) · 1 − lN Equations (38)-(39) illustrate the connection between the parameters governing the skill distribution and the schedules of comparative and absolute advantage. First, the dispersion of comparative advantage, σ, controls the magnitude of the between-sector reallocation of individuals in response to changes in the relative task price. Second, the sensitivity of the mean absolute advantage to the comparative advantage, ρ, controls the compositional effect of employment on sector average wage. D.4.2 Extreme Value Distribution Recent papers have adopted a skill distribution of the Frechet family (Hsieh, Hurst, Jones, and Klenow, 2013; Burstein, Morales, and Vogel, 2015; Galle, Rodriguez-Clare, and Yi, 2015). The main advantage of this distribution is its tractability in the multi-dimensional problem of sectoral choice, allowing for an analytical characterization of the equilibrium with an arbitrary number of sectors. As discussed below, this tractability comes at a price: it imposes a restrictive pattern of selection across sectors. Specifically, assume that skill in each sector is independently draw from a Frechet distribution: " T C (i ), T N (i ) ∼ exp − ∑ Tk −κ # k =C,N where I assume that κ > 1 to guarantee finiteness of first-order moments. First, consider the distribution of comparative advantage: ˆ F (s) ≡ Pr [s(i ) < s] = ∞ −∞ e − e −κ ( a + s ) κe −κa −e−κa e ˆ da = ∞ −∞ κe−κa e−(1+e Define x ≡ (1 + e−κs )e−κa such that dx = −κ (1 + e−κs )e−κa da. Thus, F (s) = α(q) ≡ F −1 1 (q) = ln κ 58 q 1−q −κs ) e−κa 1 1+e−κs da. and, therefore, . (40) Second, consider the joint distribution of absolute and comparative advantage: ˆ Pr [ a(i ) < ā; s(i ) < s] = ā −∞ κe−κa e−(1+e −κs ) e−κa da = −κs −κ ā 1 e−(1+e )e . − κs 1+e To obtain the average task efficiency, notice that the skill distribution in the non-commodity sector is h Pr a(i ) < a|s(i ) < α l N i =e −κα l N − 1+e ( ) e−κa 1 = e− l N e −κa = e−e −κ a+ κ1 ln l N ( ) where the second equality follows from the definition of α(.). Since this is a Gumbel distribution with parameters β ≡ 1/κ and µ ≡ − κ1 ln l N , the average task efficiency in the non-commodity sector is Ā N (l ) = γ 1 − ln l N κ κ where γ is the Euler-Mascheroni constant. From this expression, we can recover A(q) = γ−1 1 − ln l N . κ κ (41) Analogously, the skill distribution in the commodity sector is h i −κ ( aC + κ1 ln(1−l N )) Pr ln T C (i ) < aC |s(i ) > α l N = e−e ⇒ γ 1 ĀC l N = − ln 1 − l N . κ κ The schedules of comparative and absolute advantage in equations (40)-(41) are fully characterized by the skill dispersion parameter, κ. If skill dispersion is low (i.e., κ is high), then a small variation in the relative task price is associated with a large response of sector employment. In addition, a sector employment expansion causes a decrease in the average sector efficiency whose magnitude is also controlled by the skill dispersion. In other words, the extreme value distribution only allows for positive selection in both sectors. This very particular pattern of selection has strong implications for the wage distribution, implying that both sectors exhibit the same distribution of labor earnings. Specifically, the log-wage distribution in sector k is G N ( y ) = e−e −κ y−ω N + κ1 ln l N ( ) = e−e −κ y−ω C + κ1 ln(1−l N ) ( ) = GC (y) where the second equality follows from the employment equation in (5). D.4.3 Log-Linear System: A Skill Distribution Example In this section, I describe a distribution that delivers the log-linear functional forms in Assumption 5. To avoid infinite moments, assume that comparative advantage has a modified logistic distribution 59 with an upper bound. In particular, ( s (i ) ∼ F ( s ) = 1 1 1+ e − α s if s < S̄ 1 if s ≥ S̄ where α > 0 and s ∈ (−∞, S̄] such that S̄ ≡ −α ln[ε/(1 − ε)] and ε ≥ 0. Immediately, ( α(q) = α ln [q/(1 − q)] if q < 1 − ε α ln [(1 − ε)/ε] if q ≥ 1 − ε . where ε ≥ 0. Although the comparative advantage distribution has finite moments for every ε ≥ 0 and α > 0, this is not necessarily true for its moment generating function. Accordingly, the upper bound in the support implies a well defined moment generating function for all ε > 0 and, therefore, a finite supply N < 1 − ε and, of sector-specific tasks. Notice that positive employment in both sectors implies that l g,r therefore, the empirically relevant portion of the quantile function is that in Assumption 5. Also, assume that the conditional distribution of absolute advantage is normal with a linear conditional mean: { a(i )|s̃(i ) = α(q)} ∼ N A g (q), σ2 Thus, 1 Ā (l ) ≡ N l N ˆ N lN s.t. A(q) ≡ Ā + A · ln q. A(q) dq = A N + A · ln l N 0 where A N = ( Ā − A). N < 1 − ε and With positive employment in both sectors, l g,r,t 1 Ā (l ) ≡ 1 − lN C N ˆ 1 α(q) + A(q) dq = AC − (α N + A) · lN lN ln l N − αC · ln(1 − l N ). 1 − lN where AC ≡ ( Ā − A) + αe · ln [(1 − ε)/ε]. E Structural Estimation: Sensitivity Analysis In Table E1, I evaluate the robustness of the structural results of Section 6 to specific choices of the estimation procedure regarding the moment weighting matrix and the set of instruments. In this investigation, I focus on the log-linear model with the overidentification restriction. Column (1) replicates the baseline specification with the Two-Step GMM estimator and the vector of instruments built from the quadratic polynomial of the exposure to international prices in Grains, Soft Agriculture, Livestock, Mining and Energy. Columns (2)-(3) estimate the model with alternative moment weighting matrices. Specifically, column (2) imposes that structural errors in the three equations are independent and, in addition, column (3) imposes that structural errors are homoskedastic (i.e., 2SLS weights). In both cases, point estimates are similar albeit, as expected, more imprecise. 60 Table E1: Structural Parameters, Specification Robustness Baseline Choice of Moment Weight Matrix Choice of Instrument Vector (1) (2) (3) (4) (5) (6) 0.835** (0.212) 0.879** (0.185) 0.997** (0.237) 0.924** (0.359) 0.698** (0.219) 0.933** (0.110) 1.966* (0.935) 1.759* (0.834) 2.032* (0.978) 1.501+ (0.873) 1.605+ (0.897) 2.595* (0.904) 0.916* (0.399) 1.302* (0.701) 1.475+ (0.879) 0.394 (0.536) 0.823 (0.568) 1.030* (0.460) -0.727** (0.142) -0.640** (0.134) -0.795** (0.147) -0.811** (0.190) -0.552** (0.212) -0.617** (0.146) A. High-School Graduates 𝛼𝐻𝑆𝐺 𝐴𝐻𝑆𝐺 B. High-School Dropouts 𝛼𝐻𝑆𝐷 𝐴𝐻𝑆𝐷 Matrix of Moment Weights Optimal GMM weights Yes No No Yes Yes Yes Optimal weights under independent equations No Yes No No No No 2SLS weights under independent equations No No Yes No No No Exposure to international prices in Grains, Soft, Livestock, Mining and Energy Yes Yes Yes Yes Yes Yes Squared exposure to international prices in Grains, Soft, Livestock, Mining and Energy Yes Yes Yes No No Yes Sum of squared exposure to international prices in Grains, Soft, Livestock, Mining and Energy No No No No Yes No Sum of cubic and quartic of exposure to international prices in Grains, Soft, Livestock, Mining and Energy No No No No No Yes Vector of instruments Note. Stacked sample of 518 microregions in 1991-2000 and 2000-2010. GMM estimator with microregions weighted by their share in the 1991 national population. All columns correspond to the log-linear model with the overidentification equation. Control vector includes macroregion-period dummies, initial commodity sector size controls, and initial labor market conditions interacted with period dummies (full description in Section 6.3). Standard Errors clustered by microregion.** p<0.01, * p<0.05, + p<0.1 In columns (4)-(6), I estimate the model with different instrument sets. The instrument vector in column (3) is restricted to contain only the exposure to commodity price shocks by category. In this case, estimates are similar for HSG, but the comparative advantage parameter for HSD is lower and more imprecise. Similar conclusions are obtained with the inclusion of the sum of the squared exposure to commodity price shocks in column (5). Thus, the disaggregated squared exposure is the main source of identification of the comparative advantage parameter for HSD. Lastly, column (6) augments the instrument vector to also include the cubic and quartic of the microregion’s total exposure to the price shock. In this case, results are very similar, indicating that there is little additional gain from the use of higher order polynomials in (24). 61
© Copyright 2026 Paperzz