Urbanization, Internal Migration, and Occupational Mobility in Victorian Britain Jason Long Current version: November, 2001 Victorian Britain experienced the most rapid and thorough urbanization the world had yet seen. People flocked from the countryside to the cities to such an extent that, by mid-century, Great Britain had become the first predominantly urban society in history. This study addresses three related questions: Who were the rural-to-urban migrants, what forces drove them, and did moving to the city pay off in terms of attaining higher socioeconomic status? I answer these questions with the aid of a new dataset of 28,000 individuals matched between the 1851 and 1881 Censuses of the Population of England and Wales. I build a structural econometric model to assess the effect of migration and the nature of migrant selectivity, controlling for the endogeneity of the migration decision. I find that those who left the countryside for the cities did not come from the bottom of the economic and social ladder; rather, they were better off initially than those who remained in rural areas. Urban migrants were positively selected: their prospects in the urban labor market were superior to those who did not move. Moreover, their prospects in the rural labor market were also superior. Finally, the decision to move was, by and large, a fruitful one. The treatment effect of urban migration was large and positive across all socioeconomic strata. On average, people who moved to the city were substantially more successful in improving their socioeconomic status than they would have been had they remained in rural areas, and they were more likely to experience upward intergenerational occupational mobility. When a man is tired of London, he is tired of life; for there is in London all that life can afford. Samuel Johnson1 The scope and acceleration of urbanization over the last two centuries are striking. In 1800, only 2 percent of world population was urbanized (living in places of 20,000 or more inhabitants). Since then the rate of urbanization has dramatically risen, so that today approximately half of the world’s populace lives in cities. In recent decades this urban growth has been driven largely by the developing world, and nearly all of the projected expansion of the next 30 years is expected to accrue to less developed countries. In many cases, the migrants who fill these cities come from and enter into poverty. Though this has recently begun to change, for most of the 20th century rural-to-urban migration has been dominated by poor, unskilled individuals with essentially non-existent opportunities in their Jason Long, Department of Economics, Northwestern University, 2003 Sheridan Rd., Evanston, IL 60208-2600. E-mail: [email protected]. I would like to thank my advisors, Joel Mokyr, Joe Ferrie, and Joe Altonji, for their helpful comments. I also benefited from conversations with Henry Siu, Jim Sullivan and Chris Taber, and from input from workshop participants at Northwestern University. Justin Hayes and Humphrey Southall provided me with data. This research was greatly aided by a Northwestern University Graduate Research Grant and a Pew Younger Scholars Program Fellowship. 1 Boswell, Life. places of origin. According to Robert McNamara, former President of the World Bank: 2 The cities are filling up and urban unemployment steadily grows…the “marginal men,” the wretched strugglers for survival on the fringes of farm and city, may already number more than half a billion, by 1990 two billion. Can we imagine any human order surviving with so gross a mass of misery piling up at its base? Empirical evidence supports McNamara’s assessment. Recent work has revealed that internal migration within a developing country works like a lottery: most individuals who move experience negative returns while a few reap a very large payoff.3 Considered as a group, nearly all rural-urban migrants fare very poorly, and experience negative returns to migration. In short, rural-urban migration in the modern developing world is at best a rational but highly risky proposition, and at worst an irrational, detrimental decision. Just as urbanization over the last fifty years has been largely a phenomenon of the developing world, so it was in the 19th century, when Great Britain was at the forefront of the then developing world. Through the 1800s, Great Britain experienced the most rapid and thorough urbanization the world had ever seen.4 The population underwent a tremendous redistribution in the wake of the Industrial Revolution. The 1851 Census of the Population of England and Wales revealed a watershed event: for the first time in the history of any significant nation, more people lived in towns than in the countryside.5 Furthermore, 38 percent of Britain’s populace lived in cities of 20,000 or more, and by 1881, this figure topped 50 percent. From 1801 to 1911, the urban population increased nearly by a factor of ten, from 3.5 to 32 million. “Galloping urbanization” is Bédarida’s description.6 The primary city in Great Britain, indeed in the world, was London. In 1825 it passed Beijing to become the world’s largest city, and by 1900 it counted 6,480,000 occupants, 42 percent more than second place New York. It is difficult to say with any precision how important were the dual forces of natural increase and ruralurban migration to the growth of the cities. What is clear is that both were strong. It is the latter that is the focus of the current study. Rural exodus had been going on for some time (fueling the growth of London as early as the 17th century), but it picked up considerable speed in the 1830s and 1840s and did not slow until the 1880s. It has been estimated that from 1841 to 1901, the rural areas of England and Wales lost more than 4 million people from internal migration, 3 million of whom headed for the towns, at a rate of more than half a million per decade.7 They 2 Qtd. in Todaro, Economic Development, p. 264. Tunali, “Rationality of Migration.” 4 The Netherlands experienced one of the first sustained periods of substantial urbanization, first in the south from 1475 to 1550 and then in the north, to an even greater degree, between 1580 and the 1670s. Only the province of Holland, however, was ever more than half urban. The country as a whole reached its maximum level of urbanization, 42 percent, in 1675, then fell to a level of 35 percent by 1815. Neither the rate nor the level of Dutch urbanization matched that of England, Wales, and Scotland in the 19 th century. See de Vries and van der Woude, First Modern Economy, pp. 60-63. 5 Towns being places of more than 2,500 inhabitants. See the “Data” section for a discussion of the definition of “towns” and “cities.” 6 Bédarida, Social History. 7 Crouzet, Victorian Economy, p. 93. 3 2 left to escape disaster. They left to seek an education, a spouse, adventure, anonymity, or just the change in locality itself. Most importantly, they left in search of employment—either a better job or simply any job at all.8 As Britain’s cities grew during the 19th century, so did its economy. From 1830 to 1900, Britain’s GNP grew at an annual rate of 2.5 percent, while GNP per capita grew 1.44 percent per year. 9 Certainly the technological advancements of the Industrial Revolution were a primary force behind this growth, but other factors were at work as well. Among these were urbanization, and rural-urban migration in particular. Modern economic growth theory, with its emphasis on increasing returns and external scale economies, has recognized that the localized information and knowledge spillover of urbanization make cities the “engines of economic growth in an economy.”10 The movement of people from Britain’s rural areas to the cities represented a movement of labor from areas of lower marginal product to higher, resulting in efficiency gains for the national economy. Rural-urban migration was an important ingredient in the rise of Great Britain to wealth and economic modernity; as such, it has received extensive attention in the literature since the very days during Victoria’s reign when it was occurring. There is still much we do not know, however. We know very little about the migration decision from the level of the individual migrant: what individual-specific factors drove them, whether they were the best or the worst of the rural labor pool, and most significantly, what were the returns to moving from a rural to an urban area. We do not know these things because we have always lacked adequate longitudinal data with which to observe the actual migration decision. In this study I fill this gap by using nationally representative, individual level, longitudinal data to examine rural-urban migration in 19th century Great Britain for the first time. To do this, I have constructed a dataset of approximately 28,000 individuals matched between the 1851 and 1881 Censuses of the Population of England and Wales. Three questions provide central motivation: (1) Who were the rural-urban migrants, (2) what forces drove them, both from an individual and a locational perspective, and (3) what was the effect of moving to a city on the attainment of socioeconomic status? In order to answer these questions, I have developed a structural econometric model with which to analyze the data on matched individuals. I find that the urban migrants of Victorian Britain were not the “wretched strugglers,” those with poor to non-existent rural opportunities; rather, they came from the middle of the social and economic ladder. They in fact had better rural prospects than did those who remained behind, and they were positively selected for migration: they performed better in the urban labor markets than the rural persisters would have had they instead chosen to move to an urban place. Economic forces, specifically the lure of higher quality jobs, were chief among the determinants of migration. Finally, the effect of moving to a city, the returns to urban migration, was positive and large across all socioeconomic strata. 8 For evidence supporting this assertion, see Pooley and Turnbull, Migration, p. 152. Crouzet, Victorian Economy, pp. 33, 37. 10 Black and Henderson, “Urban Growth,” pp. 252-53. Also, Lucas, “Mechanics.” 9 3 Previous Research Internal migration in Great Britain after the Industrial Revolution has been the subject of a literature dating back to the 19th century itself. 11 E. G. Ravenstein wrote the earliest important work on internal migration in 19th century Britain in 1885. He used birthplace data from the 1881 census to draw up seven “laws of migration,” which have served to frame much of the subsequent work on internal migration within Britain and elsewhere. The first and most important of Ravenstein’s laws is that most moves covered only a short distance, and that exceptions to this rule generally involved Britain’s “great centres of commerce or industry.” He also posited that urban natives were “less migratory” than their rural counterparts, and that “females are more migratory than males.” Ravenstein’s laws have largely stood the tests of time and modern scholarship. Another major early work is Redford’s Labour Migration in England, 1800-1850, which confirms the short-distance nature of 19th century British internal migration and refutes any notion of large-scale movement from the overpopulated South to the recently industrialized North. Redford also highlights the attractive power of higher wages and better opportunities for employment in the towns and cities as the primary force behind the rural-urban moves. Perhaps the most comprehensive study of rates of migration within and emigration from Victorian Britain is Baines’ Migration in a Mature Economy. He uses census birthplace information along with mortality statistics from the Annual Reports and Decennial Supplements of the Registrar General to estimate for every county in England and Wales, for every decade from 1860 to 1900, the rate of outward-bound overseas emigration and the rate of outward-bound migrants who remained within England and Wales. Measuring internal migration this way, Baines found that about 8.4 percent of the population of England and Wales undertook an internal, intercounty move between 1861 and 1880.12 His numbers also indicate that outward-bound inter-county migration rates were higher for rural counties than for urban: roughly 11 versus 7 percent.13 Finally, Baines finds that most intercounty migrants were young: 81 percent were aged 15-34, and the rest were children.14 Though still significant, the literature that aims specifically to analyze the causes of migration in 19th century Britain is smaller than that which simply describes it. Hunt is an early example.15 He finds a moderately significant positive relationship between county-level earnings and net gain from migration for 1871-1891 and 1891-1911. Many other studies similarly find a relationship between internal migration levels and economic indicators. Boyer and Hatton verify the positive relationship between earnings (and expected earnings, taking unemployment into account) and migration, and add a consideration of the so-called “friends and relatives” effect. 11 See Boyer and Hatton, “Migration” for a recent, thorough survey of the field. No doubt Baines found this out; he did not, however, report it. His book contains many detailed decompositions of migration numbers, but it nowhere gives an overall rate of internal migration. This figure was calculated using Baines’ raw migration data, reported in Appendix 1 of the book. 13 Baines defines rural counties as those in which fewer than 35% of the population lived in towns of more than 20,000 occupants. So defined, there were 34 rural counties and 18 urban. 14 Others have verified this result. See, for example, Friedlander and Roshier, “Internal Migration,” and Williamson Coping, pp. 40-42. 15 Hunt, Regional Wage Variations. 12 4 They find internal migration flows to be positively correlated with the stock of previous migrants, which indicates that the migration decision depends upon the availability of information as well as anticipated earnings gains. All of the studies above—16 books and papers on the subject are surveyed by Boyer and Hatton—are concerned with internal migration within Great Britain. In all of them, migrants from the rural areas of the land to the cities play a central role. In none of them are any such migrants actually observed. All of these studies analyze migration at some aggregate level, most often that of the county.16 It is sensible to study internal migration at the county level; counties gain and lose population, and county-level indicators certainly affect migration flows. On the other hand, it is people who do the moving. If we cannot see these people, who they are and what they do, then we are missing much of the story of why migration occurs. We also miss the entire story of the extent to which moving to a city in Victorian Britain “paid off.” Supposedly, people moved in response to average wage gaps and unemployment rates. But did they receive those higher wages? Did they get better jobs? If we don’t see the people who moved, as well as those who did not, then we cannot know. Two studies do consider individuals, but not in such a way as to answer these questions. Williamson uses 1851 census data to compare the (imputed) earnings of those born and living in cities with those born outside the cities but living in them in 1851.17 He finds that those born elsewhere earn about as much as natives, and that both groups earn more than did the average fully employed farm worker. Though these data contain information on individuals, it is only at a point in time; nothing is known of the people before 1851, so the migration decision itself cannot be analyzed, nor can the consequences of the migration decision (i.e. the treatment effect) be assessed. Pooley and Turnbull use an interesting dataset to look at individuals over time: survey responses from genealogists and family historians from all over England and Wales. Their data are chronologically diffuse, containing 16,000 people born between 1750 and 1930, and they use them to examine the history of migration broadly from the 1700s to the present. One finding of their study which is particularly relevant for the work at hand is that, according to the documentary evidence provided by their survey respondents, work-related reasons were the single most important cause of all internal migration among their sample, accounting for approximately 38 percent of all moves.18 The book is unique and important, but it does not ask the questions that economists have asked: there is no formal analysis of the determinants or consequences of the migration decisions of individuals, nor is rural-urban migration specifically addressed. Also, the nature of their dataset calls into question its accuracy and representativeness. It is only accurate insofar as the family historians’ information is accurate, and it is in no way a random sample of information from the population of England and Wales. What is needed, and what this study provides, is a large, nationally representative dataset of individuals who are observed both before and after they decide to move to the city (or to remain in the countryside). Only with 16 Friedlander, “Occupational Structure,” analyses migration at the somewhat finer level of the registration district. Williamson, Coping, ch. 5. 18 Pooley and Turnbull, Migration, p. 152. 17 5 such information is it possible to analyze properly the role of both individual and locational factors in the ruralurban migration decision and to gauge the effect this decision had on the ability of people to obtain high quality occupations.19 The Data The data used here come from a new sample of nearly 30,000 males linked from the 1851 Census of the Population to the 1881 census. The population censuses have long been recognized as the most important source of individuallevel data for Great Britain from the 19th century. They are large, and they are nationally representative; in fact, they are (essentially) the whole population. One critical limitation of the censuses, however, is that each covers only a point in time. Successive censuses can be used to examine changes in the nation over time but not changes in the lives of individuals over time. No continuity exists for individuals between the censuses. But continuity can be created. It has recently become possible to search for specific individuals among the vast records of the census enumerators’ books which include information on every person living in England and Wales at the time of enumeration. The data source that makes such a fine search feasible is rather remarkable. The Genealogical Society of Utah in conjunction with the Federation of Family History Societies has computerized the entire 1881 census of the population of England, Wales, and Scotland. The compilers of the data have made the raw data files for the enumerators’ books from England and Wales available through the Data Archive at the University of Essex.20 With these data any individual or group of individuals can quickly and easily be searched for throughout the entire 1881 census. In this case the search was for individuals from another census: that of 1851. The dataset used was a computerized 2 percent sample of the 1851 census, compiled principally by Anderson, Collins, and Stott.21 The information available for each individual relates, of course, to the questions asked in the two censuses: name, address, relationship to head of household, marital status, age, sex, occupation, place of birth, and whether blind or deaf and dumb. Those born and currently residing in England or Wales were asked to give town and county of birth. The same was true of Scotland. Those born in England or Wales currently residing in Scotland and those 19 It is important here to note that such data have been assembled and analyzed for the U.S. in the 19th century by Ferrie. See “New Sample,” “Migration,” and “Down on the Farm.” Ferrie has employed data from the 1850-70 U.S. censuses to construct a linked sample of individuals similar to the one constructed for the present study. Eventually it will be possible to compare the causes and consequences of rural-urban migration in Britain directly to those in the U.S. A presentation of this joint research will be given at the at the 2002 International Economic History Congress in Buenos Aires, for the session titled “Making a Career: Individual Work-Life Histories and Labour Market Structures”. 20 Study number 3643 21 It is a stratified two percent systematic cluster sample from the enumerators’ books. For England and Wales, settlements with fewer than 2,000 inhabitants are sampled in their entirety, on the basis of one settlement in fifty. For the remainder of these countries, and for all of Scotland, the sampling unit is the enumeration district, every fiftieth successive enumerator’s book being selected. In its entirety the sample contains 945 clusters and 415,000 individuals. Regarding the construction of the sample, it is noteworthy that the clustering procedure ensures that family units remain intact; thus, for every individual contained in the sample, we have the complete census information for each member of that individual’s household, including immediate family members and anyone else residing in the same dwelling place (servants, lodgers, visitors, etc.). For a full description see Anderson,National Sample. This dataset is also available through the Data Archive, as study number 1316. 6 born in Scotland currently residing in England or Wales were asked to give country of birth only. Name, age, and birthplace information was used to link individuals between the two censuses. In order to be considered a true match for an individual from 1851, an individual from 1881 had to have either the same name or a close phonetic variation thereof (for example, Aitken and Aitkin were considered to be equivalent), a year of birth different by no more than five years, and the same county and town of birth. Also, none of this information could be missing from an individual’s record. The variation in birth year was allowed in order to account for age misreporting, and the town of birth, like the name, was allowed to vary slightly (Bowden was considered equivalent to Gt. Bowden, for example).22 If by these criteria an individual from the 1851 sample had more than one match from the 1881 census, then that individual was discarded. Applying this matching process to an initial pool of 187,117 English and Welsh males from the 1851 2 percent sample yielded a set of 28,474 men observed both in 1851 and 1881, a success rate of approximately 15 percent.23 The data come from two nationally representative sources, so as long as the matching process does not skew the sample, the set of matched individuals should also be representative of the population of England and Wales. Table 1 shows a comparison between the sample of matched individuals and the entire group of males from the 1851 2 percent sample.24 There are differences between the groups throughout the categories, but they all stem from one important factor: the matched men are younger on average. The average age for the matched men is 18, while for the entire group it is 24. This should be expected. Life expectancy at birth was only 39.5 years in England in 1851; younger men were more likely still to be alive in 1881. From this one difference follow the others: matched men were more likely to be sons rather than household heads; they were less likely to be married; more likely to be students and weavers and less likely to be anything else; and more likely to still be living within their county of birth (having had less time to move away). The only noticeable difference that probably is not due to age is an under representation of people living in London in 1851. The most likely explanation for this anomaly is that the concept of a “town of birth” is a bit more slippery for those born in London. There is such a multitude of place names within the city that those born in London were probably more likely to report something different as their town of birth between the two censuses, particularly if they lived in London in 1851 but had moved by 1881.25 For the most part, the sample is a good representation of the population of Victorian England and Wales, skewed somewhat toward the young. 22 Age misreporting was fairly common in 19 th century Britain. A discussion of this issue is included below, in the “Estimation” section. 38,504 Scottish men were also included in the search, but only a handful was matched. Non-migrant Scots could not be matched, as the 1881 data cover only England and Wales, and even migrants could not be matched with much success in the 1881 census since they were not specifically required to give more than simply their country of birth. A small minority did give their full birth information, were able to be matched, and are included in the sample of 28,000. 24 It also includes the smaller subsample of individuals to be used in the econometric analysis. This group is discussed below. 25 Much as a modern suburbanite would respond with her specific town name if asked about her residence within the metropolitan area, but in another part of the country might well simply give the city name as her home. 23 7 TABLE 1 COMPARISON OF ALL MALES, MATCHED SAMPLE, AND ESTIMATION SUBSAMPLE All males (%) 24.36 Matched sample (%) 18.09 Estimation subsample (%) 16.02 15.84 11.22 3.01 2.91 2.58 2.11 1.68 1.61 1.29 1.12 22.30 11.96 2.20 2.30 2.11 2.66 1.59 1.68 1.68 1.27 27.61 18.76 0.45 2.81 3.05 3.26 1.51 1.80 0 1.75 44.56 33.80 5.69 1.89 1.86 59.86 25.24 3.46 3.69 1.32 100 0 0 0 0 Married Unmarried Widowed 47.72 47.66 4.18 42.31 55.61 1.45 4.12 95.21 0.22 Region East Lancashire-Cheshire London London Environs Midlands North South Wales York 6.58 11.31 8.55 12.70 19.89 5.06 18.49 4.93 12.50 7.88 12.43 5.14 13.71 20.94 5.02 20.57 4.61 9.70 9.86 8.77 0 13.99 25.12 5.25 26.50 3.13 7.39 Lived in county of birth in 1851 71.56 88.41 95.12 187,117 28,474 3,774 Age (mean) Occupation Student Ag. Laborer Farmer Laborer Miner Weaver Tailor Carpenter None Shoemaker Relation Son Head Lodger Servant Visitor Marital status N Notes: All values are from 1851. The occupations listed are the 10 most prevalent among the sample of all males. Students were children either attending school or receiving formal instruction at home. Sources: 2 percent sample of 1851 census and new sample of matched individuals. In order to examine the rural-urban migration decision, a smaller subsample must be extracted from the 28,000 matched individuals. If we want to see what caused some to move to the city and others to remain behind, and what happens to those who go compared to those who stay, then we obviously want to limit ourselves to considering individuals who began the period outside of any city. There is no hard definition for what constitutes a “town” or a “city.” The United Nations has recommended that all places with more than 20,000 inhabitants living close together be considered as “urban.” Not all countries use this definition. In fact, the U.S. Census defines an “urban place” as any locality with more than 2,500 people. Within the literature these two numbers are often considered to be meaningful cut-off points. In 19th century Britain, both places of 2,500-20,000 and those of more than 20,000 were on the rise. The population of the former quadrupled from 1801-1901, and that of the later 8 increased by a factor of eleven; for the years 1841-1901, the multiples were two and four, respectively. So while both were growing, it was the cities of more than 20,000 that were experiencing the most rapid expansion, and it will be these that constitute “urban” for the purposes of this study.26,27 In order to judge the effects of moving, or not moving, to the city, the sample must also be limited to those individuals for whom there is solid economic information from both censuses. Unlike their U.S. counterparts, the Victorian censuses do not provide any quantitative economic variables such as personal or real estate wealth. What they do include is fairly detailed occupational information. A distinction was made between employer and employee and master and apprentice, and the number of persons employed was recorded. The instructions given to the enumerators in 1851 are as follows: In trades the master is to be distinguished from the Journeyman and Apprentice, thus- ‘(Carpenter, master employing [6] men)’; inserting always the number of persons of the trade in his employment on March 31st. In trades where women or boys and girls are employed, the number or each class should be separately given. Where the master is one of a manufacturing or mercantile firm, the entry should be after this form: – ‘Cotton manufacturer – firm of 3, employ 512 men, 273 women, 35 boys, and 272 girls.’ 28 The occupation variable contains similarly detailed information for farmers and agricultural workers. The enumerators were told that The term FARMER is to be applied only to the occupier of land, who is to be returned – ‘Farmer of [317] acres employing [12] labourers,’ the number of acres, and of in and out-door labourers, on March 31st, being in all cases inserted. Sons or daughters employed at home or on the farm, may be returned – ‘Farmer’s Son,’ ‘Farmer’s Daughter.’ 29 There were literally thousands of occupations listed in the enumerators’ books; there are in the neighborhood of 8,000 unique occupations given in the set of 28,000 matched individuals.30 Making sense of all of this occupational information requires some work. In the absence of wage information, some system of ranking jobs according to their desirability must be employed. Clearly nothing like the 65-tier, thoroughly modern Hope-Goldthorpe scale will be applicable to eighteenth century occupational data.31 Fortunately, fairly strong consensus has arisen in support of a ranking scheme proposed by Armstrong.32 He argues that only the Registrar General’s social classification schemes of 1921 and 1951 satisfy the dual requirements of being not too refined for the rough (by modern standards) Victorian census data and at the same time including exhaustive published lists whereby the 26 “Urban” being so defined, the terms “rural” and “countryside” will be used throughout the paper to refer to all other places; hence, the terms are used rather loosely, as towns of, say, 15,000 hardly could be considered truly “rural” in mid-19th century Britain. 27 Town and city populations from the 1881 census were used both for places in 1851 and in 1881, to ensure that the nationwide effects of increasing population between the two dates did not cause people to appear to urbanize simply by living in a growing town. 28 Qtd. in Higgs, Clearer Sense, p. 108. Instructions for 1881 were not substantially different. 29 Ibid, p. 104. 30 This after substantial cleaning of the data. A first pass reveals 19,139 non-identical occupation responses from the 28,000 men. 31 For a description of the Hope-Goldthorpe scale and its application to modern British occupational data, see Mayhew and Rosewell, “Occupational Mobility.” 32 Armstrong, “Information.” 9 multitude of occupations listed in the census can be classified into established categories. Armstrong recommends the 1951 scheme on the grounds that it is simpler to implement, but for the present study the 1921 scheme was chosen for its earlier date. It is, after all, thirty years closer to the Victorians. This scheme consists of five ranked classes of occupation: Class I. Class II. Class III. Class IV. Class V. Professional etc., occupations Intermediate occupations Skilled occupations Partly skilled occupations Unskilled occupations This system of classification is clearly aligned to capture job desirability. It makes a normative statement: higher-class jobs were better than lower class. Two sources, both published by the General Register Office of Great Britain, are needed to classify each occupation fully. The Classification of Occupations, 1921, lists approximately 16,000 different occupations and gives a three-digit code number for each. The Decennial Supplement for 1921 then provides the appropriate class rank for every code number.33 Armstrong provides several simple, straightforward, and most importantly, objective modifications to this system to bring it into somewhat better harmony with the nature of nineteenth century occupational and class structure. Probably most important is his use of the employee information given under the occupational field in the census. Regardless of job title, all employers of 25 or more are to be placed in Class I, and all people with Class III or IV occupations employing at least one person other than a family member is to be placed in Class II. Since Armstrong first proposed this scheme in 1966 and further developed it in 1972, it has become the standard for ranking occupations in Victorian Britain.34 Armstrong’s basic scheme has been modified in one way, in order to take full advantage of the information offered by the census. Since entire households are observed both in 1851 and 1881, including not only family members but servants as well, the ratio of servants to household members can be calculated for each household. The job class ranking of some individuals was upgraded (never downgraded) according to the following scheme, proposed by Royle: all heads whose households contained at least one servant per household member were placed in Class I, all others with one servant per three household members in Class II, and any others that employed at least one servant in Class III.35 Formulated in this way, the job class variable may be thought of as a representation of socioeconomic status, and the terms occupational class and socioeconomic status are used interchangeably throughout the study. 33 Warm thanks to Dr. Humphrey Southall for providing me with a computerized dictionary with which I was able to code approximately one third of the occupations. This dictionary, along with many new entries which I have provided, is available through the Great Britain Historical Database. See http://www.geog.qmw.ac.uk/gbhgis/database/db_index.html. Armstrong recommends using the 1951 scheme for the sole reason that both sets of information needed to code the occupations are contained within one book, the Classification of Occupations, 1951. Furthermore, this book is much more readily available in libraries throughout the country than are the two 1921 volumes. 34 Armstrong demonstrates that job class, defined according to this system, is positively correlated with the employment of servants and negatively correlated with the incidence of shared accommodation. “Information,” p. 212. 35 Royle, “Social Stratification.” 10 The final cut taken from the data also has to do with the amount of information available for individuals. In 1851 the men in the sample were either heads of a household (25%), sons of the head (60%), grandsons, nephews, or brothers of the head (4%), or non-family members (servants, lodgers, and visitors being the most common, in that order). Sons are by far the largest group, which is good, because they offer the richest source of information. For sons (if they are old enough) we can observe not only their class and industry of occupation, but also those of their fathers. Then as now, the concept of intergenerational occupational mobility was an important one. The occupation and class of the father were powerful influences on the occupation and class of the son. We want to know the extent to which moving to the city allowed people to get better jobs—better jobs than they themselves used to have, and better jobs than their fathers before them had. So the analysis will focus on the largest group within the sample: the sons. As for considerations of age, the minimum legal working age was nine years, so only those sons who were nine years old or older in 1851 will be included in the sample. Only a small minority of sons in the sample were in their thirties or older. These were excluded, as the forces governing their migration decision might be significantly different from those of young men. Out of the 28,474 matched individuals, 3,774 are sons living with their father in a rural area in 1851, with both son and father reporting solid occupational information. They are shown in the third column of Table 1 for comparative purposes, though of course this subsample is unrepresentative of the male population of England and Wales by construction. It is this group which will be used to analyze rural-urban migration. A Model of Rural-urban Migration Migration to the city is not, of course, a randomly assigned treatment; an individual’s expectation of his labor market prospects will influence his decision of whether or not to move. The model used to analyze the migration decision, then, is one of “regime switching with endogenous switching,” to paraphrase Maddala’s terminology.36 The basic model is Regime 0: y 0i = β 0′ X 0i + ε 0 i if Di = 0 (1) Regime 1: y1i = β 1′ X 1i + ε 1i if Di = 1 (2) Di = 1 if γ ′ Z i + u i ≥ 0 (3) Di = 0 otherwise (4) where y (generally a continuous variable) represents the labor market outcome and D the decision of the regime in which to participate. X and Z are the factors that influence an individual’s labor market outcome and participation 36 Maddala, Limited-Dependent, pp. 223-28. Also referred to as a Type 5 Tobit Model; see Amemiya, Advanced Econometrics, pp. 399-402. For a good overview of the identification and parametric and semiparametric estimation issues associated with this class of model, see Heckman, “Varieties.” 11 decision, respectively, β and γ are vectors of coefficients, and ε and u are unobservable factors. A wide range of empirical questions has been examined with this model, including union-nonunion wage analysis, housing-demand, and migration.37 A modification of this model will be employed here. The model cannot be adopted wholesale because in the present case the labor market outcome variable yi is not a continuous variable like wage or wealth, but rather it is the discrete variable job class (or socioeconomic status). The standard model involves the estimation of two linear outcome equations, (1) and (2), and one nonlinear decision equation represented by (3) and (4). The model developed here allows for equations (1) and (2) to be nonlinear as well. The model is defined as follows. There exists a continuum of job quality, Y * ∈ (− ∞, +∞ ) . Individuals have ( ) a utility function U y * , Θ , where y* is a realization of Y* and Θ is a vector of other inputs. All individuals prefer higher quality jobs: ∂ U ∂y * > 0 . The maximum quality of job that an individual can attain, y i* , is defined to be a linear function of that person’s observable traits and skills, Xi , and unobservable characteristics, εi . The function may be different in the urban regime than in the rural, though the relevant elements of Xi are assumed to be the same in each: Rural: y 0*i = β 0′ X i + ε 0 i (5) Urban: y1*i = β 1′ X i + ε 1i (6) The decision facing each individual in this model is whether to migrate to a city or remain in a rural setting. The net benefit of moving, Di* , is defined to be a linear function of (1) Zi , a vector of observable individual and location specific characteristics, (2) the difference between the individual’s maximum attainable job quality in the urban ( ) regime versus that in the rural, y1*i − y 0*i , and (3) ui , unobservable characteristics: ( ) Di* = γ 1′ Z i + γ 2 y1*i − y 0*i + u i (7) Only the outcome of the choice, Di , is observed: Di = 1 if Di* ≥ 0 , Di = 0 otherwise.38 To this point, the model is identical to the base model specified in (1) – (4). The difference is that in the base model, either y0i or y1i is observed for each individual, depending on the value of Di , whereas in the current model neither y 0i* nor y1i* is observed for any individual. What is observed is the job class, yi , of every individual: 37 See Trost, “Demand”; Lee, “Unionism”; Robinson & Tomes, “Self-Selection”; and Ferrie, Yankeys. Note that the decision is not whether to move or stay, as is typical in most migration studies. The decision is whether to move to an urban area or not. So an individual who moves from one rural area to another would have Di = 0. 38 12 y 0i ∈ [1,2,3,4,5] if Di = 0 y1i ∈ [1,2,3,4,5] if Di = 1, (8) The following relationship is assumed to exist between job class, yi , and job quality, y i* : yri = 5 if -∞ < y ri* ≤ k1 4 if k1 < y ri* ≤ k2 3 if k2 < y ri* ≤ k3 2 if k3 < y ri* ≤ k4 1 if k4 < y ri* ≤ +∞ (9) r = 0,1 where k1 – k4 are constants. Class I jobs are the most desirable, and Class V the least. Under this formulation, all jobs within a class are not equivalent; indeed, the best job in any class is only marginally inferior to the worst job in the next highest class. Also, the quality/class structure is identical in the cities and the countryside, since the four threshold levels k1 – k4 do not vary between the two regimes. This is as it should be; otherwise, an individual could change job class without changing jobs simply by moving from one regime to the other. The three-equation system defined by (7) and (9) is estimated by Full Information Maximum Likelihood (FIML), which proceeds in two stages. First, by substituting (5) and (6) into equation (7), (7) can be written in reduced form as Di* = γ ′Wi + v i (10) where W contains all the elements of X and Z. A standard assumption is made regarding the disturbance terms: ε1 , ε2 , and v are assumed to be i.i.d. draws from a trivariate normal distribution with mean vector zero and covariance matrix σ 12 Σ= σ 12 σ 1v σ 22 σ 2 v 1 (11) 13 With this distributional assumption, (9) becomes a standard ordered probit model with five outcomes and four threshold levels, with the outcome depending on the latent index variable y*.39 The model, therefore, is one of “switching ordered probits,” rather than the common switching regression model. The three equations in (9) and (10) are jointly estimated by maximum likelihood. The likelihood function is L( β 0 ,β1 , γ, k1 , k 2 ,k 3 ,k 4 ,ρ0 , ρ1 ) = F(k − β ′ X ,γ ′W, − ρ ) ′ ′ ′ ′ ′ ′ 1 0 0i 0 ∏ F( k j − β 0 X 0 i ,γ W, − ρ0 ) − F( k j −1 − β 0 X 0 i ,γ W, − ρ0 ) ∏ F( β 0 X 0 i − k 4 ,γ W,ρ 0 ) ∏ yi =5 yi = j yi =1 j = 2 ,3 ,4 [ 1− Di ] G( k − β ′ X , − γ′W,ρ ) ′ ′ ′ ′ ′ ′ 1 1 1i 1 ∏ G( k j − β1 X 1i , − γ W,ρ1 ) − G( k j −1 − β1 X 1i , − γ W,ρ1 ) ∏ G( β1 X 1i − k 4 , − γ W, − ρ1 ) ∏ yi = j yi =1 yi =5 j = 2 ,3 ,4 [ Di ] where F and G are, respectively, the bivariate normal distribution functions of (ε0i , vi) and (ε1i , vi) and ρ0 and ρ1 are the correlation coefficients of the two distributions. β0 and β1 are assumed to include a constant term, so the four threshold levels, k, are not all identified, and k1 is normalized to 0. This estimation procedure produces consistent and asymptotically efficient estimates of the parameters of interest, β0 and β1 .40 The second stage is the structural estimation of equation (7). With estimates of β0 and β1 in hand, predicted values of both y 0i* and y1i* are obtained for each individual as yˆ ri* = βˆ r′ X i , r = 0,1. These predicted values are substituted into (7) in place of the unobservable y 0i* and y1i* . (7) is then estimated by probit maximum likelihood to obtain estimates of the structural parameters γ1 and γ2. The estimates of β0 , β1, γ1, and γ2 reveal the determinants of job class attainment and urban migration. To get at the central issues of the selection of urban migrants and rural persisters and especially the treatment effect of migration, it is necessary to define several more parameters to be estimated. The selection of urban migrants (ϕ1) and the selection of rural persisters (ϕ0) are given by 39 The distributional assumption is standard, but it is not innocuous, as the growing literature on semi- and non-parametric estimation of selectivity and treatment effects demonstrates. See Heckman, “Varieties,” Manski, “Nonparametric,” and Newey et al. “Semiparametric.” 40 Standard “two-step” estimates of the parameters in a self-selection model are consistent but not efficient. Their advantage lies in ease of computation; indeed, in econometrics texts one often encounters phrases like “maximization of the likelihood function…can be cumbersome” (Maddala, Limited-Dependent, p. 224). No such problems were encountered with the likelihood function specified above, however. Given appropriate starting values and the technical trick of maximizing the inverse hyperbolic tangent of ρ instead of ρ itself (in order to constrain it between its logical bounds of -1 and 1 and for numerical stability during maximization), convergence was not problematic. At any rate, the nonlinearity of the second stage job class equations precludes standard two-step methods for this model. 14 ( = E (y ) ( | D = 0 ) − E (y ) | D = 1) ϕ 1 = E y1* | D = 1 − E y1* | D = 0 ϕ0 * 0 * 0 (12) They are calculated conditional on the observed variables using the estimated β coefficients and conditional means of the X variables: ~ ~ ϕ 1 = X 1 βˆ1 − X 0 βˆ1 ~ ~ ~ ~ ϕ 0 = X 0 βˆ 0 − X 1 βˆ 0 , where X 1 = E (X | D = 1) and X 0 = E (X | D = 0) (13) If ϕ1 > 0 then, conditional on the observables, urban migrants were positively selected; i.e., they achieved higher job quality in the urban labor market than the rural persisters would have had they chosen to move to the city. The analogous characterization holds for ϕ0 and the selection of rural persisters. The treatment effect of rural-urban migration, τ, is defined to be ( ) τ = E y1* − y 0* | D = 1 (14) ~ ~ τ = X 1 βˆ1 − X 1 βˆ 0 (15) and is calculated as Like the selection parameters, τ is calculated conditional on the observables of the model. The treatment effect is ~ considered in terms of the average urban migrant: X 1 represents the endowments of the skills and attributes available to the average migrant. If the treatment effect is positive, τ > 0, then migration to the city yielded a positive return in terms of job quality for the average rural individual in England and Wales of the type who typically chose to move to the city from the countryside.41 Estimation of the Model The determinants of maximum attainable job quality in 1881—the elements of X in (5) and (6)—include class rank in 1851, father’s class rank, age, age squared, “age discrepancy,” whether the individual lived in a town in 1851, whether the individual was an eldest son, whether the father was a farmer or employer, an interaction between the 41 Conditioning the selection and treatment effect parameters on the observable variables of the model entails the following assumptions: (1) E(ε1 | D = 1) - E(ε1 | D = 0) = 0, (2) E(ε0 | D = 1) - E(ε0 | D = 0) = 0, and (3) E(ε1 - ε0 | D = 1) = 0. See Heckman, “Varieties,” for a theoretical and empirical discussion of these assumptions. 15 previous two terms, the industrial classification of the individual’s occupation in 1851, the degree of age-heaping for the county of residence in 1851, and region in 1851. Summary statistics for these variables, and for those in the migration equation, are presented in Tables 2 and 3. The first table summarizes the attributes of all of the individuals in the estimation sample in 1851 and 1881. The second compares the attributes of those who decided to move to a city by 1881 with those who remained in a rural place, also giving values for both 1851 and 1881. This table will be discussed in more detail in a subsequent section. Several of the variables merit some discussion. The inclusion of the individual’s class rank in 1851 allows us to measure the effect of moving to the city on the change in an individual’s job class rank; it controls for individual-specific effects on the level of job quality. This is analogous to running a regression on change in wage or wealth without constraining the coefficient on the first time period value to be equal to one. Formulating the model in this way offers several advantages. First, if change in job class were the independent variable in the model, the set of possible realizations of that variable would not be the same for all individuals. For example, only people who started in Class V in 1851 would not have the possibility of a downward move. Second, it is interesting in its own right to measure the effect of class in 1851 on class in 1881 as an indicator of the rigidity of occupational class structure in Victorian Britain. Finally, we might expect that the influence of class in 1851 on class attainment in 1881 would be different for someone who switched regimes than for someone who made no switch. A priori we might expect that the relationship would be weaker for those individuals who moved to the city, given that they would be entering a somewhat different job market and would be sure to experience some sort of change in occupation (even if the change were as small as being a spinner in a different factory or a carpenter in a different town). The job class of the father is also included. Just as estimating the effect of the individual’s class in 1851 on his class in 1881 allows us to measure the degree of occupational mobility, so including father’s class in 1851 reveals the degree of intergenerational mobility, an important question itself. So the model is really one of change in job class, where part of the change is from the individual’s own class in 1851 to his class in 1881, and part is an intergenerational class change from father’s class in 1851 to son’s class in 1881. One “class” included in 1851 is not a proper occupational classification at all: that of “student.” It is the second most common class in 1851, and the single most commonly reported occupation, in this sample as well as in the census as a whole. It does not fit into the Registrar General’s occupational classification system, but it may be regarded as a class of its own. For the 1851 census, parents were instructed to report their children as students (“scholars” in the language of the Victorian censuses) if they were older than five and were “daily attending school, or receiving regular tuition under a master or governess at home.” The instructions for what constituted a student were more stringent for the 1851 census than for any other of the nineteenth century: the requirement of being under a master or governess if at home was dropped from the 1861—1881 censuses, and no instructions at all were given on the subject in 1891. Furthermore, compulsory education did not become law until the Education Act of 1870, so there was no legal incentive for parents to lie about their children’s schooling. Though no substantive 16 TABLE 2 SUMMARY STATISTICS Means INDIVIDUAL CHARACTERISTICS: Job class 1 - Professional 2 - Intermediate 3 - Skilled 4 - Partly skilled 5 - Unskilled Student Father's class 1 - Professional 2 - Intermediate 3 - Skilled 4 - Partly skilled 5 - Unskilled Age discrepancy (years) 0 1 2 3 4 5 Age Eldest Inheritance Married Not in town of birth Industry Agriculture Building Distributive Mining Textiles Iron & Steel Other Manuf. Other LOCATION CHARACTERISTICS: Living in city Living in town Region Lancashire London London Environs Wales Yorkshire Others Age-heaping Distance to city Wage gap Previous migrants Unemployment Nearby cities Pct. in agriculture Pct. in manuf. Notes: N = 3,774. Variables defined in text. Sources : See text. 1851 Frequency Percent 14 63 1144 1046 436 1071 0.37 1.67 30.31 27.72 11.55 28.38 61 621 1445 1255 392 1.62 16.45 38.29 33.25 10.39 1881 Frequency Percent 89 581 1713 944 447 --- 2.36 15.39 45.39 25.01 11.84 --- 1853 1095 386 205 116 119 49.10 29.01 10.23 5.43 3.07 3.15 16.02 years 2166 453 93 980 57.39 12.00 2.46 25.97 907 123 71 145 292 84 560 1592 24.03 3.26 1.88 3.84 7.74 2.23 14.84 42.18 906 304 330 182 168 146 764 974 24.01 8.06 8.74 4.82 4.45 3.87 20.24 25.81 0 1918 0 50.82 896 1524 23.74 40.38 210 0 528 118 279 2639 5.56 0 13.99 3.13 7.39 69.93 307 156 511 121 286 2393 8.13 4.13 13.54 3.21 7.58 63.41 61.0 (index) 23.35 km 4.79 sh/wk 13,716 7.97% 32.56 29.29% 23.58% 17 TABLE 3 SUMMARY STATISTICS, URBAN vs. RURAL Mean, 1851 Moved to city Remained rural INDIVIDUAL CHARACTERISTICS: Job class 1 - Professional 0.007 0.003 2 - Intermediate 0.017 0.017 3 - Skilled 0.324 0.297 4 - Partly skilled 0.195 0.303 5 - Unskilled 0.102 0.120 Student 0.356 0.261 Father's class 1 - Professional 0.026 0.013 2 - Intermediate 0.170 0.163 3 - Skilled 0.453 0.361 4 - Partly skilled 0.267 0.353 5 - Unskilled 0.085 0.110 Age discrepancy (years) 0 0.486 0.493 1 0.284 0.292 2 0.085 0.108 3 0.067 0.050 4 0.044 0.027 5 0.036 0.030 Age 15.3 16.3 Eldest 0.566 0.576 Inheritance 0.107 0.124 Married 0.020 0.026 Not in town of birth 0.325 0.239 Industry Agriculture 0.146 0.270 Building 0.031 0.033 Distributive 0.019 0.019 Mining 0.021 0.044 Textiles 0.123 0.063 Iron & Steel 0.027 0.021 Other Manuf. 0.151 0.148 Other 0.482 0.403 LOCATION CHARACTERISTICS: Living in town 0.626 0.472 Region Lancashire 0.111 0.039 London 0 0 London Environs 0.150 0.137 Wales 0.018 0.035 Yorkshire 0.083 0.071 Others 0.640 0.718 Age-heaping 61.6 60.8 Distance to city 19.6 24.5 Wage gap 4.98 4.73 Previous migrants 15,339 13,211 Unemployment 8.84 7.70 Nearby cities 36.8 31.2 Pct. in agriculture 0.264 0.302 Pct. in manuf. 0.259 0.229 Notes: N = 3,774. 896 moved to an urban area; 2878 did not. Variables defined in text. Sources : See text. Mean, 1881 Moved to city Remained rural 0.031 0.141 0.564 0.116 0.148 --- 0.021 0.158 0.420 0.292 0.109 --- 0.036 0.099 0.127 0.027 0.061 0.059 0.229 0.362 0.304 0.075 0.075 0.055 0.039 0.032 0.194 0.226 0 0.530 0.199 0.174 0.135 0.017 0.088 0.387 0.045 0 0.136 0.037 0.072 0.711 18 details of their instruction can be known, children older than nine who were listed as students were receiving an education.42 Many of their peers were not; they were working.43 By including the students as a separate class, we can gauge in a very rough fashion its efficacy for future job quality attainment as compared to being employed at a young age. Literacy and specific educational information are not observed, but an interesting proxy is available. “Age discrepancy” is defined to be Reported Age 1881 − (Reported Age 1851 + 30) As discussed above, the age matching criterion is that an individual’s reported age in 1881 cannot be more than five years different from what it “should” be (the reported age in 1851 plus 30).44 Roughly half of all individuals in the sample reported consistent values for age in the two censuses. Another quarter was off only by a year, and the remaining quarter was off by two to five years. This was a time before systematic record keeping, and many people had only an approximate idea of their age.45 The observed age discrepancy for each individual gives an indication of that individual’s familiarity with arithmetic (“numeracy”) and the precision of his concept of time. In general it may be regarded as a proxy for literacy, education, intelligence, and the like, and therefore be expected to have a positive impact on job quality attainment. 46,47 For the population as a whole, this tendency of people not to know their age precisely manifests itself as age heaping—the tendency of any population to over-report rounded ages. Hence, the census will record many more 30-year olds than 29- or 31-year olds, for example. The precise degree of age heaping can be measured for each county in England and Wales by comparing the observed age distribution (using the entire population of each county from the 1881 census) to an approximation of the true distribution obtained by applying any of a number of 42 See note on following page regarding the correlation between student status and inconsistency in age reporting in the census. See Tuttle, “Role of Children,” for a discussion of child labor in 19th century Britain and the high incidence of gainful employment for young children. 44 In most cases for this sample, the age in 1851 would have been reported by the parents. It could be possible, therefore, for an individual to report his correct age and still have a non-zero value of age discrepancy, if his parents had incorrectly reported his age in 1851. This seems unlikely, though, as most people would learn their age from their parents. 45 For a discussion of age-enumeration in the Victorian censuses, see Higgs, Clearer Sense, ch. 7. 46 There is, in fact, a correlation between age discrepancy and being a student in 1851. Students were 8% more likely to know their exact age than non-students; among those whose age reporting was not consistent between the two censuses, students were 4% more likely to be off only by a year, rather than by 2–5 years. 47 Under reporting was more common at every level than over-reporting, by about 23% overall. There may be reason to think that those who under-report their age are less likely truly to be ignorant of their exact date of birth and/or of arithmetic than those who over-report; it may be simple vanity at work. To check this, the model was re-estimated with age discrepancy defined as the actual discrepancy, rather than the absolute value, with the following formulation: 43 … + β-2(discrep ≤ -2) + β-1(discrep = -1) + β1(discrep = 1) + β2(discrep ≥ 2) + … Neither of the hypotheses that β-2 = β2 and that β-1 = β1 could be rejected with any confidence. 19 smoothing techniques to the observed distribution. The technique used here is Graybill’s weighted moving average of the Sprague coefficients, which produces drastic smoothing of the heaped distribution.48 An index of age heaping is then calculated for each county relative to the county with the highest degree of heaping (the Isle of Man, followed by Jersey and Radnorshire; lowest were Sussex, Westmoreland, and Yorkshire). Age heaping has been shown to be correlated with important measures of economic development like per capita income, so in this context it is designed to capture otherwise unmeasured community effects that might influence the labor market performance of individuals.49 The remaining variables are more straightforward. People living in a town in 1851, as opposed to those in the truly rural countryside, may have possessed a form of “locational” human capital that would ease their transition into the big-city labor market. This effect is captured by the inclusion of a dummy variable which equals one if the individual lived in a town of between 2,500 and 19,999 inhabitants. The dummy variables indicating whether a person in the sample was an eldest son and whether his father was a farmer or an employer are designed to capture an inheritance effect. Fathers who owned land were likely to leave it to their eldest son, as the system of primogeniture was the norm with regard to property inheritance in most of England. Likewise, sons of fathers who owned any sort of business were likely heirs to an occupational inheritance, though in this case the inheritor well could have been any son rather than just the eldest. Finally, industrial and regional dummy variables are included to test whether certain industries and locations offered particularly advantageous or disadvantageous starting points for job quality attainment. Industrial classification is according to the well-known scheme developed by Charles Booth for his survey Life and Labour of the People in London, written in nine volumes from 1892 to 1897. Booth’s system consists of 10 divisions and 49 subdivisions, the subdivisions generally being aggregated in some fashion to facilitate analysis. The aggregation used in this case, along with the relative frequency of each group in the data, is given in Table 2. In addition to his expectation of his future job prospects in both regimes, the factors that influence an individual’s decision whether to move to the city—the components of Z in (7)—include both individual- and location-specific elements. The individual-specific characteristics include age and age squared, marital status, age discrepancy, whether the person lived in a town in 1851, whether the person was living in the town of his birth in 1851, and industrial classification. The human capital interpretation of migration (that people migrate in order to maximize lifetime net benefit from moving) would suggest that the likelihood of migrating will decrease with age, as individuals have a shorter time span over which to reap the gains from moving and as they make more locationspecific investments in their place of residence. More educated, knowledgeable people would have a lower tendency to report their age inconsistently; these same people should be better able to gather information on 48 For a complete description of this and other smoothing methods employed by demographers, see Shryock and Siegel, Methods and Materials, pp. 696–702, 878. 49 For a more thorough discussion of the economic usefulness of age heaping information, particularly in a migration analysis setting, see Mokyr, Ireland. 20 potential urban moves. Moving to a big city might have been a less drastic change, and so have carried a lower psychic cost, for those already living in towns in 1851. There are only two new variables in this group. The first is marital status, the expected effect of which is not clear a priori. While studies of overseas migration generally find the typical migrant to be a young, single male,50 this may not be the case for shorter-distance (indeed often local) moves from countryside to city. Indeed, during the second half of the nineteenth century, the decline in rural employment affected the job prospects of women more than those of men, so women became more likely than men to leave the countryside in order to find work in the city, most often as domestic servants.51 The pull of the urban labor market being strong for women, a married young man whose wife was looking for work outside the home might have been more likely to move to the city than an unmarried man. The second new variable is an indicator of whether an individual was living in his town of birth in 1851. Approximately 26% of the individuals in the sample were not; the household had therefore already made a move prior to 1851. These individuals (or at least the heads of their households) had already demonstrated some willingness to move, and their ties to their community in 1851 might not have been as strong as those who were still living in their town of birth. For both reasons, these individuals are expected to be more likely to decide to move to the city by 1881. The elements of Z that depend on the individual’s location in 1851 include the distance to the nearest city, the number of large cities within 100 km, a proxy for the total number of previous migrants in each nearby city, the average rural-urban wage gap, the urban unemployment rate, an interaction between the unemployment rate and distance to nearest city, the percent of the male workforce engaged in agriculture and in manufacturing in the home county in 1851, the degree of age heaping of the home county in 1851, and the regional dummies. Of these variables only age heaping and the regional dummies are in X. Distance to the nearest city and number of large cities nearby were calculated using modern grid point references for place names from the 1991 British Census.52 Distance should be negatively correlated with tendency to migrate, as it would be both more costly to move to a distant city and more difficult to gather pertinent information; conversely, number of nearby cities should exhibit a positive correlation. Knowing people who had already moved to a particular city was another important source of information for the potential migrant. A proxy was constructed for this “friends and family” effect. First, the county of birth of all urban residents in England and Wales in 1881 was tabulated by city using the 1881 census data. Second, the set of most likely destination cities for each individual in the sample was defined to be the nine closest cities to his town of residence in 1851 plus London.53 Finally, for each individual i with likely destination cities 50 For example, see Hatton and Williamson, “Mass Migrations.” Baines, “Population.” 52 These data are readily available only to academics working within the U.K. I am very grateful to Justin Hayes with MIMAS at the University of Manchester for personally providing me with the geographic data from the 1991 census (after receiving all proper permissions, of course). Karen Dennison with the U.K. Data Archive at Essex University and Ann Janda with the Social Science Data Services at the Northwestern University Library were also very helpful. 53 Hence, all individuals from the same town will have the same set of likely destinations, which is a desirabl e property for the set. A direct result of this property is that it is possible that an individual’s actual choice of destination city will not be among his likely choices. London 51 21 j = 1,…,10, the proxy for the stock of previous migrants is defined to be previous migrants i = j =1 10 ∑ m −1 ij d j ij d ij−1 ∑ (16) where mij is the number of people living in city j in 1881 born in the county of residence of individual i, and dij is the distance between city j and the place of residence of individual i. This is a proximity-weighted average of the m values of all likely destination cities. The average urban-rural wage gap and the urban unemployment rate were calculated along similar lines. The wage gap is defined as the average wage of laborers in the building trade in nearby cities minus the average wage of agricultural laborers in the origin county.54 The urban unemployment rate was calculated using data on joblessness among members of the Amalgamated Society of Engineers, available for 56 cities in England and Wales from 1858 through 1909.55 It is calculated as the proximity-weighted average—as in (16)—of the rates from 1862, 1868, and 1879 for the two nearest cities and London. This is the urban unemployment rate, so its expected effect on tendency to move to the city should be negative; people would be less likely to move to the city if there were high unemployment there. However, we do not observe the relevant local unemployment rate and so cannot control for it. Therefore, since the urban unemployment rate is taken from the cities nearest to each individual’s place of origin, it will proxy, to an extent, for the local unemployment rate facing each individual. Insofar as this is true, the effect of unemployment would be positive.56 To test for this effect, an interaction term between the unemployment rate and distance to nearest city is included. We expect this term to have a negative coefficient: for any given unemployment rate, an increase in the value of the interaction term indicates that the unemployment is “farther away,” thus more truly the unemployment of the destination city and less that of the place of origin. Finally, the fraction of the male workforce in the county of origin engaged in agriculture and in manufacturing is taken from the occupational tables from the 1871 census of the population. 57 The three parameter vectors β1, β2, and γ are identified under the assumptions of the model; however, if the variables in X and Z are identical, this holds only if the structure and normality assumptions of the model are attracted migrants from a great distance much more than did any other city in Great Britain, so it is considered to be a feasible option for all potential migrants. 54 Agricultural laborers’ wages are from Hunt, “Industrialization,” Table 6. They cover the years 1867-1870. Building laborers’ wages are from Hunt, Regional Wage Variation, Table 1.5, for the year 1886. As this series includes only 36 cities in England and Wales, the set of likely destination cities was limited to London plus the closest two, rather than the closest nine. City wages were proximity-weighted as in (16). 55 These data are from Southall, “Regional Unemployment,” Table 3. For a discussion of its appropriateness as a proxy for the unemployment experiences of other groups, see Southall and Gilbert, “Good Time to Wed.” 56 Unfortunately, the occupational information from the 1851 census cannot be used to determine individual unemployment within the sample. Specific instructions regarding the unemployed were not given to enumerators until 1861; many people out of work in 1851 reported the fact, but many simply reported the job they most recently held. 57 Lee, British Regional Employment Statistics. 22 exactly correct, “almost certainly too thin a reed upon which to base inference.”58 Fortunately, in this case there are some reasonable exclusion restrictions. Z includes quite a few variables not present in X: marital status, whether the person was living in the town of his birth in 1851, and all of the location-specific variables save age heaping and the regional dummies. X does not include quite so many unique variables: the eldest and inheritance dummies, father’s job class, and own job class in 1851. These, of course, do influence the individual’s migration decision, but only through their effect on (y * 1i ) − y 0*i . With these exclusion restrictions, the parameters of the model are identified, even if the assumptions of the model do not hold exactly. Empirical Results Occupational Class Attainment Results from estimating the two job class equations, (9), as well as from what may be considered the “baseline” model, an ordered probit estimation of job class in 1881 on X plus a dummy variable for moving to a city by 1881, are presented in Table 4. The columns labeled “Urban” show the estimates of β1 , the coefficients for those who chose to move to an urban area by 1881, and the columns labeled “Rural” show the estimates of β0 , the coefficients for those who remained in a rural place throughout the time period.59 The ancillary parameters ρ and k do not vary between the two groups. Interpretation of ordered probit results is not exactly straightforward. The coefficient estimates represent the effect of a change in the explanatory variable on the unobserved latent variable y*, job quality in this case. Job quality is a theoretical construct of the model; it has no units or ready interpretation. The coefficient estimates reveal the sign of the overall effect of the explanatory variable, the precision with which the effect is estimated, and the statistical significance level of the effect. For an understanding of the magnitude of the effect on the “real world variable,” here job class, the marginal effect of each dependent variable must be calculated. These are shown in Table 5, with the “Urban” and “Rural” columns as in Table 4. Also shown are the baseline probabilities of attaining each job class conditional on moving to a city or remaining in a rural place. These are calculated at the mean values of all explanatory variables. Each explanatory variable has a different marginal effect for each possible outcome of the ordered probit, so there will be a different effect for each of the five possible job classes that an individual could attain in 1881. Note that the sign of the overall effect of each explanatory variable is unambiguous (though perhaps not statistically significant from zero), but the sign will vary across 1881 job classes. A variable which positively influences job quality will positively influence the probability of being in a high job class but will negatively influence the probability of being in a low job class. As expected, the 1851 class 58 59 Johnston and Dinardo, Econometric Methods, p. 450. Again, these individuals did not necessarily remain in the same place; they could have moved from one rural area to another. 23 TABLE 4 MAXIMUM LIKELIHOOD RESULTS: DETERMINANTS OF JOB CLASS ATTAINMENT Job class, 1881 Variable Constant Job class Job class, 1881 Urban Rural (1) (2) (3) 0.449 -0.131 0.699 (.319) (0.758) (0.359) (†††) (†††) (†††) 1,2 - Prof., Intermed. 1.371*** 1.958*** 1.156*** (0.147) (0.289) (0.172) 3 - Skilled 0.354*** 0.403** 0.317*** (0.089) (0.193) (0.101) 4 - Partly Skilled 0.136 0.311 0.054 (0.109) (0.227) (0.126) 0.515*** 0.785*** 0.384*** (0.074) (0.162) (0.086) (†††) (†††) (†††) 1 - Professional 1.427*** 0.930*** 1.630*** (0.159) (0.283) (0.203) 2 - Intermediate 0.463*** 0.444** 0.366*** (0.101) (0.198) (0.121) 0.342*** 0.213 0.348*** (0.064) (0.142) (0.074) -0.134** -0.238 -0.117* (0.065) (0.148) (0.072) 0.201*** 0.133 0.224*** (0.046) (0.095) (0.053) -0.008 -0.028 0.004 (0.05) (0.103) (0.057) 0.052** 0.167*** 0.009 Student Father's class 3 - Skilled 4 - Partly Skilled Age discrep. (years) (†††) 0 1 Age Age 2 / 100 Eldest Inheritance Eldest*Inheritance Age heaping Living in town Job class, 1881 Variable (1) Industry (0.027) (0.059) (0.031) -0.505*** -0.023 (0.072) (0.162) (0.081) 0.03 0.111 0.002 (0.039) (0.081) (0.045) 0.335*** -0.102 0.552*** (0.11) (0.213) (0.13) -0.039 -0.252 -0.012 (0.108) (0.237) (0.121) Urban Rural (2) (3) -0.128 (†††) (†††) Agriculture -0.181* -0.199 (0.106) (0.234) (0.122) Building 0.280** 0.305 0.301** (0.128) (0.273) (0.144) Distributive 0.319** 0.126 0.361** (0.148) (0.308) (0.168) Mining Textiles Iron & Steel Other manuf. Region Lancashire London Environs Wales (†††) -0.152** Job class, 1881 Yorkshire Moved to city 0.065 0.167 0.083 (0.122) (0.309) (0.136) -0.114 0.033 -0.200* (0.102) (0.211) (0.119) -0.022 -0.049 -0.016 (0.144) (0.289) (0.166) 0.204** 0.271 0.192 (0.089) (0.189) (0.101) (†) (†††) 0.041 -0.015 -0.037 (0.089) (0.155) (0.12) -0.075 -0.353*** 0.001 (0.053) (0.109) (0.061) 0.166 0.113 0.219* (0.116) (0.313) (0.126) 0.126* 0.282* 0.045 (0.075) (0.154) (0.086) -0.005 (0.043) ρ1 -0.053 (0.100) ρ2 -0.324*** (0.088) k2 0.959*** (0.028) (0.031) k 3 2.429*** 2.408*** (0.038) (0.050) k 4 3.665*** 3.642*** (0.060) (0.076) 0.948*** -0.003 -0.005 -0.003 (0.003) (0.007) (0.004) 0.051 -0.084 0.050 Pseudo R2 0.092 (0.038) (0.083) (0.045) Log likelihood LR χ2 -4572.715 -6457.798 925.530 1284.230 Prob > χ2 0.000 0.090 0.000 * = Significant at the 10 percent level ** = Significant at the 5 percent level *** = Significant at the 1 percent level † = Jointly significant at the 10 percent level †† = Jointly significant at the 5 percent level ††† = Jointly significant at the 1 percent level Notes: N = 3,774. The two columns of equation (1) represent all 3,774, (2) represent the 896 who moved to a city, and (3) the 2,878 who remained in a rural place. 2 Standard errors are in parentheses. All explanatory variables use 1851 values. Pseudo R ≡ 1 - L1/L0, where L1 is the log likelihood of the model and L0 is the log likelihood of the “constant only" model. The LR χ statistic represents a test of all parameters of the model being equal to 0. Omitted dummies are Class, Father's class: 5, Age discrepancy: 2-5, Industry: Others, Region: Others. k1 is normalized to 0. 2 Source : Sample of matched individuals, 1851-1881. 24 TABLE 5 MAXIMUM LIKELIHOOD RESULTS: DETERMINANTS OF JOB CLASS ATTAINMENT Marginal effects of explanatory variables d[Pr(y=1)] / dx d[Pr(y=2)] / dx d[Pr(y=3)] / dx d[Pr(y=4)] / dx d[Pr(y=5)] / dx Urban Rural Urban Rural Urban Rural Urban Rural Urban Rural (1) (2) (1) (2) (1) (2) (1) (2) (1) (2) Job class 1,2 - Prof., Intermed. 0.3639 0.0921 0.3075 0.2739 -0.3423 -0.0452 -0.2437 -0.2151 -0.0855 -0.1057 3 - Skilled 0.0161 0.0081 0.0888 0.0596 0.0335 0.0515 -0.0847 -0.0636 -0.0538 -0.0555 4 - Partly Skilled 0.0120 0.0012 0.0682 0.0097 0.0273 0.0098 -0.0656 -0.0106 -0.0419 -0.0100 Student 0.0402 0.0103 0.1797 0.0734 0.0334 0.0591 -0.1594 -0.0774 -0.0939 -0.0654 Father's class 1 - Professional 0.0811 0.2007 0.2297 0.3490 -0.0668 -0.1784 -0.1720 -0.2581 -0.0720 -0.1131 2 - Intermediate 0.0206 0.0108 0.1022 0.0726 0.0237 0.0510 -0.0930 -0.0751 -0.0535 -0.0593 3 - Skilled 0.0074 0.0084 0.0453 0.0640 0.0228 0.0591 -0.0450 -0.0690 -0.0306 -0.0626 4 - Partly Skilled -0.0072 -0.0024 -0.0480 -0.0202 -0.0316 -0.0224 0.0494 0.0225 0.0374 0.0225 0 0.0044 0.0049 0.0277 0.0396 0.0155 0.0413 -0.0279 -0.0436 -0.0197 -0.0421 1 -0.0009 0.0001 -0.0059 0.0007 -0.0034 0.0007 0.0060 -0.0008 0.0042 -0.0007 0.0055 0.0002 0.0349 0.0016 0.0197 0.0017 -0.0352 -0.0018 -0.0248 -0.0017 -0.0166 -0.0005 -0.1055 -0.0040 -0.0595 -0.0042 0.1065 0.0044 0.0750 0.0043 0.0036 0.0000 0.0229 0.0004 0.0134 0.0004 -0.0233 -0.0004 -0.0166 -0.0004 Inheritance -0.0031 0.0202 -0.0206 0.1167 -0.0136 0.0568 0.0213 -0.1139 0.0160 -0.0798 Eldest*Inheritance -0.0065 -0.0002 -0.0477 -0.0021 -0.0402 -0.0022 0.0509 0.0023 0.0435 0.0022 Age heaping -0.0002 -0.0001 -0.0010 -0.0005 -0.0007 -0.0006 0.0011 0.0006 0.0008 0.0006 Living in town -0.0028 0.0011 -0.0176 0.0088 -0.0099 0.0093 0.0177 -0.0098 0.0125 -0.0094 Agriculture -0.0059 -0.0025 -0.0398 -0.0218 -0.0275 -0.0251 0.0413 0.0244 0.0318 0.0251 Building 0.0138 0.0092 0.0704 0.0607 0.0173 0.0400 -0.0646 -0.0624 -0.0369 -0.0475 Distributive 0.0047 0.0119 0.0276 0.0745 0.0117 0.0431 -0.0268 -0.0750 -0.0172 -0.0544 Mining 0.0065 0.0020 0.0368 0.0153 0.0142 0.0143 -0.0354 -0.0166 -0.0222 -0.0149 Age discrep. (years) Age Age 2 / 100 Eldest Industry Textiles 0.0011 -0.0035 0.0069 -0.0322 0.0037 -0.0425 -0.0069 0.0365 -0.0048 0.0417 Iron & Steel -0.0015 -0.0003 -0.0100 -0.0028 -0.0062 -0.0030 0.0102 0.0031 0.0075 0.0031 Other manuf. 0.0110 0.0049 0.0605 0.0362 0.0209 0.0310 -0.0573 -0.0388 -0.0351 -0.0333 Region Lancashire -0.0005 -0.0008 -0.0032 -0.0064 -0.0018 -0.0070 0.0032 0.0071 0.0023 0.0071 London Environs -0.0089 0.0000 -0.0658 0.0001 -0.0582 0.0001 0.0704 -0.0001 0.0625 -0.0001 Wales 0.0042 0.0060 0.0246 0.0427 0.0108 0.0324 -0.0240 -0.0449 -0.0156 -0.0363 Yorkshire 0.0122 0.0010 0.0642 0.0082 0.0188 0.0081 -0.0598 -0.0090 -0.0353 -0.0083 0.0127 0.0078 0.1456 0.1105 0.5183 0.4905 0.2436 0.2807 0.0799 0.1104 Pr[y =class | X =E(X )] Notes: Dependent variable y is job class in 1881. Effects are calculated at mean of x for continuous variables; for discrete variables, effect is Pr(y=c|x=1) - Pr(y=c|x=0), c=1,…,5. variables, both own and father’s, are strong influences on subsequent job quality attainment. In each case save one, the effects are as anticipated: being in a higher class in 1851 and being the son of a father in a higher class strongly predict ending up with a higher quality job in 1881. For example, Table 5 indicates that people who began in Class I or II and moved to the city were 31 percentage points more likely to end up in Class II than all others who moved to the city, and they were 36 percentage points more likely to end up in Class I. Given that the two baseline attainment probabilities are 15 and 1 percent, respectively, it was very difficult indeed to move up into a Class I or II job from a lower initial class. The only anomalous class is Class IV, for which the own-class effect is not significant and the father’s class effect is negative. This has a simple explanation: by far the most common Class IV job outside of cities was agricultural laborer. For agricultural laborers (or their sons) who moved to a city, the most commonly attainable job was that of “general laborer,” which could be all sorts of things (very often simply unskilled factory work) and which carried with it a class ranking of five. As a result many of the Class IV workers who migrated to the city ended up with a lower class job. It should be noted that they did not end up with lower paying jobs, on average. The average farm laborer in England and Wales earned ₤29.04 per year, while the average general, non-agricultural laborer earned nearly two thirds more, ₤44.83.60 It could be argued on this ground that general laborers should be ranked at least as high as agricultural laborers. Perhaps they should. There are other factors to consider, however, such as difficult to measure payment-in-kind to farm laborers, arguably better job quality of agricultural work, and perhaps occupying a higher, somewhat more dignified position on the social ladder (though still quite low). For the present purposes, the rankings will not be modified from Armstrong’s standard scheme. Perhaps most interesting with regard to the class variables is to compare their effect on those who moved to urban areas versus the effect on those who remained in rural places. A chi-square test reveals no significant difference between the combined effect of both own and father’s class on those who moved versus those who remained. However as Table 4 shows, for those who moved to a city, the effect of their own class in 1851 was stronger than the effect of their father’s class. The reverse was true for those who remained behind, the job class of their father being a stronger influence on their 1881 class attainment than was their own class in 1851. Also, the effect of own class was stronger for those who moved than those who did not, and the effect of father’s class was weaker, though only the former effect is statistically significant (the two χ2 statistics are different from zero with 97 and 80 percent confidence, respectively). It appears that leaving the countryside and moving to the city offered migrants a better chance to escape the intergenerational career trajectory begun with their father’s job. For those who moved, their own position mattered more than that of their father. It is also interesting to note that the benefit of being a student in 1851 was nearly twice as great for those who moved to a city as it was for those who remained 60 Tuttle, “Children,” p. 173. 26 rural. Students who moved to the city were 18 percentage points more likely to end up with a Class II job than nonstudent urban migrants; for those who remained in a rural place, the advantage was only 7 percentage points. As for other variables of interest, people who reported their age with consistency between the two censuses performed better in the labor market. The effect was particularly strong, and statistically significant, for nonmigrants, who were 8 percentage points more likely to end up with a Class II or III job and 4 percentage points less likely to find themselves in a Class V job than non-migrants who inconsistently reported their age by more than one year. The benefit did not apply to those who were off by only a year; they did not do appreciably better than those off by more than a year. Age positively influenced job quality attainment for urban migrants, though at a declining rate—17 years of age in 1851 was optimal. Having a potential inheritance was a strongly positive force, but only for non-migrants, as was expected. The effect was particularly positive for the attainment of Class II jobs, where it increased the probability by 12 percentage points. The industrial classification of the individual’s job in 1851 was also significantly influential only for persisters; starting out in the building and distributive trades (the latter being “dealers” of one sort or another) was beneficial to eventually getting a high quality job, while beginning in the textile industry was detrimental. For those who moved to an urban place, having an initial job in the agricultural sector was poor preparation for eventual success and starting out in any branch of manufacturing other than steel, iron, and textiles was helpful, though neither effect was statistically significant. Conversely, region of origin was an important factor only for those who moved. York seemed to be a good place from which to hail, while for whatever reason the immediate environs of London were disadvantageous. Self-Selection and the Determinants of Migration Who were the urban migrants? Table 3 gives an overview of their characteristics. 896 of the 3,774 individuals in the estimation sample moved from a rural to an urban area between 1851 and 1881, for an overall urban migration rate of 24 percent. Out of the entire sample of 28,474 matched individuals (which is younger than the population as a whole because of the survival effect in the matching process), 18,740 (66 percent) were living in a non-urban area in 1851. Of these, 4,387 moved to a city by 1881, for a rate of 23 percent over the thirty-year period. As for the characteristics of these migrants, most strikingly, they were not those clinging to the bottom of the economic and social ladder, desperate for any sort of a change. We see relatively more people with Class III jobs in 1851 and sons of fathers with Class III jobs among the pool of urban migrants than among the non-migrants. Just the reverse is true for Classes IV and V, which are represented more heavily among the rural persisters. Sons who were described as students in 1851 were also more prevalent within the group of urban migrants than the group who remained in rural places. It was decidedly not the case that the truly destitute and poverty stricken, those occupying the society’s lowest class, were pouring from the countryside into the cities. Rather, it was those who were middle-of-the-road— children who had received schooling, sons of fathers with skilled occupations, young men who themselves were employed in a skilled occupation—who left the rural areas of England and Wales and headed for the cities. 27 Table 6 presents results from the probit estimation of the urban migration decision, which yields a more ( accurate and detailed picture of the characteristics of urban migrants. 61 The large, positive coefficient on yˆ 1*i − yˆ 0*i ) indicates that in fact people were moving to the cities in order to improve their socioeconomic status and that the pull of this factor was strong relative to other factors. Interpreting the magnitude of the effect is not straightforward, as y* , job quality, is not a “real” variable. Every unit increase in ant icipated job quality difference between the rural and urban regimes increased the odds of an individual moving to a city by 7 percentage points. The estimated cut points from Table 6 shed a little light on this figure. The range of y* between different job classes varies from 0.95 for Class IV to 1.46 for Class III. So the prospect of being one class higher in the city than in the countryside (roughly an increase in y* of 1 to 1.5) increases the odds of migration by about 7-10 percentage points—a large increase considering that the baseline predicted probability of moving is 22 percent. Understanding the other determinants of migration is simpler. As expected, people who were not living in their place of birth and those living in towns were more likely to move to a city by 1881, by 5 and 8 percentage points, respectively. Taken together, this suggests at least the possibility of a sort of intergenerational step -wise migration, with fathers moving from the truly rural areas to small towns, and their sons subsequently moving to the bigger cities. This is purely speculative but not inconsistent with the data. 62 Age did not yield the usual negative effect, almost certainly because the sample is already restricted to the young. Nor did consistent age reporting yield the expected positive informational effect. Those working in agriculture and mining in 1851 were considerably less likely to move to a city by 1881. The location-level variables were also important factors in the urban migration decision. The economic incentive to move was not entirely captured by the effect of job quality; the average wage gap was also a significant influence. For every shilling per week of expected wage difference between the city and the countryside, an individual was about 1 percentage point more likely to move. As anticipated, the effect of the unemployment rate was not clear-cut. The effect of local urban unemployment by itself was significantly positive, indicating that it may have been proxying for local unemployment. The test for this is to interact unemployment with distance to nearest city. The coefficient is negative, indicating that the farther away was the nearest city, the more the effect of high urban unemployment was indeed negative. Surprisingly, the number of nearby cities exerted a significant, though small, negative influence. Those living in more agricultural counties in 1851 were slightly less likely to move to a city, and those living in the vicinity of London in 1851 were about 6 percentage points more likely to be living in a 61 The standard errors in this table are calculated by bootstrapping. Standard analytical estimates are not appropriate due to the fact that yˆ − yˆ 0*i is estimated rather than observed. * 1i 62 In future work I will look into this more closely, since the father’s birthplace is observed as well. 28 TABLE 6 PROBIT RESULTS: DETERMINANTS OF URBAN MIGRATION Move to city Variable coefficient (individual-specific) Move to city dF/dz -1.296*** coefficient (location-specific) (1) Constant Variable --- Distance to city (0.501) y 1i * – y 0i * 0.239* Wage gap 0.035** Previous migrants -0.016 -0.044 -0.013 (0.073) Age 0.020 -0.093 0.092 0.157*** 0.281*** -0.270*** 0.025 0.010 Mining -0.493*** 0.005 -0.002 0.002 -0.685* -0.204 (0.370) 0.508 0.151 (0.553) Region (†) Lancashire -0.076 0.108 0.033 (0.189) London Environs 0.007 0.194** 0.061 (0.095) Wales 0.003 (0.209) -0.0004 (0.005) 0.083 (0.155) Distributive 0.013 (0.002) Pct. in manuf. (0.087) Building -1.32E-06 (0.000) Pct. in agriculture (†††) Agriculture -0.006*** 0.048 (0.056) Industry Nearby cities Age heaping (0.055) Living in town -0.001*** 0.028 (0.163) Not in town of birth 0.043*** Unemp*Distance -0.028 (0.109) Married -4.43E-06* (0.013) 0.006 (0.038) Age 2 / 100 0.011 (-3.12E-06) Unemployment (0.065) 1 0.0003 (0.016) Age discrep. (years) -0.055 0.001 (0.004) 0.071 (0.127) 0 dF/dz (1) -0.230 -0.063 (0.184) Yorkshire -0.12 0.156 0.049 (0.139) (0.170) Textiles 0.056 0.017 Pseudo R2 0.065 -1933.311 0.025 Log likelihood LR χ2(94) Prob > χ2 0.000 Pr[D =1 | Z =E(Z )] 0.222 (0.107) Iron & Steel 0.083 (0.170) Other Manuf. -0.040 (0.082) 270.290 -0.012 * = Significant at the 10 percent level ** = Significant at the 5 percent level *** = Significant at the 1 percent level † = Jointly significant at the 10 percent level †† = Jointly significant at the 5 percent level ††† = Jointly significant at the 1 percent level Notes: N = 3,774, of which 896 moved to a city by 1881 . Standard errors are in parentheses, calculated by bootstrapping via data resampling with 250 repetitions. All dependent variables are 1851 values. Omitted dummies are Age discrepancy: 2-5, Industry: Others, Region: Others. dF/dz are marginal effects, F is Pr(D=1) evaluated at the mean of z, except for discrete variables, where the marginal effect is Pr( D=1|z=1) - Pr(D=1|z=0). Sources : see text. 29 city (not necessarily London) in 1881. Distance to nearest city had a positive effect, but it was very small and not statistically significant. The “family and friends” effect of previous migrants also was not significantly different from zero. We have seen that the urban migrants did not begin the time period as the dregs of the labor market. We turn now to the question of selection: whether the urban migrants and rural persisters were positively or negatively selected. Table 7 presents estimates for the three migration effect parameters defined in (12) and (14) .63 The first two summarize the selection process. TABLE 7 MEASURES OF THE MIGRATION EFFECT (Dependent variable is latent job quality in 1881, y*) estimate standard error 90% Confidence Interval 80% Confidence Interval (ϕ 1 ) 0.1337 0.0400 0.0676 0.1998 Selection of Urban Migrants 0.0824 0.1850 Selection of Rural Persisters (ϕ 0 ) -0.0982 0.0308 -0.1490 -0.1377 -0.0474 -0.0587 Treatment Effect (τ ) 0.2087 0.1563 -0.0494 0.0082 0.4668 0.4092 Notes: Standard errors and confidence intervals are calculated by bootstrapping via data resampling with 250 repetitions. Urban migrants were positively selected while rural persisters were negatively selected. Both results are statistically significant at the 1 percent significance level. Not only did the migrants perform better in the urban labor market than the persisters would have, they also would have outperformed the persisters in the rural labor market had they chosen not to migrate. In this sense, then, urban migrants were the “cream of the crop.” They were those whose labor market prospects were brightest.64 Treatment Effect The central question of the present study concerns the effect of moving to a city on the ability of migrants to attain high quality jobs; i.e. the treatment effect of moving to a city. The estimate of the treatment effect, along with standard error and confidence intervals, are shown in Table 7. The treatment effect is positive. Though the estimate of τ is not statistically significant according to the standard two-tailed test, the null hypothesis that τ1 ≤ 0 can be 63 As before, the standard errors are calculated by bootstrapping. Typical analytical standard errors are inaccurate since the X values used to calculate the migration effect parameters are conditional means rather than observed values. This distortion is small, however, and the reported bootstrapped standard errors are very close to the analytical ones. 64 In this aspect they appear to be different from rural-urban migrants in the 19 th century U.S. Ferrie reports that urban migrants in the U.S. between 1850 and 1860 were negatively, rather than positively, selected. They fared worse in the cities than rural persisters would have had they chosen to move. See Ferrie, “Down on the Farm,” p. 11. 30 rejected at the 10 percent significance level. Moving to the city allowed the average individual to obtain a better job than he would have been able to get had he remained in a rural place. Magnitude is best interpreted by considering job class transitions rather than effects on the latent variable * y . Also, it is informative to examine exactly for whom the treatment effects were largest. This information is presented in Table 8. Here the treatment effect of moving to a city is considered in terms of the effect of a move on occupational mobility, both own mobility and intergenerational. The table gives the probabilities that an individual will attain either a higher or lower occupational class (respectively, “up” and “down” in the column headings), conditional either on moving to an urban place or remaining in a rural area.65 In the first five rows, “higher” and “lower” are interpreted in terms of the individual’s own class in 1851, and in the second five in terms of father’s class. Class in 1851 = 1 or 2 Class in 1851 = 3 Class in 1851 = 4 Class in 1851 = 5 Class in 1851 = student Father's class = Father's class = Father's class = Father's class = Father's class = TABLE 8 TREATMENT EFFECT BY CLASS (Dependent variable is job class in 1881) Pr(up|urban) Pr(up|rural) Pr(down|urban) Pr(down|rural) 0 0 0.1976 0.3874 0.2131 0.1641 0.2533 0.3148 0.5188 0.4282 0.1598 0.2215 0.8374 0.8260 0 0 0.2510 0.1793 0.2153 0.2941 1 2 3 4 5 0 0.0377 0.2225 0.5326 0.8940 0 0.0341 0.1602 0.4219 0.8140 0.8376 0.7067 0.2432 0.1516 0 0.7432 0.7224 0.3206 0.2264 0 Net Gain 0.1898 0.1105 0.1523 0.0115 0.1505 -0.0944 0.0193 0.1398 0.1855 0.0800 Net gain (of moving to a city) is defined as [Pr(up | urban) - Pr(up | rural)] + [Pr(down | rural) - Pr(down | urban)] (17) For example, the average individual who held a Class IV job in 1851 had a 49.12 percent chance of moving up to a Class I, II, or III job if they moved to a city and a 41.18 percent chance if they chose to remain in a rural place. That same person would have a 17.73 percent chance of falling to a Class V job in the city and a 23.42 percent chance of falling in the countryside. So moving to the city confers a 7.94 percentage point boost in the probability of making 65 For the purpose of this exercise, individuals’ classes as students in 1851 were considered to have moved up if they obtained a Class I or II job by 1881, and they were considered to have moved down if they held Class IV or V jobs in 1881. 31 an upward move and offers a 5.69 percentage point lower likelihood of falling to a lower class, for a net gain of 13.63 percentage points. The first important thing to note from Table 8 is that the gains from urban migration were realized across all socioeconomic strata. The only measure of net gain that is not positive is that of intergenerational mobility for sons of fathers with Class I jobs (less than 2 percent of the sample); they were less likely to experience downward mobility if they remained in a rural area. All individuals were more likely to improve their own socioeconomic status if they migrated to a city, and sons of fathers belonging to any class other than I were more likely to end up in a higher strata by 1881 than their fathers occupied in 1851 if they moved. The second feature to note from the table is those groups for whom the treatment effect of migration was the largest. The highest gains in occupational mobility are realized by those who hold Class I or II jobs in 1851; they are much less likely to fall to a lower class if they move to a city than if they do not. As Table 2 shows, however, this group represents a small minority (about 2 percent) of the sample. By far the largest groups are those who begin with a Class III or IV job or as a student. The net gain of moving to a city was large for all these individuals. For the average student, Class IV worker, and Class III worker, the net gain of moving to a city was 15.05, 13.23, and 11.05 percentage points percent, respectively.66 About 70% of the sample had fathers with Class III or IV jobs. These men realized net gains to urban migration, in terms of intergenerational occupational mobility, of 13.98 and 18.55 percentage points, respectively. Consider, then, the average person who held a Class IV, partly skilled job in 1851, lived in the countryside, and subsequently decided to leave and move to a city. Such an individual was 21 percent more likely to improve the quality of his occupation than if he had remained in a rural area (52 versus 43 percent), and he was 39 percent less likely to fall into a Class V, unskilled occupation (16 versus 22 percent). The average rural son of a Class IV worker was 26 percent more likely to attain a higher status than his father if he moved to an urban place than if he remained in the countryside, and he was 49 percent less likely to find himself with a Class V occupation. These are substantial gains. For people like this, moving to the city was an important way to improve their standing in the labor market. Conclusions Large-scale rural-urban migration was one of the most prevalent demographic features of 19th century Britain. Cities grew more than twice as fast as the nation as a whole during this time, and in the process Britain became the world’s first predominantly urban society. Of the young men moving from the countryside to the cities, the skilled and educated were the most prevalent. They were neither the poorest nor the richest rural residents. In one 66 As a group, individuals with Class 4 jobs in 1851 realized strong net gains from moving to urban areas even though many of them made the downward move from agricultural laborer to general laborer. But as has been noted, these people actually received 65 percent greater annual earnings, on average. If agricultural and general laborers were put into the same class, or if general laborers were ranked above agricultural, then the treatment effects would be even stronger. 32 important sense, they were the best that the pool of rural labor had to offer. Urban migrants were positively selected; they were those whose labor market prospects were brightest, whether in the cities or in rural areas. In this aspect, the nature of rural-urban migration in Victorian Britain is very different from what it was during much of the 20th century, when the urban migrants typically were those with the poorest prospects. In 19th century England and Wales, these migrants left for the cities for a variety of reasons, but chief among them was a desire to improve their economic condition. Expected increases in socioeconomic status over what could be obtained in rural England and Wales and higher average wages in urban than in rural areas drove people from the countryside and small towns into the cities. People left to find better jobs, and they found them. Urban migrants were more likely to improve their socioeconomic status (occupational class) than they would have been had they remained in a rural area, and they were more likely to experience upward intergenerational occupational mobility. The effect was strong for the average rural resident of Great Britain, and it was even stronger for the sort of person who was in fact inclined to move. Individuals from all socioeconomic strata realized these positive returns to migration. In this aspect as well then, the experience of the urban migrant in Victorian Britain was very different from that of the migrant in the modern developing world, who is quite likely to experience negative returns to urban migration. It is beyond the scope of the current study to attempt to explain these differences, though it is possible that British industry and the British economy as a whole was in many ways more developed in the mid to late 19th century than were those of many Third World countries in the late 20th century, and that this difference is germane in some way to the effect of urban migration. Perhaps even more importantly, the institutional structure of 19th century Britain surely was very different from that of the modern Third World. Victorian Britain represented the height of laissez faire political economy. The same certainly cannot be said of most of today’s Third World, particularly in Africa and Latin America. The free market economy of Britain from 1851 to 1881 may have provided more stable urban employment opportunities and more potential for advancement across socioeconomic strata than is true in the modern developing world, where a small minority of migrants experience tremendous gains and the large majority suffer loss. References Amemiya, T. Advanced Econometrics. Oxford: Basil Blackwell, 1986. Anderson, Michael. National Sample from the 1851 Census of Great Britain: Introductory User Guide. University of Edinburgh, Dept. Of Economic and Social History, September 1987. Armstrong, W. A. “The Use of Information about Occupation,” in E. A. Wrigley, ed., Nineteenth-century Society. Cambridge: Cambridge University Press, 1972. Baines, D. Migration In A Mature Economy: Emigration and Internal Migration in England and Wales, 1861-1900. Cambridge: Cambridge University Press, 1985. _______ “Population, Migration and Regional Development, 1870-1939” in R. Floud and D. McCloskey, The Economic History of Britain Since 1700, vol 2. Cambridge: Cambridge University Press, 1994. Bédarida, F. A Social History of England, 1851-1975. London, 1979. 33 Black, D., and V. Henderson. “A Theory of Urban Growth.” Journal of Political Economy 107, no. 2 (1999): 25284. Booth, C. (ed.) Life and Labour of the People in London, 9 vols. London, 1892-97. Boswell, J. Life Of Samuel Johnson. New Haven: Yale University Press, 1994. Boyer, G. R., and T. J. Hatton. “Migration and Labour Market Integration in Late Nineteenth-Century England and Wales.” Economic History Review 50, no. 4 (1997): 697-734. Crouzet, F. The Victorian Economy. New York: Columbia University Press, 1982. de Vries, J. and A. van der Woude. The First Modern Economy : Success, Failure, and Perseverance of the Dutch Economy, 1500-1815. New York : Cambridge University Press, 1997. Encyclopædia Britannica Online, “Population.” [http://www.eb.com:180/bol/topic?eu=115183&sctn=23] [Accessed 3 Sep 2001]. Ferrie, J. P. “A New Sample of Americans Linked from the 1850 Public Use Sample of the Federal Census of Population to the 1860 Federal Census Manuscripts.” Historical Methods 29 (Summer 1996). _______. “Migration To The Frontier In Mid-Nineteenth Century America: A Re-Examination Of Turner’s ‘Safety Valve.’” prepared for the World Congress of Cliometrics, July 1997. [http://www.faculty.econ.nwu.edu/faculty/ferrie/papers/munich.pdf] _______. “How Ya Gonna Keep ‘Em Down on the Farm [When They've Seen Schenectady]? Rural to Urban Migration in the U.S., 1850-70,” prepared for the Cliometrics Conference, May 1999. [http://www.faculty.econ.nwu.edu/faculty/ferrie/papers/urban.pdf] _______. Yankeys Now: Immigrants in the Antebellum United States, 1840-1860. New York: Oxford University Press, 1999. Floud, R. and D. McCloskey. The Economic History of Britain Since 1700, 2nd ed., 3 vols. Cambridge: Cambridge University Press, 1994. Friedlander, D. “Occupational Structure, Wages, and Migration in Late Nineteenth-Century England and Wales.” Economic Development and Cultural Change 40 (1992): 295-318. Friedlander, D., and R. J. Roshier. “A Study of Internal Migration in England and Wales: Part I.” Population Studies 19 (1966): 239-79. Great Britain, General Register Office. The Registrar-General's Decennial Supplement. England and Wales. London: H.M.S.O., 1921. _______. Census of England and Wales, 1921. Classification of Occupations. London: H.M.S.O., 1924. _______. Census of England and Wales, 1951. Classification of Occupations. London: H.M.S.O., 1956. Harris, J. R., and M. P. Todaro. “Migration, Unemployment, and Development: A Two-Sector Analysis.” The American Economic Review 60, no. 1 (1970): 126-142. Hatton, T. J., and J. G. Williamson. “What Drove the Mass Migrations from Europe in the Late Nineteenth Century?” Population and Development Review 20 (September 1994): 533-59. Heckman, J. “Varieties of Selection Bias.” The American Economic Review 80, no. 2 (1990): 313-18. Higgs, E. A Clearer Sense of the Census. London: HMSO, 1986. Hunt, E. H. Regional Wage Variations in Britain, 1850-1914. Oxford: Oxford University Press, 1973. _______. “Industrialization and Regional Inequality: Wages in Britain, 1760-1914.” Journal of Economic History 46, no. 4 (1986): 935-966. Johnston, J., and J. DiNardo. Econometric Methods, 4th ed. New York: McGraw-Hill. Krugman, P. “The Current Case for Industrial Policy,” in Salvatore, D., ed., Protectionism and World Welfare. Cambridge: Cambridge University Press, 1993. Lee, C.H. British Regional Employment Statistics, 1841-1971. Cambridge: Cambridge University Press, 1971. Lee, L. F. “Unionism and Wage Rates: A Simultaneous Equation Model with Qualitative and Limited Dependent Variables.” International Economic Review 19: 415-33. Lucas, R. E., Jr. ‘‘On the Mechanics of Economic Development.’’ Journal of Monetary Economics 22 (July 1988): 3–42. Maddala, G. S. Limited-Dependent and Qualitative Variables in Econometrics. Cambridge: Cambridge University Press, 1983. 34 Manski, C. F. “Nonparametric Bounds on Treatment Effects.” The American Economic Review 80, no. 2 (1990): 319-23. Mayhew, K., and B. Rosewell. “Occupational Mobility in Britain.” Oxford Bulletin of Economics and Statistics 43 (1980): 225-255. Mokyr, J. Why Ireland Starved. London: George Allen & Unwin, 1983. Newey, W. K., J. L. Powell, and J. R. Walker. “Semiparametric Estimation of Selection Models”Some Empirical Results.” The American Economic Review 80, no. 2 (1990): 324-28. Pooley, C. and J. Turnbull. Migration and Mobility in Britain Since the 18th Century. London: UCL Press, 1998. Ravenstein, E. G. “The Laws of Migration.” Journal of the Royal Statistical Society 48 (1885): 167-227. Redford, A. Labour Migration in England, 1800-1850, 2nd ed. Manchester: Manchester University Press, 1964 (1926). Robinson, C., and N. Tomes. “Self-Selection and Interprovincial Migration in Canada.” Canadian Journal of Economics 15, no. 3: 474-502. Royle, S. A. “Social Stratification from Early Census Returns: A New Approach.” AREA 9 (1977): 215-219. Shryock, H., and J. Siegel. The Methods and Materials of Demography, 2 vols. Washington: U.S. Dept. of Commerce, Bureau of the Census, 1973. Southall, H. “Regional Unemployment Patterns Among Skilled Engineers in Britain, 1851-1914.” Journal of Historical Geography 12, no. 3 (1986): 268-286. Southall, H., and D. M. Gilbert. “A Good Time to Wed?: Marriage and Economic Distress in England and Wales, 1839 to 1914.” Economic History Review 49 (1996): 35-57. Todaro, M. P. Economic Development in the Third World. New York : Longman, 1989. Trost, R. P. “ Demand for Housing: A Model Based on Inter-Related Choices Between Owning and Renting.” Unpublished Ph.D. dissertation, University of Florida. Tunali, I. “Rationality of Migration.” International Economic Review 41, no. 4 (2000): 893-920. Tuttle, Carolyn. “The Role of Children in the Industrial Revolution.” Unpublished Ph.D. diss., Northwestern University, 1985. United Nations Centre for Human Settlements (Habitat), “General Assembly, Special Session for an Overall Review and Appraisal of the Implementation of the Habitat Agenda.” New York, June 2001. [http://www.unchs.org/Istanbul+5/bg10.htm] Williamson, J. G. Coping with City Growth During the British Industrial Revolution. Cambridge: Cambridge University Press, 1990. 35
© Copyright 2026 Paperzz