New Media, Institutions, and Economic Change: Evidence from the

New Media, Institutions, and Economic Change:
Evidence from the Protestant Reformation
Jeremiah Dittmar and Skipper Seabold∗
Version 0.1 – Preliminary & Incomplete Draft
Please Do Not Circulate
September 27, 2013
Abstract
The printing press was the great information technology revolution of the European Renaissance. During the Protestant Reformation of the early 1500s, religious
reformers employed print media to disseminate their ideas. We construct a new
measure of religious content in the media using data on all known books and pamphlets printed in German-speaking Europe 1450-1600 (100,000+ publications). We
apply the measure to study the diffusion of the ideas of the Reformation across cities
and time. We document the relationship between the diffusion of religious ideas
and the characteristics of local media markets and variations in diffusion induced
by geography. At the city level, we find that Protestant content was associated
with increases in educational and non-religious print media in high frequency data.
We document that cities with high levels of exposure to Protestant ideas were more
likely to adopt the institutions of the Reformation using new data on municipal laws
of the 1500s that established Europe’s first state-run education system. We show
that these legal institutions explain subsequent cross-county variations in literacy.
∗
Dittmar: London School of Economics & American University. Email: [email protected].
Seabold: American University. Email: [email protected]. We thank Russ Gasdia for research
assistance, discussion, and suggestions. Dittmar acknowledges support through the Deutsche Bank
Membership at the Institute for Advanced Study (Princeton, NJ) and the National Science Foundation.
1
1
Introduction
Sometimes people develop ideas and social movements that radically alter economic institutions, culture, and beliefs. The use of internet-based communications technologies
in contemporary social movements such as the so-called Color Revolutions, Arab Spring,
and Occupy Wall Street has raised questions about the role of new media in the diffusion
of beliefs, the impact of exposure to new ideas, and institutional change. This research
examines the diffusion of ideas in new media and institutional change over one of the
most significant upheavals in European history – the Protestant Reformation.
The new media used in the Protestant Reformation were print media. Before the invention of the movable type printing press the production of texts had been concentrated
in religious establishments often located outside cities. The printing press was invented
in Mainz, Germany around 1454. Between 1454 and the early 1500s the technology
spread to cities across Europe.1 The price of text data used to store and disseminate
ideas declined. Book prices fell by over 80% between the 1450s and the early 1500s.
The Protestant Reformation was the first mass movement to exploit the new information technology and the first religious movement to successfully breach the monopoly
of the Catholic church in Western Europe (Edwards 1994, Brady 1999). In 1517, Martin Luther posted a set of theses calling for a reform of Church practices in the city of
Wittenberg, in Northeast Germany. In a matter of weeks, Luther’s ideas were being disseminated in print across German-speaking Europe. Over the coming years, Protestant
and Catholic thinkers disseminated their ideas in the new media.
This research addresses three principal questions: What factors determined the diffusion of Protestant ideas in the media? How did the diffusion of Protestant ideas change
media markets more broadly? And what were key impacts of religious and institutional
reform? These two questions reflect religion’s “two-way interaction with political economy” (McCleary and Barro 2006).
Historians argue that print media were critical to the success of the Reformist challenge to the religious monopoly of the Catholic church (Brady 1999, Ozment 1981).2
However, previous social science research has not systematically documented the diffusion of Protestant ideas in print in quantitative terms and the existing data on historic
1
See Dittmar (2011) for a discussion of the diffusion process, including evidence that the technology
diffused from Mainz rather than from other centers.
2
Print media was important despite the fact that literacy levels were low (Edwards 1994). Print media
was read aloud and the ideas transmitted in print were further circulated in sermons and conversations.
On literacy and the impact of print media see Brady (1999) and Scribner (1982) and discussion below.
2
print media do not systematically categorize books or authors by religion.3
A substantial social science literature argues that Protestantism transformed Europe’s
economic landscape, but there is mixed evidence on the economic impact of Protestantism
in historic Germany. Arguments running back to Max Weber (1930) have suggested that
Protestantism fostered a work ethic and social norms conducive to commerce and growth.
Economists have suggested that Protestant beliefs generated a demand shift for literacy
that explains variations in technology adoption and economic outcomes observed across
Prussian counties by the mid-1800s (Becker and Woessman 2009; Becker, Hornung, and
Woessman 2011). However, city growth was historically associated with economic dynamism and there is no evidence that cities in territories where rulers adopted Protestantism in the 1600s enjoyed any subsequent growth advantage (Cantoni 2010).
The first contribution of this research is to construct a new, high frequency measure of
religious content in print media at the city and firm levels. We assemble new data on all
known books and pamphlets printed in German-speaking Europe (100,000+ publications
printed in 200+ cities). We use historical sources to identify 500+ leading Protestant
and Catholic authors – authors responsible for 18% of the books printed in Germany
1450-1600. We identify the language characteristic of Protestant and Catholic authors in
long, historical book titles using statistical models from natural language processing.4 We
then generate a measure of the religious content of print media at the firm- and the citylevel that captures how similar the language of their output is to the language of known
Protestant and Catholic writers.5 In contrast to our high frequency measure of content
3
For example, the Universal Short Title Catalogue (USTC 2012) which we discuss below simply classifies book subject matter as “religious” or not. The work that comes closest to the spirit of this paper
is Edwards (1994), which examines a sample of pamphlets from the first half of the 1500s that comprised 3,183 pamphlets authored by Martin Luther and 1,763 pamphlets authored by Catholic activists.
Consistent with the evidence presented below, Rubin (2011) finds that Protestantism was more likely
to be adopted in cities that had a printing press in 1500. However, the data in Rubin (2011) do not
document the spread of Protestant media and do not untangle the supply and demand puzzle around
whether Protestantism was adopted due to the supply of religious media content or whether printing
may have been associated with growth and the emergence of social groups that had a higher demand
for (were more receptive to) religious innovation. Similarly, the existing data and previous work do not
identify the firms producing content.
4
In the data section below, we describe these titles. The median title is 21 words long (the mean title
is 20.4 words). The median title has 153 characters (mean 149.6). For comparison, twitter messages are
no longer than 140 characters. As discussed below, estimation strategies similar to ours are widely used
to classify the content of tweets, spam email, and other short texts.
5
For similar estimation strategies see Taddy (2012) and Gentzkow and Shapiro (2010), which measures
the “slant” (ideology) of US newspapers by determining whether they employ language similar to the
language used in speeches by Democratic or Republican representatives in the US Congress. Details of
the estimation strategy are discussed below. The data we construct identify the author, city and year of
publication, and the printing firm for each book. We identify the firm producing each individual book
from the printer information contained on the front pages of historical books (see below and Dittmar
2012 for details on the construction of firm-level data).
3
in media, previous work has relied on measures of religious belief observed centuries after
the Reformation (Becker and Woessman 2009) or on the binary Protestant-or-Catholic
religious choice made by rulers in the 1600s (Cantoni 2010).6
The second contribution of this paper is to document (1) the relationship between
the diffusion of Protestant print media and the characteristics of local media markets
and (2) variations in diffusion induced by geography. We show that within cities the
firms that were more likely to become producers of Protestant content were younger,
larger, and had prior specialization in producing humanistic content. Across cities, we
find locations where the vernacular (German) share of output was relatively large were
more likely to adopt Protestant content than locations specializing in Latin print media.
We document that distance from Martin Luther’s base in Wittenberg was significantly
and negatively associated with the diffusion of Protestant content. We further estimate
the cost of inter-city trade using new data on 2,000+ individual level book and pamphlet
purchases made in 42 cities over the 1500s.
The third contribution of this paper is to present new evidence on the relationship
between exposure to Protestant media, media markets, institutions, and economic outcomes at the city level. We document the positive relationship between exposure to
Protestant media and the diffusion of educational and non-religious vernacular print media 1517-1600. This finding provides quantitative support for narrative evidence on the
relationship between media, reform, and lay culture previously examined by historians in
select cities (Chrisman 1981). We also document the relationship between local diffusion
of religious content in media and the adoption at the city and territory level of the legal
institutions of the reformation. The legal institutions we consider are 2,000+ church ordinances (evangelische kirchenordenung) that both stipulated correct religious doctrine
in law and made provision for the organization, funding, and oversight of Europe’s first
public education system. We show that these laws explain cross-county variations in
literacy and development in the mid-1800s.
Section 2 provides a distilled overview of the history of the Reformation. Section
3 describes the data. Section 4 presents the estimator used to measure the religious
content of print media. Section 5 documents the diffusion of religious content in the
media. Section 6 examines the relationship between the diffusion of Protestantism ideas,
institutions, and economic outcomes.
6
Basten and Baetz (2013) examine differences in contemporary preferences using a fuzzy regression
discontinuity along a border between Protestant and Catholic cantons in Switzerland.
4
2
History
In this section we present a highly distilled history of the Reformation.7 The Reformation
began as a protest of Catholic clerics and scholars against church institutions and their
superiors. It quickly became a mass movement.
Two key features distinguish the Protestant Reformation from previous attempts to
challenge church institutions and practices. First, when the Catholic church attacked
the protesting clergy, the reformers responded developing and dissemminating their arguments in print media. Second, politically active laymen adopted and adapted these
calls for reform – and pressed them on their governors. “This blending coalition – of
reformers’ protests and laymen’s political ambitions – is the essence of the Reformation”
(Cameron 1991, p. 2).
The Reformation was based on a critique of existing church institutions and practices.
Reformers called for moral renewal within cities (Moeller 1962), emphasized their belief
that biblical authority was paramount over and above the authority of existing church
institutions (Brady 1975), and were typically though not always anti-clerical (Goertz
1975). The reforms that were at the heart of the Reformation included the abolition of
the Catholic rite mass, rejection of the argument and rule that clergy should be celebate,
and moves to eliminate church corruption.
The reformers produced and set-up new legal institutions governing religious practice and education. The legal institutions of the Reformation were church ordinances
(evangelische kirchenordnungen and schulordnungen) passed by magistrates, municipal
councils, and princes at the city and territory level. These ordinances provided for the
first state-run public education system in Europe. The ordinances established schools,
sources of funding for these schools, and oversight mechanisms.8 The oversight mechanisms included regular and formal “visitations” in which the physical assets of local
schools would be inspected and both student and teacher performance would be assessed
and recorded.9 As discussed below, we examine data on some 2,000+ ordinances.
7
There is a very rich literature on the history of the Reformation to which we only allude here. We
ask historically sensitive readers to make allowances for the fact the current research is directed at an
audience of economists. Similarly, our use of techniques from economics and machine learning should be
seen as complementing – not substituting for – research based on primary sources and local case studies.
8
A small subset of ordinances provide explicit terms of teacher pay.
9
Formal proclamations were printed and sent to each local authority before each visitation. Typically,
visitations occurred once per year. Local church officials were required to submit a tabulation of revenues
and inventory of relics. Preachers and towns and village people were formally questioned on doctrine
and the adequacy of funding for local schools by inspectors who then submitted written reports. See
Strauss (1981).
5
The Reformation is usually dated to October 1517, when Martin Luther posted a
set of hand-written theses criticizing church corruption in Wittenberg. Luther was a
Dominican monk and professor of theology at the University of Wittenberg. Luther’s
theses notably criticized the Catholic church’s practice of selling indulgences which were
believed to secure the early release of dead relatives from purgatory in the afterlife. The
proceeds from the sale of indulgences were used to finance church investments (e.g. the
basilica of St. Peter in Rome) and consumption.10 Luther circulated his theses in letters
to three correspondents. Within weeks they were being printed throughout Germanspeaking Europe.
Cities and urban actors played a central role in the development and diffusion of
Reformation ideas and institutions (Moeller 1962).11 Cities had the concentrations of
people and levels of literacy and cultural sophistication to put the ideas of religious
reformers, preachers, and pamphleteers at the top of the political agenda in the 1520s
(Brady 1998, Ozment 1975). The ideas of the Reformation diffused within and across
cities and were first formalized at the city level in municipal ordinances in the 1520s and
1530s. The adoption of the Reformation at the territory or principality level came later,
after city-level adoption.12
Historians argue that print media played a central role in the diffusion of the Protestant Reformation (Edwards 1994, Brady 1999). In the early years of the movement,
Reformist ideas were innovations with very uncertain prospects that had not yet diffused
through the population. As Chrisman (1980, p. 29) observes, “The importance of the
printers lay neither in birth, wealth, property, nor political power...It lay in their control
of the printed word. Ultimately, their decision to print or not to print a particular book
or tract could have an immediate effect on political and religious events and, in a time of
rapid change, on institutions. The most striking example of their influence can be seen
in the religious publication of the pivotal years of the Reformation.” Beyond the impact
of the particular book or tract, the overall shift in ideas and increase in quantity of print
media influenced events. “It was the superabundance, the cascade of titles, that created
the impression of an overwhelming tide, an unstoppable movement of opinion...Pamphlets
and their purchasers had together created the impression of irresistible force.” (Pettegree
2005, p. 163). Tens and even hundreds of thousands of copies of individual Lutheran
pamphlets were printed and Martin Luther became the first best-selling author. As a
10
In his 86th thesis, Luther asked: “Why does the pope, whose wealth today is greater than the wealth
of the richest Crassus, build the basilica of Saint Peter with the money of poor believers rather than
with his own money?”
11
The Reformation was by no means restricted to cities. Religious ideas were notably central to the
so-called peasants war of 1525 (Blickle 1979).
12
Cantoni (2010) examines the diffusion of the Reformation at the territorial level.
6
counterfactual to the success of Protestantism in the print age, historians have examined
the failure of pre-print heresies such as Lollardism in England and the Hussite movement
in Bohemia in the late 1300s and early 1400s. The suppression of the Hussite movement
notably involved the burning of some 200 Hussite manuscripts in Prague in 1410 (Cheney
1936).
The decentralization of the Holy Roman Empire limited the emperor’s capacity to
regulate religious and political speech and was an important feature of the environment
in which the Reformation developed and spread. The Holy Roman Empire was composed
of a large number of semi-autonomous principalities, as well as 85 so-called free imperial
cities. The free imperial cities were jurisdictionally free from the control of local territorial
lords and princes and in theory answerable only to the emperor.13 In practice, these
cities had considerable autonomy in governance.14 In this context, suppression of dissent
was relatively costly and in fact slow to be tried. Printers producing Protestant media
were not censored in the early years of the Reformation and the Edict of Worms (1521)
banning Luther’s work was itself not rushed into print (Brady 1985, p. 153). By the
mid-1520s, magistrates and urban governments – notably in the free cities – begin to
defy the emperor and institute ordinances institutionalizing the Reformation. In 1526, a
formal magesterial right to reform (ius reformandi) was passed into law by the Imperial
Diet or parliament of the Holy Roman Empire (Brady 1995, p. 55).
The ideas of the Reformation spread through a two-part process and were followed
by staggered institutional change (Edwards 1994). First, print media impacted “opinion
leaders”, notably clergy and educated laymen.15 Opinion leaders then transmitted ideas
orally to the broad public and developed a militant popular movement. At the city level,
the Reformation typically reflected a basic conflict within cities between lower and middle
burghers on the one hand and increasingly plutocratic and oligarchic elites on the other
(Ozment 1975, p. 121-3).
Institutional change came after the diffusion of the ideas of the Reformation as the
last and decisive steps in formalizing the Reformation (Ozment 1975). However, even
the institutional changes were gradual. In Wittenberg, Catholic mass was not abolished
until 1525. In Strasbourg, Catholic mass was restricted to specific churches in 1525 and
13
The number of cities with the legal status of free imperial cities fluctuated over time. There were
85 such cities listed in the imperial budget (Reichsmatrikel ) of 1521.
14
The empire as a whole had a representative assembly called the Diet. The Diet provided representation for the different “estates” involved in governance. These included: seven electors (seven princes),
the college of imperial princes, and the college of imperial cities.
15
A number of significant lay proponents of the Reformation were city clerks. For example, Lazarus
Spengler was the town clerk of Nürnberg and published a defence of Luther that was printed in Basel in
1520. Jörg Vogeli became city clerk of Konstanz in 1524, and defended three local Reformist preachers
in a series of publications over the 1520s.
7
abolished only in 1529. In Nürnberg, the legal institutions of the Reformation were
instituted between 1520-1533. In Osnabrück, the process lasted from 1521 to 1542. By
the middle 1500s, Lutheran Protestantism in historic Germany acquired the geographic
distribution it would maintain for several centuries (Brady 1998, p. 373). The Peace of
Augsburg (1555) established the rule of cuius regio, euis religio (whose rule, his religion)
with exceptions for mixed cities where Protestants and Catholics were to share churches
and magistracies (Brady 1998, p. 375).
3
Data
We focus attention in this research on new media and institutional change in the Germanspeaking regions of the Holy Roman Empire.
The primary source for data on print media is the Universal Short Title Catalogue
(USTC 2012) database. The USTC is designed as a universal catalogue of all known
books printed in Europe 1450-1600.16 We rely on Reske (2007), which identifies 384
German-speaking cities with printing.17 Figure 1 shows the boundaries of the principalities comprising the Holy Roman Empire and the German-speaking cities where print
media was produced 1450-1600. Our database comprises 101,042 titles that were printed
1457-1600 in these cities.
Figure 2 shows the number of book titles printed each year in German-speaking
Europe. The unit of analysis in this paper is the book title (edition), which can be
thought of as a variety.18 Figure 3 shows the dynamics of book production across Europe
by subject.
We construct data on the religious affiliation of authors from several sources. Klaiber
(1978) provides data on 564 Catholic authors and “controversialists” working in German16
The closest competitor dataset is the Consortium of European Research Libraries’ Heritage of the
Printed Book (HPB) database. The HPB is essentially a master catalog of the major national research
library catalogs. However, since these catalogs are themselves incomplete for several countries the USTC
provides much more comprehensive coverage. In the Appendix, we present evidence that rules out the
leading concern around survivorship bias due to historical events that might distort the data on historical
books.
17
The set of cities in our data includes cities now in Austria, France (e.g. Metz and Strasbourg),
Switzerland (e.g. Zürich and Basel), Poland (e.g. Gdansk, historically known as Danzig, and Szcezecin,
historically known as Stettin), the Czech Republic (e.g. Brno or Brunn), and Russia (Kaliningrad
or Königsberg). These cities should be considered German-speaking in a qualified sense. Many were
characterized by linguistic diversity. Within German-speaking Europe High and Low German co-existed.
18
For a subset of several hundred books we have data on the number of copies printed per edition.
These data and evidence from book contracts and printer’s correspondence indicate that the typical
print run rose from 400-800 copies around 1500 to 1,000-1,400 copies in the later 1500s. We present this
evidence and further discussion of print runs in the appendix.
8
Figure 1: Cities Producing Books in German-Speaking Europe. Principality boundaries
are as of 1500. Cities are German-speaking printing centers from Reske (2007).
2000
Book Titles Printed
1500
1000
500
0
1475
1500
1525
Annual
1550
1575
1600
10 Year MA
Figure 2: Book and Pamphlet Titles Printed in German-Speaking Europe
9
.2
Share of Output
.15
.1
.05
0
1475
1500
Merchants' Manuals
1525
Science
1550
1575
Education
1600
Bibles
Figure 3: The Subject Matter of Print Media
speaking Europe. For each author, the Klaiber data provide a standardized name, a
list of name variants (alternate spellings and aliases), and a list of publications. For
Protestant authors we rely on Mullett (2010), Carey (2000), and Wikipedia’s list of over
900 Protestant Reformers.19 Based on these sources, known Catholic authors account for
2,937 titles (3% of books) and known Protestants account for 15,142 (15% of books).20
The research uses the text of long book and pamphlet titles to identify the language
most characteristic of Protestant and Catholic print media. Historical titles typically
provide an extended gloss on content. Table 1 summarizes the distribution of the word
lengths of these titles. The first column of Table 1 shows that the median title in our data
is 21 words long (the mean title is 23.2 words); at the 5th percentile titles are 3 words long;
at the 95th percentile titles are 51 words.21 To estimate the language characteristic of
religious authors we construct a vocabulary that excludes common “stop words” (words
such as and and the) and extremely rare words. The second column of Table 1 provides
19
See www.wikipedia.org/wiki/List of Protestant Reformers.
The print media dominance of Protestant reformers has been observed by social historians and in
earlier research based on small samples of historical print media. For instance, Edwards (1994, p. 29)
finds in a sample of vernacular (German-language) pamphlets that the ratio of works by Martin Luther
to works by Catholic publicists was approximately 5 to 1.
21
The median title has 153 characters (mean 171.6). For comparison, twitter messages are no longer
than 140 characters. A substantial literature employs statistical models to identify sentiment in twitter
messages. For example, Go, Bhayani, and Huang (2009); Pak and Paroubek (2010); Bollen, Mao, and
pepe (2011); Bifet and Frank (2010).
20
10
Table 1: Number of Words in Titles
Total Words
Words in Estimating
including Stop Words
Vocabulary
5th Percentile
3.0
2.0
25th Percentile
11.0
7.0
Median
21.0
13.0
75th Percentile
33.0
20.0
95th Percentile
51.0
32.0
Mean
23.2
14.4
summary statistics on titles restricting to the words in the estimating vocabulary.
To document the diffusion of religious content produced at the firm level we construct
annual panel data on the output of 1,577 printing firms operating in historic Germany
1450-1600. We identify the individual firms that produced each publication from the
printer information recorded on the front pages of historic publications. Information
identifying the printing firm is available on 99% of historic books and pamphlets.22
To identify the legal institutions of the Reformation we code over 2,000 reformation ordinances (kirchenordennung) passed at the municipal and provincial levels. These
ordinances were laws written and passed by local magistrates to institutionalize the reformation. The ordinances have five key dimensions embodied in provisions governing (1)
religious dogma, (2) public morality, (3) sex, family, and marriage, (4) social welfare and
poor relief, and (5) education (Witte 2002). A key innovation in the new legal framework
was provision for state-run education in the form of city and territorial schools, together
with a system of funding and oversight. The principal source for the data on reformation
ordinances is Sehling, Die evangelischen Kirchenordnungen des 16. Jahrhunderts (21
volumes 1902-1910).23 We supplement Sehling with historical sources described in the
appendix to identify the dates of ordinances in cities outside contemporary Germany.
To document the costs of the inter-city book trade we construct new data on the price
of books and trade routes. The price data are from the purchasing notebooks of Hernando
Colón (Christopher Columbus’ son).24 Over the period 1509-1540, Colón purchased
several thousand books in 42 different cities. Colón’s notebooks record the price paid
for individual books in the local currency and the current exchange rate (Martı́nez,
22
Printers are identified in multiple languages (e.g. Latinized and German variants of the same name),
with non-standard spelling, abbreviations, and in some instances aliases. We construct standardized
firm-level identifiers for each printer and book. For details see Dittmar (2012).
23
We refer to these volumes collectively as Sehling (1910).
24
The data in the notebooks are to my knowledge (1) our best and most comprehensive currently
available source of data on book prices in the 16th century and (2) have not previously been used in
economic research other than Dittmar (2012). The data are discussed in further detail in Dittmar (2012).
11
Asencio, Wagner 1993; Biblioteca de Huelva 2012). We extract data on book prices in
local historical currencies, the contemporaneous exchange rate, and book characteristics
from the Colon notebooks and the archive catalogue.25 We match these price data to
information on book characteristics. The characteristics include the city and date where
the book was purchased, the city and date where it was printed, the length in pages,
the physical size of the pages, the format (octavo, quarto, folio, etc.), and whether there
are illuminations. The catalogue also provides a subject classification we use to identify
book subject matter.26 We construct a digitized GIS network of historic land and river
trade routes as described below.
We rely on city population data from Bairoch, Batou, and Chèvre (1988) and data
from the Prussian Censuses of the 1800s (Becker, Cinnirella, Hornung, and Woessmann
2012). These and other data are described as introduced below and in the appendix.
4
4.1
Measuring Religious Ideas in Media
Overview
We construct an index of the religious content of print media as follows. We first use
historical sources to identify the religious denomination – Protestant or Catholic – of
a subset of 500+ authors. We then document which language (words and phrases) is
important in identifying religious denomination for authors with known religious affiliation. The information on partisan language is used to predict the religious content of
media where authors’ religious affiliations are not known and to construct an index of
content. In the economics literature, Gentzkow and Shapiro (2010) use a similar estimation strategy to identify the dimensions of political ideology in Congressional speeches
and to measure the political “slant” of US newspapers by documenting the extent to
which these media use language characteristic of Democrats or Republicans.27
The distinction between Catholic and Protestant authors is stark and revealing. We
argue that this distinction provides a fruitful starting point and powerful first model for
25
A typical example of how the notebooks record prices is as follows: “Este libro costó 8 negmit en
Anvers a 29 de julio de 1531 y el ducado de oro vale 320 negmit.” In my translation: “This book cost 8
negmit in Anvers [Antwerp] on July 29, 1531 when the gold ducat was worth 320 negmit.”
26
The classification includes bibles, jurisprudence, philosophy, literature, orations, poetry, theology,
medicine, languages, religion, and history and legislation.
27
Taddy (2012) extends the Gentzkow and Shapiro (henceforth “GS”) approach by introducing a
model-based framework to estimate such indices. Taddy shows that the GS index is a special case in the
Inverse Regression (IR) framework which seeks to reduce high-dimensional text data to low-dimensional
sufficient statistics.
12
thinking about media markets and religious change in historic Germany. However, the
distinction between Catholics and Protestants clearly does not exhaust the distinctions
one could draw among authors and literatures.28
Our estimation strategy has traction because historical book titles provide extensive
glosses on the content in books. To fix ideas and get a “feel” for our data examples are
useful. The overwhelming majority of books in our database were written in German and
Latin. Here we provide two examples of English-language books printed in 16th century
Germany that may be useful for English language readers. An example of a Protestant
title is a book written by Martin Luther and produced in Wesel (North Rhine-Westfalia):
The last wil and last Confeßion of martyn Luthers faith concerming the principal articles of religion which are in controversy, which he wil defend &
maiteine until his death, agaynst the pope and the gates of hell.
An example of a Catholic title is a book written by John Old and produced in Emden
(East Frisia):
A Confeßion of the most auncient and true christen catholike olde belefe
accordyng to the ordre of the .XII. Articles of our co[m]mon crede, set furthe
in Englishe to the glory of almightye God, and to the confirmacion of Christes
people in Christes catholike olde faith.
These examples illustrate the fact that historical titles convey significant information
about content. They also suggest potential challenges in analyzing historical text data in
which spellings are not yet standardized. The results described below suggest that these
difficulties do not preclude obtaining useful estimates.29
Below we document how our estimation strategy accurately predicts the religious
affiliation of known religious authors in out of sample tests. We also document that our
measures of the local production of religious ideas capture information not present in
the existing bibliographic meta-data – which simply designate book subjects as religious
28
Extensions could distinguish between Lutheran, Calvinist, and Zwinglian ideas in Protestant media,
between different types of Catholic authors, and between time varying features of religious language.
29
According to one ideal, we would want to identify spelling variants as the same word. However, it is
not clear that introducing standardizations or model features to capture orthographic variations might
improve the precision of our estimates. Because spelling conventions reflect both regional and ideological
influences, orthographic standardization could entail the loss of potentially valuable information. For
this reason, and because standardizing orthography is a difficult computational problem exacerbated by
the fact that we have nearly 70 different languages and language combinations in the texts, we analyze
texts “as is”.
13
or non-religious, without capturing the denominational nature of the content. Similarly,
we are able to estimate the religious content of print media in 136 cities while books
produced by our set of known religious authors appeared in only 104 cities.
4.2
Estimator
After we code a subset of the titles to document whether the author is a Protestant or a
Catholic, we use supervised learning techniques to extract low-dimensional features of the
text that indicate whether the author is likely Catholic or Protestant. We then project
these features onto texts for which the author’s religious affiliation is not known to get a
measure of similarity to the texts of known Protestants and Catholics. The estimation
strategy used is multinomial inverse regression (MNIR) introduced by Taddy (2012).
The idea of MNIR is to reduce the dimensionality of the documents to a summary
statistic while preserving meaningful sentiment and/or content information. The key
assumption for MNIR and in the text analysis literature more broadly is that the order
of the words or phrases is relatively unimportant in classifying the content of a text. Following the exchangeability assumption a text is treated as a “bag” of words or phrases
(Blei et al. 2005; Taddy 2012).30 The exchangeability assumption allows the econometrician to treat a document as a multinomial random variable in which the words or
phrases are the categories. Let Xi be a document. Any such document may be denoted
Xi = [xi1 , xi2 , . . . , xiW ] where xiw represents the number of times token w appears in
document i for each of the W words in a given vocabulary V . The vocabulary V is
analogous to the possible categories of a multinomial variable and must be chosen before estimation. We discuss how we construct our vocabulary below. In this setting,
we are interested in ascribing to these summary statistics a religious affiliation r, for
Protestant P and Catholic C. Assuming that Protestant and Catholic documents are
P
independent, we may collapse the counts for each document such that Xr = i Xi |ri = r
for r ∈ {Protestant, Catholic}. Following Taddy (2012), the basic MNIR model of each
document is given:
Xr ∼ M N (qr , mr ) where
(1)
exp [αw + rϕw ]
qrw = PW
for w = 1, . . . , W, r ∈ (P, C)
l=1 exp [αl + rϕl ]
where each Xr is a W −dimensional multinomial variable with size mr =
30
P
i
mi , mi =
One might choose to use the word token in place of word. The unit of analysis can be a word or a
phrase as we show below.
14
PW
xiw , and probabilities qr = [qr1 , . . . , qrW ] that are a linear function of r transformed
by the logistic link function. The sufficient reduction (SR) score for the token frequencies
xi
fi = m
is31 :
i
zi = ϕ0 → ri ⊥
⊥xi , mi |xi
(2)
w
Given the sufficient reduction projection of ϕ0 onto the frequencies, identifying the
relationship between documents and religious affiliation reduces to a separate univariate
logistic regression, called the forward regression:
P r(ri = r|zi ) =
1
1 + exp [β0 + β1 zi ]
(3)
Since both fat-tailed and sparse multinomials arise naturally in the context of text
data analysis, independent Laplace priors with unknown variance are placed on the individual regression coefficients ϕw . The unknown rate parameter λw accounts for our
uncertainty as to how much variable-specific regularization is appropriate. The rate
parameter is given a gamma hyperprior Γ(α, β) such that:
λw −λw |ϕw | β α α−1 −βλw
Pr(ϕw , λw ) =
e
λ e
, α, β, λw > 0.
2
Γ(α) w
(4)
Estimation of the likelihood implied by the multinomial distribution in (1) and the
prior (4) takes place via the gamma-lasso algorithm to maximize the joint posterior over
coefficients and their prior scale (Taddy 2012).32
4.3
Identifying Partisan Vocabulary
To prepare the titles for use in the classifier described above, it is necessary to identify
the words which serve as the support for the multinomial distribution. This support will
be refered to as the vocabulary.
We construct the vocabulary as follows. First all generic titles are removed.33 We
clean the remaining text by making the titles lower-case and removing punctuation,
numbers and roman numerals, words that occur fewer than four times, and German and
31
The conditions under which the sufficient reduction holds are detailed in Taddy
This algorithm outperforms similar algorithms in precision and computational
mented in Taddy (2012).
33
Titles that do not contain words illuminating what the works are about such as
spelling of the German teil meaning part of a whole, and Samml, an abbreviation
collection, are removed.
32
15
(2012).
efficiency as docuTheil, an alternate
of the German for
Table 2: Distribution of Known Catholic Title Lengths
Total Words
Words in Estimating
including Stop Words
Vocabulary
5th Percentile
6.0
4.0
25th Percentile
14.0
9.0
Median
23.0
15.0
75th Percentile
34.0
21.0
95th Percentile
50.0
33.0
Mean
25.1
16.0
Table 3: Distribution of Known Protestant Title Lengths
Total Words
Words in Estimating
including Stop Words
Vocabulary
5th Percentile
4.0
2.0
25th Percentile
10.0
6.0
Median
17.0
10.0
75th Percentile
27.0
15.0
95th Percentile
43.0
26.0
Mean
25.1
16.0
Latin stop words.34 The impact of removing stop words on the title length distributions can be seen in the second column of table (1). As can be seen in tables (2) and
(3) the sample-wide distribution is indicative of the sub-populations of Protestants and
Catholics.
We identify “Protestant phrases” and “Catholic phrases” as those phrases that are
used significantly more by one group than the other. To do this, we obtain counts of the
individual words, bigrams, and trigrams used in each unique cleaned title.35 We then
place the word and phrase counts into a contingency table and compare the incidence of
usage by the identified Protestants against that of the identified Catholics – normalizing
the marginal totals for overall phrase usage within the religious camps. We use the
Pearson’s χ2 statistic as in Gentzkow and Shapiro (2010). The χ2 measure is:
χ2wn =
(fwnc f∼wnp − fwnp f∼wnc )2
(fwnc + fwnp ) (fwnc + f∼wnc ) (fwnp + f∼wnp ) (f∼wnc + f∼wnp )
34
(5)
German stop words include words such as da and dass, der and die, and und and unde. These are
words for this, the, and and in both modern and Old High German.
35
Bigrams and trigrams are two- and three-word phrases, respectively. We do not stem words to
reduce them to a common root due to the lack of orthographic standardization demonstrated above.
As shown below, we obtain meaningful results without stemming. For the computation of counts used
to construct measures of denominational language, we temporarily drop titles that may be reprints and
would therefore skew the usage of certain words simply by the number of times a given title appeared
in print.
16
Table 4: Most Protestant Phrases
single words
translation
christi
Christ
sermon
sermon
christen
Christian
wider
against
gottes
God
two-word phrases
translation
jesu christi
Jesus Christ
martini lutheri
Martin Luter
theologiae doctore
Doctor of Theology
philippi melanthonis
Philip Melanthonis
doctore professore
Doctor and Professor
three-word phrases
translation
theologiae doctore professore
Professor and Doctor of Theology
praeside jacobo heerbrando
President Jacob Heerbrand
unsers jesu christi
of our Lord Jesus Christ
sacrosanctae theologiae doctore Doctor of Sacred Theology
unsers jhesu christi
of our Lord Jesus Christ
where fwnc and fwnp denotes the total number of times phrase w of length n is used
in a title by a Catholic or Protestant writer, respectively. Whereas, f∼wnc and f∼wnp
denotes the total number of times a length n phrase that is not w is used by Catholics
and Protestants, respectively. After obtaining the measure for each phrase for both
Protestants and Catholics, we sort each list of the n-length phrases for each group from
largest to smallest and select out the smaller of 2,000 or the number of total phrases of
length n from both groups to obtain our vocabulary. This gives 1,335 unigrams, 2,000
bigrams, and 686 trigrams for each religious group. Tables 4 and 5 list the top n-length
phrases for Catholics and Protestants.
When predicting, we collapse all of the titles for one author to obtain a single oeuvre.
We do this for two reasons. First, it yields a single predicted probability for each author,
Catholic or Protestant, rather than a prediction for each title. Second, it makes the
multinomial observations less sparse, improving accuracy of classification. After eliminating words that are not in the vocabulary and collapsing across authors, we have on
average 366 words per author in the training set in which we know the author’s religious
affiliation and 30 words on average for the data set used for prediction in which the
religious affiliation of the author is unknown ex ante.
17
Table 5: Most Catholic Phrases
single words
fidei
petri
georgii
adversus
missae
two-word phrases
sacrament altars
petri mosellani
martinum lutherum
sanctorum patrum
ursachen warumb
three-word phrases
hochwirdigen sacrament altars
rhetorica philippi melanchthonis
paedologia petri mosellani
laurentium surium carthusianum
petri mosellani protegensis
4.4
translation
faith
Peter
George
against
posted
translation
Sacrament altars
Petrus Mosellanus (Peter Schade)
Martin Luther
Holy Father
Causes why
translation
Blessed Sacrament of the Altars
In the Rhetoric of Philip Melanchthonis
Paedologia, by Petrus Mosellanus
Carthusium Laurentius Surius (Lorenz Sauer)
Petrus Mosellanus Protegensis
In and Out of Sample Tests
To assess the performance of the estimator we present within-sample predictions and the
document the estimator’s performance under different prior specifications on held-out
data from our training sample (i.e. the data coded with authors’ religious information).
Figure 4 presents in-sample estimates and shows that the estimator predicts correctly
in-sample approximately 90% of the time.36
To document the out-of-sample performance, we estimate the model 100 times using
different random samples containing 60% of the available training data, then predict over
the remaining data not used for estimation. In the process, we vary the shape parameter
α of the gamma hyperprior which controls the sparsity of the estimates. Increasing α
can reduce problems with overfitting. Figure 5 presents the results of this exercise and
shows that α = 1 has the best out-of-sample performance.37
36
Our estimates have a correlation of XYZ with true religious affiliation. For comparison, Gentzkow
and Shapiro (2010) use a similar strategy to predict party affiliation of members of Congress based on
the text of their speeches and obtain a correlation of 0.61 between true and predicted affiliation.
37
One of the attractive properties of MNIR is its relative speed compared to other estimators, which
can take orders of magnitude longer to converge. Wall times for the estimator are given in the Appendix.
18
1.0
Pr(Catholic==1)
0.8
0.6
0.4
0.2
0.0
Protestant
Catholic
1.0
60
0.8
50
Wall Time (seconds)
% Misclassified
Figure 4: In-sample performance for MNIR estimator over all known Protestant and
Catholic authors. The estimator correctly predicts 89% of the authors’ religious affiliations. Incorrect predictions are in red.
0.6
0.4
0.2
0.0
s = 0.01
s = 0.10
s = 1.00
Prior for shape
40
30
20
10
0
s = 10.00
s = 0.01
s = 0.10
s = 1.00
Prior for shape
s = 10.00
Figure 5: Out-of-sample performance and wall times under various specifications for
hyperprior shape parameter s. λj prior if Γ(s, .5). 100 runs of the model trained on 60%
of the data and predicted on the held-out 40%.
5
The Diffusion of Religious Ideas
In this section we first present our measures characterizing the diffusion of Protestant
content. We then document the characteristics of local media markets associated with
diffusion and variations induced by geography.
19
5.1
The Pattern of Diffusion
Our estimation strategy delivers a measure of religious content. In this section we first
present our results in graphical form. We aggregate book-level estimates to obtain summary measures of religious content at the firm, city, and regional level.
Figure 6 presents the estimated index of religious content in print media for all of
Germany 1475-1600. Figure 6 shows that the relative intensity of Catholic-type speech
was approximately stable in the run up to the Reformation, at which point there is a
discontinous shift towards Protestant-type speech. The first panel shows our estimates
for all religious books. The second panel examines just religious publications in German.
In German language media, there is a positive pre-1517 trend away from Catholic-type
speech. In German language media, there is also a relatively much larger increase in the
number of publications printed in the post-1517 period as indicated by the scale of the
annual markers. This evidence of a move away from Catholic-type media in the vernacular
before Luther’s 1517 intervention is consistent with the observation that Protestantism
was in part a response to underlying cultural trends.38 The third panel documents that
the discontinuous shift towards Protestant-type speech also characterizes religious media
published in Latin.
Religious Books in German
Religious Books in Latin
1
.75
.75
.75
.5
.25
0
1475
Religion Index
1
Religion Index
Religion Index
All Religious Books
1
.5
.25
1500
1525
1550
1575
1600
0
1475
.5
.25
1500
1525
1550
1575
1600
0
1475
1500
1525
1550
1575
Figure 6: Religion index in historic Germany. Protestant = 1. Catholic = 0. Markers
present the annual mean of the religion index across all publications. Marker sizes are
scaled to represent the relative number of publications in each year.
Figure 7 maps the evolution of media output at the city level. In Figure 7, city
markers are scaled to reflect the number of titles produced and shaded to reflect the
38
Thus Cameron (1991, p. 175) observes that the majority of first-generation reformers had been
influenced by Renaissance humanism before they became advocates of religious reform.
20
1600
average value of the Religion Index for city-level output. Cities with lighter markers
produce Catholic-type print media. Cities with darker markers produce Protestant-type
print media. Panel A presents data for the period up to 1517, when Martin Luther posted
his Theses. Panel B presents data for the period 1518-1539, at which point Duke Georg
of Saxony died. Panel C presents data for 1540-1555, the period between Duke Georg’s
death and the Peace of Augsburg. Panel D presents data from 1556-1600, the period
following the Peace of Augsburg.
Figure 8 compares predicted and observed Protestant output across 226 firms and 127
cities producing 1517-1600.39 Four key facts stand out. First, while there is considerable
variation, the majority of cities and firms were net producers of Protestant-type print
media. Second, there is a set of smaller firms that did not print any known religious
authors for which we find variation in predicted Protestantism These are firms for which
the observed Protestant Index is 0.5. Third, a number of firms with observed protestant
index P ∈ (0.5, 0.6) have predicted content P̂ < 0.5. Fourth, for many firms with
observed Protestant share P ∈ (0.7, 0.8), the predicted Protestant index exceeds what
is observed on the basis of known authors (i.e. P̂ > P ). These facts raise the question:
on average, does the estimator predict more or less Protestant media than is directly
observed on the basis of the circumscribed set of known religious authors?
Figure 9 shows that we systematically estimate more Protestant print media than is
directly observed based on coded authors. The ratio of predicted to observed Protestantism provides a parsimonious measure of the relative magnitude of predicted and
observed religious output. Figure 9 plots the distribution of this ratio at the city- and
firm-level. The fact that our estimation strategy captures otherwise unobserved Protestant output is evident in the fact that the distributions have their mass concentrated in
values greater than one.
5.2
Determinants of Diffusion
In this section we document the determinants of the diffusion of religious print media.
We first document city-level variations in media markets and geography associated with
Protestant media production post-1517. We then document the association between firm
characteristics prior to 1517 and the Protestant share of output post 1517.
39
The observed output shares are calculated by indexing Catholic authors with 0, Protestant authors
with 1, and authors with unknown religious affilation with 0.5.
21
Page 1 of 1
A: Media Output 1501-1517
Page 1 of 1
B: Media Output 1518-1539
Page 1 of 1
C: Media Output 1540-1555
file:///E:/Dropbox/share_maps_mean_period1.svg
Page 1 of 1
D: Media Output 1556-1600
file:///E:/Dropbox/share_maps_mean_period2.svg
2/21/2013
2/21/2013
Figure 7: City-level print media output. City markers are scaled to reflect the number
of book titles produced. Markers are shaded to reflect the average value of the Religion
Index for city-level output. Cities with lighter (whiter) cities produce Catholic-type print
media. Cities with darker (blacker) cities produce Protestant-type print media.
5.2.1
Cities
At the city level, historians and economists observe that Protestant ideas diffused from
Wittenberg (Pettegree 2010; Becker and Woessman 2009). Figure 10 plots predicted
religious content at the city level against distance from Luther’s base at Wittenberg.
Figure 10 is consistent with the diffusion argument in Becker and Woessman (2009),
file:///E:/Dropbox/share_maps_mean_period3.svg
file:///E:/Dropbox/share_maps_mean_period4.svg
2/21/2013
2/21/2013
which argues that Protestantism diffused from Wittenberg and exploits this distance as
an instrument for Protestantism.40
It is natural to wonder how non-geographic characteristics of local media markets
40
This argument is also supported by historical evidence on Wittenberg’s transformation from a minor
provincial town to a hub in the media network. See for instance Pettegree (2010).
22
Firm-Level
1
.75
.75
Predicted Protestant
Predicted Protestant
City-Level
1
.5
.5
.25
.25
0
0
0
.2
.4
.6
Observed Protestant Index
Predicted Protestant
.8
1
0
.2
Fitted values
.4
.6
Observed Protestant Index
Predicted Protestant
.8
1
Fitted values
Figure 8: Index of predicted Protestant print media versus index of observed Protestant
output. At left, city-level output. At right, firm-level output. Markers are scaled by
total output at the city or firm level. The index is constructed with Protestantism = 1
and Catholicism = 0. See text for details.
City-Level
Firm-Level
1.5
1
1
Density
Density
1.5
.5
.5
0
0
0
.5
1
1.5
Ratio of Predicted to Observed Protestant
2
0
kernel = epanechnikov, bandwidth = 0.1100
.5
1
1.5
Ratio of Predicted to Observed Protestant
2
kernel = epanechnikov, bandwidth = 0.0941
Figure 9: Ratio of predicted Protestant print media to index of observed Protestant
output. At left, city-level output. At right, firm-level output.
varied with religious ideas, and whether cities specialized in particular types of print
media were more receptive to religious innovation. A key dimension for variation in
local media markets concerned the extent to which they produced German-language
(as opposed to Latin) content before the Reformation. In general, Latin output served
scholarly and religious consumers who were arguably less receptive to religious innovation.
23
1
Magdeburg
Erfurt
Wittenberg
Predicted Religion
.8
Nuernberg
Augsburg
Frankfurt am Main
Leipzig
Strassburg
.6
Basel
.4
Koln
.2
0
0
100
200
300
400
500
Distance to Wittenberg in Km
600
700
Figure 10: Predicted Protestant print media at the city level. Marker scales reflect overall
number of titles produced in each city.
In addition, historians observe that the Protestant Reformation was characterized by an
increase in the use of German vernacular.
Figure 11 examines the data on Protestant print media and the intensity of vernacular
(German-language) printing before and after the Reformation. Figure 11 shows that
cities that became centers of Protestant printing post-1517 were places in which the
share of print media in German vernacular was high before 1517. This finding qualifies
an influential claim in the literature, namely that Protestantism in Germany was drove
growth in access to vernacular print media (Brady 1999, Becker and Woessman 2009,
Rubin 2010). As with our evidence on the pre-1517 trend in the Religion Index away
from Catholic-type content, this finding indicates that pre-Reformation characteristics
were associated with subsequent adoption of religious content.
While the characteristics of pre-1517 media markets were associated with subsequent
diffusion of Protestant ideas, Figure 12 shows that the relationship between distance
to Wittenberg and Protestantism is significant even conditional on pre-1517 vernacular
printing. Figure 12 plots residual Protestantism (1517-1600) controlling for the share
of pre-1517 books printed in German against distance from Wittenberg to document
that Protestant content was relatively high in cities close to Wittenberg and declining in
24
distance.
Protestant 1517-1600 vs. Pre-German
Protestant 1517-1600 vs. Post-German
1
1
Predicted Protestant 1517-1600
.6
Nuernberg
Basel
.4
Wittenberg
.8
Predicted Protestant 1517-1600
Wittenberg
.8
Koeln
.2
.6
Nuernberg
Basel
.4
Koeln
.2
0
0
0
.2
.4
.6
.8
Share German Language 1450-1517
1
0
.2
.4
.6
.8
Share German Language 1517-1600
1
-.4
Residual Protestantism
0
-.2
.2
.4
Figure 11: Share of books printed in German and post-1517 Protestantism.
0
100
200
300
400
500
Distance to Wittenberg in Km
600
700
Figure 12: Distance to Wittenberg and residual Protestant print media 1517-1600 conditional on share of pre-1517 printing in German. Marker scales reflect overall number
of titles produced in each city.
25
Table 13 shows that the intensity of Protestant print media 1517-1600 and city-level
pre-characteristics is predicted in the cross section by pre-1517 printing of vernacular
texts, the presence of a university and distance to Wittenberg. In all specifications we
also control for the number of monasteries and convents at the city level as well as
indicators capturing whether each city was an Imperial Free City and/or a member of
the Hanseatic league.41Protestant 1517-1600 on Covariates as of 1517 (Cross Section)
Variable
(1)
Books pre-1517
Ln Protestant
(2)
0.07
(0.05)
Books in German pre-1517
0.39**
(0.16)
University in 1517
1.65**
(0.80)
Distance to Wittenberg
Ln Protestant
(3)
0.02
(0.04)
0.59***
(0.13)
2.47***
(0.89)
Ln Protestant
(4)
0.05
(0.05)
0.50***
(0.14)
2.14***
(0.58)
-0.28**
(0.13)
Ln Protestant
(5)
0.03
(0.04)
0.55***
(0.12)
2.22***
(0.79)
-0.66***
(0.24)
Ln Distance to Wittenberg
Principality Fixed Effects
Observations
Yes
118
Ln Protestant
(6)
0.05
(0.04)
0.51***
(0.14)
1.86***
(0.53)
Ln Protestant
(7)
0.03
(0.04)
0.55***
(0.11)
1.76***
(0.58)
-0.66***
(0.16)
-1.24***
(0.24)
Yes
118
118
Yes
118
118
118
Figure 13: Cross sectional determinants of Protestant print media 1517-1600. Books are
measured in hundreds. Specifications with principality fixed effects cluster standard errors at the principality level. All specifications control for imperial free city and Hanseatic
League status and the number of monasteries and convents at the city level in each of
the Augustinian, Benedictine, Carmelite, Cistercian, Dominican, and the Brethren of the
Common Life religious orders.
Figure 14 exploits data from the pre-1517 period to estimate the difference-in-differences
impact of distance from Wittenberg on Protestant-type print media. 14 treats all texts
using the characteristic language of Protestantism as “like Protestant” to test whether
this language was more or less frequently observed closer to or farther from Wittenberg
before and after 1517.
5.2.2
Firms
Figure 15 shows that firms that were younger in 1517, and firms that were larger in
1517, produced more intensively Protestant-type print media 1518-1528. The dependent
variable for the regressions in Figure 15 is the firm-level religion index for output produced
41
We control separately for the number of monasteries and convents in each of the Augustinian,
Benedictine, Carmelite, Cistercian, Dominican, and the Brethren of the Common Life religious orders.
26
Protestant Difference in Differences
Variable
(1)
Ln Distance to Wittenberg
Ln Distance to Wittenberg x Post-1517
Post-1517
City Fixed Effects
Observations
Share Protestant
(2)
0.00
(0.01)
-0.06***
(0.02)
0.0%
0.55***
(0.11)
100
Share Protestant
(3)
.
.
-0.06**
(0.03)
0.0%
0.55***
(0.16)
Yes
100
Ln Protestant
(4)
-0.14
(0.22)
-0.61***
(0.19)
6.59***
(1.05)
94
Ln Protestant
(5)
.
.
-0.53**
(0.25)
6.22***
(1.31)
Yes
94
Figure 14: Distance to Wittenberg and Protestant-Type Media Before and After 1517.
The sample comprises the balanced panel of German-speaking cities with print media
published before 1450-1516 and 1517-1600. Standard errors clustered at the city level.
1518-1528.42 Columns (2) to (4) show the correlations between firm age and firm size, on
the one hand, and religious output, on the other, where the Religion Index is constructed
using only known authors. In columns (5) to (7) the dependent variable is the predicted
Religion Index estimated above, and the correlations are larger in magnitude but less
precisely estimated. Columns (3), (4), (6), and (7) introduce city fixed effects, so that
the identifying variation is across firms in a given city. The finding that firm age is
negative correlated with adoption of innovations in religious thought is consistent with
historians’ observations on the relationship between youth and adoption of innovations.43
42
Values close to 0 tell us that at the firm level, content is characterized by (most closely resembles)
the language of Catholic activitsts. Values close to 1 tell use that firm output uses the language of
Protestant Reformers.
43
Cameron (1991, p. 186) observes that Reformation ideas were typically rejected by the scholars and
thinkers who had taught the reformers, and that almost all academics in Wittenberg who were older than
Luther resisted the reformers’ innovations. Cole (1981, p. 141) similarly notes: “During the early years
of the sixteenth century, the readiness of many people to accept and use the new medium of printing
depended in part on one’s chronological age.”
27
Regression Variables
(1)
Firm Age at 1517
Ln Books Lagged
City FE
Firm Product Mix
Observations
Observed Religion Index
Protestant Protestant Protestant
Share
Share
Share
(2)
(3)
(4)
-0.004*
-0.006**
-0.006**
(0.002)
(0.002)
(0.002)
0.027*
0.038**
0.042
(0.014)
(0.015)
(0.025)
Yes
Yes
Yes
88
88
88
Predicted Religion Index
Protestant Protestant Protestant
Share
Share
Share
(5)
(6)
(7)
-0.005
-0.008*
-0.010
(0.003)
(0.004)
(0.008)
0.025*
0.058***
0.036
(0.014)
(0.020)
(0.065)
Yes
Yes
Yes
88
88
88
Figure 15: Firm-level regressions documenting correlates of Religion Index 1518-1528.
The dependent variable is the firm-level average of the Religion Index (1= Protestant,
0=Catholic). Analysis restricted to firms that produced in the 10 year period before
Luther posted his theses (i.e. 1507-1517) and in the 10 year period afterwards (15181528). Standard errors are clustered at the city level. “Firm Product Mix” controls for
the share of output 1507-1517 in each of 37 subject categories.
6
Outcomes
In this section we document how the diffusion of Protestant ideas was associated with
economic outcomes. Specifically, we document the diffusion of non-religious ideas in print
media, institutional change, and religious and economic outcomes observed in the 1800s.
6.1
Protestant Media and Institutional Change
In this section we present evidence on the print media and the laws of the Protestant
Reformation and document the variation in laws and print media induced by distance
to Wittenberg. Figure 16 shows the number of cities passing their first Reformation law
and the total number of Reformation laws passed each year.
Figure 17 exploits data from the pre-1517 period to estimate the cross-sectional precharacteristic determinants of the adoption of the legal institutions of the reformation.
Consistent with Becker and Woessman (2009), distance to Luther is a significant correlate
of adoption – even with polity fixed-effects. Other pre-characteristics have little if any
significant relationship to whether or not cities adopted the laws of the Reformation.
Finally, in Figure 18 we also show the relative intensity of Protestant-type language in
cities that did and did not adopt city-level Reformation laws. Particularly notable is the
fact that until the mid-1520s, and thus including the first years of the Reformation, the
cities that did and did not adopt the legal institutions of the Reformation were producing
28
Laws Passed Per Year
8
6
4
2
0
1500
1520
1540
1560
1580
1600
1620
1640
Cities Passing First Reformation Law
Total Reformation Laws (5 Year Moving Average)
Figure
16: 1517-1600
The city-level
lawsas of
the (Cross
Reformation.
Protestant
on Covariates
of 1517
Section)
Variable
(1)
Books pre-1517
Books in German pre-1517
University in 1517
Legal
Reform
(2)
0.03**
(0.01)
-0.00
(0.03)
-0.10
(0.16)
Legal
Reform
(3)
0.00
(0.01)
0.04**
(0.02)
0.05
(0.13)
Legal
Reform
(4)
0.02
(0.01)
0.01
(0.03)
-0.05
(0.13)
-0.15***
(0.03)
Distance to Wittenberg
Legal
Reform
(5)
0.01
(0.01)
0.04
(0.03)
0.02
(0.13)
-0.16**
(0.07)
Ln Distance to Wittenberg
Principality Fixed Effects
Observations
Yes
137
Legal
Reform
(6)
0.02*
(0.01)
0.01
(0.03)
-0.17
(0.16)
Legal
Reform
(7)
0.00
(0.01)
0.04*
(0.02)
-0.04
(0.12)
-0.27***
(0.07)
-0.17
(0.11)
Yes
137
137
Yes
137
137
137
Figure 17: City-level adoption of Reformation laws. Linear probability model with binary dependent variable capturing whether or not a given city adopted an evangelische
kirchenordnungen. Books are measured in hundreds. Specifications with principality
fixed effects cluster standard errors at the principality level. All specifications control
for imperial free city and Hanseatic League status and the number of monasteries and
convents at the city level in each of the Augustinian, Benedictine, Carmelite, Cistercian,
Dominican, and the Brethren of the Common Life religious orders.
29
relatively similar print media as measured by our religion index. From the 1520s, a gap
opens and stabilizes. Impressionistically, this is suggestive of a story in which relatively
modest differences in exposure were associated with sorting into different equilibria.
Religion Index (5 Year MA)
1
.8
.6
.4
.2
0
1475
1500
1525
1550
Cities with Laws
1575
1600
Cities without Laws
Figure 18: The relative intensity of Protestant-type language in religious media.
6.2
Protestant and Non-Religious Media
Within cities, exposure to Protestant media was associated with increased production
of the specific literatures historians and economists have identified as crucial to the acquisition of human capital and city growth in early modern Europe. In particular, we
show that increases in Protestant media were associated with increases in the production
of educational books designed to impart basic literacy and of the merchant’s manuals
that were the key texts in Renaissance business education and which Dittmar (2013)
were uniquely associated with city growth in early modern Europe. We also show that
increases in Protestant media were associated with increases in non-religious titles published in German vernacular. Historical evidence indicates that the increase in books in
the vernacular was critical to the emergence of a practical, literate, bourgeois lay culture
that supported relatively open access to ideas and innovation in business and industry
(Chrisman 1981).
30
Our findings both support and contrast with the central findings in the economics
literature relating religion, and in particular Protestantism, to economic outcomes. The
economics literature has documented that Protestantism is associated with economic
attitudes (Guiso et al. 2003), literacy (Becker and Woessman 2009), and political institutions (Woodberry 2010). This literature typically takes religion as effectively fixed or
sufficiently slow moving to identify variations in economic outcomes.44 In contrast, our
results characterize a setting in which the signal in religious media and religious beliefs
were changing dramatically. We show that during this period of relatively rapid religious
change, variations in the exposure to religious media within and across cities were associated with variations in the diffusion of the content historians have tied to economic
growth and development in early modern Europe.
In this section we present our findings on the incidence of Protestant media production
and the production of non-religious media. Motivated, by the historical literature on
economic development in early modern Europe, we present evidence for four key outcome
variables: the production of education titles, the production of legal texts on the theory
and study of jurisprudence, the production merchant manuals, and the production of all
non-religious titles published in German vernacular.45 In the data, our outcome variables
are all positive integer counts. We describe the estimator chosen for this setting and then
present the results of the estimation.
6.2.1
Estimation
The counts of types of media output across cities i over time t are modeled as Poisson
distributed random variables.46 The following exponential model specifies the conditional
mean of the Poisson model:
E(yit |xit ) = exp(x0it β)
Unobserved heterogeneity is modelled by specifying multiplicative individual effects:
E(yit |xit , αi ) = exp(x0it β + δi )
= λit αi
44
Gentzkow and Shapiro (2010) notably argue that religiosity is effectively exogenous with respect to
contemporary media markets in the contemporary USA.
45
In our data, ordinances themselves are classified separately from law books. The law books classification we employ comprises jurisprudence including legal texts books, handbooks, and commentaries,
but excludes edicts. In our data, printed copies of laws are classified as ordinances, together with edicts
and proclamations.
46
It is not uncommon to use a continuous approximation in place of the Poisson for applications in
which the counts take on large values. However, when the counts are close to and include zero, the
Poisson is better able to capture the qualities of the data.
31
Where we let:
λit = exp(x0it β)
and
αi = exp(δi )
Because unobserved individual effects αi may be correlated with observed city-level data
xit such that E(xit αi ) 6= 0, we choose to employ the standard fixed-effect Poisson model
of Hausman et al. (1984) (“HHG”) with an exponential conditional mean function and
multiplicative individual specific effects as described above.
The conditional probability of observing a count yit for a city i in a given period t
may be written
e−αi λit (αi λit )yit
P r(yit |xit , αi ) =
yit !
This leads to the following log likelihood function:
ln L = −αi
X
λit + ln αi
X
t
yit +
t
X
yit ln λit −
t
X
ln Γ(yit + 1)
t
where ln Γ is the log gamma function.
Following HHG, start from the assumptions that yit is conditionally Poisson distributed and that yit and yis are independent conditional on xit and αi . This implies
strict exogeneity:
E[yit |xi1 , . . . , xiT , αi ] = αi exp(x0it β)
HHG develop a conditional maximum likelihood estimator using the approach of Andersen (1970) to eliminate αi from the likelihood, conditioning on its sufficient statistic
P
t = yit in order to obtain a consistent estimator for β.
It has since been shown that the Poisson estimator does not suffer from the incidental
parameters problem so that the HHG estimator is equivalent to a maximum likelihood
estimator with individual specific effects. This estimator can also be obtained under
a single first moment condition by replacing the fixed effects with a ratio of within
group means (Blundell, Griffith, and Windmeijer, 2002; Cameron and Trivedi, 1998;
Wooldridge, 1999) such that
XX
i
xit (yit − λit
t
ȳi
)=0
λ̄i
These results imply that the only requirement for the consistency of our estimator is that
of strict exogeneity. This alone is enough to ensure a robust, consistent estimator of β
32
(Wooldridge 1999). We do not need any assumptions about the distribution or about
temporal dependence of yit . We only require that the conditional mean be correctly
specified. This means that our estimator is robust to departures from the Poisson distribution, including its requirement that there is no underdispersion or overdispersion. If
the distributional and temporal independence assumptions do not hold, it is necessary
to use a robust estimator for the variance of β. All results contained in the next section
use the robust covariance estimate of Wooldridge (1999).
In addition to the conditional likelihood model for panel data, we use as a baseline
model a pooled Poisson quasi-maximum likelihood estimator (QMLE). We do this for the
following reasons. First, the QMLE estimator is also consistent under the assumption
that the conditional first moment is correctly specified. This follows from Gourieroux et
al.’s (1984) result on robustness of QMLE estimators in the linear exponential family.
Second, it does not require an assumption of strict exogeneity. Third, there is no restriction on the time dependence of the observations. Finally, we can include time-invariant
regressors and regressors with a small variance relative to that of yit in xit – which allows
us additional insight into what drove the locational effects characterizing the diffusion of
ideas in print in the 1500s.
6.2.2
Results
Here we present the results of our specifications for the pooled Poisson without city fixed
effects and the fixed effects Poisson estimators. Two models are fit for each dependent
variable within each specification—one without time dummies and one with time fixed
effects, where time is measured in five year periods starting in 1517.47 In each case we
prefer the one with time dummies on the basis a Wald test on the joint hypothesis that
the dummy variable coefficients are zero. Since the conditional mean of the Poisson
is an exponential, the coefficient on the log transformed variables can be interpreted
as elasticities. Standard errors are reported in parantheses. All standard errors are
corrected for clustering at the city-level and are robust to serial correlation following
Wooldridge (1999). The results documenting the relationship between Protestant media
and the production of educational and non-religious vernacular media are robust over
the specifications.
The results in Table 6 are for the baseline pooled Poisson model without city fixed
effects. Both Catholic and Protestant media are associated with a positive increase in
the production of education works. The coefficients on both are not statistically different
47
We collapse our annual-level data to 5 year periods to reduce sparseness.
33
Table 6: Baseline Poisson Model
lnprot
lncath
time dummies
N
χ2lnprot=lncath
ρ
educ
0.58** 0.63**
(0.07) (0.08)
law
0.45** 0.50**
(0.06) (0.07)
merchant
0.92** 0.95**
(0.09) (0.09)
vernac
0.76** 0.78**
(0.11) (0.11)
0.61**
(0.08)
no
2176
0.06
0.21
0.84**
(0.04)
no
2176
17.05
0.24
0.13
(0.12)
no
2176
17.89
0.19
0.14
(0.10)
no
2176
9.51
0.55
0.63**
(0.07)
yes
2176
0.00
0.20
0.81**
(0.04)
yes
2176
10.37
0.22
0.19
(0.11)
yes
2176
19.30
0.18
0.16*
(0.09)
yes
2176
10.27
0.58
Robust standard errors in parentheses
* p < 0.10, ** p < 0.05, ** p < 0.01
from one another as indicated by the χ2 statistic resulting from Wald test that coefficients
on lnprot and lncath are equal. This test statistic is distributed χ21 with a critical value
of 3.84 at the 95% confidence level. In the case of law books, Catholic media is associated
with a bigger effect than Protestant though both are statistically significant. The presence
of Protestant works is strongly associated with an increase in the number of merchant
manuals produced. A doubling of Protestant works leads to nearly a doubling of merchant
manuals. Catholic works do not have a statistically significant effect here. The same can
be said for the production of all non-religious works written in the vernacular German.
The production of Protestant works is overwhelming associated with a higher count of
these vernacular works.
Table 7 presents results from a pooled Poisson specification that includes covariates
that account for city-level institutional factors that could plausibly affect the production
of other media and might otherwise be conflated with the effects of Catholicism and
Protestant media. The institutions we control for are the presence of a university in the
city and the enactment of the first of Protestant reform laws, which marked the first
official approval of the Reformation and thus inaugurated the “decisive, final step in the
Protestant movement” (Ozment 1975, p. 125). These are two absorbing state variables
that become and stay equal to one for a given city when that city obtains the relevant
institutions.
The point estimates with the addition of these two variables are broadly similar to
our baseline results. However, there are some notable differences. For education works,
we still cannot statistically discern the effects due to Protestant and Catholic media. It
is evident, however, that the enactment of the Protestant reform law is associated with
an environment that leads to more education books being published. In fact, this effect
34
Table 7: Pooled Poisson Model
educ
0.51** 0.55**
(0.06) (0.06)
law
0.39** 0.45**
(0.07) (0.06)
merchant
0.73** 0.74**
(0.10) (0.10)
vernac
0.66** 0.70**
(0.10) (0.09)
lncath
0.67**
(0.06)
0.67**
(0.06)
0.73**
(0.07)
0.73**
(0.06)
0.26**
(0.08)
0.30**
(0.09)
0.32**
(0.08)
0.34**
(0.08)
reform law
0.48**
(0.13)
0.49**
(0.12)
0.37*
(0.19)
0.15
(0.15)
1.61**
(0.34)
1.56**
(0.35)
0.66**
(0.21)
0.56**
(0.22)
university
0.01
(0.17)
no
2176
2.16
0.18
0.05
(0.19)
yes
2176
1.29
0.17
0.92**
(0.24)
no
2176
7.04
0.22
0.61**
(0.22)
yes
2176
7.73
0.20
0.16
(0.38)
no
2176
7.84
0.14
0.12
(0.40)
yes
2176
6.21
0.14
-0.51
(0.34)
no
2176
4.35
0.48
-0.59
(0.37)
yes
2176
5.10
0.49
lnprot
time dummies
N
χ2lnprot=lncath
ρ
Robust standard errors in parentheses
* p < 0.10, ** p < 0.05, ** p < 0.01
is present across all outcome variables. The only exception being the specification for
law books with time fixed effects included. The production of legal texts is strongly
associated with the presence of a university in a city, consistent with the fact that universities housed law faculties and produced jurists. Legal output is strongly correlated
with Catholic media production as most universities in 16th century Germany were and
remained predominantly Catholic.48
The row labeled ρ in Tables 6 and 7 reports the mean of the unique off-diagonal
estimates of serial correlation developed in HHG. If the correct specification is one in
which city-level effects are indeed important, then the residuals will exhibit serial correlation. To compute ρ, we take the generalized (Pearson) residuals of the pooled model,
p
it = (yit − λPit )/ (λit ) and obtain the correlation coefficient between it and is , esti mated as √P i2 it√isP 2 . Under the null of no individual effects, we expect the correlation
i it
is
coefficient to be zero for t 6= s. These estimates are all positive, indicating that some
small degree of serial correlation is likely present. However, if the serial correlation is due
to unchanging unobserved city-level effects αi , we would expect that ρ is fairly constant
(t − s). This is a slighty tougher case to make. The standard deviation of the correlations
is about 0.2 for each dependent variable.
Table 8 contains the results for the specification with city fixed effects. The point
48
While the universities at Wittenberg and Basel became centers of reformist thought, the universities
at Köln and Louvain developed as centers of Catholic counter-reformation thinking.
35
Table 8: Fixed effects Poisson model
educ
0.32** 0.44**
(0.07) (0.08)
law
0.40** 0.42**
(0.07) (0.09)
merchant
0.58** 0.62**
(0.11) (0.10)
vernac
0.41** 0.49**
(0.06) (0.06)
lncath
0.24**
(0.06)
0.26**
(0.06)
0.74**
(0.09)
0.60**
(0.07)
-0.00
(0.09)
0.07
(0.12)
0.08**
(0.04)
0.11**
(0.03)
reform law
-0.00
(0.17)
no
1232
0.63
158.93
0.08
(0.21)
yes
1232
2.41
49.90
1.40**
(0.37)
no
1200
6.69
84.55
0.53
(0.36)
yes
1200
1.85
67.38
1.09**
(0.43)
no
512
11.67
118.66
1.13**
(0.49)
yes
512
11.24
23.24
0.26*
(0.14)
no
1712
14.36
92.76
-0.19
(0.15)
yes
1712
29.39
105.68
lnprot
time dummies
N
χ2lnprot=lncath
q
Robust standard errors in parentheses
* p < 0.10, ** p < 0.05, ** p < 0.01
estimates are stable with the exception of the education specification. Here, the interpretation is reversed with higher Protestant media output associated with the publication
of more educational books. However, there is still not enough evidence to say that the
elasticities of educational print media with respect to Protestant and Catholic media
are statistically different from one another. It should be pointed out that we now have
varying sample sizes for each model. This is due to the fact that the fixed effects Poisson
P
drops cases for which i yit = 0 as there is no sufficient statistic for αi in this case.
To help determine the preferred model specification, we check for serial correlation.
Since the conditional MLE does not provide consistent individual effects αi , we cannot
easily compute residuals yit − αi λit . HHG (1984) develop a test of serial correlation for
P
the fixed effects Poisson model. Based on the fact that yi1 , . . . , yiT | t yit is multinimialdistributed with probability pit = Pλitλis , they introduce a conditional moments test based
s
on the usual moment conditions of a multinomial-distributed random variable. Here the
test statistic is q ∼ χ2105 with critical value 129.92 at a confidence-level of 95%. The only
specification that exhibits significant serial correlation is that of the educational books in
the case without time dummies. However, with the addition of time dummies this ceases
to be the case.
Finally, we consider the elasticity of media production with respect to distance to
Wittenberg over the period before and after the Reformation. We do this by estimating our baseline Poisson model with interactions that capture the effect of distance to
Wittenberg on media production in each five year time period. Figure 19 documents
that distance to Wittenberg was not associated with any significant difference in book
36
production pre-1517 and was systematically post-1517. It also shows that places farther
from Wittenberg produced systematically less educational books from the mid-1500s onwards. These facts are documented in Figure 19 by plotting the estimated elasticity of
print media production with respect to distance to Wittenberg as it evolves over time.
Educational Books
Vernacular Books
2.5
2
2
2
1.5
1.5
1.5
1
.5
0
-.5
-1
Distance to Wittenberg Elasticity
2.5
Distance to Wittenberg Elasticity
Distance to Wittenberg Elasticity
Total Books
2.5
1
.5
0
-.5
-1
-1.5
1520
1540
1560
1580
1600
.5
0
-.5
-1
-1.5
1500
1
-1.5
1500
1520
1540
1560
1580
1600
1500
1520
1540
1560
1580
Figure 19: Elasticity of print media with respect to distance to Wittenberg.
6.3
Institutions, Media, and Long-Run Outcomes
This section documents the relationship between educational outcomes in the 1800s and
exposure to the legal institutions of the Protestant Reformation. The key new fact we
present is that populations in counties where Reformation ordinances were passed in
the 1500s were more literate in the 1800s. This holds conditional on variations in the
religious composition of the population, which is not a significant correlate of literacy
once we control for legal institutions. We also show that while variations in both religious
composition and literacy in the 1800s are explained by exposure to the institutions of
the Reformation, they are not explained by our measure exposure to the message of
the Reformation in historical print media. These findings provide new evidence on the
channels through which the Reformation may have impacted development, and suggest
an institutional underpinning for the human capital impacts of religious change.49
49
An extensive literature ties religion to economic behavior and outcomes (e.g. Barro and McCleary
2005). A number of recent economic analyses examine Protestantism and economic outcomes in Germanspeaking Europe. Davide Cantoni (2009) finds that Protestant cities enjoyed no advantage in population
growth in the Holy Roman Empire. Becker and Woessman (2009) find that Protestantism was associated
with higher levels of literacy in 19th century Prussia and argue that Protestantism operated as a demand
shift for human capital that subsequently facilitated the adoption of new technology and economic
growth. Basten and Betz (2013) find exposure to Calvinist Protestantism reduces preferences for leisure,
redistribution, and government intervention along the border between Calvinist Protestant and Catholic
regions of Switzerland.
37
1600
Our findings suggest an institutional interpretation of the key fact that Protestantism
was associated with higher literacy in the 1800s across Prussian (German) counties.
Becker and Woessman (2009) propose a causal interpretation of the observed relationship
between Protestantism and literacy, using distance to Wittenberg as a source exogenous
variations in Protestantism. Our evidence qualifies the relationship between religioun
and education observed in the 1800s. We document that specific historical institutions
established by the Reformation explain variations in literacy observed in the 1800s, while
exposure to Reformation ideas in the 1500s and religious denomination in the 1800s itself
do not.
We measure exposure to religious laws as follows. For all city or municipal level
Reformation ordinances in Sehling (1910) we identify the subsequent Prussian county
locations of the city or municipality.50 This identifies a set of counties with urban or
city-level Reformation ordinances. For all territorial Reformation ordinances in Sehling
(1910) we proceed in two steps. We identify and geocode the extent of each polity
(principality) with a territorial law. We then overlay the map of Prussian counties from
the 1800s on the historic map of territorial ordinances to determine which counties were
treated by which ordinances. In cases, where the territory of an ordinance covers part
but not all of a particular county, we assign treatment on a proportional basis. For
example, we observe a territorial ordinance in Electoral Saxony. The 1849 boundaries of
the county (kreis) of Sagan show that 72% of the county is inside Electoral Saxony, so we
assign a treatment value of 0.72 to Sagan for any ordinance passed in Electoral Saxony.
We map city-level print media exposure to Prussian counties in the 1800s in the same
manner.
Table 20 documents the positive relationship between the legal institutions of the
reformation and county-level literacy data from 1871 (Becker, Woessman, and Hornung
2006).51 Column (2) shows the baseline correlation between Protestantism and literacy
across 334 Prussian counties. Column (3) introduces our indicator for city-level laws
institutionalizing the Reformation. In column (3) we show that when we also include our
measure of territory-level exposure to the legal institutions of the Reformation, Protestantism in the 1800s ceases to be a significant predictor of literacy in the 1800s. Column
(5) shows that exposure to Protestant print media in the 1500s does not positively predict literacy in the 1800s. Column (6) shows that non-religious, German language print
50
For cities that produced print media 1450-1600, we similarly associated them with the counties they
were subsequently located in.
51
We use the set of counties with consistent data available in all periods. Thus we exclude counties
that appear in the data in the 1870s due to Prussian expansion. Following Becker and Woessman (2006),
we use religion as of 1811 in order to take an observation on religion before the industrial revolution of
the 1800s took off.
38
media of the 1500s does predict literacy in the 1800s. Finally, column (7) shows that the
relationship between historic exposure to German-language non-religious media and subsequent literacy is much larger and is only statistically significant in the post-reformation
(1517-1600) era. Literacy, Religion, Institutions, Print Media -- Cross Section Prussian Counties
Variable
(1)
Share Protestant 1811
Literacy
in 1871
(2)
0.06***
(0.02)
0.0%
0.0%
Literacy
in 1871
(3)
0.05***
(0.02)
0.0%
0.07***
(0.02)
0.0%
0.0%
0.0%
Literacy
in 1871
(4)
0.03
(0.02)
0.0%
0.06***
(0.01)
0.0%
0.05***
(0.02)
0.0%
0.0%
0.0%
0.0%
Literacy
in 1871
(5)
0.03
(0.02)
0.0%
0.06***
(0.02)
0.0%
0.05***
(0.02)
0.0%
0.01
(0.01)
0.0%
0.0%
0.0%
0.0%
0.0%
Literacy
in 1871
(6)
0.03
(0.02)
0.0%
0.05***
(0.02)
0.0%
0.05***
(0.02)
0.0%
-0.01**
(0.00)
0.0%
0.17***
(0.05)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
334
0.0%
334
0.0%
334
0.0%
334
0.0%
334
City Reformation Law
Territorial Reformation Law
Protestant Books 1517-1600
Non-Religious German Books 1454-1600
Non-Religious German Books 1454-1516
Non-Religious German Books 1517-1600
Observations
Literacy
in 1871
(7)
0.03
(0.02)
0.0%
0.05***
(0.02)
0.0%
0.05***
(0.02)
0.0%
-0.01*
(0.00)
0.0%
0.0%
0.11
(0.53)
0.0%
0.17**
(0.08)
0.0%
334
Figure 20: Literacy across Prussian counties in 1871.
Religion, Distance to Wittenberg, Institutions, Print Media -- Cross Section Prussian Counties
% Protestant % Protestant % Protestant % Protestant % Protestant % Protestant
in 1811
in 1811
in 1811
in 1811
in 1811
in 1811
(2)
(3)
(4)
(5)
(6)
(7)
-0.79***
-0.75***
-0.12
-0.12
-0.13
-0.11
(0.15)
(0.15)
(0.19)
(0.19)
(0.19)
(0.19)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
City Reformation Law
0.18***
0.15***
0.19***
0.20***
0.18***
(0.06)
(0.06)
(0.05)
(0.05)
(0.05)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
Territorial Reformation Law
0.39***
0.39***
0.39***
0.39***
(0.06)
(0.06)
(0.06)
(0.06)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
Protestant Books 1517-1600
-0.04
-0.02
-0.05***
(0.03)
(0.03)
(0.01)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
Non-Religious German Books 1454-1600
-0.22
(0.31)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
Non-Religious German Books 1454-1516
-10.28***
(1.29)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
Non-Religious German Books 1517-1600
0.76***
(0.18)
0.0%
0.0%
0.0%
0.0%
0.0%
0.0%
Observations
334
334
334
334
334
334
Variable
(1)
Distance to Wittenberg
Figure 21: The share of Protestants across Prussian counties in 1811.
Our interpretation of these facts is that the religious denomination of citizens recorded
by census enumerators in the 1800s tells us much less about variations in literacy than the
long effects of laws that formalized the institutions of the reformation.52 Our finding is
52
The census classes people as Protestant, Catholic, Jewish, or “other christian religion.” See iPEHD
(2013).
39
thus consistent with the argument in Becker and Woessman (2009) that “Protestantism”
was associated with variations in educational attainment.
Because distance from Wittenberg provides a source of plausibly exogenous variation in exposure to Protestantism, we examine the importance of distance in predicting
Protestantism in the early 1800s. Table 21 shows that distance to Wittenberg is a highly
significant predictor of variations in county-level Protestantism – before we control for
exposure to city and territorial laws. Columns (4) to (7) show that once we control for
exposure to Reformation ordinances, distance to Wittenberg ceases to be a significant
predictor of variations in religion at the county level. This finding does not mean that
distance to Wittenberg only impacted subsequent religion through the institutional channel. But it provides prima facie evidence consistent with narrative evidence documenting
the importance of Protestant educational institutions. A caveat is in order on these results: We are currently exploring the spatial structure of the data to determine whether
there is robust evidence of spatial spillovers. The current estimates abstract from spatial
questions except insofar as distance from Wittenberg is concerned.
7
Conclusion
The print media revolution dramatically lowered the cost of transmitting ideas. The
Protestant Reformation was the first mass movement to exploit the new media. Previous
research has not documented the diffusion of Protestantism in quantitative terms.
We use new data on the universe of known books and pamphlets printed in Germany
to document the diffusion of Protestant ideas across time and space. We find that the
diffusion of Protestant ideas in the 1500s was associated with increased access to educational and non-religious vernacular in the 1500s, institutional change, and ultimately
higher rates of literacy by the 1800s.
References
[1] Erling Bernhard Andersen. Asymptotic properties of conditional maximumlikelihood estimators. Journal of the Royal Statistical Society. Series B (Methodological), pages 283–301, 1970.
[2] Christoph Basten and Frank Betz. Beyond work ethic: Religion, individual, and
political preferences. American Economic Journal: Economic Policy, 5(3):67–91,
40
2013.
[3] Sascha O. Becker, Francesco Cinnirella, Erik Hornung, and Ludger Woessmann.
ipehd - the ifo prussian economic history database, cesifo working paper no. 3904.
August 2012.
[4] S.O. Becker, E. Hornung, and L. Woessmann. Education and catch-up in the industrial revolution. American Economic Journal: Macroeconomics, 3(3):92–126, 2011.
[5] S.O. Becker and L. Woessmann. Was weber wrong? a human capital theory of
protestant economic history. The Quarterly Journal of Economics, 124(2):531–596,
2009.
[6] S.O. Becker and L. Woessmann. The effect of protestantism on education before the
industrialization: Evidence from 1816 prussia. Economics Letters, 107(2):224–228,
2010.
[7] Albert Bifet and Eibe Frank. Sentiment knowledge discovery in twitter streaming
data. In Discovery Science, pages 1–15. Springer, 2010.
[8] Richard Blundell, Rachel Griffith, and Frank Windmeijer. Individual effects and
dynamics in count data models. Journal of Econometrics, 108(1):113–131, 2002.
[9] Johan Bollen, Huina Mao, and Alberto Pepe. Modeling public mood and emotion:
Twitter sentiment and socio-economic phenomena. In ICWSM, 2011.
[10] A Colin Cameron and Pravin Trivedi. Regression analysis of count data, volume 53.
Cambridge University Press, 2013.
[11] E. Cameron. The European Reformation. Clarendon Press, 1991.
[12] Davide Cantoni. The economic effects of the protestant reformation: Testing the
weber hypothesis in the german lands. 2010.
[13] Davide Cantoni. Adopting a new religion: The case of protestantism in 16th century
germany. Economic Journal, 122(560):502–531, 2012.
[14] P.W. Carey and J.T. Lienhard. Biographical Dictionary of Christian Theologians.
ABC-Clio ebook. Greenwood Press, 2000.
[15] R.A. Crofts. Printing, reform, and the catholic reformation in germany (1521-1545).
The Sixteenth Century Journal, 16(3):369–381, 1985.
[16] J. De Vries. European Urbanization: 1500-1800. Methuen, 1984.
41
[17] J. Dittmar. New media, firms, ideas, and growth: European cities after gutenberg.
[18] J.E. Dittmar. Information technology and economic change: the impact of the
printing press. The Quarterly Journal of Economics, 126(3):1133–1172, 2011.
[19] Jr. Edwards, Mark U. Lutheran pedagogy in reformation germany. History of
Education Quarterly, 21(4):pp. 471–477, 1981.
[20] M.U. Edwards Jr. Catholic controversial literature, 1518-1555: Some statistics.
Archiv für Reformationsgeschichte, 79, 1988.
[21] Robert B Ekelund Jr, Robert F Hébert, and Robert D Tollison. An economic analysis
of the protestant reformation. Journal of Political Economy, 110(3):646–671, 2002.
[22] Richard Gawthrop and Gerald Strauss. Protestantism and literacy in early modern
germany. Past & Present, (104):pp. 31–55, 1984.
[23] Matthew Gentzkow and Jesse M. Shapiro. What drives media slant? evidence from
u.s. daily newspapers. Econometrica, 78(1):35–71, 2010.
[24] Alec Go, Richa Bhayani, and Lei Huang. Twitter sentiment classification using
distant supervision. CS224N Project Report, Stanford, pages 1–12, 2009.
[25] Christian Gourieroux, Alain Monfort, and Alain Trognon. Pseudo maximum likelihood methods: applications to poisson models. Econometrica: Journal of the
Econometric Society, pages 701–720, 1984.
[26] Christian Gourieroux, Alain Monfort, and Alain Trognon. Pseudo maximum likelihood methods: Theory. Econometrica: Journal of the Econometric Society, pages
681–700, 1984.
[27] M. Greengrass. The Longman companion to the European Reformation, c. 15001618. Longman companions to history. Longman, 1998.
[28] Luigi Guiso, Paola Sapienza, and Luigi Zingales. People’s opium? religion and
economic attitudes. Journal of monetary economics, 50(1):225–282, 2003.
[29] Jerry Hausman, Bronwyn H Hall, and Zvi Griliches. Econometric models for count
data with an application to the patents-r & d relationship. Econometrica, 52(4):909–
938, 1984.
[30] W. Klaiber. Katholische kontroverstheologen und reformer des 16. Jahrhunderts
(Reformationsgeschichtliche Studien und Texte 116), 136, 1978.
42
[31] H.J. KÖhler. Flugschriften als Massenmedium der Reformationszeit: Beitrage zum
Tubinger Symposion 1980. Klett-Cotta, 1981.
[32] Jimmy Lin and Alek Kolcz. Large-scale machine learning at twitter. In Proceedings
of the 2012 ACM SIGMOD International Conference on Management of Data, pages
793–804. ACM, 2012.
[33] J. Mark U. Edwards. Printing, Propaganda And Martin Luther. Augsburg Fortress
Pub, 2004.
[34] Rachel M McCleary and Robert J Barro. Religion and economy. The Journal of
Economic Perspectives, 20(2):49–72, 2006.
[35] M. Mullett. Historical Dictionary of the Reformation and Counter-Reformation.
Historical Dictionaries of Religions, Philosophies, and Movements Series. Scarecrow
Press, 2010.
[36] Steven E Ozmint. The Reformation in the Cities: The Appael of Protestantism to
Sixteenth Century Germany and Switzerland, volume 357. Yale University Press,
1975.
[37] Alexander Pak and Patrick Paroubek. Twitter as a corpus for sentiment analysis
and opinion mining. In LREC, 2010.
[38] Marco Pennacchiotti and Ana-Maria Popescu. A machine learning approach to
twitter user classification. In ICWSM, 2011.
[39] A. Pettegree. Reformation and the Culture of Persuasion. Cambridge University
Press, 2005.
[40] A. Pettegree. The Book in the Renaissance. Yale University Press, 2010.
[41] Jared Rubin. Printing and protestants: reforming the economics of the reformation.
Available at SSRN 1742523, 2011.
[42] R.W. Scribner. Popular Culture and Popular Movements in Reformation Germany.
History series. Hambledon Press, 1987.
[43] M. Taddy. Multinomial inverse regression for text analysis. Journal of the American
Statistical Association, (just-accepted), 2012.
[44] M.A. Taddy. Inverse regression for analysis of sentiment in text. arXiv preprint
arXiv:1012.2098, 2010.
43
[45] J. Witte and M.E. Marty. Law and Protestantism: The Legal Teachings of the
Lutheran Reformation. Cambridge University Press, 2002.
[46] Robert D Woodberry. Religion and the spread of human capital and political institutions christian missions as a quasi-natural experiment, 2011.
[47] Jeffrey M Wooldridge. Distribution-free estimation of some nonlinear panel data
models. Journal of Econometrics, 90(1):77–97, 1999.
44