Evgeniya Goryacheva

Interconnections of the Industries in the
Economy: a Network Approach
Evgeniya Goryacheva a,∗
a Economics Discipline Group, Business School, University of Technology Sydney,
Australia
Abstract
This paper uses a network approach to investigate the interrelation between industries in the economy. I consider the economy as a network of
the industries connected with each other through the product flows. I propose dynamic directed weighted network formation model. According to
the model industry’s position in a network and productivity level affect its
probability of a new link adoption. I apply a empirical methodology to
the data obtained from the input-output tables in order to identify the key
factors affecting the changes of the technological network.
Keywords: Network; Input-Output Analysis; Intersectoral Linkages.
∗
E-mail:
[email protected].
1
1
Introduction
I use a network approach to describe U.S. economy. I build a network using U.S.
input-output tables. Observing the U.S. input-output tables in different years, we
can conclude that this network is not stable. The product flows and inputs’ shares
are changing over time. New links are created between industries, some old links
disappear, and the weights of the links are also changing over time. The main
research question of this paper is how the industries’ network changes over time.
Which factors affect the network formation process?
I propose the rules of the industries’ network evolution that have economic
interpretation and test their validity empirically. One of these rules is that industries’ positions in a network are closely related to economies of scale. Industries
with high centrality have many consumers. They have cost advantages compared
to other industries, because their fixed costs are spread out over more product
flows. Lower costs lead to the lower prices of the outputs, this attracts new consumers and also increases existing product flows. As a result, these industries
become even more influential over time.
Obtained empirical results support this idea. The U.S. panel data estimation shows that the industry’s eigenvector centrality affects its probability of a new
link creation and its share in intermediate inputs of other connected industries.
I propose the hypothesis that the industry’s productivity influences the
industry’s position in the network. The network evolves in the following way. More
productive industries have low costs and as a result low prices of their outputs.
Many other industries adopt these industries as suppliers for this reason. At the
same time industries increase the product flows from the productive industries
because of the low costs. This makes productive industries more influential over
time. I test this hypothesis using empirical data and obtained results show that
more productive industries become more central over time.
The growth of the industry’s productivity also increases the industry’s
indegree in the period between 1972 and 1992. Because of the competition, industries try to improve their products. As a result innovations lead to the changes in
their production functions and to the adoption of new suppliers. For example, at
some point the car manufacturing industry started using plastic instead of steel
for some car parts. The car manufacturing adopted plastic manufacturing as its
new supplier because of the technological innovation.
I propose a dynamic weighted network formation model which is a modification of the model introduced in Barrat, Barth´elemy, and Vespignani (2004a).
This model captures above evolution rules and leads to the power-law distribution
2
of the industries’ weighted outdegree.
It is important to investigate the process of the network formation, because the structure of the network affects stability of the whole system. As was
shown in Acemoglu, Carvalho, Ozdaglar, and Tahbaz-Salehi (2012), weighted outdegree of the U.S. industries have a power-law distribution.12 This means that
even small shocks can lead to the large system volatility.
The structure of the network can help predict how shocks will propagate in
the economy. Many models try to predict the shocks’ propagation in the network.
It is worth mentioning that the input-output analysis was popular for
investigating effects of different kinds of shocks on the economy in the previous
century. The main advantage of the input-output analysis is its simplicity. It is a
clear mathematical procedure to calculate multipliers and then to use them to get
an impact of the change in one industry on the economic activity of another industry. But at the same time there is a limitation of this approach, because it is valid
only for short-term prediction. Because it makes prediction under an assumption
that the network is static, which is not true in the long run. Many models that use
a network approach to describe shocks’ propagation in the economy also assume
that industries’ network is fixed.
In reality, the structure of the network is changing over time, and if these
changes are large the models with network stability assumption can not be used for
long-term predictions. In this case it is necessary to take into account the evolution
of the network’s structure; otherwise, the prediction of the shocks’ influence on
the economy would be incorrect. Therefore I investigate how industries’ network
is changing over time and what causes these changes.
Using data of the U.S. input-output tables I build a network, where nodes
are industries and links represent product flows between them. Each industry uses
a part of other industries’ outputs as inputs for its own production. The weights
of the links represent the size of the product flows.
For example, the aircraft manufacturing industry uses inputs from different industries: rubber, iron and steel, electronics, metalworking machinery, space
vehicles, motor vehicle parts manufacturing industries. At the same time some of
these industries such as electronics, metalworking machinery, space vehicles and
motor vehicle parts manufacturing industries also use part of the output produced
by the aircraft manufacturing as inputs in their production. This interconnection
ck− β , where P (k)
- the empirical counter-cumulative distribution function, k - a weighted outdegree, c, β - the
constants and β > 1.
2Industry’s weighted outdegree is a sum of the industry’s shares in intermediate inputs of
other industries.
1The network has a power-law weighted outdegree distribution if P (k) ∼
3
Figure 1: Industries-suppliers and industries-consumers of the aircraft manufacturing industry
is illustrated in Figure 1.
Figure 2 illustrates the U.S. economy in 2014 as a network. This directed
graph includes the most important links (with large product flows) between aggregated sectors. The size of the node reflects the importance of the industry’s
position in the network. The sectors with a higher eigenvector centrality have a
larger size of the nodes that represent them.
In reality this network is not fixed over time. Industries start to use inputs
from new industries-suppliers, while some industries may lose their industriesconsumers. These changes can be described as new links’ adoption and links’
destruction in the network. At the same time the size of the product flows between
industries is also changing, which again can be described as a change of the links’
weights in the network. I describe the rules of the industries’ network evolution
and their economic interpretation.
4
Figure 2: Aggregated Input-Output Network of the U.S. industries in 2014
5
1.1
Related literature
This paper relates to the two branches of literature, namely, the network formation
models and work that combines the network approach and input-output tables.
Acemoglu, Carvalho, Ozdaglar, and Tahbaz-Salehi (2012) relate the idea
of the effect of the power-law distribution on the aggregate fluctuations in a system with a network approach. They show that if nodes’ degrees in a network
have a power-law distribution then a small idiosyncratic shock can create a large
system volatility. This means that the structure of the industries’ network influences aggregate fluctuations. In asymmetric system, where intersectoral linkages
are not uniformly distributed the rate at which aggregate volatility decays is quite
low. Using the U.S. input-output tables they show that the weighted outdegree
of the industries follow the power-law distribution. For this reason I propose theoretical model with evolution rule that leads to the power-law weighted outdegree
distribution.
The networks in which degrees of the nodes behave as a power-law are
called scale-free networks. There are theoretical models of the network formation
that explain how a scale-free network may occur. All of them are based on a
preferential attachment mechanism, which I also use in my model. According to
this mechanism the probability that existing node will be connected to a new node
is proportional to a number of its links. Nodes with many connections become
even more influential over time. The first such model was introduced in Baraba´si
and Albert (1999).
A further extension of this original model is a fitness model introduced
in Bianconi and Baraba´si (2001). Fitness network formation model uses the same
preferential attachment mechanism, but also includes a node-specific parameter
that affects the probability of adoption. I also include a industry-specific parameter related to industry’s productivity level.
Barrat, Barth´elemy, and Vespignani (2004b) build a undirected weighted
network formation model that considers the basic evolution of the system as driven
by the weight properties of the links and nodes. They allow a dynamical evolution
of weights during the growth of the system. A new link creation affects the weights
of existing links. I use their model as a basis for my model and modify it to consider
a directed weighted network formation process.
Atalay, Hortacsu, Roberts, and Syverson (2011) use a scale-free framework
to model indegree distribution of the connected firms in an economy. According
to this model, firms are born, create links with each other and die with certain
probability. Using the U.S. firm level data they estimated parameters of the model.
Their model is close to my model, but the main difference is that they consider
6
unweighted network’s evolution, while I focus on weighted network’s formation
process.
There is some recent literature that also considers the input-output tables
of the industries as networks. For example, Fadinger, Ghiglino, Teteryatnikova,
et al. (2015) build a multi-sector general equilibrium model with IO linkages,
sector-specific productivities and tax rates. Using a statistical approach they show
that an aggregated income depends on the distribution of IO multipliers, sectoral
productivities and sectoral tax rates. They also find the difference in distributions
of IO multipliers for rich and poor countries. Rich countries have more sectors
with intermediate multipliers, while poor countries have many sectors with low
multipliers and just a few sectors with high multipliers.
Magalha˜es, Oso´rio, and Ant´onio (2016) use the U.S. input-output tables
to study the influence of the changes in the inter-sector network structure on
the process of technology adoption. According to the obtained results there are
global (but not local) network structure effects in the intensity of the technology
adoption.
Acemoglu, Akcigit, and Kerr (2015) build a model of the shocks’ propagation through an input-output network under an assumption that this network
is fixed. They show that in the economy with the Cobb-Douglas production function and perfect competition supply-side shocks (productivity shocks) propagate
downstream and affect the consumers but not the suppliers. Demand-side shocks
propagate upstream and affect only industry’s suppliers, but not the consumers.
They assume that the inputs’ shares are stable over time and only industries’ outputs are changed. I argue that productivity shocks also propagate through the
system by creating new links.
The empirical literature that considers the factors that affect the network
formation process is very poor. Carvalho and Voigtla¨nder (2014) use the dynamic
network formation model described in Jackson and Rogers (2007) as a base for
the hypotheses. According to the model each new firm in first period randomly
chooses its suppliers, but in the next period it forms the links with the suppliers
of its suppliers. Using the U.S. input-output data for the period 1967-2002 they
show that a technological distance between industries affects the probability of
a new link creation between them. My paper is closely related to this paper. I
use the same data, but test a different hypotheses. I test whether the industry’s
position in the network affects its probability of new link adoption and weights of
the links to the other connected industries.
7
Figure 3: The U.S. industries’ indegree and outdegree in different years
Number of industries
150
1972
1977
1982
1987
1992
100
50
0
0
50
100
150
200
250
Outdegree
300
350
400
450
Number of industries
40
1972
1977
1982
1987
1992
30
20
10
0
20
40
60
80
100
120
Indegree
140
160
180
200
Number of industries
60
1997
2002
2007
40
20
0
0
50
100
150
200
250
300
350
Outdegree
Number of industries
30
1997
2002
2007
20
10
0
80
100
120
140
Indegree
8
160
180
200
Table 1: Data on the numbers of formed and disappeared links in the U.S. industries’ network
Y ear
Number of new links
Number of disappeared links
1977
1982
1987
1992
5896
5459
2614
2987
4123
3387
4162
7649
Number of industries
318
318
2002
2007
6108
1746
6487
2376
Number of industries
219
219
1.2
Empirical observations
The network of the U.S. industries is changing over time. In each period some
new links are formed, while others disappear. Table 1 provides data on changes
in numbers of formed and disappeared links in the U.S. industries’ network. In
some years the number of new links is greater than the number of the disappeared
links, in other years it is right the opposite. The size of the gap between these two
numbers is changing over time.
I consider two periods from 1972 to 1992 and from 1997 to 2007 separately, because there were two different industries’ classifications during these
periods (SIC and NAICS). These classifications differ in the levels of aggregation;
therefore, they have different numbers of manufacturing industries for these two
periods.
The number of new links halved in both 1987 and 1992 compared to 1977
and 1982 respectively. There was a large drop in the number of new links between
2002 and 2007, this number decreased threefold.
One may think that these changes in the number of new links are related
to the fact that industries buy some inputs every few years. For example, an
airline does not buy new airplanes every year. However, we consider the links not
between the firms but between the industries. Each industry consists of many
9
firms and all these firms buy their inputs in different periods. Therefore, it is
unlikely that no firm buys the input from the regular industry-supplier. Let’s
consider the same example of the aircraft manufacturing and airlines. There are
more than 120 airlines in the U.S. and at least one of them buys a new airplane
every year.
Table 2 provides data on an average indegree, outdegree and weighted
outdegree of the U.S. industries in different years. An average indegree shows
an average number of suppliers that the U.S. industries have. An average outdegree is an average number of industries-consumers that industry has. An average
weighted outdegree is an average sum of the industry’s shares in intermediate
inputs of other industries.
Figure 3 illustrates the outdegree and indegree of the U.S. industries in
different years. Most of the U.S. industries have a low outdegree. The mode for
the indegree is always higher than for the outdegree. This means that there is a
small number of industries that have many consumers. I define these industries as
”universal” suppliers, because they provide inputs to many other industries. The
indegree and outdegree distributions are changing each year. These changes are
related to differences in the numbers of formed and disappeared links in each year.
Acemoglu, Carvalho, Ozdaglar, and Tahbaz-Salehi (2012) using the U.S.
data showed that the industries’ weighted outdegrees have a power-law distribution
and got an estimation of the power. Figure 4 illustrates the empirical distribution
function of the weighted outdegree.
The U.S. industries’ network is not fixed and it is constantly changing.
Moreover, the structure of the network demonstrates the power-law properties. I
propose a model that can properly describe the evolution of the industries’ network
below.
2 Theoretical framework
I propose the model of the weighted network formation that is a modification of
the model introduced in Barrat, Barth´elemy, and Vespignani (2004a). The main
difference between their model and my model is that I consider a directed network,
while they consider an undirected network.
First I introduce notation.
wij - weight of the link between industry i and industry j, an amount of
industry i’s output used by industry j in its production.
ki - an industry i’s outdegree, number of its industries-consumers.
The weighted outdegree of the industry i shows the amount of the industry
10
Table 2: The average indegree, outdegree and weighted outdegree of the U.S.
industries
Y ear
Average
indegree
Average
outdegree
Average
weighted outdegree
1972
1977
1982
1987
1992
97.7
104.3
112.6
106
89.2
80.2
87.6
94
89.9
73
0.63
0.63
0.59
0.58
0.58
Number of industries
318
318
318
1997
2002
2007
137.6
132.8
131.4
110.1
105.6
101.4
0.55
0.61
0.68
Number of industries
219
219
219
i’s output which is used as an intermediate input by other industries:
si =
,
wij
(1)
j
An initial network consists of N0 industries connected by links with weights
w0. In each period of time a new industry t is born and it creates m links.
The industry i is chosen as a supplier by a new industry with probability
sis . The probability of adoption as a supplier depends on the industry’s weighted
j
j
outdegree. I define industries with many consumers (with a high weighted outdegree) as the ”universal” industries, which means that they have a high probability
to be adopted by new industries in the future. The weight of each new link is w0.
The new link adoption changes the weights of other links in the following
way. Suppose that an industry t adopts an industry i as a new supplier, then
weights of the links for industry i’s suppliers increase in the following way:
wki → wki + ∆wki,
11
(2)
Figure 4: Empirical counter-cumulative distribution function of the U.S. industries’ weighted outdegree
Source: Acemoglu, Carvalho, Ozdaglar, and Tahbaz-Salehi (2012)
∆wki = δk
wki
,
si
(3)
where δk - a productivity level of the industry k. We assume that δk is an independent random variable taken from a given distribution ρ(δ). A higher level of
productivity induces a higher weight growth. Because a high productivity level
leads to a low price of the output.
At the same time the weights of the links for industry i’s consumers increase in the following way:
wik → wik + ∆wik,
∆wik = δi
wik
si
(4)
(5)
As a result we get the following evolution equation for the industry i’s
weighted outdegree:
dsi = m(w + δ ) si (t) + , mδ sk (t) wik (t)
i ), s (t)
i ), s (t) s (t)
0
k
j j
j j
k∈ V
dt
(6)
(i)
The first term shows the case when industry i is chosen as a new supplier with
some probability, which changes its outdegree by 1 and its weighted outdegree
by (w0 + δi). Further we set w0 = 1. The second term shows how the weighted
outdegree may change if the industry i’s consumer was adopted as a new supplier.
12
Where V (i) is a set of industry i’s consumers.
The evolution equation for industry i’s outdegree is as follows:
si (t)
dki
= m ),
dt
sj (t)
(7)
j
In comparison with the usual degree preferential attachment models nodes
connect more likely to the nodes with the higher weighted outdegree.
),
1
Each new link from industry i to industry t increases j sj by (1+δi +δi ),
),
1
where δi = k∈
δk wskit , H(i) - set of industry i’s suppliers. On the average the
H(i)
total weighted outdegree is increasing in the following way:
t
,
1
where δ is a solution of
[
si(t) c:: m(1 + δ )t,
1
(8)
i=1
dδρ(δ)
= 1 (we assume that ρ(δ) is bounded).
Using the fact, that
k∈ V (i) δiwik = δisi and Eq.(8), I obtain the following
expressions for Eqs.(6) and (7):
(δ1),
− 2δ)
(1 + 2δi)si(t)
dsi
=
,
dt
(1 + δ1 )t
si (t)
dki
=
dt
(1 + δ1 )t
(9)
Using initial conditions ki(t = i) = si(t = i) = m I solve these differential equations:
t
1+2δi
1
1+δ
si(t) = m( )
i
,
ki(t) = si(t) + 2δim
(1 + 2δi)
(10)
(11)
According to Eq.(10) a productivity level of an industry increases its weighted
outdegree, that in turn increases the industry’s adoption probability.
The weighted outdegree of the industry and the number of its consumers
are proportional, with a coefficient depending on δi:
si(t) = (1 + 2δi)ki(t) − 2δim
(12)
The generated networks display power-law behavior for the weight, degree, and weighted outdegree distributions. The obtained results suggest that the
inclusion of weights in the networks modeling naturally explains the diversity of
13
power-law distributions empirically observed in real industries’ networks.
3 Hypotheses
Hypothesis 1. More productive industries become more central over time. The
logic of this hypothesis comes from the following reasoning. The industries that
experience growth in their productivity have a lower cost and can be adopted as
suppliers by new industries. A creation of new links makes these industries more
influential. Productivity shock shifts the industry’s supply curve, which leads to
a decrease in equilibrium price and to an increase in equilibrium quantity. This
increase in quantity may occur due to the growth of the existing product flows
and by attracting new consumers.
Hypothesis 2. The higher the industry’s centrality the higher the probability of a new link adoption is. More influential industries have a higher chance to
grow further, because they have more contracts with different industries, through
which they may find new suppliers and consumers. Additionally, industries with
high centrality supply big product flows to many other industries. Their costs per
unit of output decrease with increasing outgoing product flows. This leads to the
lower prices of their outputs. These industries become more attractive suppliers,
which increases their chance of finding a new industry-consumer.
Hypothesis 3. The centrality of the industry increases its product flows
to other industries (the weights of the links). Economies of scale and low prices
of the industries with high centrality may also lead to increased product flows to
existing industries-consumers.
4 Data
I use the panel data of the U.S. 5-year period annual input-output benchmark
tables for 1972-2007 provided by the Bureau of Economic Analysis. The Use table
shows the inputs to industry production and the amounts that are consumed
by final users. The Industry-by-Industry Total Requirements Tables show the
production that is required from each industry to deliver a dollar of output to
final users.
I derived the Direct Requirements tables (DR) from the Total Requirements tables (TR) using the following formula:
DR = (TR − I) ∗ TR − 1
14
(13)
The Direct Requirements table shows the amount of a commodity that is
required by an industry to produce a dollar of the industry’s output.
To get the matrices with shares of the intermediate inputs (W), I divided
each element of the matrix DR by the appropriate column sum.
I also use the NBER-CES Manufacturing Industry Database, which provides information on the manufacturing industries’ total factor productivity (TFP)
and total employment.
I use a matrix W , where wij - a share of the industry i’s input in a
total intermediate input of the industry j. And from the matrix W I derived an
”indicator” matrix A, where each element is determined as follows:
Aij,t =
1,
if wij,t > 0 and wij,t− 5 =
0,
otherwise
0
(14)
Final uses growth rate for each industry is determined as follows:
∆i,t =
Di,t − Di,t− 1
,
Di,t−
(15)
1
where ∆i,t - industry i’s final uses growth rate, Di,t - industry i’s final uses in
period t.
A technological distance between industries i and j in defined in the following way:
dij =
1
w ij
(16)
The higher the weight of industry i’s input in intermediate input of industry j, the shorter technological distance between these two industries is. If
industries i and j are connected indirectly through other industries, then the
technological distance is a minimal sum of the distances between all nodes on
the path. To calculate technological distances between industries I use Dijkstra’s
shortest path algorithm for the weighted graphs.
I consider two datasets for 1972-1992 (Panel 1) and for 1997-2007 (Panel
2) separately, because of a significant change in classification (from SIC to NAICS)
in 1997. There are some changes in classifications even within these two periods.
I transform all the matrices to the unified versions of classification for these two
periods.
Centrality measures identify the most important nodes within graph. The
simplest centrality measure is a degree centrality. Because I work with the directed
15
graph I derive both an indegree and an outdegree centrality for each industry by
16
using the Direct requirement matrix W . An indegree is a number of links directed
to the node and an outdegree is a number of links that originate from the node.
The degree centrality counts every link a node receives equally. But not
all of the links are equivalent, some of them represent large product flows between
industries, other represent small flows.
I also use an extended version of the degree centrality, an eigenvector
centrality. This centrality measure also takes into consideration an importance
of the other nodes linked to the node. The main difference between the degree
centrality and the eigenvector centrality is that a node with a high degree centrality
does not necessarily have a high eigenvector centrality. Because it can be linked
to many nodes, but all of them are not so important. Reverse is also true, a node
with high eigenvector centrality is not necessarily highly linked.
To provide an example I consider centrality measures for the U.S. industries in 2007. The computer terminals manufacturing has a relatively low degree
but a very high eigenvector centrality. Which means this industry does not have
so many consumers, but it is an important supplier for influential industries such
as the telephone apparatus manufacturing, the semiconductor and related device
manufacturing. These industries also have important positions in the network
according to the eigenvector centrality.
An example for the second case is the plastics packaging materials manufacturing. It has a high degree, but a low eigenvector centrality. Many other
industries use product of this industry as inputs, but these inputs do not play
important roles in the production process of the industries.
I derive a eigenvector centrality from the Direct requirements coefficients
matrix W . The eigenvector centrality of a node i is determined as follows:
Cent =
i
1,
λ
wij Centj
(17)
j
Or in matrix form:
λCent = CentW,
(18)
where Cent - an eigenvector of the matrix W related to the largest eigenvalue
λ /= 0.
A higher centrality measure means that an industry has more connections
with other industries. A possible explanation of the outdegree measure’s influence
on a probability of a link creation is that if an outdegree of an industry is high
enough, then it is an evidence that this industry is an universal supplier to other
industries. This means that there is a high probability that new industries will
also choose this industry as their supplier. The second centrality measure is the
17
eigenvector centrality that takes into account not only links that an industry sends
but also receiving industries, which captures the whole spread of information about
this industry.
5
Methodology
To test Hypothesis 1 and to check whether the productivity level of the industry
affects its centrality in the future I use U.S. industries’ panel data to estimate the
following equation:
Vi,t = αi + β1Vi,t− 1 + β2∆TFP i,t− 1 + ui,t,
(1)
where Vi,t - different centrality measures (indegree, outdegree, eigenvector centrality) for the industry i in period t and ∆TFP i,t− 1 - a lag of the industry
i’s productivity growth rate. I use different centrality measures, because the
produc- tivity growth may affect in several ways. More productive industries
may have more consumers over time, because the cost of their outputs
decreases. Then I expect a positive and significant coefficient in a regression
where the dependent variable is outdegree. At the same time, an industry that
experience its productiv- ity growth may introduce some innovations. And for a
future development of these innovations it needs new materials and it will start
adopting new suppliers for this reason. Then I can also expect a positive and
significant coefficient in a regression with indegree. While the eigenvector
centrality may capture both of these effects and also an increase of product flows
of the more productive industries.
To test Hypotheses 2 and to verify whether the industry’s centrality and
growth in final uses affect the industry’s probability of a link creation I use the
following fixed effect panel data linear probability model:
Aij,t = αij + β1Centi,t− 5 + β2Centj,t− 5 + β3Empi,t + β4Empj,t + β5∆TFP i,t− 1
+β6 ∆TFP j,t− 1 + β7Distanceij,t− 5 + β8∆FUsei,t− 5 + β9∆FUsej,t− 5 + uij,t, (2)
where Aij,t is a dummy variable, which is equal to 100, if a new link between industries i and j was formed in period t and 0 otherwise; Centi,t− 5 is a 5-years lag of
the industry i’s eigenvector centrality; Empi,t is the industry i’s total employment
in period t in 1000s; ∆F inalU sei,t− 5 is a 5-years lag of the industry i’s growth in
final uses; Distanceij,t− 5 is a 5-years lag of technological distance between industries i and j; ∆T F Pi,t− 1 is a one-year lag of the industry i’s productivity growth
rate; uij,t is an error term.
Similar equation is used in Carvalho and Voigtla¨nder (2014). But they do
not include industries’ centralities, which is important for testing my hypothesis.
18
I use the total employment variable to control for the size of the industries.
Larger industries may have more suppliers and consumers and more possibilities
to create new links. I also include lags of ∆TFP to control for productivity levels.
The estimated coefficients β1 and β2 will clarify how centrality measures
of both industries affect the probability of new link creation between them. Additionally, I include lags for the growth of final uses and expect coefficients β8 and
β9 to be positive and significant. An increase in final uses of one industry may
affect the process of its link creation. For example, if in one industry final uses
are growing then it may attract new suppliers, because now it has a higher ability
to optimize its production.
To test Hypothesis 3 and to check whether the industry’s centrality affects its share in intermediate inputs of other industries I estimate the following
equation:
Weightij,t = αij + β1Centi,t− 5 + β2Centj,t− 5 + β3Empi,t + β4Empj,t + β 5 ∆TFP i,t− 1
+β6 ∆TFP j,t− 1 + β7Distanceij,t− 5 + β8∆FUsei,t− 5 + β9∆FUsej,t− 5 + uij,t, (3)
where Weightij,t - a share (%) of an input from the industry i in a total intermediate input of the industry j in period t, all other variables are the same as in
Equation (2). I also control for the industry’s size and productivity growth. I add
lags of the growth in final uses for both industries to check whether they affect
the link’s weight.
6 Results
The results for Regression 1 for Panel 1 and Panel 2 are provided in Table 2.
According to the results for Panel 1, an industry’s TFP growth rate affects all
three centrality measures. Coefficients of ∆TFP t− 1 are positive and significant
for all three variables. The change in the industry’s growth rate of TFP by 1%
in previous year increases the industry’s indegree by 25.79 (p-value=0.001), the
industry’s outdegree by 18.6 (p-value=0.0065) and the eigenvector centrality by
0.003 (p-value=0.028) in current year. A change for the eigenvector centrality is
small, because the eigenvector centrality itself takes values from 0 to 0.2531. This
means that industry’s productivity growth affects the process of a new link formation. More productive industries have more industries-suppliers and industriesconsumers over time.It is also possible that the industry’s productivity growth
increases its product flows to other industries.
More productive industries have low costs and low prices for their inputs.
As a result, other industries start to buy inputs from these industries; this means
19
Table 3: Results of Regression 1
Dependent variable
Eigenvector centrality
Centt− 1
0.16
(0.11)
∆TFP t− 1
0.003∗ ∗
(0.001)
Indegree Outdegree
Panel 1
0.22∗ ∗ ∗
(0.26)
25.79∗ ∗
∗
0.125∗ ∗
(0.056)
18.6∗
(10.05)
(7.39)
1268
1268
1268
Centt− 1
0.083
(0.067)
Panel 2
− 0.14∗ ∗
∗(0.043)
− 0.05∗ ∗
∗(0.018)
∆TFP t− 1
0.016∗ ∗ ∗
(0.006)
2.14
(10.96)
− 0.69
(27.66)
Number of observations
436
436
436
Number of observations
an increase in their outdegree. High productivity growth means that an industry
makes some innovations in its technological process. Further innovations need new
inputs and the industry adopts new suppliers. This is reflected in the growth of the
industry’s indegree. The influence of the productivity growth on the eigenvector
centrality captures both of these effects. It also may capture an increase in the
size of product flows within existing connections.
Results for Panel 2 show that the coefficient for ∆TFP t− 1 is only significant in the regression where the dependent variable is the eigenvector centrality.
This means that during the period 1997-2007 the productivity growth only affected the industry’s eigenvector centrality and did not affect its indegree and
outdegree. During this period more productive industries did not have higher
chance to be adopted as new suppliers or consumers. But because there is an
influence of the productivity growth on the eigenvector centrality we can conclude
that the weights of the industry’s links to other connected industries increased.
The industry’s productivity growth increased the size of industry’s product flows.
Results for both panels support our Hypothesis 1 that more productive
industries become more central over time.
20
Table 4: Baseline results for Panel 1 (1972-1992)
Regression 2
Dummy for adoption∗
Regression 3
Weight∗ ∗
Centi,t− 5
27∗ ∗ ∗
(4)
Centj,t− 5
35∗ ∗ ∗
(5.5)
Empi,t
− 0.002∗ ∗ ∗
(0.003)
− 0.007∗ ∗
(0.002)
2.2∗ ∗ ∗
(0.4)
− 0.05
(0.04)
0.0007∗ ∗ ∗
(0.0002)
− 0.00005
(0.00005)
0.05∗ ∗
(0.02)
0.014
(0.015)
− 0.0000006∗ ∗
Dependent variable
Empj,t
∆TFP i,t− 1
∆TFP j,t− 1
Distanceij,t− 5
∆FinalUsei,t− 5
∆FinalUsej,t− 5
Observations
5∗ ∗ ∗
(0.7)
1.2∗
(0.7)
− 0.00007∗ ∗ ∗
(0.00001)
− 0.017∗ ∗ ∗
(0.003)
− 0.014∗ ∗ ∗
(0.003)
291957
∗
(0.00000006)
− 0.00017∗
(0.00007)
0.00005
(0.00006)
291957
The estimated coefficients for Regression 2 for Panel 1 are provided in
Table 4. The estimated coefficients for lags of both industries’ eigenvector centralities are positive and significant. This means that there is a high probability
that an industry with a high eigenvector centrality will be adopted as a consumer
or as a supplier in the future. The coefficient for the industry-consumer is greater
than the coefficient for the industry-supplier. The eigenvector centrality of the
industry-consumer affects the probability of a new link creation stronger than the
centrality of the industry-supplier.
These results support Hypothesis 2. More central industries have more
connections to other industries; and through these connections they find new suppliers and new consumers. Also, I consider more central industries as ”universal”
suppliers to many other industries. When a new industry is born it adopts this
”universal” industry as supplier with a high probability.
21
Table 5: Baseline results for Panel 2 (1997-2007)
Regression 2
Dummy for adoption
Regression 3
Weight
Centi,t− 5
32∗ ∗ ∗
(4.6)
24∗ ∗ ∗
(4)
Centj,t− 5
− 4.1
(4.6)
− 0.0045∗ ∗ ∗
(0.0012)
0.0047∗ ∗ ∗
(0.0012)
− 0.9∗ ∗
(0.4)
− 0.00019∗
Dependent variable
Empi,t
Empj,t
∆TFP i,t− 1
∆TFP j,t− 1
Distanceij,t− 5
∆FinalUsei,t− 5
∆FinalUsej,t− 5
Observations
∗
(0.000096)
0.00007
(0.000096)
11∗ ∗ ∗
(1.6)
2.2
(1.6)
− 0.00026∗ ∗
(0.0001)
0.0094
(0.03)
− 0.08∗ ∗ ∗
(0.03)
1∗ ∗ ∗
(0.1)
− 0.065
(0.12)
− 0.43∗ ∗ ∗
(0.009)
0.0059∗ ∗
(0.0024)
0.0009
(0.00244)
47088
47088
I control for the technological distance between industries (to avoid the
effect that industries with higher eigenvector centrality have short distances to
many other industries). The estimated coefficient for the distance is negative and
significant, but very small. This means that the shorter the technological distance
between industries, the higher the probability that these industries will be linked
in the future is. This result is consistent with the results obtained in Carvalho
and Voigtla¨nder (2014). They show that if there is a short technological distance
from the industry i to the industry j then the industry j will adopt the industry
i as a new supplier with a high probability.
The estimated coefficients for the growth in final uses for both industries
are negative and significant, but very small. As a result, I cannot argue that the
growth in final uses increases the probability of a new link adoption.
The estimated coefficients for Panel 1 and Panel 2 support the Hypothesis
22
3. The industry i’s eigenvector centrality increases its input’s share in intermediate inputs of other industries (Table 2). This effect is big enough in Panel 2, but
small in Panel 1. The industry i with the high eigenvector centrality has many
industries-consumers that leads to the cost advantage and low price of the output.
For this reason industries-consumers increase their product flows from this industry. ”Universal” industries have high shares in intermediate inputs of connected
industries.
According to the results for both Panels the shorter the distance between
two industries 5 years ago, the higher the input share of industry i’s product in
total intermediate input of the industry j, but again this effect is very small.
The estimated coefficients for the lags of the industry i’s productivity
growth for both Panels are significant and positive. This means that the higher
the industry’s productivity growth in previous year the higher its input share in
an intermediate input of other industries this year.
7
Conclusion
I consider the U.S. industries as a network, which is changing over time. I propose
a theoretical model that describes a industries’ network formation process and an
evolution of links and their weights.
According to the model, the most productive and central industries have
a higher chance to be adopted as suppliers by new industries. The position of the
industry in the network and its productivity affect the weights of the links. This
means that productive industries with many connections will have large product
flows to other industries-partners. As a result, the evolution rule leads to the
power-law distribution of the industries’ weighted outdegree. This is exactly what
is observed in the U.S. data.
I use the U.S. input-output tables for 1972-2007 to verify whether these
rules of the network formation are implementable for real industries’ network formation process. The obtained empirical results support the hypotheses.
Industries with a higher productivity growth become more central over
time. Three different centrality measures are used for the estimation: the indegree,
the outdegree and the eigenvector centrality. The productivity growth of the
industry affects all three centralities in period 1972-1992. This means that if
an industry experiences a productivity growth and introduces innovations then
it starts to adopt new suppliers. An industry may also attract new industriesconsumers, because of a reduction in its costs that is related to the productivity
growth.
23
I also empirically verified that the industry’s position in the network affects its future expansion. More influential industries in terms of the eigenvector
centrality have a higher probability to create new links with other industries in the
future. Moreover, it also increases the size of its product flows to other connected
industries.
One possible direction for the future research is to build a model that
combines a network formation process and microfoundations, where the evolution
of a network is not simply determined by the proposed rule, but is a result of
firms’ economic decisions. A firm belonging to a particular industry makes a
choice of suppliers and the size of the inputs. Decisions of all the firms determine
the network evolution over time.
23
References
Acemoglu, D., U. Akcigit, and W. Kerr (2015): “Networks and the Macroeconomy: An Empirical Exploration,” NBER Chapters.
Acemoglu, D., V. M. Carvalho, A. Ozdaglar, and A. Tahbaz-Salehi
(2012): “The Network Origins of Aggregate Fluctuations,” Econometrica, 80(5),
1977–2016.
Atalay, E., A. Hortacsu, J. Roberts, and C. Syverson (2011): “Network structure of production,” Proceedings of the National Academy of Sciences,
108(13), 5199–5202.
Baraba´si, A.-L., and R. Albert (1999): “Emergence of scaling in random
networks,” science, 286(5439), 509–512.
Barrat, A., M. Barthe´lemy, and A. Vespignani (2004a): “Modeling the
evolution of weighted networks,” Physical review E, 70(6), 066149.
(2004b): “Weighted evolving networks: coupling topology and weight
dynamics,” Physical review letters, 92(22), 228701.
Bianconi, G., and A.-L. Baraba´si (2001): “Competition and multiscaling in
evolving networks,” EPL (Europhysics Letters), 54(4), 436.
Carvalho, V. M., and N. Voigtla¨nder (2014): “Input diffusion and the evolution of production networks,” Discussion paper, National Bureau of Economic
Research.
Fadinger, H., C. Ghiglino, M. Teteryatnikova, et al. (2015): “Productivity, Networks and Input-Output Structure,” in 2015 Meeting Papers, no. 624.
Society for Economic Dynamics.
Jackson, M. O., and B. W. Rogers (2007): “Meeting strangers and friends
of friends: How random are social networks?,” The American economic review,
pp. 890–915.
Magalha˜ es, M. F., C. Oso´rio, and M. Anto´nio (2016): “The US InterSector Network Dynamics and Its Effects on Technology Adoption,” Available
at SSRN 2805394.
24