What Causes Business Cycles?
Analysis of the Japanese Industrial Production Data
arXiv:0912.0857v2 [q-fin.GN] 22 Nov 2010
Hiroshi Iyetomi, Yasuhiro Nakayama
Department of Physics, Niigata University, Niigata 950-2181, Japan
Hiroshi Yoshikawa
Faculty of Economics, University of Tokyo, Tokyo 113-0033, Japan
Hideaki Aoyama
Department of Physics, Kyoto University, Kyoto 606-8501, Japan
Yoshi Fujiwara
ATR Laboratories, Kyoto 619-0288, Japan
Yuichi Ikeda
Hitachi Ltd, Hitachi Research Laboratory, Ibaraki 319-1221, Japan
Wataru Souma
College of Science and Technology, Nihon University, Chiba 274-8501, Japan
Abstract
We explore what causes business cycles by analyzing the Japanese industrial production data. The methods are spectral analysis and factor analysis.
Using the random matrix theory, we show that two largest eigenvalues are
significant. Taking advantage of the information revealed by disaggregated
data, we identify the first dominant factor as the aggregate demand, and
the second factor as inventory adjustment. They cannot be reasonably interpreted as technological shocks. We also demonstrate that in terms of two
dominant factors, shipments lead production by four months. Furthermore,
out-of-sample test demonstrates that the model holds up even under the
2008-09 recession. Because a fall of output during 2008-09 was caused by an
exogenous drop in exports, it provides another justification for identifying
Preprint submitted to Journal of Japanese and International EconomyNovember 23, 2010
the first dominant factor as the aggregate demand. All the findings suggest
that the major cause of business cycles is real demand shocks.
Keywords:
JEL No: E23,, E32,
1. Introduction
What causes business cycles? A glance at Haberler (1964) reveals that
all kinds of theories had already been advanced by the end of the 1950’s. In
recent years, macroeconomists had been so confident of their skillful policy
management as to hail the “Great Moderation”. Lucas (2003) in his Presidential Address to the meeting of the American Economic Association made
the following declaration:
Macroeconomics was born as a distinct field in the 1940’s, as a
part of the intellectual response to the Great Depression. The
term then referred to the body of knowledge and expertise that
we hoped would prevent the recurrence of that economic disaster.
My thesis in this lecture is that macroeconomics in this original
sense has succeeded: Its central problem of depression prevention
has been solved, for all practical purposes, and has in fact been
solved for many decades (Lucas, 2003, p.1).
Ironically, when Lucas delivered this address, the American economy had
been on the steady road to the post-war worst recession, perhaps even depression. Business cycles are still with us, and await further investigation.
The real business cycle (RBC) theory pioneered by Kydland and Prescott
(1982), arguably the most dominant school today, regards changes of total
factor productivity as the primary cause of business cycles. This theory is
basically the neoclassical equilibrium theory, and minimizes the role of the
government’s stabilization policy. The Keynesian theory, in contrast, attributes business cycles to fluctuations of real aggregate demand (Summers,
1986; Mankiw, 1989; Tobin, 1993), and assigns the government and the central bank the substantial role of stabilization. The fundamental problem,
namely what causes business cycles, remains unresolved.
The standard approach to study this problem is to explore the relative importance of technology shocks on one hand and demand shocks on the other
2
as the sources of business cycles by making various identifying restrictions on
macro time-series models: See Shapiro and Watson (1988), Blanchard and
Quah (1989), Francis and Ramey (2005) and works cited therein. Though
variables used and methods differ in details, they all share the same key identifying restriction. Namely, the level of output (or real GDP) is determined
in the long-run only by supply shocks such as technology shocks. Conversely,
the long-run effect of demand shocks on output is assumed to be nil. On this
assumption, not directly observable disturbances are sorted into technology
shocks and demand shocks.1
A majority of works routinely make this assumption without questioning its validity. However, we maintain that the assumption is not actually
so tenable as is commonly believed. The problem is that in the study of
business cycles, technology shocks are usually taken exogenous as typified by
real business cycle theory (Kydland and Prescott, 1982). Here, we must observe that modern micro-founded macroeconomics is curiously inconsistent
in that while new growth theory puts so much emphasis on the endogeneity
of technical progress, business cycle theory is naively content with taking advances in technology as exogenous. New technology or technical progress is
certainly not a free lunch! First of all, it must be generated by R&D investment. Once new technology is found out, it can augment production only by
way of investment which embodies such new technology. Now, investment
including R&D investment, is, of course, much affected by cyclical fluctuations. Monetary factors as well as real factors affect investment. Thus, to the
extent that new technology is embodied in investment, demand shocks are
bound to affect the pace of technical progress. Therefore, we can not naively
accept the standard identifying restriction that demand shocks do not leave
the permanent effects on the level of output.
One way to proceed without this questionable assumption is to take advantage of disaggregated data. The representative works on business cycles
cited earlier all use the macro variables. However, business cycles as the
conspicuous macro phenomena are characterized by the fact that output
movements across broadly defined sectors and types of goods move together
(Mitchell, 1951). In Mitchell’s terminology, they exhibit high conformity. It
is then most natural to seek possibly unobserved common factors underlining
sectoral comovements as the basic factors causing business cycles. Moreover,
the separate effects of such unobserved factors on production, shipments and
inventory are expected to provide us with valuable information on the causes
of business cycles.
3
Factor models precisely fit out purpose (Sargent and Sims (1977), Stock
and Watson (1998, 2002)). Stock and Watson (1998, 2002) based on this
method develop the Diffusion Index to be used for improving forecasting the
cyclical fluctuations. In this paper, we take the same approach for a different
purpose. By examining the relationship between production, shipments, and
inventory of twenty-one types of goods, we explore what causes business
cycles. We also show that the model estimated based on the pre-Lehman
crisis (September 15th, 2008) can reasonably explain the post-Lehman (200809) recession of the Japanese economy. Our findings are in favor of the old
Keynesian view (Tobin, 1993).
2. Indices of Industrial Production
We use the monthly data of the Indices of Industrial Production (IIP)
compiled by the Ministry of Economy, Trade and Industry (METI) of Japan.
They are indices of industrial production, producer’s shipments, and producer’s inventory of finished goods. Details of IIP are in the web site of
METI, Japan (2008). Here, we simply note that the indices are compiled
based on nearly 500 items (496 for production and shipment indices; 358
for inventory). We use the data for the 240 months from January 1988 to
December 2007. (The data itself is available starting on January 1977, but
some items are missing before January 1988.) We use IIP classified by types
of goods, which is listed in Table 1.
We denote the IIP by Sα,g (tj ), where α = 1, 2, and 3 denotes production
(value added), shipments, and inventory, respectively. Also, g = 1, 2, . . . , 21
denotes 21 categories of goods shown in Table 1, and tj = j∆t with ∆t = 1
month, and j = 1, 2, . . . , N for N (= 240) months starting in 1/1988 and
ending in 12/2007. A typical Sα,1 (tj ) is shown in Figure 1.
The logarithmic growth-rate rα,g (tj ) is defined by the following:
Sα,g (tj+1 )
,
(1)
rα,g (tj ) := log10
Sα,g (tj )
where j runs from 1 to N − 1 = 239 (:= N 0 ). The time-series rα,1 (tj ) is
plotted in the bottom panel of Figure 1.
The normalized growth rate wα,g is then defined by
wα,g (tj ) :=
rα,g (tj ) − hrα,g it
σα,g
4
(2)
where h·it denotes average over time t1 , . . . , tN 0 and σα,g is the standard deviation of rα,g over time. Thus the set {wα,g (t1 ), wα,g (t2 ), . . . , wα,g (tN 0 )} has
the average equal to zero, and the standard deviation equal to one. The
normalization is done because the normalized data are to be used in the subsequent analysis of random matrix theory. We also note that the information
revealed by correlation rather than covariance is sufficient for our study of
lead-lag relationships among production, shipments and inventories. In the
next section, we will examine the cyclical behavior of the normalized growth
rate (2).
3. The Power Spectrum
There are two basic problems in the study of business cycles. First is to
identify the periodicity. Secondly, we must identify common factors causing
sectoral comovements.
In this section, we identify the periodicity. Some economists are skeptical
of the stable periodicity of business cycles (Diebold and Rudebusch, 1990). It
is certainly true that when we examine the sample path property of stochastic
process, we face difficulties in finding or even clearly defining the periodicity.
However, it does not necessarily mean that we cannot usefully explore the
periodicity of business cycles. We resort to the standard spectral analysis
(see Granger (1966), for example).
The Fourier decomposition is done as follows:
0
N −1
1 X
√
w
eα,g (ωk )e−iωk tj ,
wα,g (tj ) =
0
N k=0
(3)
where the Fourier frequency ωk := 2πk/(N 0 ∆t), so that ωk tj = 2πkj/N 0 .
Because wα,g (tj ) are real-valued, we have2 ,
∗
w
eα,g
(ωk ) = w
eα,g (ωN 0 −k ).
(4)
The normality of wα,g (tj ) implies that
w
eα,g (0) = 0,
0 −1
N
X
|w
eα,g (ωk )|2 = N 0 .
(5)
k=0
We define the averaged power spectrum p(ωk ) as follows:
3
21
1 XX
p(ωk ) =
|w
eα,g (ωk )|2 ,
M α=1 g=1
5
(6)
where M = 63, the total number of the time series. Recall that we have 21
categories of goods each for which there are three different variables, namely
production, shipments, and inventory. We note that the following relation is
satisfied:
0 −1
N
X
p(ωk ) = N 0 ,
(7)
k=0
due to Eq. (5).
Figure 2 shows the behavior of p(ω) of the seasonally-adjusted data3
and the non-adjusted original data as a function of the period T := 2π/ω
[months]. Hereafter, we will use units of months for T unless otherwise noted.
Since the Fourier components of j = 1, ..., (N 0 − 1)/2 are actually identical
to the components of j = N 0 − 1, ..., (N 0 + 1)/2 because of Eq. (4), we plot
only the former half. Therefore, the lower limit of the data in this plot is
2 ∗ N 0 /(N 0 − 1) ≈ 2.008. Also, the non-adjusted data is multiplied by a factor
7 in this plot so that the comparison becomes easier.
The original data has high peaks4 at T = 1 year, 6 months, and 3 months
showing the presence of significant seasonal fluctuations. We observe that
they are well-eliminated in the adjusted data.5 We will use the seasonallyadjusted data for the following analysis.
We observe the significant power at low frequencies, T = 60 months (5
years) and T = 40 months (3 years and 6 months). One might be tempted
to argue that strengths of the cycles T = 120 and T = 239 are comparable to
them. However, for the 239 months our data covers, the T = 120 oscillation
occurs only twice, and the T = 239 only once. Therefore, we disregard them
as “one-time” events. Also, one might argue that T = 24 appears to be
a possible cycle. But so are many peaks for T < 24. In general, peaks for
shorter periods tend to be the results of noise, and one needs to discard them.
The criteria for selection of significant periods will be discussed in relation
to dominant eigenmodes in Section 5.
We can demonstrate the robustness of the T = 60 and T = 40 cycles.
These two cycles have the wavenumber (number of periods in the whole region
of 239 months) k = 4 and k = 6 respectively, so that more accurate values
of the periods are N 0 /k ≈ 59.75 and N 0 /k ≈ 39.83. Therefore, if the number
of observations, N 0 is different from the present value, there is actually no
Fourier-coefficient at these periods. Because of this, one may cast a doubt
on the robustness of the two cycles. In what follows, we will examine this
point to demonstrate that the two cycles are quite robust.
6
Specifically, we have done the following two analyses.
Discrete Fourier transformation for chopped data: We take the first S months
and carry out the discrete Fourier transformation as in the manner of
Eq. (6).
Continuous Fourier transformation: We allow ω to take any value in Eq. (6),
and obtain the power-spectrum as a continuous function of T = 2π/ω.
This is an over-complete analysis, but should be sensitive to existence
of periods away from the discrete values allowed in the original analysis.
The results are shown in Figure 3. The dots connected with thick lines
are the same discrete power-spectrum as in Figure 2 while the small dots
connected by thin lines are the discrete Fourier transform of the chopped
data for S = 234, 229, · · · , 164. The solid curve with gray shades shows the
result of the continuous Fourier transformation on which the original powerspectrum data-points lie. We find that these results are basically same. For
the short periods, say less than T < 24, all the analyses give essentially identical results, and for the longer periods, the discrete spectrum of the chopped
data shows behavior similar to the averages of the continuous analysis.
In Figure 3, we observe three cycles A, B, and C for the continuous plot,
and also for some of the chopped data. The peak A of the continuous plot is
at the period T ≈ 36.96 and the peak B is at the period T ≈ 59.62. These
are both very close to the peaks we have found by our discrete analysis. This
confirms the robustness of our basic finding.
The third peak C is at T ≈ 100.16. The periods allowed in our original
power-spectrum in Eq. (6) are for T = N 0 /2 ≈ 119.5 and T = N 0 /3 ≈ 79.67,
both of which are in the outskirts of this peak as is seen in this plot. However,
the T ' 100 cycle is also a “one-time” event; it only occurred two and a half
times during our observation of 239 months.
We conclude that the T = 40 and T = 60 cycles are significant and robust.
The peaks and troughs of Japan’s business cycles are officially determined
by the Cabinet Office (prior to 2000, the Economic Planning Agency). In
the post-war period (October, 1951 – January, 2002), the Japanese economy
had experienced twelve cycles. The average length of twelve cycles is 50.3
months with the maximum 83 months and the minimum 31 months. The
periodicity we have identified, T = 40 and T = 60 months is, therefore, quite
reasonable.
7
4. The Dominant Factors Identified by the Random Matrix Theory
Next, we must identify the significant common factors underlying sectoral
comovements. By the common factors, we mean a small number of factors
that explain the correlation of the original data as much as possible. We
resort to the principal component analysis.
First, we obtain the correlation matrix C as follows;
Cα,g;β,` := hwα,g wβ,` it
(8)
whose diagonal elements are 1 by definition of the normalized growth rate
wα,g . Because α and β run from 1 to 3 and g and ` run from 1 to 21, the
matrix C has M × M (M = 63) components.
To uncover the unobservable common factors, we examine the eigenvalues
(n)
λ and the corresponding eigenvectors V (n) of the matrix C;
C V (n) = λ(n) V (n) .
(9)
The eigenvalues λ(n) satisfy the trace-constraint;
M
X
λ(n) = M.
(10)
n=1
The correlation matrix C is written in terms of the eigenvalues and eigenvectors as follows:
M
X
λ(n) V (n) V (n)T
(11)
C=
n=1
The distribution of the eigenvalues λ(n) is plotted in Figure 4. We see that
there are a few isolated large eigenvalues, and that the rest concentrates in the
smaller region. The large eigenvalues and associated eigenvectors correspond
to the dominant factors which explain comovements of the fluctuations of
different goods.
The eigenvector corresponding to the largest eigenvalue (which we call
“the first eigenvector” hereafter) is the first principal component (PC) that
accounts for as much of the correlation as possible in the M -dimensional
space. The eigenvector for the second largest eigenvalue (“the second eigenvector”) is the second principal component that accounts for as much of the
correlation as possible in the subspace orthogonal to the first eigenvector,
and so on.
8
In the principal component analysis (PCA), it remains an unresolved
problem how to choose significant factors. To solve this problem, many
procedures and guidelines have been proposed (Jackson, 1991; Jolliffe, 2002;
Peres-Neto et al., 2005). For example, the Kaiser-Guttman rule (Guttman,
1954; Kaiser, 1960) which is very popular in practice suggests that only
factors having eigenvalues greater than one are to be retained. The scree
graph named by Cattell (1966) is also widely used. The scree graph is a kind
of rank size plot of the eigenvalues of a correlation matrix. In this graph, the
abscissa is the rank and the ordinate is the eigenvalue. The typical shape of
the scree graph is a downward-sloping curve, and the slope changes from the
“steep” section to the “shallow” section. Accordingly, we can obtain PCs by
looking for the point that separates the “steep” and the “shallow”. Although
the scree graph is used mainly in PCA, Cattell proposes the same method
for the factor analysis as well.
The problem of testing the significance of a few dominant factors is also
a challenge to exploratory factor analysis (FA), or the so-called approximate
dynamic factor model such as Stock and Watson (1998, 2002). Determining the number of unknown dominant factors has been, in fact, studied in
the rapidly growing literature. Bai and Ng (2002), for example, propose a
criteria for the selection of dominant factors in large dimensions of N and
T . They basically use the information criteria in a framework of model selection. Their analyses are, however, done on the asymptotic assumption of
large dimensional panels.
Random Matrix Theory
Though how to determine the number of significant components is still
an open issue in PCA and FA, the parallel analysis (PA) proposed by Horn
(1965) is considered very likely to be the best procedure (Zwick and Velicer,
1986). In PA, the scree graph derived from the actual correlation matrix
is compared with that from the random matrix that parallels the original
data set.6 Because the multivariate time-series of IIP contain uncorrelated
shocks and “measurement” errors, the empirical correlation matrix is to some
extent random. It is, therefore, reasonable to compare the properties of the
empirical correlation matrix C to a null hypothesis on purely random matrix
which one would obtain with time-series of totally uncorrelated fluctuations.
This is the basic idea of the PA.
We apply the method of random matrix theory (RMT) to the problem of
comparing the properties of the empirical correlation matrix C with the null
9
hypothesis based on the purely random matrix. The RMT has made great
success in physical sciences, and has quickly found applications to various
disciplines. We note that the RMT has been applied to finance for a decade
(see the textbook by Bouchaud and Potters (2000) and references therein)
and macroeconomics (Ormerod, 2008).
PA is widely applied to various disciplines, but it requires the Monte-Carlo
simulation to obtain the scree graph for the random matrix. The drawback
of the Monte-Carlo simulation is cost of computation time. Several methods
have been proposed to overcome this problem. For example, the tables of
simulated eigenvalues of the random matrix with different sample size and
number of variables are provided by Buja and Eyuboglu (1992). However,
if we can obtain the theoretical formula for the probability density function
(PDF) for the eigenvalue of the random matrix with different sample size
and number of variables, we can overcome this drawback of the Monte-Carlo
simulation and PA. That is what we will do in what follows.
Let us consider an ensemble of random matrices in which all the elements
of a square matrix A of size M × M are iid (independent and identicallydistributed) random variables with the constraint that Aij = Aji . As a
consequence of the central limit theorem, in the limit of large matrices, the
distribution of eigenvalues has universal properties, which are mostly independent of the distribution of the elements Aij . In particular, it can be
shown that if the Aij ’s are iid random variables of zero mean and variance
hA2ij i = σ 2 /M , the probability density function (PDF) of eigenvalues of A is
given by the so-called “semi-circle law” (Wigner, 1955):
q
1
4σ 2 − λ2A ,
(12)
ρ(λA ) =
2πσ 2
for |λA | ≤ 2σ and zero elsewhere (see Bouchaud and Potters (2000)).
Its relevance to our study of correlation is obvious when sample correlation matrix is written as C = AT A where A is a rectangular matrix of size
M × N (M and N respectively the number and length of time-series), and
AT is the transpose of A. If M = N , the eigenvalue λ of C can be obtained
from the obvious relation λ = λ2A . If it is assumed that the elements of A are
iid random variables, the PDF of eigenvalues of C can be easily calculated
from the semi-circle law (12) by using the relation ρ(λ)dλ = 2ρ(λA )dλA (for
λA > 0) as
r
1
4σ 2 − λ
ρ(λ) =
,
(13)
2πσ 2
λ
10
for 0 ≤ λ ≤ 4σ 2 and zero elsewhere.
If M 6= N , one can derive a similar PDF (Marčenko and Pastur (1967); see
Bouchaud and Potters (2000) for a simple derivation). Using our notations in
this paper of M and N 0 , the PDF for the eigenvalue of the M ×M correlation
matrix C for M stochastic variable of length N 0 with standard deviation σ
is given by
p
(λ+ − λ)(λ − λ− )
Q
for λ− < λ < λ+ ,
2
λ
(14)
ρ(λ) = 2πσ
0
otherwise,
in the limit of
M → ∞,
N0 → ∞
(15)
with Q := N 0 /M fixed. The bounds λ± for λ are defined as follows:
√ 2
Q)
2 (1 ±
.
λ± := σ
Q
(16)
Note that (14) is reduced to (13) when M = N 0 .
Though the above results (Eqs.(14)–(16)) hold exactly true only asymptotically in the limit (15), we emphasize that the RMT is actually robust
with respect to sample size; it holds up not only asymptotically but also for
finite sample with nontrivial autocorrelation. We demonstrate it in Appendix
A with the help of a Monte-Carlo analysis of randomized data. Given this
result, in what follows, we perform significance test of eigenvalues based on
the RMT advanced in physics.
The dashed line in Figure 4 shows the prediction of RMT, Eq. (14) and
(16), where in the present case,
σ = 1,
Q=
239
≈ 3.79,
63
λ+ ≈ 2.29,
λ− ≈ 0.24.
(17)
We find that two largest eigenvalues
λ(1) ≈ 9.95,
λ(2) ≈ 3.82,
(18)
are significant in comparison with the RMT prediction. That is, they are significantly above the upper bound λ+ and, therefore, reflect the non-random
structure of the data.
11
Identification of Two Dominant Factors
We have demonstrated that two largest eigenvalues are significant. In
Figure 5, the components of the eigenvectors V (1,2) are shown for production, shipments, and inventory by type of goods. Taking advantage of this
information revealed by disaggregated data, we can identify two eigenvectors
as follows.
We observe that the first eigenvector V (1) captures the comovements of
production and shipments. Thus, the first dominant factor corresponding
to the largest eigenvalue is the “aggregate demand” which moves both shipments and production in all the sectors. This is nothing but Keynes’ principle
of effective demand.
The central Keynesian proposition is not nominal price rigidity
but the principle of effective demand (Keynes, 1936, Ch.3). In the
absence of instantaneous and complete market clearing, output
and employment are frequently constrained by aggregate demand.
In these excess-supply regimes, agents’ demands are limited by
their inability to sell as much as they would like at prevailing
prices. Any failure of price adjustments to keep markets cleared
opens the door for quantities to determine quantities, for example
real national income to determine consumption demand, as described in Keynes’ multiplier calculus. · · · · · · In Keynesian business cycle theory, the shocks generating fluctuations are generally
shifts in real aggregate demand for goods and services, notably
in capital investment. (Tobin, 1993)
The textbook multiplier analysis depends, of course, on the assumption that
changes in shipments induce almost equal changes in production.
It is noteworthy that in 15 out of 21 cases, responses of inventory stock
is negative, suggesting depletion of inventory stock in response to the first
dominant factor V (1) which we interpret as the aggregate demand. When a
firm’s production is subject to increasing marginal cost and/or production
takes time, there is a buffer-stock motive for inventory. The buffer stock
motive produces the negative correlation between inventory investment and
sales when demand exogenously changes (Hornstein, 1998).
The second dominant factor V (2) , on the other hand, captures the comovements of production and inventory. Its effect on shipments is relatively
small. Metzler (1941)’s classical work demonstrates that production is made
12
not only for the purpose of meeting the current sales or shipments but also
for maintaining the firm’s desired level of inventory stocks. The firm’s desired
level of inventory stocks depends on anticipations of future demand or sales.
That is, inventory investment becomes, in part, “speculative”. In fact, based
on a robust stylized fact that the variance of production is greater than that
of shipments, Blinder and Maccini (1991) and others have demonstrated that
the role of inventory stock is not confined to production smoothing. Rather,
they show that inventories apparently destabilize output at the macro level.
Whether originated from the Metzlerian inventory accelerator (the idea that
desired inventory stocks rise with sales) or from the (S, s) inventory strategy
at the micro level, inventory and production tend to move together. Our second dominant factor V (2) appears to capture this channel. For convenience,
we call it “inventory adjustments”.
Whether direct or indirect, V (1) and V (2) are basically demand factors. It
is difficult to interpret V (1) and V (2) as “productivity shocks” for the following reason. Changes in total factor productivity (TFP) emphasized by Real
Business Cycle theory such as Kydland and Prescott (1982) bring changes in
marginal cost in production. In order to minimize costs, the firm increases
(decreases) production and accumulates (reduces) inventories during times
when marginal cost is low (high). Thus, not only final demand (consumption
/ investment) usually analyzed by RBC but inventory investment also moves
together with production when productivity changes (Hornstein, 1998). In
other words, productivity shocks, temporary shocks in particular, must move
all the variables, namely production, shipments, and inventory in tandem.
Figure 5 clearly shows that V (1) captures the comovements of production and
shipments while V (2) the comovements of production and inventory, leaving
the third variable relatively unaffected in both cases. The two dominant
factors, therefore, do not represent productivity shocks.
5. Dominant Eigenmodes and the Business Cycles
So far, we have found that (1) the T = 60 (months) and the T = 40
fluctuations are significant, and also that (2) two largest eigenvalues, λ(1,2)
of the correlation matrix, are significant based on the RMT.
These two findings suggest that the contribution of two largest eigenvalue components at T = 60 and T = 40 periods are properly defined as the
“business cycles”. In this section, we measure the contribution of this “business cycle” factor in economic fluctuations. Toward this goal, we express the
13
normalized growth rate in Fourier modes of the eigenvectors.
First, we decompose the normalized growth rate wα,g (tj ) on the basis of
the eigenvectors V (n) ;
wα,g (tj ) =
M
X
(n)
an (tj ) Vα,g
.
(19)
n=1
By substituting this into Eq. (8) and comparing it with Eq. (11), we find
that
han an0 it = δnn0 λ(n) .
(20)
The Fourier decomposition of the coefficients an (tj ) is
0
N −1
1 X
e
an (ωk ) e−iωk tj ,
an (tj ) = √
0
N k=0
(21)
similar to Eq. (3). This leads to
0
(n)
λ
N −1
1 X (n)
λ (ωk ),
= 0
N k=0
λ(n) (ωk ) := |e
an (ωk )|2 .
(22)
From Eq. (3) and Eq. (22), we obtain
w
eα,g (ωk ) =
M
X
(n)
e
an (ωk ) Vα,g
.
(23)
n=1
Substituting this into Eq. (6), we obtain that the following decomposition of
the power spectrum
M
1 X (n)
p(ωk ) =
λ (ωk ).
(24)
M n=1
By summing the both-hand sides over k, we recover Eq. (10):
0
N −1
M
1 X
1 X (n)
1= 0
p(ωk ) =
λ .
N k=1
M n=1
(25)
This shows that the eigenvectors corresponding to the larger eigenvalues have
larger contribution to the power spectrum on average: For example, the
14
relative contribution of the first eigenvector is 16% as λ(1) /M ≈ 0.16 (M =
63). If the contributions of all the eigenvectors were the same, it would be
only 1.6% (1/M ≈ 0.016). The sum of the relative contribution of the first
and the second eigenvector components amounts to 21%. Dominance of two
large eigenvalues is clear.
So far, we have discussed the average contribution. The important question is in what part of the power spectrum the large eigenvalues have significant contributions. Figures 6 and 7 answer this question. In Figure 6,
the dark gray spectrum at the bottom is the contribution of the first eigenvector, λ(1) (ωk )/M , whereas the spectrum with lighter shading is that of the
second eigenvector, λ(2) (ωk )/M . The solid black line is the whole spectrum,
p(ωk ), which is the same as the spectrum with gray shading in Figure 2. We
observe that the eigenmodes of the larger eigenvalues tends to have greater
contributions as T gets larger.
This is better illustrated in Figure 7 which shows the relative contribution
in each bin of period with the same notations as used in Figure 6. We observe
here that the relative contribution of the first eigenmode is about 20% for
T & 40 whereas it is only 10% or less for T . 40. The same is true for
the second eigenmode. The sum of the relative contribution of the first and
second eigenmodes is about 40% for T & 38 but is only 20% for T . 38.
(The boundary T ≈ 38 is denoted by the vertical dashed line and the mark
“A” on the horizontal axis.) This is another reason why we rejected smaller
periods like T = 24 as insignificant in Section 3; The relative contributions
of the dominant two eigenmodes for T . 38 are much smaller than those for
T = 40 and T = 60. It suggests that fluctuations in shorter periods T < 38
are basically due to noises, and do not reflect of any systematic comovements.
We have identified the two dominant factors V (1) and V (2) as demand
factors, not productivity shocks. It is important to recognize that the fact
that the relative contribution of V (1) and V (2) is roughly 40% does not mean
that the residual 60% is to be explained by other systematic factors such as
“productivity shocks” emphasized by RBC. The residual 60% is to be explained by V (j) (j = 3, 4, . . .), but the significance of V (j) (j = 3, 4, . . .) is
rejected by the RMT (Section 4). The residual 60% is, therefore, not really
a part of systematic “business cycles”, but rather caused by various unidentified random factors. In fact, Slutzky (1937) suggested that the summation
of pure random shocks might generate apparently “cyclical” fluctuations; In
effect, he rejected the existence of systematic “business cycles”. Contrary to
his assertion, our findings confirm the presence of business “cycles” which
15
systematically account for 40% of economic fluctuations. Such “business cycles” defined based on V (1) and V (2) at T = 40 and 60 are basically caused by
demand factors. In the next section, we will investigate the cyclical behavior
of production, shipments and inventory in greater detail.
6. Production and Shipments
In the previous sections, we have shown the presence of the two eigenmodes of the correlation matrix for the T = 40 and 60 Fourier modes. Let
us now recover the behavior of production and shipments in these dominant
modes. In order to do so, we first express the decomposition of the IIP data
to each Fourier and eigenmode by combining Eqs. (19) and (21):
0
M N −1
1 XX
(n)
e
an (ωk ) e−iωk tj Vα,g
wα,g (tj ) = √
0
N n=1 k=0
(26)
0
M (N −1)/2
(n)
2 X X
=√
< e
an (ωk )e−iωk tj Vα,g
.
N 0 n=1 k=1
(27)
Here we used the facts that ãn (0) = 0 due to w
eα,g (0) = 0, ãn (ωk ) = ã∗n (ωN 0 −k )
due to an (tj ) being real, and that N 0 − 1 = 238 is even. The “business
cycle component” is obtained by limiting the above sum to over only k = 4
(T = 60) and k = 6 (T = 40) and n = 1, 2.
Figure 8 shows the cyclical movements of production, shipments, and
inventory. As explained in the previous section, they are defined as cycles
produced by two dominant factors at T = 40 and T = 60. The
P21 cycles shown
in Figure 8 are the average values over the goods, wα (tj ) := g=1 wα,g (tj )/21
for T = 60 (the left panel) and for T = 40 (the right panel). The top row
in each panel is for the first dominant factor, the middle row for the second
dominant factor, and the bottom row for their sum.
In the first eigenmode (the top row), production (the solid line) and
shipments (the dashed line) move together with equal amplitude. In contrast,
in the second eigenmode (middle) the movements of shipments are almost
flat. This is consistent with the results in Figure 5.
Recall that we identify the first eigenmode with the aggregate demand
while the second eigenmode with inventory adjustments. It is notable that
although in each eigenmode, production and shipments move with the same
(or opposite) phase as is guaranteed by our expansion, they are shifted in
16
time once the sum over two dominant eigenmodes is done. This is because
of the difference in their amplitudes.
For the T = 60 oscillations, production is delayed by the second eigenmode as shown by two black arrows in Figure 8. In contrast, shipments
receive almost no contribution from the second eigenvector. As a result, production is delayed a little after shipments. We can exactly derive this delaytime. From decomposition (26), we see that the average of
P the normalized
growth rates wα,g over the goods (g = 1, . . . 21), w̄α (tj ) := 21
g=1 wα,g (tj )/21,
contains the following T = 60 (k = 4) part:
1 e
a1 (ω4 )V̄α(1) + e
a2 (ω4 )V̄α(2) e−iω4 tj + · · · .
(28)
w̄α (tj ) = √
N0
P
(n)
(n)
where V̄α := 21
g=1 Vα,g /21. Therefore, denoting the absolute value and the
phase of the coefficients as,
e
a1 (ω4 )V̄α(1) + e
a2 (ω4 )V̄α(2) := Aα eiφα ,
(29)
where Aα > 0, we find that production (α = 1) and shipment (α = 2) obey
the following relations:
w̄1 (tj ) = A1 e−i(ω4 tj −φ1 ) + · · · ,
w̄2 (tj ) = A2 e
−i(ω4 tj −φ2 )
+ ··· .
(30)
(31)
This means that the delay-time of production behind shipments for the period
(SP)
T = 60, ∆60 , is given by (φ1 − φ2 )/ω4 . Because ωj = 2πj/N 0 [1/month]
(SP)
with N 0 = 239, we obtain the delay-time ∆60 in units of months as follows:
"
#
(1)
(2)
239
1
e
a
(ω
)
V̄
+
e
a
(ω
)
V̄
1
4 1
2
4 1
(SP)
∆60 :=
Arg
≈ 4.28.
(32)
(1)
(2)
4 2π
e
a1 (ω4 )V̄2 + e
a2 (ω4 )V̄2
It is clear in this expression that the lead-lag between shipments and
production is caused by the combination of two eigenmodes. For inventory,
the shift occurs in the opposite direction, and as a result, it lags by
(PI)
∆60 ≈ 10.95,
behind production.
17
(33)
The basically same story holds for the T = 40 oscillations (Figure 8). We
thus obtain7
(SP)
(PI)
∆40 ≈ 1.93, ∆40 ≈ 13.62.
(34)
(SP)
(SP)
We can formally test whether the time-delays ∆60 and ∆40 are statistically significant or not. Toward this goal, we randomize the rest of the
eigenmodes and evaluate those values. The simulation is done in following
steps:
1. Subtract the eigenmodes of the first two eigenvalues from the actual
data to define:
0
wα,g
(tj )
:= wα,g (tj ) −
2
X
(n)
an (tj ) Vα,g
.
(35)
n=1
2.
3.
4.
5.
(see Eq. (19)).
0
(tj ) separately, i.e., with ranRandomly reorder the time-series wα,g
domization different for each α and g.
Combine the reshuffled part with the first two eigenmodes to create a
simulated data.
Calculate the eigenvalue distribution and see whether the largest two
eigenvalues still remain. If they are, we proceed to the next step. (Note
that the above randomization process changes the eigenvalues.)
(SP)
(SP)
Calculate ∆60 and ∆40 for the resulting simulated data.
In this manner, the noise part (the eigenmodes except for the first and second
eigenmodes) are reshuffled. We then check whether the values we obtain
(SP)
(PI)
for the time-delays, ∆60,40 and ∆60,40 , stand out against the noises. We
carried out this simulation 105 times. The result obtained is that the first
two eigenvalues always exist as seen in Figure 9. The distribution of the
time-delays are shown in Figure 10 with the summary statistics of simulation
results in Table 2. We observe that the delay-time obtained in Eqs. (32) and
(34) are statistically significant.
In Appendix B, we perform cross-spectrum analysis and estimate the
phase in the spectrum to obtain the lead-lag relationships among production, shipments and inventory. For the series of the sum of the first and
second eigenmodes, the results obtained by the cross-spectrum analysis are
consistent with, and thereby confirm from a different angle, those presented
in this section. In addition, by comparing them with the cross-spectrum for
18
the original data, we can say that the extraction of the two dominant eigenmodes or our definition of “business cycles” is essential for understanding the
lead-lag relationship between shipments and production.
The sum of the components of the first and the second eigenvectors at
the T = 40 and T = 60 periods is compared with the original data for
production, shipments and inventory in Figure 11. To make comparison
easier, we applied a simple moving average operation to the original data
over one-year period. For the original data so smoothed out, we can hardly
detect any lead-lag relation between production and shipments. In contrast,
in the extracted “business cycles”, shipments clearly precede production with
the average lead time of four months.
In Figure 11, we observe a close one-to-one correspondence between the
original data and the extracted business cycles. The peaks and troughs of
two cycles coincide very well although their amplitudes slightly differ due
possibly to economic fluctuations with longer cycles (T > 60) and external shocks. A linear combination of two sinusoidal functions oscillating at
different periods gives rise to an extended cyclic motion with the period of
T = 120. Figure 11 thus spans two cycles of the endogenous business fluctuations. The characteristic behavior of the original data shown in the bottom
panel is well captured by the “business cycles” defined here.
Japan had experienced a long stagnation beginning 1990 through 2002.
One might consider a possible “structural break” during the “lost decade”.
However, the finding of such steady cycles shown in Figure 11 suggests that
the Japanese economy has had actually a fairly stable structure over the last
two decades. Besides, our sample period (January, 1988–December, 2007)
basically covers the whole period of the long stagnation. Thus, rather than
chopping our data into sub-sample periods, we resort to the out-of-sample
examination of the 2008-09 economic crisis in the next section.
We have found that production lags behind shipments over the extracted
“business cycles”. It does so inventory adjustments tend to lag behind the
initial changes in shipments. Suppose that shipments or final demand increased in the economy as a whole. It induces a contemporaneous increase
of production, and at the same time, depletion of inventory stock in many
sectors and also for many types of goods; This is well captured by the first
eigenmode which we interpret the “aggregate demand” (the top panel of Figure 8). Now, an increase of final demand and depletion of inventory stock
make firms revise their expectations on future demand, and accordingly the
level of desired inventory stock. If demand does not increase further, firms
19
produce more for the purpose of inventory accumulation. This is captured by
the second eigenmode which we interpret the “inventory adjustments” (the
middle row of Figure 8). As Metzler (1941) shows, inventory adjustments
take time. This makes production lag behind shipments. Our analysis shows
that the average lag is about 4 months. In the case of inventory stock, an
increase of shipments initially causes depletion of inventory, and, therefore,
the lag becomes even longer than in the case of production. We find that
the average lag of inventory behind shipments is about 11 months. In this
way, we can easily understand that shipments lead production and inventory
within the framework in which changes in real demand or shipments are the
major shocks. Note that productivity shocks emphasized by RBC, by definition, instantaneously change production. There is no room for production
to lag behind shipments in such framework.
7. The 2008-09 Economic Crisis
The 2008-09 financial crisis triggered by the bursting of the housing bubbles in the U.S. immediately spread all over the world, and caused great
damage on the global economy, reminiscent of the Great Depression. Based
on the analyses in the previous sections, we update the IIP data to cover
the period from January 2008 (j = 241) through June 2009 (j = 258), and
perform an out-of-sample test.
The extended data shown in Figure 12 demonstrates a dramatic drop of
production in Japan. Panel (a) of Figure 13 shows the volatility P (t) of the
IIP defined by
3 X
21
X
|wα,g (t)|2 .
(36)
P (t) :=
α=1 g=1
We observe the exceptionally large amplification of P (t) during the 2008-09
economic crisis.
This situation, therefore, provides us with a perfect opportunity for outof-sample test for validity of the dominant eigenmodes that we have identified
using the random matrix theory. To this end, we decompose the extended
IIP data based on the exactly same eigenvectors as in Eq. (19). Then P (t)
is expressed in terms of the mode amplitude coefficients an (t) as follows:
P (t) =
63
X
n=1
20
|an (t)|2 .
(37)
Panels (b) and (c) of Figure 13 show the partial contributions of the largest
and the second largest modes to P (t), respectively. Plainly, an unprecedented
rise of volatility of the first dominant factor, namely |a1 (t)|2 accounts largely
for a rise of P (t) during the 2008-09 crisis. The contribution of the second
dominant factor is not so great as that of the first dominant factor.
The relative contributions of the first and second eigenmodes defined by
πn (t) :=
|an (t)|2
.
P (t)
(38)
are shown in Figure 14. The total sum of πn (t) with respect to n is normalized
to unity. The first dominant eigenmode accounts for almost a half of the total
power of fluctuations during the 2008-09 recession. Note that the relative
contribution of the first eigenmode, namely about one half, is comparable to
that for the in-sample period. This result demonstrates that our factor model
holds up over the 2008-09 crisis despite of the fact that a fall of industrial
production during the period as shown in Figure 12 was truly unprecedented.
We have seen that the first eigenmode basically explains a fall of industrial production during 2008-09. Now, Figure 15 shows IIP and the index
of exports during the period under investigation. Plainly, production fell in
response to a sudden decline in exports. We emphasize that a fall of exports
was caused by a sudden shrinkage of the world trade due to the financial
crisis accompanied by the global recession, and that it was basically exogenous to the Japanese economy. In Section 4, by analyzing the components
of eigenvectors (Figure 5), we identified the first dominant factor as the aggregate demand. To the extent that we identify a sudden fall of exports, an
exogenous negative aggregate demand shock, as the basic cause of the 200809 recession, the dominance of the first eigenmode provides us with another
justification for our interpreting it as the aggregate demand factor. This
confirms once again the validity of the Old Keynesian View (Tobin, 1993).
8. Concluding Remarks
In his comments on Shapiro and Watson (1988) entitled “Sources of Business Cycle Fluctuations,” Hall (1988) made the following remark:
All told, I find the results of this paper harmonious with the
emerging middle ground of macroeconomic thinking. There are
21
major unobserved determinants of output and employment operating at business-cycle and lower frequencies. Some are technological, some are financial, some are monetary (Hall, 1988, p.151).
For finding out such unobserved determinants of business cycles, the standard identifying restriction is that only technology shocks affect output permanently. As discussed in Introduction, we maintain that this assumption
is untenable. Using production, shipments, and inventory data, we analyze
the causes of business cycles more straightforwardly than in the standard
approach.
We have identified (i) two periods T = 60 and 40 months for the Japanese
business cycles, and also (ii) the two significant factors causing comovements
of IIP across 21 goods. Taking advantage of the information revealed by disaggregated data, we can reasonably interpret the first factor as the aggregate
demand, and the second factor as intentional inventory adjustments. It is difficult to interpret the two dominant factors as productivity shocks. Within
the framework of the standard RBC, productivity shocks are expected to
affect production, shipments, and inventory together. The two dominant
factors we have identified do not fulfill this condition (Figure 5).
We have also found that, for the part of shipments and production explained by two factors, shipments lead production by a few months. This
lead-lag relationship between shipments and production caused by inventory
adjustment is not clearly seen in the original data. However, inventory adjustments which generate this lead-lag relationship between shipments and
production are an important part of “business cycles” defined by two dominant factors.
Furthermore, out-of-sample analysis of the 2008–09 recession demonstrates that the first dominant factor largely explains this major episode. We
know that it was basically caused by a sudden fall of exports beginning the
fourth quarter of the year 2008 (Figure 15). Therefore, to the extent that exports can be reasonably regarded as exogenous real aggregate demand shocks
to the Japanese economy, we can identify the first dominant factor as the
real aggregate demand. All these findings suggest that the major cause of
business cycles is real demand shocks accompanied by significant inventory
adjustments.
It is important to recognize that this conclusion of ours does not necessarily mean that technology shocks are unimportant. On the contrary, it is very
plausible that a major driving force of fixed investment is technology shocks.
22
Furthermore, one might argue that technology shocks is also a driving force
of consumption by way of introduction of new products (Aoki and Yoshikawa,
2002). The point is that technology shocks affect the macroeconomy by way
of changes in aggregate demand, notably investment, in the short-run. For
the purpose of studying business cycles, it is wrong to consider technology
shocks as direct shifts of production function.
The conclusion we drew from our analyses of the Japanese industrial
production data is clear. To study whether the same results hold up for
other economies is obviously an important research agenda. We believe that
the methods we employed in the present paper can be usefully applied to the
study of business cycles in other economies.
Finally, we have identified 40 to 60 months “cycles” of aggregate demand.
The theory of such “cycles” of aggregate demand is beyond the scope of
the present paper. After all, the multiplier-accelerator model of Samuelson
(1939) and Hicks (1950) may be an answer. The problem awaits further
investigation.
Acknowledgments
We gratefully acknowledge detailed comments made by two anonymous
referees and the editor’s kind suggestions. This work was supported in part
by the Program for Promoting Methodological Innovation in Humanities and
Social Sciences by Cross-Disciplinary Fusing of the Japan Society for the
Promotion of Science.
23
Appendix A. Significance of Dominant Factors in Finite Sample
We have found that two large eigenvalues exist outside the range of the
RMT prediction for the covariance matrix C of IIP. This result, however,
holds exactly true only asymptotically in the limit (15), while we have only
finite sample, N 0 = 239 and M = 63. Also the RMT applies theoretically
to iid data as noted in Section 4. In this appendix, we will first show that
nontrivial autocorrelation exists for the growth rate, and then present a simulation result which shows that in spite of the “finite-size” effect and the
nontrivial autocorrelation, the dominant factors we have found are still significant.
The autocorrelation R(t) of the normalized growth rate wα,g is defined
by the following:
0
NX
−m
1
wα,g (tj )wα,g (tj+m ).
Rα,g (tm ) := 0
N − m j=1
(A.1)
By definition, Rα,g (0) = 1, and if there are no autocorrelations, Rα,g (tm ) = 0
for m ≥ 1. Figure 16 shows the autocorrelation averaged over the goods;
21
1 X
Rα,g (tm ).
R̄α (tm ) :=
21 g=1
(A.2)
We observe that for both production (α = 1) and shipments (α = 2), the
autocorrelations have nontrivial values for t = 1 month: R1 (1) ≈ −0.31 and
R2 (1) ≈ −0.39, while we have R3 (1) ≈ 0.007 for inventory.
Since this nontrivial autocorrelation, in addition to the finiteness of the
data size, may bring corrections to the RMT results, we have done a simulation in the following steps:
(1) Random “rotation” of each time-series,
wα,g (tj ) → wα,g (tMod(j−τ,N 0 ) )
(A.3)
where τ ∈ [0, N 0 − 1] is a (pseudo-)random integer, different for each α
and g.
(2) Then we have calculated the eigenvalues λ(n) for the resulting set of the
randomized data.
24
The data randomized in the above manner has the same autocorrelation
and the same finite-size with the original data, but correlations between
production, shipments, and inventory of different goods are destroyed.
We have run this calculation 105 times and have obtained PDFs shown
in Figure 17. In the plot (a) of Figure 17, the gray area shows the RMT
prediction identical to that of Figure 4 and the dots connected by solid lines
show the result of the simulation. We observe here that the simulation result
differs from that of the RMT prediction, but not to the extent that the values
we have found in actual IIP data, λ(1) ≈ 9.95 and λ(2) ≈ 3.82 are nontrivial.
The same can be said from the plot (b), where the PDFs of the first (largest)
eigenvalue λ(1) and the second (next-largest) eigenvalue λ(2) are given. The
average of λ(1) obtained from this simulation is λ̄(1) ≈ 2.47 with standard
deviation σλ(1) ≈ 0.10 and λ̄(2) ≈ 2.29, σλ(2) ≈ 0.07.
Therefore, we conclude that the two largest eigenvalues λ(1) and λ(2) we
have found in the actual IIP data are statistically significant factors, which
represent comovements of production, shipments, and inventory of different
goods.
Appendix B. Lead-Lag Relationships by Cross-Spectrum
In Section 6, we evaluate the lead-lag relationships among production,
shipments and inventories by defining ∆(SP) and ∆(PI) for two cycles T = 60
and 40. In this appendix, we resort to the method of cross-spectrum (see
e.g. Jenkins and Watts (1968); Bloomfield (2000)) to identify the lead-lag
relationships in terms of coherency and phase.
For a pair of time-series of length N 0 , x(tj ) and y(t
√ the
Pj ), we consider
Fourier transforms x̃(ωk ) and ỹ(ωk ) defined by x̃(ωk ) = j x(tj )eiωk tj / N 0 as
Eq.(3). Their periodograms are given by Ixx (ωk ) := |x̃(ωk )|2 and Iyy (ωk ) :=
|ỹ(ωk )|2 , respectively. The cross-periodogram for the pair is defined by
Ixy (ωk ) := x̃(ωk )∗ ỹ(ωk ) which is a complex number (except ω = 0, π).
Just as the periodogram is smoothed to obtain an estimate of spectrum,
the cross-periodogram is smoothed to obtain estimate of cross-spectrum, i.e.
X
ŝxy (ωk ) =
γ` Ixy (ωk − ω` )
(B.1)
`
where γ` ’s are the weights for smoothing. Similarly, we can obtain the auto-
25
spectra, ŝxx and ŝyy . Then, the ratio defined as
κ̂2xy (ωk ) =
|ŝxy (ωk )|2
ŝxx (ωk ) ŝyy (ωk )
(B.2)
is the estimate of squared coherency. It satisfies the following inequality:
0 ≤ κ̂2xy (ωk ) ≤ 1,
(B.3)
with 0 corresponding to no linear dependence and 1 corresponding to exact
linear dependence of the two time-series at frequency ωk .
The cross-spectrum can be expressed in terms of its amplitude and phase
as follows:
ŝxy (ωk ) = |ŝxy (ωk )| e2πiφ̂xy (ωk ) .
(B.4)
The phase φ̂xy (ωk ) estimates the lead-lag relationship between the series.
Note that the estimate of the phase becomes meaningless when the crossspectrum is small in its amplitude.
In the following estimation, we consider the multiple
consisting
P time-series
(n)
of the first two components, namely wα,g (tj ) = 2n=1 an (tj ) Vα,g , averaged
over the index g of goods for α = 1, 2, 3. We take two pairs of shipments
and production (α = (2, 1)), and production and inventory (α = (1, 3))
for x(tj ) and y(tj ). For the smoothing of the cross-periodograms, we used
a modified Daniell kernel for the weight γ` of length 11 (corresponding to
the bandwidth ∆ω/2π = 0.01226 cycles per month) with no tapering. The
standard formulae of significance levels and confidence intervals are used in
the calculation (Bloomfield, 2000).
Figure 18 shows the estimates of κ̂2xy (ω) and φ̂xy (ω) for the pair (SP) of
shipments and production (panel (a)) and for the pair (PI) of production
and inventory (panel (b)). For κ̂2xy (ω), the confidence levels (90% and 99%)
are depicted by dotted horizontal lines, which are calculated under the null
hypothesis of zero coherency at each frequency. The estimate of κ̂2xy and its
95% confidence interval are shown by a curve and the shaded region around
it. One can then see that κ̂2xy is significant at all frequencies for the pair
(SP), while it is significant only at low frequencies corresponding to T & 20
months for the pair (PI).
In the frequency regions where the squared coherency κ̂2xy (ω) is significant, one can evaluate the phase and its confidence interval in each plot of
φ̂xy (ω). We examine the sign and magnitude of the phase to evaluate the
26
lead-lag relationship and its magnitude. For the pair of (SP), at the two
frequencies ωk=6 , ωk=4 corresponding to T = 40, 60 months, respectively, the
phase is significantly positive meaning that shipments lead production at
these frequencies. Its magnitude can be evaluated in months by calculating
φ̂xy (ω)/ω and its confidence intervals. The results are
(SP)
∆40
= 2.26 [1.43, 3.09],
(SP)
∆60
= 2.74 [0.973, 4.49],
(B.5)
where the square brackets represent 95% confidence intervals. For the sake
of comparison, in Figure 18, the results presented in Section 6 are depicted
as two points with error bars at T = 40, 60 in the plot of φ̂xy (ω).
For the pair of (PI), the technique of alignment (Jenkins and Watts, 1968)
was used to weaken the biasing effect due to rapidly changing phase in the
presence of large lead-lag. The two series of production and inventory are
aligned by translating one series a distance of 8 months so that the peak in
the cross-correlation function of the aligned series is close to zero lag. Then
the results are
(PI)
(PI)
∆40 = 9.04 [7.86, 10.2],
∆60 = 9.32 [7.56, 11.1].
(B.6)
Just at in panel (a) of Figure 18, in panel (b) for the pair of (PI), the
results presented in Section 6 are depicted as two points with error bars in the
plot of φ̂xy (ω). The confidence intervals do not overlap with the error bars,
presumably due to the alignment problem, but we found no large differences
from the results.
It is interesting to compare the above results based on two extracted eigenmodes, namely our “business cycles”, with the cross-spectrum for the original
data, that is, by retaining all the eigenmodes. Figure 19 shows the estimates
of the cross-spectrum for the original data: (a) for the pair of (SP) and (b)
for that of (PI). One can make the following observations for T = 40, 60.
• The squared coherency κ̂2xy (ω) is significant for both pairs. However,
the phase φ̂xy (ω) is significant for the pair (PI) with a similar magnitude
of phase, whereas for the pair (SP), it is not significantly different from
zero at those periods T .
This observation demonstrates that our extraction of two dominant eigenmodes by the method of RMT is essential to reveal the lead-lag relationship
between shipments and production which is not clearly present in the original
27
data. We also note that the extraction does not deform the other results of
significant coherency and the lead-lag relationship between production and
inventory.
We conclude that the method of cross-spectrum and that used in Section 6
are consistent in the evaluation of lead-lag relationships among production,
shipments and inventory, and also that the extraction of two dominant eigenmodes or our definition of “business cycles” is essential for understanding the
lead-lag relationship between shipments and production.
28
Figure 1: A sample of the IIP data for g = 1 (Manufacturing Equipment).
Note: The upper panel shows the indices Sα,1 (t) while the lower panel shows the corresponding growth-rates rα,1 (tj ) defined by Eq.(1). As explained in the legend, α = 1
(Production) is shown by open circles connected by solid lines, α = 2 (Shipment) is shown
by open squares connected by dashed lines, and α = 3 (Inventory) is shown by rectangles
connected by dotted lines. The horizontal axis shows the months beginning January 1988
through December 2007.
29
Figure 2: The average power spectrum p(ω) of the normalized growth rate.
Note: The horizontal axes is chosen to be the period, T := 2π/ω, which is shown in units
of months at the bottom and in units of years at the top. The points connected by solid
lines (with darker shading) are the spectrum for the seasonally-adjusted data, while the
points connected with dashed lines (with lighter shading) are for the non-adjusted original
data.
30
Figure 3: Demonstration of robustness of the T = 60 and T = 40 cycles.
Note: The big circles connected with solid lines (denoted as “Original FT” on the top
line of the legend) are the spectrum for the seasonally-adjusted data, the same as in
Figure 2. The small circles connected with dashed lines (denoted as “Chopped FT” on
the second line of the legend) are the results of the discrete Fourier transformation of the
first S months as in the manner of Eq. (6). The dark shaded area shows the result of the
continuous Fourier transform allowing ω to take any value in Eq. (6).
31
Figure 4: PDF of the eigenvalues (bars) and the RMT prediction ρ(λ) (dashed curve).
32
Figure 5: The components of the two dominant eigenvectors, V (1,2) .
Note: The horizontal axis shows the 21 goods (g = 1, 2, . . . , 21). The gray bars are the
production (α = 1), the white bars the shipments (α = 2), and the black bars the inventory
(α = 3).
33
Figure 6: Power spectrum for the total and the two largest eigenmodes.
Note: The solid lines at the top show the power spectrum, which is the same as the solid
lines with darker shading in Figure A.2. (Note that the vertical axis is in linear scale in
this plot.) The darker shading show the power spectrum of the first eigenmode, while the
lighter shading show that of the second eigenmode.
34
Figure 7: The relative contribution of the two largest eigenmodes to the power spectrum.
Note: The relative contributions are the values averaged for each bin. The shading is the
same as in Figure 6. The point A on the horizontal is at T ≈ 38 months.
35
Figure 8: The T = 60 (left) and T = 40 (right) oscillations due to the first eigenvector
(top), the second eigenvector (middle), and their sum (bottom).
Note: The solid line shows the production averaged over 21 goods, the dashed line the
average shipments, and the dotted line the average inventory.
36
Figure 9: PDF of the three largest eigenvalues obtained by the simulation.
Note: The three vertical lines show the RMT boundary, the value of the true λ(2) , and
the value of the true λ(1) in the ascending order.
37
Figure 10: PDF of the time delays obtained by the simulation of 105 times.
(SP)
(SP)
(PI)
Note: (a) ∆60 (solid lines) and ∆40 (dashed lines with gray shading). (b) ∆60 (solid
(PI)
lines) and ∆40 (dashed lines with gray shading). The vertical line in each plot shows the
observed values. The fact that the simulation results are distributed around the observed
values is an evidence that the observed values are independent from the noise part and
are statistically significant. The statistical measures obtained in this simulation are given
in Table 2.
38
Figure 11: Comparison between the extracted business cycles and the original data.
Note: (a) Sum of the T = 60, 40 components of the first and the second eigenvectors
for production, shipments and inventory, each of which is averaged over 21 goods. (b)
The original data shown in the bottom panel are smoothed out by taking one-year simple
moving average. We observe that the systematic “business cycles” are well extracted from
the original data. (Since the Fourier components of T = 120 and 240 are NOT included in
the plot (a), that part of the the behavior of the real data shown in (b) is not reproduced
in the plot (a).)
39
Figure 12: The 2008–09 recession.
Note: The IIP data for g = 1 (Manufacturing Equipment) shown in Figure 1 is extended
to encompass the 2008–09 economic crisis. The gray shaded zone depicts the in-sample
period used in the analyses in Sections 3–6. Production and shipments almost overlap
with each other and are hard to distinguish.
40
Figure 13: Decomposition of volatility of the extended IIP data.
Note: (a) The total volatility. (b) The partial contribution of the first eigenmode to the
total volatility. (c) The partial contribution of the second eigenmode. The gray shaded
zone depicts the in-sample period used for determining the dominant modes.
41
Figure 14: Relative contributions of (a) the largest mode and (b) the second largest mode.
Note: Panels (a) and (b) correspond to panels (b) and (c) in Figure 13, respectively.
42
Figure 15: Indices of exports (solid line) and the Industrial Production (dashed line),
normalized to 100 for 2005.
43
Figure 16: Autocorrelation of the IIP growth rate, R̄α , defined by Eqs.(A.1) and (A.2).
44
Figure 17: Results of the random-ordering simulation.
Note: Because of the fact that the IIP data is of finite size and have nontrivial autocorrelation, the distribution of eigenvalues could differ from that of RMT prediction, even if
the IIP data lacks any comovements. The plot (a) shows that the PDF of the eigenvalues
λ, where the gray-shaded area is the RMT result and the dots connected by lines show
the result of the simulation. The plot (b) shows the PDF of the first eigenvalue λ(1) and
the second eigenvalue λ(2) obtained by simulation. From these, we conclude that λ(1) and
λ(2) found in the actual data are in fact principal factors that correspond to comovements.
45
(a)
(b)
Figure 18: Cross-spectrum for the sum of two dominant eigenmodes; (a) for production
and shipments and (b) for production and inventory.
Note: In each plot, κ̂2xy (ω) is the squared coherency, and φ̂xy (ω) is the phase of the crossspectrum. In both panels, the dotted lines are 90% and 95% confidence levels for κ̂2xy (ω)
obtained by the null hypothesis of κ̂2xy (ω) = 0. The gray areas represent 95% confidence
intervals in each plot.
46
(a)
(b)
Figure 19: Cross-spectrum for the original time-series; (a) for production and shipments
and (b) for production and inventory.
Note: φ̂xy for the production and shipments is not significantly different from zero at
T = 40, 60 in the plot of of upper panel. On the other hand, in the lower panel, production
leads inventory by a significant difference in the phase.
47
Table 1: Classifications of 21 goods in IIP.
Final Demand Goods
Investment Goods
Capital Goods
1 Manufacturing Equipment
2 Electricity
3 Communication and Broadcasting
4 Agriculture
5 Construction
6 Transport
7 Offices
8 Other Capital Goods
Construction Goods 9 Construction
10 Engineering
Consumer Goods
Durable Consumer 11 House Work
Goods
12 Heating/Cooling Equipment
13 Furniture & Furnishings
14 Education & Amusement
15 Motor Vehicles
Non-durable
16 House Work
Consumer Goods
17 Education & Amusement
18 Clothing & Footwear
19 Food & Beverage
Producer Goods
20 Mining & Manufacturing
21 Others
Note: The center column is the identification number, g.
48
Table 2: Simulation results on the time delays in comparison with estimates based on the
original data.
Note: σ is the standard deviation, and “95% CI” is the Confidence Interval at the 95% of
the distribution of the simulation results shown in Figure 10.
(SP)
∆60
(SP)
∆40
(PI)
∆60
(PI)
∆40
Original data
Average
σ
95% CI
4.28
1.93
4.16
1.87
1.22
0.77
[ 2.17, 6.91 ]
[ 0.57, 3.57 ]
10.95
13.62
11.07
13.67
2.25
2.00
[ 6.92, 15.77 ]
[ 9.94, 17.78 ]
49
Footnotes
1. Shapiro and Watson (1988) consider labor supply shocks as well as
technology shocks, and obtain the “surprising” result that the labor
supply shocks account for at least 40 percent of output variation at
all horizons. However, Hall (1988) shows that this finding is neither
surprising nor plausible. Specifically, as Hall criticizes, Shapiro and
Watson (1988) make the implausible identifying assumption that all
the contemporaneous comovements of output and work hours be attributed to shifts in labor supply, and rule out the alternative assumption, perhaps more plausible to many, that the moving force operates
from output to hours in the short-run. This paper indeed demonstrates
that output is basically determined by real demand in business cycles.
2. This reduces the number of independent real components in the complex Fourier coefficients w’s
e from 2N 0 − 1 down to N 0 , guaranteeing
that the Fourier coefficients w’s
e are the faithful representation of the
original series w.
3. The seasonal adjustment is done by the U.S. Census Bureau’s X-12
ARIMA method. The details are given in the resources available from
METI, Japan (2008).
4. Note that the vertical scale in Figure 2 is logarithmic.
5. There are, in fact, slight dips at those frequencies, which means that
the seasonal adjustments are somewhat excessively done. However, we
have also looked at the auto-correlation of the data and have found
that the adjusted data is much better than that of the non-adjusted
data. The relevant results are available from authors upon request.
6. These variables must be normally distributed and uncorrelated.
7. We note that our extraction of the first two dominant eigenmodes is
essential for obtaining the lead-lag relationship between shipments and
production. In fact, if we use all the eigenvectors, which is equivalent
to the usual Fourier decomposition, we obtain −0.10 months instead
of 4.28 months in Eq.(32) and 0.34 months instead of 1.93 months in
Eq.(34). They are much smaller than our time-increment of 1 month,
and therefore, are too small to be taken as being significantly different
from zero.
50
References
Aoki, M., Yoshikawa, H., 2002. Demand saturation-creation and economic
growth. Journal of Economic Behavior & Organization 48, 127–154.
Bai, J., Ng, S., 2002. Determining the number of factors in approximate
factor models. Econometrica 70, 191–221.
Blanchard, O., Quah, D., 1989. The dynamic effects of aggregate demand
and supply disturbances. American Economic Review 79, 655–673.
Blinder, A., Maccini, L., 1991. Taking stock: a critical assessment of recent
research on inventories. Journal of Economic Perspectives 5, 73–96.
Bloomfield, P., 2000. Fourier Analysis of Time Series: An Introduction. John
Wiley & Sons. second edition.
Bouchaud, J., Potters, M., 2000. Theory of financial risks: from statistical
physics to risk management. Cambridge University Press.
Buja, A., Eyuboglu, N., 1992. Remarks on parallel analysis. Multivariate
Behavioral Research 27, 509–540.
Cattell, R., 1966. The scree test for the number of factors. Multivariate
behavioral research 1, 245–276.
Diebold, F., Rudebusch, G., 1990. A nonparametric investigation of duration
dependence in the American business cycle. Journal of Political Economy
98, 596–616.
Francis, N., Ramey, V., 2005. Is the technology-driven real business cycle
hypothesis dead? Shocks and aggregate fluctuations revisited. Journal of
Monetary Economics 52, 1379–1399.
Granger, C., 1966. The typical spectral shape of an economic variable. Econometrica 34, 150–161.
Guttman, L., 1954. Some necessary conditions for common-factor analysis.
Psychometrika 19, 149–161.
Haberler, G., 1964. Prosperity and Depression. New edition. First published
by the League of Nations. Cambridge, Mass. Harvard University Press
[1937] 408, 167–87.
51
Hall, R., 1988. Comment on Shapiro and Watson. NBER Macroeconomics
Annual 3, 148–151.
Hicks, J., 1950. A Contribution to the Theory of Trade Cycle. Oxford
University Press.
Horn, J., 1965. A rationale and test for the number of factors in factor
analysis. Psychometrika 30, 179–185.
Hornstein, A., 1998. Inventory Investment and the Business Cycle. Federal
Reserve Bank of Richmond Economic Quarterly 84, 49–71.
Jackson, J., 1991. A user’s guide to principal components. Wiley-Interscience.
Jenkins, G., Watts, D., 1968. Spectral Analysis and its Applications. HodenDay, San Francisco.
Jolliffe, I., 2002. Principal component analysis. Springer verlag.
Kaiser, H., 1960. The application of electronic computers to factor analysis.
Educational and psychological measurement 20, 141.
Kydland, F., Prescott, E., 1982. Time to build and aggregate fluctuations.
Econometrica 50, 1345–1370.
Lucas, J.R., 2003. Macroeconomic priorities. American Economic Review
93, 1–14.
Mankiw, N., 1989. Real business cycles: A new Keynesian perspective. Journal of Economic Perspectives , 79–90.
Marčenko, V., Pastur, L., 1967. Distribution of eigenvalues for some sets of
random matrices. Mathematics of the USSR-Sbornik 1, 457.
METI, Japan, 2008. Indices of Industrial Production, Producer’s Shipments,
Producer’s Inventory of Finished Goods and Producer’s Inventory Ratio
of Finished. http://www.meti.go.jp/english/statistics/tyo/iip/index.html.
Metzler, L., 1941. The nature and stability of inventory cycles. The Review
of Economics and Statistics 23, 113–129.
Mitchell, W., 1951. What happens during Business Cycles: A Progress Report. National Bureau of Economic Research.
52
Ormerod, P., 2008. Random Matrix Theory and the Evolution of Business Cycle Synchronisation, 1886–2006. Economics: The Open-Access,
Open-Assessment E-Journal 2. http://www.economics-ejournal.org/
economics/discussionpapers/2008-11.
Peres-Neto, P., Jackson, D., Somers, K., 2005. How many principal components? Stopping rules for determining the number of non-trivial axes
revisited. Computational Statistics and Data Analysis 49, 974–997.
Samuelson, P., 1939. Interaction between the multiplier analysis and the
principle of acceleration. The Review of Economics and Statistics 21, 75–
78.
Sargent, T., Sims, C., 1977. Business Cycle Modeling without pretending to
have too much a priori Economic Theory. New Methods in Business Cycle
Research 1, 145–168.
Shapiro, M., Watson, M., 1988. Sources of business cycle fluctuations. NBER
Macroeconomics annual 3, 111–148.
Slutzky, E., 1937. The summation of random causes as the source of cyclic
processes. Econometrica 5, 105–146.
Stock, J., Watson, M., 1998. Diffusion indices. NBER working paper 6702.
Stock, J., Watson, M., 2002. Forecasting using principal components from a
large number of predictors. Journal of the American Statistical Association
97, 1167–1179.
Summers, L., 1986. Some skeptical observations on RBC theory. Federal
Reserve Bank of Minneapolis Quarterly Review 10, 23–27.
Tobin, J., 1993. Price flexibility and output stability: an old Keynesian view.
Journal of Economic Perspectives 7, 45–65.
Wigner, E., 1955. Characteristic vectors of bordered matrices with infinite
dimensions. Annals of Mathematics 62, 548–564.
Zwick, W., Velicer, W., 1986. Comparison of five rules for determining the
number of components to retain. Psychological Bulletin 99, 432–442.
53
© Copyright 2026 Paperzz