Discussion Paper Deutsche Bundesbank No 43/2013 Disentangling economic recessions and depressions Bertrand Candelon (Maastricht University) Norbert Metiu (Deutsche Bundesbank) Stefan Straetmans (Maastricht University) Discussion Papers represent the authors‘ personal opinions and do not necessarily reflect the views of the Deutsche Bundesbank or its staff. Editorial Board: Klaus Düllmann Heinz Herrmann Mathias Hoffmann Christoph Memmel Deutsche Bundesbank, Wilhelm-Epstein-Straße 14, 60431 Frankfurt am Main, Postfach 10 06 02, 60006 Frankfurt am Main Tel +49 69 9566-0 Please address all orders in writing to: Deutsche Bundesbank, Press and Public Relations Division, at the above address or via fax +49 69 9566-3077 Internet http://www.bundesbank.de Reproduction permitted only if source is stated. ISBN 978–3–86558–971–2 (Printversion) ISBN 978–3–86558–972–9 (Internetversion) Non-technical summary The business cycle is traditionally described as a sequence of recessions and expansions in aggregate economic activity. However, from a historical perspective it is well-known that some recessions are much longer, deeper or more abrupt than others. Thus, the question arises whether the binary characterization of the cyclical economic development is not overly simplistic. For example, a semantic distinction between economic recessions and “depressions” is commonly made in the economic policy discourse. Indeed, vast contractions of real output have stimulated interest in rare “economic disasters” among academic economists. The binary framework does not allow for considering such rare and severe events by construction. Questioning the validity of the binary approach constitutes the starting point of this paper. We distinguish economic depressions and booms from ordinary recessions and expansions using a novel nonparametric test. Given the peak and trough dates of the U.S. business cycle between 1919 and 2009 provided by the National Bureau of Economic Research (NBER), we compute characteristics which capture the length, depth and shape of each recession and expansion phase. Business cycle phases are then classified into four regimes according to the magnitude of these characteristics, such that we can also distinguish between recessions vs. depressions and expansions vs. booms. Our method identifies three economic depressions and one economic boom in the U.S. business cycle, which coincide with important historical episodes, including the Great Depression and the industrial production boom during the Second World War. Interestingly, despite all comparisons between the Great Depression and the recent financial and economic crisis, the so-called “Great Recession”, dated between December 2007 and June 2009 by the NBER, does not qualify as extraordinary in comparison with previous economic downturns. Even though it is the most severe postwar recession, it is dwarfed along every dimension by the prewar depressions. In the second part of the paper, we analyze by means of logistic regressions whether the four business cycle regimes can be predicted using macroeconomic and financial variables. Numerous studies have shown that the slope of the yield curve exhibits good predictive power for future recessions. The related literature also attributes some predictive power to stock market returns, the growth rate of real output and the inflation rate. Is it possible to tell whether the economy is heading toward a recession or a depression using these leading indicators? Our results show that although the slope of the yield curve outperforms other leading indicators in predicting recessions, it does not have anything to say about future economic depressions. In contrast, stock market returns, real output growth, and the inflation rate convey statistically relevant information for predicting economic depressions: a drop in these variables signals a higher likelihood of a future depression. Nicht-technische Zusammenfassung Der Konjunkturzyklus wird traditionell als Abfolge von Rezessionen und Expansionen der gesamtwirtschaftlichen Aktivität beschrieben. Aus der historischen Perspektive gemeinhin bekannt ist jedoch die Tatsache, dass einige Rezessionen viel länger, tiefgreifender und abrupter sind als andere. Daraus erwächst die Frage, ob die binäre Charakterisierung der Schwankungen in der Wirtschaftsentwicklung nicht allzu simplistisch ist. Semantisch wird beispielsweise im wirtschaftspolitischen Diskurs gewöhnlich zwischen wirtschaftlicher Rezession und Depression“ unterschieden. In der Tat haben schwerwiegende Kontraktionen ” der realen gesamtwirtschaftlichen Produktion das Interesse der Wirtschaftswissenschaftler an seltenen wirtschaftlichen Desastern“ erregt. Der binäre Rahmen lässt es aufgrund ” seiner Konstruktionsweise nicht zu, solch seltene und einschneidende Ereignisse gesondert unter die Lupe zu nehmen. Die Hinterfragung der Validität des binären Ansatzes stellt den Ausgangspunkt dieses Forschungspapiers dar. Wir unterscheiden große wirtschaftliche Depressionen und Boomphasen von gewöhnlichen Rezessionen und Aufschwüngen im Konjunkturzyklus mithilfe eines neuen nichtparametrischen Testverfahrens. Anhand von Daten des National Bureau of Economic Research (NBER) zu den Höhe- und Tiefpunkten der Wirtschaftszyklen in den Vereinigten Staaten zwischen 1919 und 2009 berechnen wir Kenngrößen, welche Länge, Tiefe und Ausformung jeder Rezessions- und Aufschwungsphase erfassen. Die Phasen im Wirtschaftszyklus werden dann entsprechend der Dimension dieser Kenngrößen in vier Ausprägungen unterteilt, um zwischen Rezession und Depression bzw. Aufschwung und Boom zu differenzieren. Mit unserer Verfahrensweise identifizieren wir drei wirtschaftliche Depressionen und einen Boom im US-Konjunkturzyklus, die mit wichtigen historischen Zeitabschnitten, so der Großen Depression und dem Industrieboom während des zweiten Weltkriegs, zusammenfallen. Interessant dabei ist die Beobachtung, dass sich ungeachtet aller Vergleiche der Großen Depression mit der letzten Finanz- und Wirtschaftskrise die sogenannte Große Rezession“, die das NBER auf die Zeitspanne von Dezember ” 2007 bis Juni 2009 datiert, in der Gegenüberstellung mit vorangegangen Konjunkturabschwüngen nicht als außergewöhnlich beweist. Obgleich es sich dabei um die schwerste Rezession der Nachkriegszeit handelt, wird sie von den Depressionen der Vorkriegszeit in jeder Dimension um Längen überragt. Im zweiten Teil des Aufsatzes analysieren wir mittels logistischer Regressionen, ob die bei einer solchen Charakterisierung der wirtschaftlichen Schwankungen identifizierten vier Regime unter Zuhilfenahme von Makroökonomischen- und Finanzvariablen vorhersagbar sind. Zahlreiche Studien haben gezeigt, dass das Gefälle der Zinsstrukturkurve gute Prognoseeigenschaften für kommende Rezessionen aufweist. Eine gewisse Prognosekraft misst die einschlägige Fachliteratur auch den Aktienmarktrenditen, der Wachstumsrate der realen Wirtschaftsleistung und der Inflationsrate bei. Ist es mithilfe dieser Frühindikatoren möglich zu erkennen, ob die Wirtschaft auf eine Rezession oder eine Depression zusteuert? Gemäß unseren Ergebnissen besitzt der Verlauf der Zinsstrukturkurve - trotz deren Überlegenheit gegenüber anderen Frühindikatoren bei der Vorhersage von Rezessionen - keinerlei Aussagekraft mit Blick auf heraufziehende Wirtschaftsdepressionen. Demgegenüber liefern Aktienmarktrenditen, das reale Wirtschaftswachstum und die Inflationsrate statistisch relevante Informationen für die Prognose wirtschaftlicher Depressionen: ein Abfall dieser Variablen signalisiert eine höhere Wahrscheinlichkeit, dass in näherer Zukunft eine schwerwiegende Depression im Anmarsch ist. Bundesbank Discussion Paper No 43/2013 Disentangling Economic Recessions and Depressions∗ Bertrand Candelon Maastricht University Norbert Metiu Deutsche Bundesbank Stefan Straetmans Maastricht University Abstract We propose a nonparametric test that distinguishes “depressions” and “booms” from ordinary recessions and expansions. Depressions and booms are defined as coming from another underlying process than recessions and expansions. We find four depressions and booms in the NBER business cycle between 1919 and 2009, including the Great Depression and the World War II boom. Our results suggest that the recent Great Recession does not qualify as a depression. Multinomial logistic regressions show that stock returns, output growth, and inflation exhibit predictive power for depressions. Surprisingly, the term spread is not a leading indicator of depressions, in contrast to recessions. Keywords: Business cycles, Depression, Leading indicators, Multinomial logistic regression, Nonparametric statistics, Outlier. JEL classification: C14, C35, E32. Contact addresses: Bertrand Candelon: Department of Economics, School of Business and Economics, Maastricht University, PO Box 616, 6200 MD, Maastricht, The Netherlands. E-Mail: [email protected]. Norbert Metiu: Corresponding author. Research Centre, Deutsche Bundesbank, Wilhelm-Epstein-Straße 14, 60431 Frankfurt am Main, Germany. Tel.: +49 69 9566 8513. E-Mail: [email protected]. Stefan Straetmans: Department of Finance, School of Business and Economics, Maastricht University. E-Mail: [email protected]. The authors thank Ben Craig, Heinz Herrmann, Malte Knueppel, Christian Matthes, Guido Schultefrankenfeld, and seminar participants at the Deutsche Bundesbank for helpful comments. Discussion Papers represent the authors’ personal opinions and do not necessarily reflect the views of the Deutsche Bundesbank or its staff. ∗ 1 Introduction The business cycle is traditionally modeled as a sequence of recessions and expansions in aggregate economic activity. This binary approach characterizes the regime-switching models of, inter alia, Hamilton (1989) and Potter (1995), as well as the turning-points methodology of Bry and Boschan (1971), King and Plosser (1994) and Harding and Pagan (2002). Moreover, the prediction of business cycles has typically also been approached via binary models, see, e.g., Estrella and Mishkin (1998) and Kauppi and Saikkonen (2008). However, some recessions and expansions have been much stronger than others – e.g., in terms of duration or output growth – throughout the history of the U.S. business cycle. This notion is also reflected by a semantic difference between recessions vs. “depressions” and expansions vs. “booms” often made by policymakers and the popular press. One possibility is that such extreme episodes are tail realizations from the same data generating process (DGP) as ordinary recessions or expansions, which would imply that a two-regime approach delivers correctly specified business cycle models. Alternatively, depressions and booms may be considered as “outliers” arising from a different DGP than recessions and expansions. Neglecting this multi-phase character of the business cycle may distort model specifications of the true DGP. The objective of this paper is therefore to investigate the possibility that the DGP of the business cycle is characterized by more than two regimes. We propose a novel nonparametric outlier detection framework to distinguish economic depressions and booms from ordinary recessions and expansions based on the occurrence of outliers in some popular business cycle characteristics.1 The considered characteristics include the duration, amplitude, cumulated movements, and excess cumulated movements suggested by Harding and Pagan (2002). We classify a phase as depression or boom if at least one of its characteristics exhibits an outlier for that particular time frame relative to other historical episodes. Our outlier detection algorithm results in a four-regime classification of the U.S. business cycle. This enables us to refine some of the conclusions drawn from previous recession prediction exercises conducted within the two-regime framework. The amount of U.S. economic recessions and expansions over the past century is relatively small and the closed form of the statistical distribution of business cycle characteristics is unknown. We therefore propose a distribution-free outlier detection test that combines two results from nonparametric statistics and that performs well in small samples. Our method involves bootstrapping the empirical distribution of the business cycle characteristics and computing the difference between the arithmetic mean and the trimmed mean of each bootstrap sample. This yields a measure of central tendency termed “mean-trimmed mean”. Singh and Xie (2003) show that the mean-trimmed mean displays a multimodal histogram if the original sample contains one or more outliers. We employ the Silverman (1981) test to assess the null hypothesis of the histogram’s unimodality. If the histogram is multimodal, we sequentially remove the most extreme observations from the sample until we end up with a unimodal histogram that is free of outliers. The omitted observations correspond to the outlying business cycle characteristics. Applying this statistical procedure, we identify four depressions and booms of the U.S. business 1 A multitude of outlier definitions co-exist. We adopt the definition that an outlier constitutes an observation that does not arise from the same DGP as the majority of observations in the data set, conform to Barnett and Lewis (1994). 1 cycle over the time horizon 1919 to 2009. These coincide with economically meaningful episodes, like the Great Depression and the industrial production boom during the Second World War. However, despite all comparisons between the Great Depression and the recent financial and economic crisis, our results suggest that the 2007-2009 Great Recession does not qualify as a depression compared to previous historical episodes. There is a growing interest in severe economic contractions in the macroeconomic literature (see, e.g., Kehoe and Prescott, 2002). Large and infrequent economic slumps may require a bolder set of policy interventions than ordinary recessions (see Eggertsson and Krugman, 2012). At the same time, rare “economic disasters”, such as depressions or wars, have been shown to play a role in determining asset risk premia, see, e.g., Barro (2006, 2009), Gabaix (2012), and Wachter (2013). A handful of papers have tried to incorporate such severe episodes into non-linear regime-switching time series models with more than two possible business cycle states, including Tiao and Tsay (1994), Sichel (1994), and Cakmakli, Paap, and van Dijk (2013). Meanwhile, substantial effort has been invested in testing for the actual number of regimes within Markov-switching models, see, e.g., Cho and White (2007), Carter and Steigerwald (2012), and Carrasco, Hu, and Ploberger (2013). To the best of our knowledge, we are the first to identify rare and severe recessions by means of nonparametric outlier detection techniques. Previous studies that deal with outliers in macroeconomic time series include, e.g., Balke and Fomby (1994) and Giordani, Kohn, and van Dijk (2007), but these are parametric in nature. From the preceding literature it is well-known that financial and macroeconomic variables, such as the slope of the yield curve, stock market returns, or real output growth, exhibit some predictive power for future recessions and expansions (see, e.g., Harvey, 1988; Estrella and Hardouvelis, 1991; Estrella and Mishkin, 1998; Birchenhall, Jessen, Osborn, and Simpson, 1999; Hamilton and Kim, 2002; Kauppi and Saikkonen, 2008; Rudebusch and Williams, 2009; Christiansen, 2013). The question arises to what extent these leading indicator properties carry over to the four-regime business cycle classification that emerges from applying our outlier detection test. The predictive ability of financial and macroeconomic variables is of potential importance for policymakers who would like to distinguish between an impending recession and a rare economic disaster. Thus, one would like to determine whether traditional leading indicators of recessions and expansions exhibit a different information content for depressions and booms. In fact, upon applying a multinomial logistic regression model to the four-regime business cycle, we find that the slope of the yield curve preserves its predictive power towards recessions but its leading indicator property vanishes for economic depressions. However, we are able to show that variables like past real output growth, inflation, and stock market returns help to predict extraordinary business cycle fluctuations.2 The remainder of the paper is organized as follows. Section 2 starts with a definition of business cycles and their characteristics. Subsequently, we introduce the nonparametric outlier test. Finally, we report estimated cycle characteristics across the NBER business cycle between 1919 and 2009, and we present the results from the outlier detection procedure. Section 3 provides a short theoretical digression on multinomial logit regressions. Next, the section compares empirical results of binomial with multinomial logit specifica2 Throughout the paper “prediction” refers to in-sample predictive ability, since out-of-sample forecasts of depressions and booms would not make sense given the scant amount of genuine depression and boom phases within the historical sample. 2 tions and univariate with multivariate logit specifications, respectively. Finally, Section 4 provides a summary and conclusions. 2 2.1 The four-regime business cycle Characterizing business cycles Classical studies define business cycle fluctuations by means of turning points (peaks and troughs) in the level of real economic activity (see, e.g., Burns and Mitchell, 1946; Bry and Boschan, 1971; King and Plosser, 1994; Harding and Pagan, 2002). A complete cycle in logarithmic real output yt consists of a recession phase from a peak to the subsequent trough and an expansion phase from a trough to the subsequent peak. We use the National Bureau of Economic Research (NBER) peaks and troughs in economic activity to construct a binary variable Stbin that either reflects a recession phase (Stbin = 1) or an expansion phase (Stbin = 0). The turning point dates published by the NBER represent a consensus chronology of the U.S. business cycle. To assess the relative strength of recessions and expansions, we focus on the unconditional frequency distribution of four commonly used business cycle characteristics: the duration (D), amplitude (A), cumulative movements (C) and excess cumulative movements (E) computed for each business cycle phase. Formal definitions of these characteristics are presented in the appendix. Previous work that characterizes the cyclical behavior of real economic activity by means of these business cycle characteristics include, e.g., Harding and Pagan (2002) and Camacho, Perez-Quiros, and Saiz (2008). Bordo and Haubrich (2010) have employed these characteristics to analyze cycles in money, credit, and output, while others have used the same measures to investigate financial cycles (see, e.g., Pagan and Sossounov, 2003; Claessens, Kose, and Terrones, 2012). 2.2 Testing for depressions and booms In order to distinguish depressions and booms from ordinary business cycle phases, we apply an outlier testing procedure to the previously described business cycle characteristics. A given recession (expansion) episode is classified as an economic depression (boom) if at least one of its four characteristics is an outlier for that period. More precisely, a depression (boom) is identified as follows. Provided that n recessions (expansions) are observed, we have a sample for each recession (expansion) characteristic denoted by X1 , ..., Xn (Xi stands for either Di , Ai , Ci , or Ei ). The observation Xi that corresponds to phase i = 1, ..., n can be seen as a random draw from an unknown cumulative distribution function FX (.). We define phase i as a depression (boom) if Xi is an outlier with respect to the distribution FX (.), i.e., Xi is not distributed according to FX (.). We propose a distribution-free test for the null hypothesis that the sample X1 , ..., Xn is free of outliers. The test combines two established results from nonparametric statistics. First, Singh and Xie (2003) introduce a “Bootlier Plot” to graphically detect the presence of outliers in a data set. The Bootlier Plot is the bootstrap density plot of the sample mean-trimmed mean statistic and its multimodality reflects the presence of outliers in the sample. Second, Silverman (1981) proposes a distribution-free test for the unimodality of a probability density function. Hence, we apply Silverman’s test to Bootlier Plots of phase 3 characteristics in order to detect outliers in these characteristics. If the null hypothesis of unimodality is rejected for the full sample, we proceed with ordering the observations X1 , ..., Xn into ascending order, sequentially dropping observations from the tails of the ordered sample, and repeating Silverman’s test on Bootlier Plots of these shrinking subsamples. We continue the iterative omission of the most extreme observations until the (subsample) Bootlier Plot becomes unimodal. The deleted observations can be identified as outliers.3 Let us now discuss this nonparametric outlier detection procedure in somewhat more detail. The Bootlier Plot is obtained as follows. Let X1b , ..., Xnb (b = 1, 2, ..., B) denote bootstrapped samples of size n based on the original sample X1 , ..., Xn of a particular business cycle characteristic. The resampling is repeated B = 10, 000 times. The meantrimmed mean (MTM) statistic is defined as the difference between the arithmetic mean and the k-trimmed mean of the bth bootstrap sample: MT M b = n n−k X 1X b 1 b X(i) , Xi − n i=1 n − 2k i=k+1 (1) b where X(i) are the ascending order statistics and k is a trimming value. Upon assuming that FX (.) exhibits finite first and second moments, the central limit theorem applies and the pdf of the mean-trimmed mean fM T M (.) converges asymptotically to a standard normal distribution in the absence of outliers. Singh and Xie (2003) show that fM T M (.) can be expressed as a multimodal mixture of normal densities if the original sample X1 , ..., Xn contains at least one outlier. The separation between the normal mixing components arises because only some of the bootstrap samples contain the outliers. As a result, fM T M (.) exhibits one mode associated with the distribution FX (.) and at least another mode corresponding to one or more outliers.4 In order to test the null hypothesis of unimodality of fM T M (.) (absence of outliers in X1 , ..., Xn ) against the alternative hypothesis of multimodality (presence of one or more outliers), we apply the test proposed by Silverman (1981). The test uses as an input the kernel density estimate of the density function fM T M (.). The kernel density estimator of the MTM statistic at any point x can be expressed as: B 1 X x − MT M b f (x, h) = K( ), Bh b=1 h (2) where h is a bandwidth and K(.) is a kernel function chosen to be the standard normal density function following Silverman (1981). For a large class of kernel functions including the standard normal, the number of modes of f (x, h) decreases as the bandwidth h is increased. Thus, a sufficiently large h exists, for which the kernel density f (x, h) has a single mode in the interior of a given closed interval ℑ. The narrowest bandwidth for which 3 By definition, outliers must be located in the upper or lower tails of the ascending order statistics X(1) , X(2) , ...,X(n−1) , X(n) . We sequentially cancel the most extreme observations by considering the subsamples: (X(1) , ..., X(n−1) ), (X(2) , ..., X(n) ), (X(1) , ..., X(n−2) ), (X(2) , ..., X(n−1) ), (X(3) , ..., X(n) ), (X(1) , ..., X(n−3) ), etc. We stop this process once unimodality can no longer be rejected. 4 As recommended by Singh and Xie (2003), we compute the MTM statistic with a trimming value of k = 2. Singh and Xie (2003) show that in the presence of an outlier the separation between the modes of the bootstrap density fMT M (.) is approximately proportional to 1/k independent of the sample size. 4 1.4 3.5 1.2 3.0 kernel density kernel density Figure 1: Bootlier Plots of NBER Recession Durations 1.0 0.8 0.6 0.4 0.2 2.5 2.0 1.5 1.0 0.5 0.0 -0.8 -0.4 0.0 0.4 0.8 1.2 1.6 2.0 0.0 -2.0 -1.6 -1.2 -0.8 -0.4 0.0 0.4 0.8 1.2 1.6 2.0 2.4 bootstrap MTM statistics bootstrap MTM statistics (a) Durations of all recessions (b) Durations without Great Depression Note: Bootlier Plots of durations for NBER business cycle recessions between March 1919 and June 2009 (full sample) and for the subsample that excludes the duration of the Great Depression (40 months). the kernel density estimate is unimodal is called the “critical bandwidth”. Intuitively, the critical bandwidth of a multimodal density should be larger than that of a unimodal density because a larger bandwidth is required to smooth out the multiple modes. This provides a rationale for using the critical bandwidth as test statistic. To implement Silverman’s test, we start with estimating the density of the MTM statistic in Equation (1) using the kernel density estimator in Equation (2). Next, we estimate the critical bandwidth ĥcrit . Let us denote the kernel density estimated with this bandwidth as fˆ(., ĥcrit ). In order to determine the small sample distribution of the critical bandwidth ĥcrit , we draw 1,000 bootstrap samples from the kernel density fˆ(., ĥcrit ). For each draw, the density of the bootstrapped MTM statistic is again estimated using the kernel density estimator defined in Equation (2). Let ĥ∗crit denote the critical bandwidth of the kernel density obtainedfor one bootstrap draw. The null hypothesis of unimodality is rejected if P r ĥ∗crit ≤ ĥcrit ≥ 1 − α, where α is the nominal size. If unimodality of the full-sample Bootlier Plot is rejected, we calculate Bootlier Plots and perform Silverman’s test on subsamples by eliminating the most extreme observations from the tails of the full sample X1 , ..., Xn . We continue shrinking the sample until the null of unimodality can no longer be rejected. Further details on the nonparametric outlier detection procedure, including its finite sample behavior, are reported in a companion paper (see Candelon and Metiu, 2013). The latter paper, inter alia, shows that the size and power properties of the outlier test are satisfactory in small samples like the ones encountered in this paper. As a simple example of how our outlier testing procedure works, consider the durations of NBER business cycle recessions. Figure 1 (a) and Figure 1 (b) report the Bootlier Plots for the full sample and the subsample excluding the duration of the Great Depression phase, respectively. The multimodal plot implies that the Great Depression’s duration is an outlier of the duration’s empirical distribution. This suggests that the 1929-33 down5 Figure 2: The U.S. Business Cycle 4.8 4.4 log real output 4.0 3.6 3.2 2.8 2.4 2.0 1.6 1.2 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Note: Log levels of U.S. real industrial production, March 1919 - December 2010. NBER recessions are shaded in grey. turn was a genuine depression episode. Upon removing this data point from the historical duration sample, the subsample Bootlier Plot becomes unimodal and Silverman’s test does no longer reject the null of unimodality (p-value of 0.22). This indicates that there are no other outliers left in the empirical distribution of recession durations. 2.3 Depressions and booms in the U.S. business cycle We compute the duration, amplitude, cumulated movements, and excess cumulated movements of the U.S. business cycle using the NBER turning points. Figure 2 plots the log of monthly U.S. real industrial production output between March 1919 and December 2010 with NBER recessions shaded in grey. Table 1 reports values of the characteristics computed for each recession and expansion phase. While on average recessions last about 10 months, the longest recession lasted 40 months from the peak in August 1929 till the trough in March 1933. The severity of recessions also varies significantly across different episodes: the average peak-to-trough amplitude of a recession is -17.6%, while the difference between the amplitude of the most severe (August 1929 - March 1933) and the mildest (March 2001 - November 2001) recession is 68.6 percentage points. Similar patterns emerge for expansions: the difference between the shortest and longest expansion is roughly 9 years, while the trough-to-peak output gains vary between 5.9% and 99.5% over the analyzed period. The varying strength of contractions and expansions suggests that one may gain more insight into the nature of the business cycle by further refining its traditional two-regime classification.5 Figure 3 shows the kernel density estimates of the Bootlier Plots corresponding to each 5 The varying strength of U.S. contractions and expansions has also been studied by, e.g., Neftci (1984), DeLong and Summers (1986), McQueen and Thorley (1993), McKay and Reis (2008), and Morley and Piger (2012). 6 Table 1: U.S. Business Cycle Characteristics Turning points Peak date Jan May Oct Aug May Feb Nov Jul Aug Apr Dec Nov Jan Jul Jul Mar Dec 1920 1923 1926 1929 1937 1945 1948 1953 1957 1960 1969 1973 1980 1981 1990 2001 2007 Duration (months) Trough date Mar Jul Jul Nov Mar Jun Oct Oct May Apr Feb Nov Mar Jul Nov Mar Nov Jun 1919 1921 1924 1927 1933 1938 1945 1949 1954 1958 1961 1970 1975 1980 1982 1991 2001 2009 P-value without outliers Amplitude (%) Cumulation (%) Excess (%) T-to-P P-to-T T-to-P P-to-T T-to-P P-to-T T-to-P P-to-T 10 22 27 21 50 80 37 45 39 24 106 36 58 12 92 120 73 15 11 10 40 10 5 8 7 5 7 8 13 3 13 5 5 15 20.07 45.81 23.21 20.87 72.25 99.52 18.38 38.06 19.54 19.61 55.29 21.05 25.95 5.91 29.00 40.58 14.37 -37.95 -18.92 -5.76 -72.11 -37.03 -33.65 -8.04 -8.85 -12.76 -6.25 -4.15 -13.57 -6.75 -8.91 -4.08 -3.47 -18.52 97.90 509.09 410.78 235.09 2023.92 4964.97 407.15 1049.35 519.24 325.38 3518.74 366.10 943.87 47.82 1731.95 2633.43 545.13 -366.46 -98.40 -21.65 -1742.67 -276.69 -105.90 -46.65 -62.08 -52.63 -33.23 -12.86 -61.14 -20.29 -77.69 -15.17 -14.91 -147.67 -1.25 -0.81 3.18 0.26 3.63 11.68 1.56 3.86 3.29 3.34 5.29 -0.65 3.07 0.78 4.17 1.48 0.18 -4.19 1.38 1.00 -6.61 -7.30 -0.99 -1.31 -3.81 -2.87 -1.18 0.73 2.60 -2.26 -1.18 -0.58 -0.90 0.03 0.50 0.22 0.30 0.29 0.18 0.19 0.07 0.24 Note: NBER peak (P) and trough (T) dates of the U.S. business cycle and corresponding business cycle characteristics (duration, amplitude, cumulated movements and excess cumulated movements). The outlier phase characteristics are shaded in grey. We report the p-values of Silverman’s modality test for the subsamples without outliers. of the four business cycle characteristics. Recall that multimodality of the Bootlier Plot indicates the presence of one or more outlying observations. The Bootlier Plots of the excess measure of expansions and of the duration, amplitude and cumulated movements of recessions exhibit more than one mode. Outlier detection tests performed on the basis of these plots lead to p-values smaller than 1%, indicating a rejection of the null hypothesis of unimodality (no outliers). We iteratively run outlier detection tests to determine the subsamples that are free of outliers. Business cycle phases with outlying characteristics are shaded in grey in Table 1. We detect six outliers via our iterative procedure, one in the excess cumulated movements of expansions, one in recession durations, one in recession amplitudes, and three in the cumulated movements of recessions. At the bottom of Table 1 we also report the p-values of the outlier detection test for the subsamples with unimodal Bootlier Plots. As Figure 3 shows, each Bootlier Plot becomes unimodal once the outliers are removed from the samples. The outlying phase characteristics correspond to four extraordinary episodes in the history of the U.S. business cycle. The expansion that corresponds with an outlying excess measure is relabeled as a boom, while the three recessions that exhibit outlying characteristics are classified as depressions. The first depression occurred between January 1920 - July 1921. Friedman and Schwartz (1963) argue that this deflationary downturn may have been triggered by a negative aggregate demand shock partly caused by restrictive 7 Figure 3: Bootlier Plots Recession Full Sample Without Outliers 0.4 kernel density 0.3 0.2 0.1 3.5 1.2 3.0 1.0 0.8 0.6 0.4 0.2 -3 -2 -1 0 1 2 3 4 5 6 kernel density kernel density 0.01 0.02 0.03 0.04 kernel density kernel density 0.5 0.4 0.3 0.2 0.1 0.0 -0.8 -0.4 0.0 0.4 0.8 1.2 1.6 2.0 2.4 2.8 3.2 kernel density kernel density kernel density 1.2 1.6 2.0 0.0 -2.0 -1.6 -1.2 -0.8 -0.4 0.0 0.4 0.8 1.2 1.6 2.0 2.4 bootstrap MTM statistics 120 100 40 30 20 10 0.01 80 60 40 20 0 -0.024 0.02 3.2 2.8 2.4 2.0 1.6 1.2 0.8 0.4 0.0 -1.2 0.007 -0.016 -0.008 0.000 0.008 500 400 300 200 0 -0.003 -0.002 -0.001 0.000 0.001 0.002 0.003 bootstrap MTM statistics 24 -1.0 -0.8 -0.6 -0.4 -0.2 0.0 0.2 20 16 12 8 4 0 -0.10 -0.08 -0.06 -0.04 -0.02 0.00 0.02 bootstrap MTM statistics 400 300 200 100 0 -0.004 -0.003 -0.002 -0.001 0.000 0.001 0.002 0.003 bootstrap MTM statistics Note: Bootlier Plots graphically represent kernel estimates of the bootstrap density of MTM statistics for each business cycle characteristic. If the full-sample Bootlier Plot is multimodal, we iteratively remove observations from the tails of the sample until the subsample Bootlier Plot becomes unimodal. The left (right) panel shows full-sample and subsample Bootlier Plots for expansions (recessions). The four rows correspond with duration, amplitude, cumulated movements, and excess cumulated movements, respectively. 8 0.016 bootstrap MTM statistics 500 600 100 bootstrap MTM statistics 0.8 bootstrap MTM statistics 700 0.005 0.4 50 bootstrap MTM statistics 0.003 1.0 bootstrap MTM statistics 0.6 0.001 0.0 0 -0.05 -0.04 -0.03 -0.02 -0.01 0.00 0.05 0.7 -0.001 1.5 60 bootstrap MTM statistics 320 280 240 200 160 120 80 40 0 -0.003 2.0 bootstrap MTM statistics bootstrap MTM statistics 40 35 30 25 20 15 10 5 0 -0.02 -0.01 0.00 2.5 0.5 0.0 -0.8 -0.4 kernel density 0.0 1.4 kernel density kernel density 0.5 kernel density Expansion Full Sample Without Outliers 0.04 Figure 4: Depressions and Booms of the U.S. Business Cycle (a) The Great Depression (b) The World War II Boom Note: Figure (a): NBER peak-to-trough depression between August 1929 and March 1933. Figure (b): NBER trough-to-peak boom between June 1938 and February 1945. The excess cumulative movements are shaded in grey. monetary policy between 1920 and 1921. The second depression is the August 1929 March 1933 collapse in real economic activity known as the Great Depression. Figure 4 (a) plots log industrial production from peak to trough. Our results reveal that the Great Depression corresponds to the longest, sharpest, and most abrupt decline in real output, with an amplitude of -72.1% and a cumulative loss of 1742.7% over 40 months. These values are between three to four standard deviations larger than for ordinary recessions. The vast literature on the Great Depression typically describes an adverse interplay between a contraction in aggregate production, systemic banking crises, and tight monetary policy (see e.g., Bernanke, 1983). The third depression that we detect is dated between May 1937 and June 1938. Velde (2009) argues that the 1937-38 episode is a prime example of a double-dip recession triggered by premature policy tightening in the aftermath of a severe recession. Despite multiple comparisons with the Great Depression, the 2007-09 Great Recession is not classified as a depression by our statistical procedure. However, it is undoubtedly the longest (15 months) postwar recession, with the largest amplitude (-18.5%) and cumulative output loss (147.7%).6 The June 1938 - February 1945 wartime expansion constitutes the only genuine boom phase. The WWII boom exhibits by far the largest excess cumulative movements: 11.7% compared to a mean of 2.5% and standard deviation of 3.06%. Figure 4 (b) shows log industrial production from trough to peak. The grey area in the plot corresponds to the measure of excess cumulative movements. This measure reflects the departure of the real output series from a triangular path for which the transition between two consecutive turning points would be linear, it thus conveys information about the shape of the busi6 Notice that the outlier detection procedure does not directly compare the Great Recession with the identified depression episodes. Whether the Great Recession is a depression solely depends on its characteristics compared to other recessions, since the sequential omission of the depression characteristics in the testing algorithm is equivalent to ultimately considering a subsample that does no longer contain the three depression periods. Thus, if the Great Recession was a depression, it should lead to a multimodal Bootlier Plot for this latter subsample, and this is not the case. 9 ness cycle phase. The concave shape of the upswing reflects a sharp surge in aggregate production related to the arms industry, the expansion of productive capacity through government-owned, privately operated capital, and an increase in labor force participation in durable goods manufacturing during World War II (see Braun and McGrattan, 1993). As the impetus of the economic boom abates by the end of the war, the economy approaches its peak at a rather subdued pace, giving rise to the strong concavity observed in the figure. 3 Predicting the four-regime business cycle 3.1 Multinomial logistic regression We examine by means of multinomial logistic regressions whether macroeconomic and financial variables reflect leading information on the four-regime U.S. business cycle. Based on the outlier detection results discussed in the preceding sections, we map the binomial cycle variable into a multinomial variable Stmul that corresponds to the expansion (Stmul = 0), recession (Stmul = 1), boom (Stmul = 2), and depression (Stmul = 3) regimes of the business cycle. Let It represent the information set which contains the past history of an L-dimensional vector of exogenous variables, xt . In a four-state multinomial logit model, the probability that the business cycle is in regime j = 1, 2, 3 at time t + m conditional on It obeys a logistic distribution function: ′ Pr mul St+m = j|It exp(βj xt ) = , P3 ′ 1 + h=1 exp(βh xt ) (3) where m = 3, 6, ..., 24 stands for the prediction horizon (expressed in months). The model is identified by imposing the condition that the expansion regime (j = 0) is the reference state and all coefficients are expressed relative to this regime. Hence, using the fact that the state probabilities must sum to unity, the conditional probability of the expansion regime is given by: 1 mul P r St+m = 0|It = . (4) P3 ′ 1 + h=1 exp(βh xt ) The multinomial logit model nests the binomial model which has been used in much of the preceding literature (e.g., Estrella and Hardouvelis, 1991; Estrella and Mishkin, 1998; Kauppi and Saikkonen, 2008). Estimation is done by maximum likelihood optimization.7 The log-likelihood for a 7 In the business cycle literature probit models are more commonly employed than logit models to predict recessions (e.g., Estrella and Hardouvelis, 1991; Estrella and Mishkin, 1998; Kauppi and Saikkonen, 2008; Christiansen, 2013). However, this approach is less preferable in the multinomial context because evaluating the state probabilities involves calculating high-dimensional integrals of the multivariate normal distribution, which makes the computation of the maximum likelihood estimates very complex (see McCulloch and Rossi, 1994). Therefore, we opt for the logistic approach. Nevertheless, binary probit and logit outcomes for our data were found to lie very close to each other, which suggests that it does not matter whether one assumes a normal density or a logistic density for sake of maximum likelihood optimization. 10 time series consisting of T observations is given by: log (L(β)) = T X 3 X mul mul 1(St+m = j) log(P r(St+m = j|It )), (5) t=1 j=0 where 1(.) is the indicator function. We measure the model’s goodness-of-fit using McFadden (1974)’s pseudo-R2 . For time horizons of m > 1, the prediction horizon exceeds the data frequency, which creates serially correlated logit disturbance terms by construction. We remedy for this overlapping data problem using heteroskedasticity and serial correlation robust standard errors (see also Estrella and Mishkin, 1998). We are interested in the effect of a ceteris paribus change in one of the right-hand-side (RHS) variables on the response probability defined in Equation (3), i.e., the so-called marginal effects.8 The lth marginal effect gives the change in the probability that the business cycle is in state j at time t + m in response to a one unit increase of the lth RHS variable relative to its mean value (l = 1, ..., L): ! P3 ′ mul = j|It ) ∂P r(St+m β exp(β x ) h,l t h mul , (6) = P r(St+m = j|It ) βj,l − h=1 P ′ ∂xl,t 1 + 3h=1 exp(βh xt ) where βj,l is the lth element of βh . Within this multinomial framework, we would like to assess to what extent macroeconomic and financial variables that traditionally exhibit leading indicator properties for recessions and expansions have a different information content for depressions and booms. Anderson (1984) defines a pair of regimes as “indistinguishable” if the RHS variables xt deliver the same prediction for both regimes. If so, the two regimes can be merged into a single regime. More precisely, let β1,l denote the coefficients that correspond to the explanatory variables xl,t in the recession regime relative to the reference (expansion) state, and let β3,l stand for the coefficients of the depression regime relative to the reference state. The null hypothesis that recessions and depressions are indistinguishable with respect to xt is given by β1,l = β3,l for l = 1, ..., L, which can be tested using conventional likelihood-ratio (LR) and Wald statistics. Notice that a rejection of Anderson’s indistinguishability hypothesis would provide further justification for a four-regime business cycle classification emerging from our outlier detection procedure. 3.2 Logistic regression results Potential leading indicator variables are selected in line with previous empirical literature on business cycle prediction (see, e.g., the seminal paper by Estrella and Mishkin, 1998). Taking into acount that our RHS variables need to be available from 1919 onwards, we end up with the term spread (SP READ), the S&P 500 stock index returns (∆ log SP 500), the inflation rate (∆ log P P I), and the growth rate of log industrial production (∆yt ), 8 The majority of related business cycle studies only report coefficient levels β in Equation (3) because the sign of the coefficients and the marginal effects are identical in the binary framework. However, in the multinomial model an explanatory variable’s marginal effect depends on all coefficients, which implies that it does not necessarily exhibit the same sign as the coefficient on the corresponding independent variable. 11 Figure 5: Time Series and the U.S. Business Cycle 5 40 4 30 S&P 500 returns (%) term spread (%) 3 2 1 0 -1 20 10 0 -10 -2 -20 -3 -30 -4 1920 1930 1940 1950 1960 1970 1980 1990 -40 1920 2000 1930 (a) Term Spread 12 real output growth (%) 16 7.5 5.0 PPI inflation (%) 1950 1960 1970 1980 1990 2000 1990 2000 (b) S&P 500 Returns 10.0 2.5 0.0 -2.5 -5.0 8 4 0 -4 -8 -7.5 -10.0 1920 1940 1930 1940 1950 1960 1970 1980 1990 -12 1920 2000 (c) Inflation 1930 1940 1950 1960 1970 1980 (d) Real Output Growth Note: Time series are expressed as monthly percentages between March 1919 and June 2009. NBER recessions are shaded in grey. all at the monthly frequency.9 Data are taken from the St. Louis Fed and from Goyal and Welch (2008). In line with the business cycle data, the sample covers the period from March 1919 until June 2009. Figure 5 shows the data together with the NBER recessions. The figure already provides some casual evidence that the business cycle and the macroeconomic and financial variables are related. The term spread seems to display counter-cyclical dynamics. Although stock returns, inflation, and output growth do not exhibit a clear-cut pattern in the wake of recessions, they tend to decline during the three depressions identified. We distinguish between binomial and multinomial models with either a single RHS variable (univariate model) or multiple RHS variables (multivariate model). Tables 2-8 report estimation results for these four possible logit model specifications. The tables have a comparable structure and information content. Each table considers prediction 9 The term spread is defined as the difference between 10-year U.S. Treasury bonds and 3-month Treasury bills. As inflation variable, we consider the growth rate of the producer price index (PPI), which is a potential leading indicator for real economic activity because it measures the change in selling prices received by domestic producers. We also ran a robustness check with CPI inflation and our results were nearly identical. 12 time horizons of m = 3 up to m = 24 months and reports marginal effects together with robust standard errors and accompanying Z-statistics. Since all regressors are expressed in percentages, the marginal effects can be interpreted as changes in the m-month ahead probability of a recession, depression, expansion or boom, in response to a one-percent rise of a RHS variable. The tables also report log-likelihoods and pseudo-R2 values.10 Table 2 reports estimation results for univariate binomial logistic regressions as a benchmark for comparison with the multinomial results, as well as a robustness check of earlier binomial business cycle studies. In line with previous evidence, we find that the slope of the yield curve is an accurate predictor of future recessions for all considered time horizons. Rising term spreads reduce future recession likelihoods in a statistically significant way. Different theoretical explanations have been launched for this empirical observation. The most popular ones are related to the stance of monetary policy and the information content on expectations regarding future economic prospects reflected in the term spread. Current monetary policy can have a simultaneous impact on both the yield curve and future real activity. For example, an expansionary monetary policy can jointly induce a decline in the short rate – leading to a steeper yield curve – and stimulate future economic activity. Furthermore, the Rational Expectations Hypothesis (REH) of the term structure of interest rates can also contribute to understanding the empirical relation between the yield curve and the business cycle. According to the REH, an upward (downward) sloping yield curve indicates that future short-term interest rates are expected to rise (fall). Hence, given that short-term interest rates are typically pro-cyclical, a positive (negative) term spread signals a future business cycle expansion (recession). We also consider S&P 500 stock index returns in the univariate logit model. The Efficient Markets Hypothesis implies that current stock prices equal the present value of the expected future dividend stream, which in turn reflects expectations about future real economic activity. Thus, in line with what one would expect, rising stock index returns significantly reduce recession likelihoods for time horizons up to one year ahead. Turning to the significance of the macroeconomic variables in the univariate logit regressions, rising real output growth significantly reduces the probability of future recessions for time horizons up to three quarters ahead. The significant outcomes reflect the temporal persistence of economic activity. However, the sign of the coefficient reverses for a time horizon of two years ahead, which suggests a boom-bust cycle in the data. Finally, inflation seems to predict the business cycle only up to six months ahead. Interestingly, a marginal rise in inflation leads to a significant drop in the recession probability up to two quarters in the future. This result is in line with, e.g., Estrella and Hardouvelis (1991) but it contradicts certain structural macroeconomic studies, e.g., Smets and Wouters (2007) who find a negative correlation between current inflation and future output growth. Summarizing the univariate binomial model outcomes, both financial and macro variables impact the recession likelihoods, but the term spread is by far the best (in-sample) predictor of future real economic activity. First of all, the term spread is the only variable that exhibits significant predictive power over all time horizons. Moreover, the absolute 10 The considered models are estimated over the entire sample period. We do not perform out-ofsample prediction exercises for the depression and boom phases of the business cycle because the number of in-sample depressions and booms (four in total) is too low to make a credible out-of-sample prediction assessment. 13 Table 2: Univariate Binomial Logistic Regressions ∂P r(St+m =1) ∂SP READt Z-statistic Pseudo R2 Log-likelihood ∂P r(St+m =1) ∂∆ log SP 500t Z-statistic Pseudo R2 Log-likelihood ∂P r(St+m =1) ∂∆ log P P It Z-statistic Pseudo R2 Log-likelihood ∂P r(St+m =1) ∂∆yt Z-statistic Pseudo R2 Log-likelihood P r(St+m = 1|It ) = F (β0 + β1 SP READt ) m = months ahead 3 6 9 12 15 18 21 24 -0.059 (0.010) −5.98a 0.034 541.493 -0.081 (0.010) −8.21a 0.067 509.912 -0.077 (0.010) −8.09a 0.062 508.471 -0.069 (0.009) −7.37a 0.050 509.971 P r(St+m = 1|It ) = F (β0 + β1 ∆ log SP 500t) m = months ahead 3 6 9 12 15 18 21 24 -0.014 (0.003) −4.97a 0.035 541.144 -0.000 (0.002) -0.080 0.000 546.498 0.002 (0.002) 0.970 0.001 541.378 0.005 (0.002) 2.37b 0.005 534.386 P r(St+m = 1|It ) = F (β0 + β1 ∆ log P P It ) m = months ahead 3 6 9 12 15 18 21 24 -0.0623 (0.014) -4.37a 0.030 544.064 0.016 (0.011) 1.43 0.002 550.097 0.010 (0.011) 0.94 0.001 546.068 0.005 (0.010) 0.53 0.000 541.691 0.0001 (0.010) 0.09 0.000 537.053 P r(St+m = 1|It ) = F (β0 + β1 ∆yt ) m = months ahead 3 6 9 12 15 18 21 24 -0.075 (0.011) −6.66a 0.106 500.941 0.004 (0.006) 0.660 0.000 550.963 0.008 (0.006) 1.250 0.001 545.739 0.011 (0.006) 1.86c 0.003 540.248 0.014 (0.006) 2.48b 0.005 534.297 -0.079 (0.010) −7.58a 0.061 525.523 -0.013 (0.002) −4.72a 0.031 542.547 -0.033 (0.013) -2.50b 0.009 555.090 -0.052 (0.009) −6.03a 0.056 528.805 -0.093 (0.010) −8.86a 0.085 511.739 -0.012 (0.002) −4.86a 0.023 546.257 -0.002 (0.012) -0.14 0.000 559.128 -0.023 (0.007) −3.30a 0.012 552.622 -0.094 (0.010) −8.97a 0.088 506.967 -0.008 (0.002) −3.43a 0.010 550.145 0.017 (0.012) 1.46 0.002 554.639 -0.003 (0.006) -0.460 0.000 555.723 -0.089 (0.010) −8.59a 0.079 507.791 -0.003 (0.002) -1.190 0.001 550.493 Note: Estimation results of univariate binomial logistic regressions with SP READt , ∆ log SP 500t , ∆ log P P It , or ∆yt on the right-hand side. The marginal effect (∂P r(.)/∂xl,t ) is the partial derivative of the predicted probability with respect to the RHS variable evaluated at its mean. Robust standard errors are in brackets and the superscripts a , b , and c denote significance at the 1%, 5% and 10% level, respectively. Identification is achieved by normalizing with the coefficients of the expansion state. We provide the model log-likelihood, and goodness-of-fit is measured by McFadden’s pseudo R-squared. 14 Table 3: Multivariate Binomial Logistic Regression P r(St+m = 1|It ) = F (β0 + β1 SP READt + β2 SP 500t + β3 IN F Lt + β4 ∆yt ) m = months ahead 3 6 9 12 15 18 21 24 Z-statistic -0.058 (0.009) −6.21a -0.012 (0.003) −4.10a -0.032 (0.014) −2.36b -0.065 (0.011) −5.83a -0.077 (0.010) −7.50a -0.011 (0.003) −4.29a -0.008 (0.013) -0.66 -0.044 (0.008) −5.37a -0.091 (0.010) −8.77a -0.011 (0.002) −4.94a 0.010 (0.011) 0.98 -0.019 (0.007) −2.72a -0.094 (0.011) −8.91a -0.008 (0.002) −3.40a 0.017 (0.011) 1.64 0.001 (0.007) 0.16 -0.090 (0.010) −8.63a -0.003 (0.003) -0.95 0.014 (0.011) 1.31 0.008 (0.007) 1.07 -0.083 (0.010) −8.37a 0.000 (0.003) 0.05 0.007 (0.010) 0.64 0.012 (0.007) 1.68c -0.080 (0.010) −8.41a 0.003 (0.002) 1.08 0.000 (0.010) 0.04 0.016 (0.006) 2.33b -0.074 (0.009) −7.92a 0.006 (0.002) 2.52b -0.005 (0.010) -0.48 0.018 (0.006) 2.80a Pseudo R2 Log-likelihood 0.175 -462.707 0.142 -480.348 0.117 -493.654 0.101 -499.453 0.083 -505.234 0.071 -507.683 0.069 -504.684 0.065 -502.060 ∂P r(St+m =1) ∂SP READt Z-statistic ∂P r(St+m =1) ∂∆ log SP 500t Z-statistic ∂P r(St+m =1) ∂∆ log P P It Z-statistic ∂P r(St+m =1) ∂∆yt Note: Estimation results of the multivariate binomial logistic regression with SP READt , ∆ log SP 500t, ∆ log P P It , and ∆yt on the right-hand side. The marginal effect (∂P r(.)/∂xl,t ) is the partial derivative of the predicted probability with respect to the RHS variable evaluated at its mean. Robust standard errors are in brackets and the superscripts a , b , and c denote significance at the 1%, 5% and 10% level, respectively. Identification is achieved by normalizing with the coefficients of the expansion state. We provide the model log-likelihood, and goodness-of-fit is measured by McFadden’s pseudo R-squared. 15 Table 4: Multinomial Logistic Regression with Term Spread P r(St+m = j|It ) = F (β0 + β1 SP READt ) m = months ahead 3 6 9 12 15 18 21 24 Z-statistic 0.031 (0.010) 3.13a -0.055 (0.008) −7.34a 0.026 (0.003) 7.35a -0.002 (0.007) -0.25 0.047 (0.011) 4.49a -0.072 (0.008) −9.39a 0.027 (0.004) 7.42a -0.003 (0.008) -0.32 0.055 (0.011) 4.97a -0.082 (0.007) −11.05a 0.029 (0.004) 7.49a -0.002 (0.009) -0.22 0.056 (0.011) 4.97a -0.082 (0.007) −10.88a 0.030 (0.004) 7.54a -0.004 (0.010) -0.41 0.052 (0.011) 4.65a -0.079 (0.007) −10.58a 0.030 (0.004) 7.58a -0.004 (0.009) -0.40 0.047 (0.011) 4.31a -0.072 (0.007) −10.46a 0.030 (0.004) 7.61a -0.005 (0.010) -0.52 0.046 (0.010) 4.50a -0.062 (0.007) −9.04a 0.031 (0.004) 7.63a -0.015 (0.008) −1.92c 0.039 (0.010) 3.99a -0.049 (0.007) −6.90a 0.031 (0.005) 7.66a -0.021 (0.006) −3.59a Pseudo R2 Log-likelihood LR: Bo ≡ Ex Wald: Bo ≡ Ex LR: De ≡ Re Wald: De ≡ Re 0.033 -938.930 12.520a 12.190a 12.603a 12.256a 0.054 -917.079 12.121a 11.840a 24.255a 22.995a 0.075 -895.691 12.435a 12.167a 38.657a 35.618a 0.076 -890.054 13.462a 13.150a 34.557a 32.033a 0.068 -889.984 14.892a 14.502a 27.860a 26.188a 0.057 -892.180 16.450a 15.950a 18.881a 18.059a 0.047 -894.103 17.596a 16.975a 3.428c 3.378c 0.040 -892.523 19.322a 18.545a 0.216 0.216 ∂P r(St+m =0) ∂SP READt Z-statistic ∂P r(St+m =1) ∂SP READt Z-statistic ∂P r(St+m =2) ∂SP READt Z-statistic ∂P r(St+m =3) ∂SP READt Note: Estimation results of the univariate multinomial logistic regression with SP READt on the right-hand side. The marginal effect (∂P r(.)/∂xl,t ) is the partial derivative of the predicted probability with respect to the RHS variable evaluated at its mean. Robust standard errors are in brackets and the superscripts a , b , and c denote significance at the 1%, 5% and 10% level, respectively. Identification is achieved by normalizing with the coefficients of the expansion state. We provide the model log-likelihood, and goodness-of-fit is measured by McFadden’s pseudo R-squared. We also report the LR and Wald test for the null hypothesis that all coefficients (except intercepts) associated with a given pair of business cycle regimes (boom (Bo) and expansion (Ex) or depression (De) and recession (Re)) are equal (indistinguishability). The test statistics are χ2 (1) distributed under the null. 16 Table 5: Multinomial Logistic Regression with Stock Returns P r(St+m = j|It ) = F (β0 + β1 ∆ log SP 500t ) m = months ahead 3 6 9 12 15 18 21 24 Z-statistic 0.014 (0.003) 4.50a -0.007 (0.002) −3.61a -0.000 (0.002) -0.05 -0.006 (0.001) −4.75a 0.014 (0.003) 4.58a -0.007 (0.002) −4.01a -0.001 (0.002) -0.40 -0.006 (0.001) −3.95a 0.013 (0.003) 4.75a -0.006 (0.002) −3.26a -0.002 (0.002) -0.95 -0.006 (0.001) −4.72a 0.010 (0.003) 3.71a -0.004 (0.002) −2.56a -0.002 (0.002) -1.15 -0.004 (0.001) −2.71a 0.005 (0.002) 2.11b 0.000 (0.001) 0.20 -0.003 (0.002) -1.56 -0.003 (0.002) −1.83c 0.002 (0.003) 0.99 0.002 (0.002) 1.02 -0.002 (0.002) -1.35 -0.002 (0.002) -0.94 -0.000 (0.003) -0.02 0.002 (0.001) 1.37 -0.002 (0.002) -1.15 0.000 (0.002) 0.06 -0.003 (0.003) -1.24 0.004 (0.002) 2.80a -0.002 (0.002) -1.20 0.001 (0.002) 0.61 Pseudo R2 Log-likelihood LR: Bo ≡ Ex Wald: Bo ≡ Ex LR: De ≡ Re Wald: De ≡ Re 0.024 -947.329 0.725 0.725 6.787a 6.689a 0.021 -949.444 1.753 1.770 3.745b 3.772b 0.018 -950.831 3.737b 3.846b 5.856b 5.839b 0.009 -954.210 3.852b 4.025b 1.935 1.978 0.005 -950.140 4.244b 4.493b 3.384c 3.49c 0.003 -943.983 2.764c 2.903c 1.940 1.990 0.002 -936.551 1.722 1.793 0.180 0.181 0.004 -925.964 1.341 1.395 0.143 0.142 ∂P r(St+m =0) ∂∆ log SP 500t Z-statistic ∂P r(St+m =1) ∂∆ log SP 500t Z-statistic ∂P r(St+m =2) ∂∆ log SP 500t Z-statistic ∂P r(St+m =3) ∂∆ log SP 500t Note: Estimation results of the univariate multinomial logistic regression with ∆ log SP 500t on the right-hand side. The marginal effect (∂P r(.)/∂xl,t ) is the partial derivative of the predicted probability with respect to the RHS variable evaluated at its mean. Robust standard errors are in brackets and the superscripts a , b , and c denote significance at the 1%, 5% and 10% level, respectively. Identification is achieved by normalizing with the coefficients of the expansion state. We provide the model log-likelihood, and goodness-of-fit is measured by McFadden’s pseudo R-squared. We also report the LR and Wald test for the null hypothesis that all coefficients (except intercepts) associated with a given pair of business cycle regimes (boom (Bo) and expansion (Ex) or depression (De) and recession (Re)) are equal (indistinguishability). The test statistics are χ2 (1) distributed under the null. 17 Table 6: Multinomial Logistic Regression with Inflation P r(St+m = j|It ) = F (β0 + β1 ∆ log P P It ) m = months ahead 3 6 9 12 15 18 21 24 Z-statistic 0.039 (0.015) 2.55b -0.006 (0.015) -0.40 0.007 (0.007) 0.99 -0.040 (0.007) -5.82a 0.018 (0.013) 1.36 0.007 (0.012) 0.61 0.004 (0.007) 0.55 -0.029 (0.007) -4.42a -0.003 (0.012) -0.21 0.020 (0.009) 2.20b 0.000 (0.007) 0.06 -0.018 (0.007) -2.63a -0.018 (0.012) -1.44 0.031 (0.010) 3.24a -0.000 (0.007) -0.01 -0.013 (0.007) -1.80c -.020 (0.012) -1.59 0.032 (0.010) 3.31a 0.002 (0.007) 0.28 -0.014 (0.007) -2.09b -0.019 (0.012) -1.52 0.029 (0.009) 3.15a 0.006 (0.008) 0.74 -0.016 (0.006) -2.75a -0.018 (0.013) -1.43 0.029 (0.009) 3.31a 0.008 (0.008) 1.01 -0.018 (0.005) -3.73a -0.014 (0.013) -1.13 0.024 (0.009) 2.65a 0.007 (0.007) 1.03 -0.018 (0.005) -3.89a Pseudo R2 Log-likelihood LR: Bo ≡ Ex Wald: Bo ≡ Ex LR: De ≡ Re Wald: De ≡ Re 0.038 -933.459 0.104 0.107 40.986a 30.370a 0.017 -952.812 0.054 0.054 23.785a 21.312a 0.007 -962.034 0.008 0.008 12.862a 13.446a 0.007 -955.796 0.047 0.047 11.534a 12.197a 0.008 -938.015 0.255 0.254 12.745a 13.599a 0.009 -947.039 0.832 0.839 15.016a 16.118a 0.011 -927.815 1.315 1.345 18.858a 20.050a 0.010 -920.492 1.212 1.242 16.693a 17.792a ∂P r(St+m =0) ∂∆ log P P It Z-statistic ∂P r(St+m =1) ∂∆ log P P It Z-statistic ∂P r(St+m =2) ∂∆ log P P It Z-statistic ∂P r(St+m =3) ∂∆ log P P It Note: Estimation results of the univariate multinomial logistic regression with ∆ log P P It on the right-hand side. The marginal effect (∂P r(.)/∂xl,t ) is the partial derivative of the predicted probability with respect to the RHS variable evaluated at its mean. Robust standard errors are in brackets and the superscripts a , b , and c denote significance at the 1%, 5% and 10% level, respectively. Identification is achieved by normalizing with the coefficients of the expansion state. We provide the model log-likelihood, and goodness-of-fit is measured by McFadden’s pseudo R-squared. We also report the LR and Wald test for the null hypothesis that all coefficients (except intercepts) associated with a given pair of business cycle regimes (boom (Bo) and expansion (Ex) or depression (De) and recession (Re)) are equal (indistinguishability). The test statistics are χ2 (1) distributed under the null. 18 Table 7: Multinomial Logistic Regression with Output Growth P r(St+m = j|It ) = F (β0 + β1 ∆yt ) m = months ahead 3 6 9 12 15 18 21 24 Z-statistic 0.055 (0.010) 5.21a -0.046 (0.008) −5.53a 0.018 (0.004) 4.74a -0.027 (0.004) −6.89a 0.031 (0.008) 3.89a -0.025 (0.006) −4.22a 0.017 (0.004) 4.35a -0.023 (0.004) −6.54a 0.010 (0.007) 1.590 -0.007 (0.004) -1.530 0.011 (0.005) 2.37b -0.014 (0.004) −3.71a -0.008 (0.007) -1.180 0.006 (0.004) 1.142 0.010 (0.005) 2.20b -0.008 (0.004) −1.84c -0.015 (0.008) −1.88c 0.008 (0.005) 1.750c 0.010 (0.005) 2.12b -0.004 (0.005) -0.690 -0.018 (0.008) −2.22b 0.010 (0.005) 1.99b 0.010 (0.005) 2.03b -0.001 (0.005) -0.250 -0.022 (0.008) −2.66a 0.012 (0.005) 2.46b 0.010 (0.005) 2.15b 0.000 (0.004) 0.030 -0.027 (0.009) −3.03a 0.015 (0.005) 3.18a 0.011 (0.005) 2.26b 0.001 (0.004) 0.150 Pseudo R2 Log-likelihood LR: Bo ≡ Ex Wald: Bo ≡ Ex LR: De ≡ Re Wald: De ≡ Re 0.074 -898.841 11.425a 12.304a 12.691a 11.881a 0.048 -923.124 12.560a 13.326a 18.241a 16.590a 0.014 -955.277 5.938b 6.405b 7.543a 7.641a 0.007 -956.439 7.378a 7.899a 5.268b 5.435b 0.005 -949.483 7.697a 8.216a 2.224 2.239 0.005 -941.182 7.532a 8.027a 1.293 1.285 0.007 -931.428 8.924a 9.509a 1.022 1.008 0.009 -920.729 10.511a 11.191a 1.356 1.324 ∂P r(St+m =0) ∂∆yt Z-statistic ∂P r(St+m =1) ∂∆yt Z-statistic ∂P r(St+m =2) ∂∆yt Z-statistic ∂P r(St+m =3) ∂∆yt Note: Estimation results of the univariate multinomial logistic regression with ∆yt on the right-hand side. The marginal effect (∂P r(.)/∂xl,t ) is the partial derivative of the predicted probability with respect to the RHS variable evaluated at its mean. Robust standard errors are in brackets and the superscripts a , b , and c denote significance at the 1%, 5% and 10% level, respectively. Identification is achieved by normalizing with the coefficients of the expansion state. We provide the model log-likelihood, and goodness-of-fit is measured by McFadden’s pseudo R-squared. We also report the LR and Wald test for the null hypothesis that all coefficients (except intercepts) associated with a given pair of business cycle regimes (boom (Bo) and expansion (Ex) or depression (De) and recession (Re)) are equal (indistinguishability). The test statistics are χ2 (1) distributed under the null. 19 Table 8: Multivariate Multinomial Logistic Regression ∂P r(St+m =0) ∂SP READt Z-statistic ∂P r(St+m =0) ∂∆ log SP 500t Z-statistic ∂P r(St+m =0) ∂∆ log P P It Z-statistic ∂P r(St+m =0) ∂∆yt Z-statistic ∂P r(St+m =1) ∂SP READt Z-statistic ∂P r(St+m =1) ∂∆ log SP 500t Z-statistic ∂P r(St+m =1) ∂∆ log P P It Z-statistic ∂P r(St+m =1) ∂∆yt Z-statistic ∂P r(St+m =2) ∂SP READt Z-statistic P r(St+m = j|It ) = F (β0 + β1 SP READt + β2 ∆ log SP 500t + β3 ∆ log P P It + β4 ∆yt ) m = months ahead 3 6 9 12 15 18 21 24 0.033 (0.009) 3.47a 0.011 (0.003) 3.76a 0.028 (0.023) 2.08b 0.045 (0.010) 4.35a -0.054 (0.007) −7.27a -0.007 (0.002) −3.30a 0.000 (0.011) 0.01 -0.044 (0.008) −5.41a 0.024 (0.004) 6.70a 0.047 (0.010) 4.59a -0.001 (0.003) -0.23 -0.004 (0.012) -0.31 -0.025 (0.008) −2.97a -0.063 (0.007) −9.05a 0.003 (0.002) 1.71c 0.022 (0.008) 2.71a 0.011 (0.005) 2.24b 0.029 (0.004) 6.79a 0.042 (0.010) 4.32a -0.004 (0.003) -1.45 -0.002 (0.012) -0.14 -0.028 (0.009) −3.22a -0.051 (0.007) −7.19a 0.005 (0.002) 2.93a 0.018 (0.009) 1.99b 0.014 (0.005) 2.78a 0.029 (0.004) 6.82a 0.050 (0.010) 4.67a 0.012 (0.003) 4.19a 0.012 (0.012) 0.99 0.024 (0.008) 3.07a -0.070 (0.008) −9.08a -0.007 (0.002) −3.78a 0.008 (0.009) 0.96 -0.021 (0.006) −3.91a 0.025 (0.004) 6.69a 0.057 (0.011) 5.16a 0.012 (0.003) 4.55a -0.000 (0.012) 0.05 0.005 (0.007) 0.74 -0.081 (0.008) −10.82a -0.004 (0.002) −2.70a 0.017 (0.007) 2.40b -0.006 (0.005) -1.16 0.027 (0.004) 6.64a 0.057 (0.011) 5.07a 0.009 (0.002) 3.67a -0.006 (0.012) -0.50 -0.013 (0.007) −1.83c -0.082 (0.008) −10.72a -0.003 (0.002) -2.01b 0.021 (0.008) 2.83a 0.008 (0.004) 1.70c 0.028 (0.004) 6.63a Note: See next page. 20 0.053 (0.011) 4.80a 0.004 (0.002) 1.79c -0.006 (0.012) -0.50 -0.018 (0.008) −2.34b -0.079 (0.008) −10.36a 0.001 (0.002) 0.85 0.024 (0.008) 3.12a 0.008 (0.005) 1.71c 0.028 (0.004) 6.67a 0.048 (0.011) 4.45a 0.002 (0.002) 0.71 -0.004 (0.012) -0.37 -0.021 (0.008) −2.62a -0.073 (0.007) −10.30a 0.003 (0.002) 1.55 0.022 (0.008) 2.94a 0.010 (0.005) 1.87c 0.029 (0.004) 6.77a Table 8: Multivariate Multinomial Logistic Regression - Continued Z-statistic -0.001 (0.002) -0.44 -0.003 (0.007) -0.38 0.016 (0.004) 4.40a -0.003 (0.005) -0.50 -0.004 (0.001) −2.89a -0.025 (0.006) −4.05a -0.018 (0.004) −4.69a -0.001 (0.002) -0.75 -0.006 (0.007) -0.81 0.015 (0.004) 4.24a -0.003 (0.007) -0.45 -0.004 (0.001) −2.58a -0.015 (0.006) −2.46b -0.018 (0.004) −4.77a -0.002 (0.002) -1.29 -0.007 (0.007) -1.04 0.010 (0.004) 2.42b -0.002 (0.009) -0.27 -0.005 (0.001) −3.86a -0.009 (0.007) -1.25 -0.010 (0.004) −2.37b -0.002 (0.002) -1.52 -0.008 (0.007) -1.06 0.009 (0.004) 2.27b -0.003 (0.009) -0.36 -0.004 (0.002) −2.46b -0.008 (0.007) -1.05 -0.004 (0.005) -0.99 -0.003 (0.001) −2.00b -0.005 (0.007) -0.71 0.008 (0.004) 2.16b -0.002 (0.009) -0.27 -0.003 (0.002) −1.84c -0.012 (0.007) -1.77c 0.001 (0.005) 0.12 -0.002 (0.001) −1.76c -0.001 (0.007) -0.16 0.008 (0.004) 1.88c -0.003 (0.010) -0.34 -0.002 (0.002) -1.01 -0.017 (0.006) −2.77a 0.003 (0.005) 0.70 -0.002 (0.001) -1.55 0.001 (0.008) 0.09 0.008 (0.004) 1.90c -0.013 (0.008) −1.65c 0.000 (0.001) 0.04 -0.019 (0.006) −3.39a 0.005 (0.004) 1.24 -0.002 (0.001) -1.59 0.000 (0.007) 0.03 0.009 (0.004) 2.02b -0.020 (0.006) −3.40a 0.001 (0.002) 0.71 -0.017 (0.005) −3.25a 0.005 (0.004) 1.36 Pseudo R2 Log-likelihood LR: Bo ≡ Ex Wald: Bo ≡ Ex LR: De ≡ Re Wald: De ≡ Re 0.140 -834.393 23.996a 23.963a 52.661a 39.953a 0.121 -852.227 26.089a 26.066a 49.638a 41.496a 0.107 -865.332 22.290a 22.318a 53.525a 46.066a 0.097 -869.637 24.612a 24.618a 49.370a 44.043a 0.086 -872.657 26.575a 26.530a 46.060a 42.218a 0.074 -876.007 25.996a 25.780a 37.487a 35.203a 0.066 -876.350 27.228a 26.845a 22.035a 21.731a 0.062 -871.475 29.928a 29.330a 16.251a 16.782a ∂P r(St+m =2) ∂∆ log SP 500t z-statistic ∂P r(St+m =2) ∂∆ log P P It Z-statistic ∂P r(St+m =2) ∂∆yt Z-statistic ∂P r(St+m =3) ∂SP READt Z-statistic ∂P r(St+m =3) ∂∆ log SP 500t Z-statistic ∂P r(St+m =3) ∂∆ log P P It Z-statistic ∂P r(St+m =3) ∂∆yt Note: Estimation results of the multivariate multinomial logistic regression. The marginal effect (∂P r(.)/∂xl,t ) is the partial derivative of the predicted probability with respect to the RHS variable evaluated at its mean. Robust standard errors are in brackets and the superscripts a , b , and c denote significance at the 1%, 5% and 10% level, respectively. Identification is achieved by normalizing with the coefficients of the expansion state. We provide the model log-likelihood, and goodness-of-fit is measured by McFadden’s pseudo R-squared. We also report the LR and Wald test for the null hypothesis that all coefficients (except intercepts) associated with a given pair of business cycle regimes (boom (Bo) and expansion (Ex) or depression (De) and recession (Re)) are equal (indistinguishability). The test statistics are χ2 (4) distributed under the null. 21 values of the marginal effects in the term-spread regressions and the pseudo-R2 exceed the other variables’ marginal effects and pseudo-R2 values for the majority of considered time horizons. Finally, the term spread is the only leading indicator of which (the absolute value of) the marginal effects and the pseudo-R2 exhibit an inverted U-shape (reaching a maximum value for the annual time horizon); all other marginal effects and pseudo-R2 ’s monotonically decline with the time horizon. Hence, the slope of the yield curve seems to have an “optimal” predictive horizon for recessions that lies between three and five quarters. We also run the multivariate version of the binomial logit regression as a robustness check, see Table 3. The outcomes of the univariate regressions are generally confirmed. The term spread continues to have predictive power for future recessions over the whole forecast horizon and its marginal effects are significant up to two years into the future. Similarly, stock market returns and real output growth preserve their predictive power for horizons of three quarters and one year, respectively. However, the marginal inflation effect is no longer significant for the 6-month horizon. The binomial logistic regression results largely confirm the preceding literature on binary business cycle prediction (e.g., Estrella and Hardouvelis, 1991; Estrella and Mishkin, 1998; Hamilton and Kim, 2002; Kauppi and Saikkonen, 2008). Turning to the multinomial logistic regressions, univariate results are reported in Tables 4-7 for the term spread, S&P 500 returns, the inflation rate, and real output growth, respectively, while multivariate results are summarized in Table 8. The marginal effects are reported across the four business cycle phases and add up to 0 for identification purposes. We also report testing outcomes (LR and Wald) for the null hypothesis that the RHS variables exhibit coefficients that are equal across a regime pair (recession vs. depression or expansion vs. boom). If the null hypothesis cannot be rejected, the two regimes are indistinguishable and can be merged into one regime; in case of rejection, the RHS variables provide different predictions across the two regimes (see Anderson, 1984). The marginal effects of ordinary recessions and expansions are somewhat smaller (in absolute value) in the multinomial regressions compared to the binomial results but remain statistically significant for the same variables and time horizons. Hence, the binary results are robust to adding regimes to the business cycle. Strikingly, however, the term spread fails to predict depressions despite the fact that it preserves predictive power for recessions and expansions. At the same time, pseudo-R2 values and marginal recession effects are largest for the term spread across different time horizons. Just as in the binomial case, the term spread is the only early warning indicator whose marginal effects and pseudo-R2 exhibit an inverted U-shape and a comparable optimal predictive horizon. In contrast to the term spread, the depression probability significantly diminishes when output growth, S&P 500 returns or inflation rise. Specifically, the univariate outcomes show that output growth and stock returns exhibit predictive power for depressions up to the 1-year time horizon. Remarkably, inflation seems to be the most important leading indicator for economic depressions in terms of marginal effects and across time horizons (up to 24 months). Moreover, the inflation rate has a significantly different information content for recessions and depressions across all forecast horizons (see the outcomes of the LR and Wald tests). A ceteris paribus one percent rise in inflation leads to a significant increase in the recession probability between 9 and 24 months ahead, while it leads to a significant drop in the depression probability at virtually every forecast horizon. The ap22 Figure 6: Predicted Probability of Recession and Depression (Multivariate) 1.0 .7 predicted probability of depression predicted probability of recession .8 .6 .5 .4 .3 .2 .1 .0 1920 1930 1940 1950 1960 1970 1980 1990 2000 0.8 0.6 0.4 0.2 0.0 1920 2010 (a) Probability of recession 1930 1940 1950 1960 1970 1980 1990 2000 2010 (b) Probability of depression Note: Predicted probabilities of a recession (Figure (a)) and depression (Figure (b)) occurring three months ahead. The probabilities are estimated from the multivariate multinomial logit model including all leading indicators (SP READ, ∆ log SP 500, ∆ log P P I, and ∆y). Recessions are shaded in light grey and depressions are shaded in dark grey. parently different impact of a rise in inflation on the occurrence of recession vs. depression regimes is somewhat puzzling but the existing literature gives some hints to what may be behind this phenomenon. Using a New Keynesian model on postwar macroeconomic data, Smets and Wouters (2007) show that inflation is primarily driven by price and wage mark-up shocks, inducing a negative correlation between current inflation and future output. This is in line with our positive marginal recession effect for inflation. Notice indeed that most recessions in our sample are situated after World War II. On the other hand, the significantly negative effect of a rise in inflation on the depression probability is in line with the observation that all three (prewar) depressions were deflationary episodes, see Figure 5 (c). As a robustness exercise, we report estimates of a multinomial logit regression that includes all four explanatory variables in Table 8. The variables’ marginal effects preserve the same signs and degrees of statistical significance. Moreover, LR and Wald tests that compare coefficients across regimes lead to the same conclusions as the test results obtained from univariate logit regressions. Figure 6 complements the regression results with graphs of the three-month-ahead predicted probabilities of a recession and depression for the multivariate multinomial logit model. These probability plots provide a qualitative complement to goodness-of-fit measures like the pseudo-R2 . The figure reveals that the leading indicators clearly differentiate between ordinary and extraordinary business cycle phases. The peaks in the predicted recession probability coincide with the recessions shaded in light grey, while the peaks of depression probabilities are associated with depressions shaded in dark grey. The central message from the figure is that depression probabilities exceed recession probabilities during depressions: they are higher than 80% in all three cases. Thus, the multinomial 23 model successfully disentangles economic recessions and depressions.11 4 Conclusions The business cycle is traditionally approached as a sequence of recessions and expansions in real economic activity. However, it is well-known that some recessions and expansions have been much longer, deeper or more abrupt than others. Thus, the question arises whether the binary characterization of the business cycle is not overly simplistic. Vast contractions of real output have indeed stimulated interest in rare economic disasters among economists. The binary framework may be overly restrictive for modeling such rare and severe events. Questioning the validity of the binary approach therefore constitutes the starting point of this paper. We distinguish depression and boom episodes from ordinary recessions and expansions by means of a novel nonparametric framework. Using industrial production data and the peak and trough dates of the NBER business cycle between 1919 and 2009, we compute the duration, amplitude, cumulated movements, and excess cumulated movements of each recession and expansion phase. A nonparametric outlier detection test is applied to the empirical distribution of these business cycle characteristics. The outlier test enables one to distinguish tail events in cycle characteristics arising from the same probability distribution from outlying characteristics generated by a different underlying process. We relabel recessions and expansions in U.S. business cycles as depressions and booms provided at least one of the corresponding cycle characteristics exhibits outlying behavior. The presence of outlying business cycle phases justifies the specification of an econometric model that is richer in dynamics than the binary approach. The re-mapping of the binary NBER business cycle into a four-regime cycle constitutes the first contribution of the paper. We identify four extraordinary episodes: three recessions are relabeled as economic depressions and one expansion is classified as an economic boom. Interestingly, the 2007-09 Great Recession is not identified as a depression by our outlier test procedure. In the second part of the paper, we use multinomial logistic regressions to analyze whether the four-regime business cycle can be predicted using macroeconomic and financial variables. Several key results emerge from our logistic regression analysis. First, we find that recessions become less likely when the term spread, stock market returns, and real output growth increase, regardless of whether one considers binomial or multinomial business cycle phases. Stated otherwise, the predictive power that macroeconomic and financial variables exhibit towards recessions in a binary prediction framework carries over to the multinomial framework. Second, using statistical tests within the multinomial logit framework, depressions are found to be significantly different from recessions. This provides further justification for the four-regime business cycle classification emerging from our outlier detection procedure. Finally, although the slope of the yield curve outperforms other leading indicators in predicting recessions, it does not have anything to say about 11 Interestingly, the indicators signal an impending depression with a probability of about 80% in March 2009, which suggests that the U.S. economy was evolving towards a depression regime at that time. However, the depression probability drops to approximately 10% by June 2009; hence, the economy avoided slipping into depression. 24 future economic depressions. In contrast, a decline in stock market returns, real output growth, or inflation increases the likelihood of a depression. A Appendix: Business cycle characteristics Formally, we observe i = 1, ..., n business cycle phases over a given period t = 1, ..., T (n < T ). First, the duration of the ith recession (expansion) measures the number of months between a peak (trough) occurring at t1 and the next trough (peak) occurring at t2 : t2 X Di = Qt , t=t1 Stbin where Qt = in case of a recession and Qt = 1 − Stbin in case of an expansion. Second, the amplitude of the ith recession (expansion) measures the change in real output yt between a peak (trough) at t1 to the next trough (peak) at t2 : Ai = yt2 − yt1 . Third, the cumulative movements of yt within the ith phase measure the overall cost (gain) of the recession (expansion): Ci = t2 X (yj − yt1 ). j=t1 +1 Finally, the excess cumulative movements measure the curvature of yt within the phase defined as: Ei = (Ci − 0.5Ai − 0.5Ai Di )/Di . The excess cumulated movements provide an approximation to the second derivative of the time series. If Ei > 0, the business cycle phase exhibits a concave shape, i.e., the slope of yt changes abruptly at the beginning of the phase but the changes in slope become more gradual as the phase approaches its turning point. Conversely if Ei < 0, the shape is convex, and the slope changes more gradually at early stages of the phase (see Camacho et al., 2008). References Anderson, J. (1984). Regression and ordered categorical variables. Journal of the Royal Statistical Society, Series B 46 (1), 1–30. Balke, N. and T. Fomby (1994). Large shocks, small shocks, and economic fluctuations: Outliers in macroeconomic time series. Journal of Applied Econometrics 9, 181–200. Barnett, V. and T. Lewis (1994). Outliers in Statistical Data (Third ed.). Wiley Series in Probability and Statistics. John Wiley & Sons. Barro, R. (2006). Rare disasters and asset markets in the Twentieth century. Quarterly Journal of Economics 121 (3), 823–866. 25 Barro, R. (2009). Rare disasters, asset prices, and welfare costs. American Economic Review 99 (1), 243–264. Bernanke, B. (1983). Nonmonetary effects of the financial crisis in the propagation of the Geat Depression. American Economic Review 73 (3), 257–276. Birchenhall, C., H. Jessen, D. Osborn, and P. Simpson (1999). Predicting U.S. businesscycle regimes. Journal of Business and Economic Statistics 17 (3), 313–323. Bordo, M. and J. Haubrich (2010). Credit crises, money and contractions: An historical view. Journal of Monetary Economics 57, 1–18. Braun, R. A. and E. R. McGrattan (1993). The macroeconomics of war and peace. In O. J. Blanchard and S. Fisher (Eds.), NBER Macroeconomics Annual, Chapter 2, pp. 197–247. MIT Press, Cambridge. Bry, G. and C. Boschan (1971). Cyclical Analysis of Time Series: Selected Procedures and Computer Programs. NBER, New York. Burns, A. and W. Mitchell (1946). Measuring Business Cycles. NBER, New York. Cakmakli, C., R. Paap, and D. van Dijk (2013). Measuring and predicting heterogeneous recessions. Journal of Economic Dynamics and Control, http://dx.doi.org/10.1016/j.jedc.2013.06.004. Camacho, M., G. Perez-Quiros, and L. Saiz (2008). Do European business cycles look like one? Journal of Economic Dynamics and Control 32, 2165–2190. Candelon, B. and N. Metiu (2013). A distribution-free test for outliers. Deutsche Bundesbank Discussion Paper No. 02/2013. Carrasco, M., L. Hu, and W. Ploberger (2013). Optimal test for Markov switching parameters. conditionally accepted in Econometrica. Carter, A. and D. Steigerwald (2012). Testing for regime switching: A comment. Econometrica 80 (4), 1809–1812. Cho, J. and H. White (2007). Testing for regime switching. Econometrica 75 (6), 1671– 1720. Christiansen, C. (2013). Predicting severe simultaneous recessions using yield spreads as leading indicators. Journal of International Money and Finance 32, 1032–1043. Claessens, S., M. Kose, and M. Terrones (2012). How do business and financial cycles interact? Journal of International Economics 87, 178–190. DeLong, J. and L. Summers (1986). Are business cycles symmetrical? In R. Gordon (Ed.), The American Business Cycle. Continuity and Change, pp. 166–179. University of Chicago Press, Chicago. Eggertsson, G. and P. Krugman (2012). Debt, deleveraging, and the liquidity trap: A Fisher-Minsky-Koo approach. Quarterly Journal of Economics 127 (3), 1469–1513. 26 Estrella, A. and G. Hardouvelis (1991). The term structure as a predictor of real economic activity. Journal of Finance 46, 555–576. Estrella, A. and F. Mishkin (1998). Predicting U.S. recessions: Financial variables as leading indicators. Review of Economics and Statistics 80, 45–61. Friedman, M. and A. Schwartz (1963). A monetary history of the United States, 18671960. Princeton University Press. Gabaix, X. (2012). Variable rare disasters: An exactly solved framework for ten puzzles in macro-finance. Quarterly Journal of Economics 127, 645–700. Giordani, P., R. Kohn, and D. van Dijk (2007). A unified approach to nonlinearity, structural change, and outliers. Journal of Econometrics 137, 112–133. Goyal, I. and A. Welch (2008). A comprehensive look at the empirical performance of equity premium prediction. Review of Financial Studies 21 (4), 1455–1508. Hamilton, J. and D. Kim (2002). A reexamination of the predictability of economic activity using the yield spread. Journal of Money, Credit and Banking 34 (2), 340–360. Hamilton, J. D. (1989). A new approach to the economic analysis of nonstationary time series and the business cycle. Econometrica 57 (2), 357–384. Harding, D. and A. Pagan (2002). Dissecting the cycle: A methodological investigation. Journal of Monetary Economics 49, 365–381. Harvey, C. (1988). The real term structure and consumption growth. Journal of Financial Economics 22, 305–333. Kauppi, H. and P. Saikkonen (2008). Predicting U.S. recessions with dynamic binary response models. Review of Economics and Statistics 90 (4), 777–791. Kehoe, T. and E. Prescott (2002). Great depressions of the 20th century. Review of Economic Dynamics 5, 1–18. King, R. and C. Plosser (1994). Real business cycles and the test of the Adelmans. Journal of Monetary Economics 33, 405–438. McCulloch, R. and P. Rossi (1994). An exact likelihood analysis of the multinomial probit model. Journal of Econometrics 64, 207–240. McFadden, P. (1974). The measurement of urban travel demand. Journal of Public Economics 3, 303–328. McKay, A. and R. Reis (2008). The brevity and violence of contractions and expansions. Journal of Monetary Economics 55 (4), 738–751. McQueen, G. and S. Thorley (1993). Asymmetric business cycle turning points. Journal of Monetary Economics 31, 341–362. 27 Morley, J. and J. Piger (2012). The asymmetric business cycle. Review of Economics and Statistics 94 (1), 208–221. Neftci, S. (1984). Are economic time series asymmetric over the business cycle? Journal of Political Economy 92, 307–328. Pagan, A. and K. A. Sossounov (2003). A simple framework for analysing bull and bear markets. Journal of Applied Econometrics 18, 233–246. Potter, S. (1995). A nonlinear approach to US GNP. Journal of Applied Econometrics 10, 109–125. Rudebusch, G. and J. Williams (2009). Forecasting recessions: The puzzle of the enduring power of the yield curve. Journal of Business and Economic Statistics 27 (4), 492–503. Sichel, D. (1994). Inventories and the three phases of the business cycle. Journal of Business and Economic Statistics 12, 269–277. Silverman, B. W. (1981). Using kernel density estimates to investigate multimodality. Journal of the Royal Statistical Society, Series B 43 (1), 97–99. Singh, K. and M. Xie (2003). Bootlier Plot - Bootstrap based outlier detection plot. Sankhya: The Indian Journal of Statistics 65 (3), 532–559. Smets, F. and R. Wouters (2007). Shocks and frictions in US business cycles: A Bayesian DSGE approach. American Economic Review 97 (3), 586–606. Tiao, G. and R. Tsay (1994). Some advances in non-linear and adaptive modelling in time-series. Journal of Forecasting 13 (2), 109–131. Velde, F. (2009). The recession of 1937 – A cautionary tale. Federal Reserve Bank of Chicago Economic Perspectives, Quarter 4. Wachter, J. (2013). Can time-varying risk of rare disasters explain aggregate stock market volatility? Journal of Finance 68 (3), 987–1035. 28
© Copyright 2026 Paperzz