ISSN: 1525-3066 Center for Policy Research Working Paper No. 75 ON THE ESTIMATION AND INFERENCE OF A PANEL COINTEGRATION MODEL WITH CROSS-SECTIONAL DEPENDENCE Jushan Bai and Chihwa Kao Center for Policy Research Maxwell School of Citizenship and Public Affairs Syracuse University 426 Eggers Hall Syracuse, New York 13244-1020 (315) 443-3114 | Fax (315) 443-1081 e-mail: [email protected] December 2005 $5.00 Up-to-date information about CPR’s research projects and other activities is available from our World Wide Web site at www-cpr.maxwell.syr.edu. All recent working papers and Policy Briefs can be read and/or printed from there as well. CENTER FOR POLICY RESEARCH – Fall 2005 Timothy Smeeding, Director Professor of Economics & Public Administration __________ Associate Directors Margaret Austin Associate Director, Budget and Administration Douglas Holtz-Eakin Professor of Economics Associate Director, Center for Policy Research Douglas Wolf Professor of Public Administration Associate Director, Aging Studies Program John Yinger Professor of Economics and Public Administration Associate Director, Metropolitan Studies Program SENIOR RESEARCH ASSOCIATES Badi Baltagi............................................ Economics Pablo Beramendi............................Political Science Dan Black............................................... Economics Lloyd Blanchard ..................... Public Administration William Duncombe ................. Public Administration Gary Engelhardt ....................................Economics Deborah Freund .................... Public Administration Madonna Harrington Meyer .....................Sociology Christine Himes........................................Sociology William C. Horrace .................................Economics Bernard Jump ........................Public Administration Duke Kao ...............................................Economics Eric Kingson ........................................ Social Work Thomas Kniesner ..................................Economics Tom Krebs ............................................. Economics Jeff Kubik ...............................................Economics Andrew London ......................................... Sociology Len Lopoo ............................... Public Administration Jerry Miner .............................................. Economics John Moran ............................................. Economics Jan Ondrich............................................. Economics John Palmer ............................ Public Administration Lori Ploutz-Snyder.... Health and Physical Education Grant Reeher ..................................Political Science Stuart Rosenthal ..................................... Economics Ross Rubenstein..................... Public Administration Margaret Usdansky…………………………Sociology Michael Wasylenko ................................. Economics Janet Wilmoth ........................................... Sociology GRADUATE ASSOCIATES Javier Baez ............................................Economics Sonali Ballal………………….. Public Administration Dana Balter ............................Public Administration Jason Bladen .........................................Economics Jesse Bricker .........................................Economics Maria Brown .....................................Social Science Yong Chen .............................................Economics Ginnie Cronin ...................................Social Science Ana Dammert .........................................Economics Mike Eriksen ..........................................Economics Katie Fitzpatrick......................................Economics Alexandre Genest .................. Public Administration Julie Anna Golebiewski ..........................Economics Nadia Greenhalgh-Stanley ....................Economics Yue Hu .................................................... Economics Becky Lafrancois..................................... Economics Joseph Marchand.................................... Economics Larry Miller .............................. Public Administration Wendy Parker ........................................... Sociology Emily Pas ................................................ Economics Shawn Rohlin .......................................... Economics Cynthia Searcy........................ Public Administration Talisha Searcy ........................ Public Administration Jeff Thompson ........................................ Economics Wen Wang .............................................. Economics Yoshikun Yamamoto ............... Public Administration STAFF Kelly Bogart ......................Administrative Secretary Martha Bonney..... Publications/Events Coordinator Karen Cimilluca ........... Librarian/Office Coordinator Kim Desmond …………….Administrative Secretary Kati Foley……………...Administrative Assistant, LIS Kitty Nasto……………….…Administrative Secretary Candi Patterson.......................Computer Consultant Mary Santy……...….………Administrative Secretary On the Estimation and Inference of a Panel Cointegration Model with Cross-Sectional Dependence Jushan Bai Department of Economics New York University New York, NY 10003 USA and Department of Economics Tsinghua University Beijing 10084 China [email protected] Chihwa Kao Center for Policy Research and Department of Economics Syracuse University Syracuse, NY 13244-1020 USA [email protected] First Draft: March 2003 This Draft: May 2, 2005 Abstract Most of the existing literature on panel data cointegration assumes crosssectional independence, an assumption that is difficult to satisfy. This paper studies panel cointegration under cross-sectional dependence, which is characterized by a factor structure. We derive the limiting distribution of a fully modified estimator for the panel cointegrating coefficients. We also propose a continuousupdated fully modified (CUP-FM) estimator. Monte Carlo results show that the CUP-FM estimator has better small sample properties than the two-step FM (2S-FM) and OLS estimators. 1 1 Introduction A convenient but difficult to justify assumption in panel cointegration analysis is crosssectional independence. Left untreated, cross-sectional dependence causes bias and inconsistency estimation, as argued by Andrews (2003). In this paper, we use a factor structure to characterize cross-sectional dependence. Factors models are especially suited for this purpose. One major source of cross-section correlation in macroeconomic data is common shocks, e.g., oil price shocks and international financial crises. Common shocks drive the underlying comovement of economic variables. Factor models provide an effective way to extract the comovement and have been used in various studies.1 Cross-sectional correlation exists even in micro level data because of herd behavior (fashions, fads, and imitation cascades) either at firm level or household level. The general state of an economy (recessions or booms) also affects household decision making. Factor models accommodate individual’s different responses to common shocks through heterogeneous factor loadings. Panel data models with correlated cross-sectional units are important due to increasing availability of large panel data sets and increasing interconnectedness of the economies. Despite the immense interest in testing for panel unit roots and cointegration,2 not much attention has been paid to the issues of cross-sectional dependence. Studies using factor models for nonstationary data include Bai and Ng (2004), Bai (2004), Phillips and Sul (2003), and Moon and Perron (2004). Chang (2002) proposed to use a nonlinear IV estimation to construct a new panel unit root test. Hall et al (1999) considered a problem of determining the number of common trends. Baltagi et al. (2004) derived several Lagrange Multiplier tests for the panel data regression model with spatial error correlation. Robertson and Symons (2000), Coakley et al. (2002) and Pesaran (2004) proposed to use common factors to capture the cross-sectional dependence in stationary panel models. All these studies focus on either stationary data or panel unit root studies rather than panel cointegration. This paper makes three contributions. First, it adds to the literature by suggesting a factor model for panel cointegrations. Second, it proposes a continuous-updated fully modified (CUP-FM) estimator. Third, it provides a comparison for the finite sample properties 1 For example, Stock and Watson (2002), Gregory and Head (1999), Forni and Reichlin (1998) and Forni et al. (2000) to name a few. 2 see Baltagi and Kao (2000) for a recent survey of the OLS, two-step fully modified (2S-FM), CUP-FM estimators. The rest of the paper is organized as follows. Section 2 introduces the model. Section 3 presents assumptions. Sections 4 and 5 develop the asymptotic theory for the OLS and fully modified (FM) estimators. Section 6 discusses a feasible FM estimator and suggests a CUP-FM estimator. Section 7 makes some remarks on hypothesis testing. Section 8 presents Monte Carlo results to illustrate the finite sample properties of the OLS and FM estimators. Section 9 summarizes the findings. The appendix contains the proofs of Lemmas and Theorems. R The following notations are used in the paper. We write the integral R1 0 W (s)ds as W when there is no ambiguity over limits. We define Ω1/2 to be any matrix such that ¡ ¢¡ ¢0 © ¡ 0 ¢ª1/2 Ω = Ω1/2 Ω1/2 . We use kAk to denote tr A A , |A| to denote the determinant of p A, ⇒ to denote weak convergence, → to denote convergence in probability, [x] to denote the largest integer ≤ x, I(0) and I(1) to signify a time-series that is integrated of order zero and one, respectively, and BM (Ω) to denote Brownian motion with the covariance matrix Ω. We let M < ∞ be a generic positive number, not depending on T or n. 2 The Model Consider the following fixed effect panel regression: yit = αi + βxit + eit , i = 1, ..., n, t = 1, ..., T, (1) where yit is 1 × 1, β is a 1 × k vector of the slope parameters, αi is the intercept, and eit is the stationary regression error. We assume that xit is a k × 1 integrated processes of order one for all i, where xit = xit−1 + εit . Under these specifications, (1) describes a system of cointegrated regressions, i.e., yit is cointegrated with xit . The initialization of this system is yi0 = xi0 = Op (1) as T → ∞ for all i. The individual constant term αi can be extended into general deterministic time trends such as α0i + α1i t+, ..., +αpi t or other deterministic component. To model the cross-sectional dependence we assume the error term, eit , follows a factor model (e.g., Bai and Ng, 2002, 2004): 0 eit = λi Ft + uit , 2 (2) where Ft is a r × 1 vector of common factors, λi is a r × 1 vector of factor loadings and uit is the idiosyncratic component of eit , which means ³ ´ 0 0 E (eit ejt ) = λi E Ft Ft λj i.e., eit and ejt are correlated due to the common factors Ft . Remark 1 1. We could also allow εit to have a factor structure such that 0 εit = γ i Ft + ηit . Then we can use 4xit to estimate Ft and γ i . Or we can use eit together with 4xit to estimate Ft , λi and γ i . In general, εit can be of the form εit = γ 0i Ft + τ 0i Gt + ηit , where Ft and Gt are zero mean processes, and ηit are usually independent over i and t. 3 Assumptions Our analysis is based on the following assumptions. Assumption 1 As n → ∞, 1 n Pn i=1 0 λi λi −→ Σλ , a r × r positive definite matrix. ¡ 0 P∞ 0 ¢0 Assumption 2 Let wit = Ft , uit , εit . For each i, wit = Πi (L)vit = j=0 Πij vit−j , P∞ a j=0 j kΠij k < ∞, |Πi (1)| 6= 0 for some a > 1, where vit is i.i.d. over t. In addition, Evit = 0, E(vit vit0 ) = Σv > 0, and Ekvit k8 ≤ M < ∞. Assumption 3 Ft and uit are independent; uit are independent across i. Under Assumption 2, a multivariate invariance principle for wit holds, i.e., the partial P r] sum process √1T [T t=1 wit satisfies: [T r] 1 X √ wit ⇒ B (Ωi ) as T → ∞ for all i, T t=1 3 (3) where ⎡ ⎤ BF ⎢ ⎥ Bi = ⎣ Bui ⎦ . Bεi The long-run covariance matrix of {wit } is given by Ωi = ∞ X j=−∞ ³ ´ 0 E wi0 wij 0 = Πi (1)Σv Πi (1) 0 = Σi + Γi + Γi ⎤ ⎡ ΩF i ΩF ui ΩF εi ⎥ ⎢ = ⎣ ΩuF i Ωui Ωuεi ⎦ , ΩεF i Ωεui Ωεi where Γi = ∞ X j=1 and ⎡ ΓF i ΓF ui ΓF εi ³ ´ 0 ⎢ E wi0 wij = ⎣ ΓuF i Γui ΓεF i ⎡ ΣF i ³ ´ 0 ⎢ Σi = E wi0 wi0 = ⎣ ΣuF i ΣεF i Γεui ⎥ Γuεi ⎦ Γεi ΣF ui ΣF εi Σui Σεui are partitioned conformably with wit . We denote ⎤ ⎤ ⎥ Σuεi ⎦ Σεi n 1X Ω = lim Ωi , n→∞ n i=1 n and 1X Γ = lim Γi , n→∞ n i=1 n 1X Σ = lim Σi . n→∞ n i=1 Assumption 4 Ωεi is non-singular, i.e., {xit } are not cointegrated. 4 (4) Define Ωbi = " ΩF i ΩF ui ΩuF i Ωui # , Ωbεi = " ΩF εi Ωuεi # and Ωb.εi = Ωbi − Ωbεi Ω−1 εi Ωεbi . Then, Bi can be rewritten as " Bi = Bbi Bεi # = " 1/2 Bbi = Vbi = " Vbi Wi 1/2 0 where and −1/2 Ωb.εi Ωbεi Ωεi # Ωεi " " BF Bui VF Vui # # #" Vbi Wi # , (5) , , = BM (I) is a standardized Brownian motion. Define the one-sided long-run covariance ∆i = Σi + Γi ∞ ³ ´ X 0 = E wi0 wij j=0 with ∆i = Remark 2 " ∆bi ∆bεi ∆εbi ∆εi # . 1. Assumption 1 is a standard assumption in factor models (e.g., Bai and Ng 2002, 2004) to ensure the factor structure is identifiable. We only consider nonrandom 0 factor loadings for simplicity. Our results still hold when the λi s are random, provided they are independent of the factors and idiosyncratic errors, and E kλi k4 ≤ M. ¡ 0 ¢ 2. Assumption 2 assumes that the random factors, Ft , and idiosyncratic shocks uit , εit are stationary linear processes. Note that Ft and εit are allowed to be correlated. In particular, εit may have a factor structure as in Remark 1. 5 3. Assumption of independence made in Assumption 3 between Ft and uit can be relaxed following Bai and Ng (2002). Nevertheless, independence is not a restricted assumption since cross-sectional correlations in the regression errors eit are taken into account by the common factors. 4 OLS Let us first study the limiting distribution of the OLS estimator for equation (1). The OLS estimator of β is bOLS = β " n T XX i=1 t=1 0 yit (xit − xi ) #" n T XX i=1 t=1 Theorem 1 Under´Assumptions 1 − ³ 4, we have n √ ³b √ nT β OLS − β − nδ nT ⇒ N 0, 6Ω−1 lim 1 n→∞ n ε n T 0 (xit − xi ) (xit − xi ) Pn ³ i=1 0 #−1 (6) . λi ΩF.εi λi Ωεi + Ωu.εi Ωεi ´o ´ Ω−1 , ε as (n, T → ∞) with → 0 where " n # µ µZ ¶ ¶ µZ ¶ 1 X 0 0 0 1/2 1/2 fi dWi Ω−1/2 fi dWi Ω−1/2 λ ΩF εi Ωεi W + ∆F εi + Ωuεi Ωεi W + ∆uεi δ nT = εi εi n i=1 i " n #−1 T 0 1X 1 X (xit − xit ) (xit − xi ) , n i=1 T 2 t=1 fi = Wi − W R Wi and Ωε = lim n1 n→∞ Pn i=1 Ωεi . Remark 3 It is also possible to construct the bias-corrected OLS by using the averages of the long run covariances. Note " n #µ µ ¶ ¶−1 1 X 0 1 1 1 E [δ nT ] ' λ − ΩF εi + ∆F εi − Ωuεi + ∆uεi Ωε n i=1 i 2 2 6 " n µ ¶ #µ ¶−1 ´ 1 ³ 0 1 X 1 0 = − λi ΩF εi + Ωuεi + λi ∆F εi + ∆uεi Ωε n i=1 2 6 à n µ ¶ !µ ¶−1 n n n 1X 1 1X 1X 0 1X 1 0 = − λi ΩF εi + Ωuεi + λ ∆F εi + ∆uεi Ωε n i=1 2 n i=1 n i=1 i n i=1 6 6 It can be shown by a central limit theorem that √ n (δ nT − E [δ nT ]) ⇒ N (0, B) for some B. Therefore, √ nT √ nT = ³ ´ bOLS − β − √nE [δ nT ] β ³ ´ √ √ b β OLS − β − nδ nT + n (δ nT − E [δ nT ]) ⇒ N (0, A) for some A. 5 FM Estimator bF M . The FM estimator Next we examine the limiting distribution of the FM estimator, β was suggested by Phillips and Hansen (1990) in a different context (non-panel data). The FM estimator is constructed by making corrections for endogeneity and serial correlation to bOLS in (6). The endogeneity correction is achieved by modifying the the OLS estimator β variable yit in (1) with the transformation ³ 0 ´ yit+ = yit − λi ΩF εi + Ωuεi Ω−1 εi ∆xit . The serial correlation correction term has the form −1 ∆+ bεi = ∆bεi − Ωbεi Ωεi ∆εi , # " ∆+ F εi . = ∆+ uεi Therefore, the infeasible FM estimator is eF M = β " n à T X X i=1 t=1 !# " n T #−1 ³ 0 ´ XX 0 + yit+ (xit − xi ) − T λi ∆+ (xit − xi ) (xit − xi ) . F εi + ∆uεi 0 i=1 t=1 (7) eF M . Now, we state the limiting distribution of β 7 Theorem 2 Let Assumptions 1 − 4 hold. Then as (n, T → ∞) with Tn → 0 à ( ) ! n ³ ´ ´ X √ ³e 1 0 nT β F M − β ⇒ N 0, 6Ω−1 lim . λi ΩF.εi λi Ωεi + Ωu.εi Ωεi Ω−1 ε ε n→∞ n i=1 Remark 4 The asymptotic distribution in Theorem 2 is reduced to Ãà ! !! à n ´ X √ ³e 1 lim nT β F M − β ⇒ N 0, 6Ω−1 λ2i ΩF.ε + Ωu.ε ε n→∞ n i=1 if the long-run covariances are the same across the cross-sectional unit i and r = 1. 6 Feasible FM In this section we investigate the limiting distribution of the feasible FM. We will show that the limiting distribution of the feasible FM is not affected when λi , Ωεi , Ωεbi , Ωεi , and ∆εbi are estimated. To estimate λi , we use the method of principal components used in Stock and 0 0 Watson (2002). Let λ = (λ1 , λ2 , ..., λn ) and F = (F1 , F2 , ..., FT ) . The method of principal components of λ and F minimizes T n ´2 1 XX³ 0 V (r) = eit − λi Ft b nT i=1 t=1 where b it yit − α b i − βx b (xit − xi ) , = (yit − y i ) − β eit = b b with a consistent estimator β. Concentrating out λ and using the normalization that ¡ 0 ¡ 0¢ ¢ F F/T = Ir , the optimization problem is identical to maximizing tr F ZZ F , where 0 0 Z = (b e1 , eb2 , ..., b en ) is T × n with b ei = (b ei1 , ebi2 , ..., b eiT ) . The estimated factor matrix, denoted √ by Fb, a T × r matrix, is T times eigenvectors corresponding to the r largest eigenvalues of 0 the T × T matrix ZZ , and b λ 0 = ³ 0 Fb Fb 0 Fb Z = T 8 ´−1 0 Fb Z is the corresponding matrix of the estimated factor loadings. It is known that the solution to the above minimization problem is not unique, i.e., λi and Ft are not directly identifiable but they are identifiable up to a transformation H. For our setup, knowing Hλi is as good 0 as knowing λi . For example in (7) using λi ∆+ F εi will give the same information as using 0 0 0 0 0 0 + −1 + ∆F εi = λi H H −1 ∆+ F εi since ∆F εi is also identifiable up to a transformation, i.e., λi H H 0 λi ∆+ F εi . Therefore, when assessing the properties of the estimates we only need to consider bi and λi . the differences in the space spanned by, say, between λ bi , Fbt , Σ bF M , with λ b i , and Ω b i in place of λi , Ft , Σi , and Ωi , Define the feasible FM, β b β FM = where " n à T X X i=1 t=1 !# " n T #−1 ³ 0 ´ XX 0 b∆ b+ b+ ybit+ (xit − xi ) − T λ (xit − xi ) (xit − xi ) , i F εi + ∆uεi 0 i=1 t=1 ´ ³ 0 b −1 ∆xit . b F εi + Ω b uεi Ω ybit+ = yit − b λi Ω εi b + are defined similarly. b + and ∆ and ∆ uεi F εi Assume that Ωi = Ω for all i. Let ³ 0 ´ e+ = e − λ Ω + Ω Ω−1 it F ε uε i ε ∆xit , it X b+ = 1 b+ , ∆ ∆ bεn n i=1 bεi n and n ∆+ bεn 1X + = ∆ . n i=1 bεi 9 Then ´ √ ³b e nT β F M − β FM (à ! à T !) n T X X 0 0 1 X + b+ = √ − eb+ e+ it (xit − xi ) − T ∆bεn it (xit − xi ) − T ∆bεn nT i=1 t=1 t=1 " #−1 n T 0 1 XX (xit − xi ) (xit − xi ) 2 nT i=1 t=1 à T !# " n ³ ´ ¢ 0 1 X X¡ + + b+ √ e − e+ b = it (xit − xi ) − T ∆bεn − ∆bεn nT i=1 t=1 it " #−1 n T 0 1 XX (xit − xi ) (xit − xi ) . nT 2 i=1 t=1 Before we prove Theorem 3 we need the following lemmas. ´ √ ³b+ + − ∆ Lemma 1 Under Assumptions 1-4 n ∆ bεn bεn = op (1). Lemma 1 can be proved similarly by following Phillips and Moon (1999) and Moon and Perron (2004). Lemma 2 Suppose Assumptions 1-4 hold. There exists an H with rank r such that as (n, T → ∞) (i) µ ¶ n °2 1 X° 1 °b ° °λi − Hλi ° = Op n i=1 δ2nT (ii) let ci (i = 1, 2, ..., n) be a sequence of random matrices such that ci = Op (1) and Pn 1 2 i=1 kci k = Op (1) then n ´0 1 X ³b λi − Hλi ci = Op n i=1 n where δ nT n√ √ o = min n, T . µ 1 δ 2nT ¶ Bai and Ng (2002) showed that for a known b eit that the average squared deviations bi and Hλi vanish as n and T both tend to infinity and the rate of convergence is between λ 10 the minimum of n and T . Lemma 2 can be proved similarly by following Bai and Ng (2002) that parameter estimation uncertainty for β has no impact on the null limit distribution of bi . λ Lemma 3 Under Assumptions 1-4 as (n, T → ∞) and T n ¢ 0 1 XX¡ + √ eit − e+ b it (xit − xi ) = op (1) nT i=1 t=1 √ n T → 0. Then we have the following theorem: √ Theorem 3 Under Assumptions 1-4 and (n, T → ∞) and Tn → 0 ´ √ ³ bF M − β eF M = op (1). nT β In the literature, the FM-type estimators usually were computed with a two-step probOLS . Then, one constructs cedure, by assuming an initial consistent estimate of β, say β b(1) . The 2S-FM, denoted b (1) , and loading, λ estimates of the long-run covariance matrix, Ω i (1) (1) (1) b b b β 2S is obtained using Ω and λi : " n à T µ 0(1) ¶!# X X +(1) (1) 0 +(1) +(1) b b b β yb (xit − xi ) − T b λi ∆ +∆ 2S = it i=1 F εi t=1 " n T XX i=1 t=1 (xit − xi ) (xit − xi ) 0 #−1 . uεi (8) In this paper, we propose a CUP-FM estimator. The CUP-FM is constructed by estimating bF M , Ω bi b and λ parameters and long-run covariance matrix and loading recursively. Thus β are estimated repeatedly, until convergence is reached. In the Section 8, we find the CUPFM has a superior small sample properties as compared with the 2S-FM, i.e., CUP-FM has smaller bias than the common 2S-FM estimator. The CUP-FM is defined as " n à T !# ´ ³ 0³ ´ ³ ´ ³ ´´ X X ³ 0 bCUP = bCU P (xit − xi ) − T λ b β bCU P ∆ bCUP + ∆ bCU P b+ β b+ β β ybit+ β i uεi F εi i=1 t=1 " n T XX i=1 t=1 (xit − xi ) (xit − xi ) 0 #−1 (9) . 11 Remark 5 1. In this paper, we assume the number of factors, r, is known. Bai and Ng (2002) showed that the number of factors can be found by minimizing the following: µ ¶ µ ¶ n+T nT IC(k) = log (V (k)) + k log . nT n+T ³ 0 0 bit = Fbt , u bit , 4 xit 2. Once the estimates of wit , w ( (10) 0 bi Fbt . u bit = ebit − λ Ω was estimated by N , were estimated, we used T n X X 0 b= 1 Σ w bit w bit nT i=1 t=1 to estimate Σ, where X b= 1 Ω n i=1 ´0 T l 1X 1X 0 w bit w bit + T t=1 T τ =1 τl T ³ X t=τ +1 0 0 w bit w bit−τ + w bit−τ w bit ´ ) , (11) b i and where τ l is a weight function or a kernel. Using Phillips and Moon (1999), Σ b i can be shown to be consistent for Σi and Ωi . Ω 7 Hypothesis Testing We now consider a linear hypothesis that involves the elements of the coefficient vector β. We show that hypothesis tests constructed using the FM estimator have asymptotic chi-squared distributions. The null hypothesis has the form: H0 : Rβ = r, (12) where r is a m × 1 known vector and R is a known m × k matrix describing the restrictions. bF M is A natural test statistic of the Wald test using β " ( ) #−1 n ³ 0 ´0 ´ ³ ´ X 1 2³ b 1 −1 bΩ b b b −1 lim b b b b b 6Ω λ Ω + Ω Ω Ω R β − r . W = nT Rβ F M − r λ u.εi εi FM i F.εi i εi ε ε n→∞ n 6 i=1 (13) 12 It is clear that W converges in distribution to a chi-squared random variable with k degrees of freedom, χ2k , as (n, T → ∞) under the null hypothesis. Hence, we establish the following theorem: Theorem 4 If Assumptions 1−4 hold, then under the null hypothesis (12), with (n, T → ∞) , W ⇒ χ2k , Remark 6 1. One common application of the theorem 6 is the single-coefficient test: one of the coefficient is zero; β j = β 0 , h i R = 0 0 ··· 1 0 ··· 0 and r = 0. We can construct a t-statistic ´ √ ³b nT β jF M − β 0 tj = sj where " b −1 s2j = 6Ω ε ( (14) # ) n ´ 1 X ³b 0 b b b b u.εi Ω b εi b −1 , lim λi ΩF.εi λi Ωεi + Ω Ω ε n→∞ n i=1 jj the jth diagonal element of " ( ) # n ³ 0 ´ X 1 bΩ b bb b b b −1 b −1 lim λ Ω 6Ω i F.εi λi Ωεi + Ωu.εi Ωεi ε ε n→∞ n i=1 It follows that tj ⇒ N (0, 1) . (15) 2. General nonlinear parameter restriction such as H0 : h(β) = 0, where h(·) is k ∗ × 1 vector of smooth functions such that ∂h 0 ∂β has full rank k ∗ can be conducted in a similar fashion as in Theorem 6. Thus, the Wald test has the following form ³ ´0 ³ ´ bF M Vb −1 h β bF M Wh = nT 2 h β h where Vbh−1 ⎛ ³ 0 ´⎞ ´⎞ ³ b b ∂h β F M ∂h β FM ⎟ −1 ⎜ b ⎠ ⎝ = Vβ ⎝ ⎠ 0 ∂β ∂β ⎛ 13 and It follows that b −1 Vbβ = 6Ω ε ( ) n ´ 1 X ³b 0 b b b b u.εi Ω b εi b −1 . lim λi ΩF.εi λΩεii + Ω Ω ε n→∞ n i=1 (16) Wh ⇒ χ2k∗ as (n, T → ∞) . 8 Monte Carlo Simulations In this section, we conduct Monte Carlo experiments to assess the finite sample properties of OLS and FM estimators. The simulations were performed by a Sun SparcServer 1000 and an Ultra Enterprise 3000. GAUSS 3.2.31 and COINT 2.0 were used to perform the simulations. Random numbers for error terms, (Ft∗ , u∗it , ε∗it ) were generated by the GAUSS procedure RNDNS. At each replication, we generated an n(T + 1000) length of random numbers and then split it into n series so that each series had the same mean and variance. The first 1, 000 observations were discarded for each series. {Ft∗ } , {u∗it } and {ε∗it } were constructed with Ft∗ = 0, u∗i0 = 0 and ε∗i0 = 0. To compare the performance of the OLS and FM estimators we conducted Monte Carlo experiments based on a design which is similar to Kao and Chiang (2000) yit = αi + βxit + eit 0 eit = λi Ft + uit , and xit = xit−1 + εit for i = 1, ..., n, t = 1, ..., T, where ⎛ ⎞ ⎛ ∗ ⎞ ⎛ ⎞⎛ ∗ ⎞ Ft Ft 0 0 0 Ft−1 ⎜ ⎟ ⎜ ∗ ⎟ ⎜ ⎟⎜ ∗ ⎟ ⎝ uit ⎠ = ⎝ uit ⎠ + ⎝ 0 0.3 −0.4 ⎠ ⎝ uit−1 ⎠ εit ε∗it ε∗it−1 θ31 θ32 0.6 14 (17) with ⎛ Ft∗ ⎞ ⎛⎡ 0 ⎤ ⎡ 1 σ 12 σ 13 ⎤⎞ ⎜ ∗ ⎟ iid ⎜⎢ ⎥ ⎢ ⎥⎟ ⎝ uit ⎠ ∼ N ⎝⎣ 0 ⎦ , ⎣ σ 21 1 σ 23 ⎦⎠ . ε∗it 0 σ 31 σ 32 1 For this experiment, we have a single factor (r = 1) and λi are generated from i.i.d. N (µλ , 1). We let µλ = 0.1. We generated αi from a uniform distribution, U[0, 10], and set β = 2. From Theorems 1-3 we know that the asymptotic results depend upon variances and covariances of Ft, uit and εit . Here we set σ 12 = 0. The design in (17) is a good one since the endogeneity of the system is controlled by only four parameters, θ31 , θ32 , σ 31 and σ 32 . We choose θ 31 = 0.8, θ32 = 0.4, σ 31 = −0.8 and θ 32 = 0.4. The estimate of the long-run covariance matrix in (11) was obtained by using the proce- dure KERNEL in COINT 2.0 with a Bartlett window. The lag truncation number was set arbitrarily at five. Results with other kernels, such as Parzen and quadratic spectral kernels, are not reported, because no essential differences were found for most cases. Next, we recorded the results from our Monte Carlo experiments that examined the bOLS in (6), (b) the 2S-FM estimator, finite-sample properties of (a) the OLS estimator, β b2S , in (8), (c) the two-step naive FM estimator, β bb , proposed by Kao and Chiang (2000) β FM bCU P , in (9) and (e) the CUP and Phillips and Moon (1999), (d) the CUP-FM estimator β bd which is similar to the two-step naive FM except the iteration naive FM estimator β FM goes beyond two steps. The naive FM estimators are obtained assuming the cross-sectional independence. The maximum number of the iteration for CUP-FM estimators is set to 20. The results we report are based on 1, 000 replications and are summarized in Tables 1 - 4. All the FM estimators were obtained by using a Bartlett window of lag length five as in (11). the´ Monte and ´standard ³ Table 1´ reports ³ ³ b Carlo ´ means ³ ³ d deviations ´ (in parentheses) of bOLS − β , β b2S − β , β b b b β F M − β , β CU P − β ,and β F M − β for sample sizes T = n bOLS , decrease at a rate of T . For example, = (20, 40, 60) . The biases of the OLS estimator, β with σ λ = 1 and σ F = 1, the bias at T = 20 is − 0.045, at T = 40 is −0.024, and at T = 60 is −0.015. Also, the biases stay the same for different values of σ λ and σ F . TABLE 1 ABOUT HERE 15 While we expected the OLS estimator to be biased, we expected FM estimators to produce better estimates. However, it is noticeable that the 2S-FM estimator still has a downward bias for all values of σ λ and σ F , though the biases are smaller. In general, the 2S-FM estimator presents the same degree of difficulty with bias as does the OLS estimator. This is probably due to the failure of the nonparametric correction procedure. In contrast, the results in Table 1 show that the CUP-FM, is distinctly superior to the OLS and 2S-FM estimators for all cases in terms of the mean biases. Clearly, the CUP-FM outperforms both the OLS and 2S-FM estimators. TABLE 2 ABOUT HERE It is important to know the effects of the variations in panel dimensions on the results, since the actual panel data have a wide variety of cross-section and time-series dimensions. Table 2 considers 16 different combinations for n and T , each ranging from 20 to 120 with σ 31 = −0.8, σ 21 = −0.4, θ31 = 0.8, and θ 21 = 0.4. First, we notice that the cross-section dimension has no significant effect on the biases of all estimators. From this it seems that in practice the T dimension must exceed the n dimension, especially for the OLS and 2S-FM estimators, in order to get a good approximation of the limiting distributions of the estimators. For example, for OLS estimator in Table 2, the reported bias, −0.008, is substantially less for (T = 120, n = 40) than it is for either (T = 40, n = 40) , (the bias is −0.024), or (T = 40, n = 120) , (the bias is −0.022). The results in Table 2 again confirm the superiority of the CUP-FM. TABLE 3 ABOUT HERE Monte Carlo means and standard deviations of the t-statistic, tβ=β 0 , are given in Table 3. Here, the OLS t-statistic is the conventional t-statistic as printed by standard statistical √ packages. With all values of σ λ and σ F with the exception σ λ = 10, the CUP-FM t-statistic is well approximated by a standard N(0,1) suggested from the asymptotic results. The CUPFM t-statistic is much closer to the standard normal density than the OLS t-statistic and the 2S-FM t-statistic. The 2S-FM t-statistic is not well approximated by a standard N(0,1). 16 TABLE 4 ABOUT HERE Table 4 shows that both the OLS t-statistic and the FM t-statistics become more negatively biased as the dimension of cross-section n increases. The heavily negative biases of the 2S-FM t-statistic in Tables 3-4 again indicate the poor performance of the 2S-FM estimator. For the CUP-FM, the biases decrease rapidly and the standard errors converge to 1.0 as T increases. b in (11) may be It is known that when the length of time series is short the estimate Ω sensitive to the length of the bandwidth. In Tables 2 and 4, we first investigate the sensitivity of the FM estimators with respect to the choice of length of the bandwidth. We extend the experiments by changing the lag length from 5 to other values for a Barlett window. Overall, the results (not reported here) show that changing the lag length from 5 to other values does not lead to substantial changes in biases for the FM estimators and their t-statistics. 9 Conclusion A factor approach to panel models with cross-sectional dependence is useful when both the time series and cross-sectional dimensions are large. This approach also provides significant reduction in the number of variables that may cause the cross-sectional dependence in panel data. In this paper, we study the estimation and inference of a panel cointegration model with cross-sectional dependence. The paper contributes to the growing literature on panel data with cross-sectional dependence by (i) discussing limiting distributions for the OLS and FM estimators, (ii) suggesting a CUP-FM estimator and (iii) investigating the finite sample proprieties of the OLS, CUP-FM and 2S-FM estimators. It is found that the 2S-FM and OLS estimators have a non-negligible bias in finite samples, and that the CUP-FM estimator improves over the other two estimators. Acknowledgements We thank Badi Baltagi, Yu-Pin Hu, Giovanni Urga, Kamhon Kan, Chung-Ming Kuan, Hashem Pesaran, Lorenzo Trapani and Yongcheol Shin for helpful comments. We also thank 17 seminar participants at Academia Sinica, National Taiwan University, Syracuse University, Workshop on Recent Developments in the Econometrics of Panel Data in London, March 2004 and the European Meeting of the Econometric Society in Madrid, August 2004 for helpful comments and suggestions. Jushan Bai acknowledges financial support from the NSF (grant SES-0137084). Appendix Let BnT = " n T XX i=1 t=1 Note 0 (xit − xi ) (xit − xi ) # . ´ √ ³b nT β OLS − β h√ P ³ P ´i £ ¤−1 0 1 1 = n n1 ni=1 T1 Tt=1 eit (xit − xi ) B n T 2 nT £√ 1 Pn ¤£ P ¤−1 = n n i=1 ζ 1iT n1 ni=1 ζ 2iT √ = nξ 1nT [ξ 2nT ]−1 , P P P P 0 0 where xi = T1 Tt=1 xit , y i = T1 Tt=1 yit , ζ 1iT = T1 Tt=1 eit (xit − xi ) , ζ 2iT = T12 Tt=1 (xit − xi ) (xit − xi ) , P P ξ 1nT = n1 ni=1 ζ 1iT , and ξ 2nT = n1 ni=1 ζ 2iT . Before going into the next theorem, we need to consider some preliminary results. P Define Ωε = lim n1 ni=1 Ωεi and n→∞ " n # µZ ¶ ¶ µZ ¶ X 0µ 1 0 0 −1/2 fi dWi Ω1/2 + ∆F εi + Ωuεi Ω−1/2 fi dWi Ω1/2 + ∆uεi . θn = λ ΩF.εi Ωεi W W εi εi εi n i=1 i If Assumptions 1 − 4 hold, then Lemma 4 (a) As (n, T → ∞) , (b) As (n, T → ∞) with √ n à n T 1 1 p 1 BnT → Ωε . 2 nT 6 → 0, T n 0 1 1 XX eit (xit − xi ) − θn n T i=1 t=1 ! ⇒N 18 à ! n o 1 1 Xn 0 0, lim λi ΩF.εi λi Ωεi + Ωu.εi Ωεi 6 n→∞ n i=1 Proof. (a) and (b) can be shown easily by following the Theorem 8 in Phillips and Moon (1999). A Proof of Theorem 1 Proof. Recall that ⎡ P ³ ³ ´ ´ ⎤ 0 0 −1/2 R f 1/2 n Wi dWi Ωεi + ∆F εi √ √ 1 i=1 λi ΩF εi Ωεi b ³ ´ ⎦ nT β nn ⎣ OLS − β − 0 −1/2 R f 1/2 +Ωεui Ωεi Wi dWi Ωεi + ∆εui £1 1 ¤−1 B n⎡ T 2 nT ⎧ ³ ³ ´ ´ ⎫⎤ 0 0 −1/2 R f 1/2 ⎨ Wi dWi Ωεi + ∆F εi ⎬ £ 1 Pn ζ 1iT − λi ΩF εi Ωεi ¤−1 √ 1 Pn ⎣ ³ ´ ⎦ = n n i=1 ζ R 2iT i=1 0 −1/2 ⎩ fi dWi Ω1/2 + ∆uεi ⎭ n −Ωuεi Ωεi W εi £√ 1 Pn ∗ ¤ £ 1 Pn ¤−1 = n n i=1 ζ 1iT n i=1 ζ 2iT √ = nξ ∗1nT [ξ 2nT ]−1 , ³ where ζ ∗1iT = ζ 1iT and ´ µ µZ ¶ ¶ µZ ¶ 0 0 −1/2 1/2 −1/2 fi dWi Ω + ∆F εi − Ωuε Ω fi dWi Ω1/2 + ∆uε − λi ΩF εi Ωεi W W ε ε εi 0 n ξ ∗1nT 1X ∗ = ζ . n i=1 1iT First, we note from Lemma 4 (b) that ! à n o √ ∗ 1 1 Xn 0 λi ΩF.εi λi Ωεi + Ωu.εi Ωεi nξ 1nT ⇒ N 0, lim 6 n→∞ n i=1 as (n, T → ∞) and √ ∗ nξ 1nT Hence, n T → 0. Using the Slutsky theorem and (a) from Lemma 4, we obtain à ( ) ! n ³ ´ X 1 0 [ξ 2nT ]−1 ⇒ N 0, 6Ω−1 lim λi ΩF.εi λi Ωεi + Ωu.εi Ωεi Ω−1 . ε ε n→∞ n i=1 ´ √ √ ³b nT β OLS − β − nδ nT ³ n ´o ´ Pn ³ 0 1 −1 ⇒ N 0, 6Ω−1 lim λ Ω λ Ω + Ω Ω Ω , u.εi εi ε ε i F.εi i εi i=1 n n→∞ 19 (18) proving the theorem, where " n # µ µZ ¶ ¶ µZ ¶ 1 X 0 0 0 −1/2 1/2 −1/2 1/2 fi dWi Ω + ∆F εi + Ωuεi Ω fi dWi Ω + ∆uεi δ nT = λ ΩF.εi Ωεi W W εi εi εi n i=1 i ∙ ¸−1 1 1 BnT . n T2 Therefore, we established Theorem 1. B Proof of Theorem 2 Proof. Let Fit+ = Ft − ΩF εi Ω−1 εi εit , and −1 u+ it = uit − Ωuεi Ωεi εit . The FM estimator of β can be rewritten as follows h ³P ³ 0 ´´i 0 T + + + eF M = Pn y (x − x ) − T λ ∆ + ∆ B −1 β it i i=1 hP t=1 ³Pit ³ 0 ´ i F εi 0 uεi ³ 0 nT ´´i n T + + + + −1 = β+ λ F + u (x − x ) − T λ ∆ + ∆ BnT . it i i it i F εi it uεi i=1 t=1 (19) ³ ´ eF M − β by √nT First, we rescale β ´ ´ i£ ¤−1 √ ³e √ 1 Pn 1 PT h³ 0 + 0 0 + + + nT β F M − β = n n i=1 T t=1 λi Fit + uit (xit − xi ) − λi ∆F εi − ∆uεi n1 T12 BnT ¤−1 £√ 1 Pn ∗∗ ¤ £ 1 Pn = n n i=1 ζ 1iT n i=1 ζ 2iT √ ∗∗ = nξ 1nT [ξ 2nT ]−1 , (20) h³ 0 ´ i P P 0 0 T n ∗∗ ∗∗ + + 1 1 where ζ ∗∗ λi Fit+ + u b+ 1iT = T it (xit − xi ) − λi ∆F εi − ∆uεi , and ξ 1nT = n t=1 i=1 ζ 1iT . Modifying the Theorem 11 in Phillips and Moon (1999) and Kao and Chiang (2000) we can show that as (n, T → ∞) with √ n à n T →0 T n ´ 0 1 1 XX³ 0 + 0 + λ F (xit − xi ) − λi ∆F εi n T i=1 t=1 i it 20 ! ⇒N à n 1 1X 0 0, lim λ ΩF.εi λi Ωεi 6 n→∞ n i=1 i ! and √ n à ! à ! T n n ´ X 0 1 1 1 1 XX³ + u b (xit − xi ) − ∆+ ⇒ N 0, lim Ωu.εi Ωεi uεi n T i=1 t=1 it 6 n→∞ n i=1 and combing this with the Assumption 4 that Ft and uit are independent and Lemma 4(a) yields ´ √ ³e nT β F M − β ⇒ N à 0, 6Ω−1 ε ( ) ! n ´ 1 X³ 0 lim λi ΩF.εi λi Ωεi + Ωu.εi Ωεi Ω−1 ε n→∞ n i=1 as required. C Proof of Lemma 3 0 bΩ b F ε is estimating H −10 Ω b F ε . Thus λ b Proof. We note that λi is estimating Hλi , and Ω i F ε is 0 estimating λi ΩF ε , which is the object of interest. For the purpose of notational simplicity, we shall assume H being a r × r identify matrix in our proof below. From ³ 0 ´ b b b b −1 ∆xit eb+ = e − λ Ω + Ω Ω it F ε uε i ε it and e+ it ³ 0 ´ = eit − λi ΩF ε + Ωuε Ω−1 ε ∆xit , hn³ 0 ´ ³ 0 ´ o i −1 −1 bΩ b b b λ + Ω Ω − λ Ω + Ω Ω ∆x uε uε it ε ε i Fε i Fε hn 0 o i 0 −1 −1 −1 b F εΩ b −1 b b = − b λi Ω − λ Ω Ω + Ω Ω − Ω Ω ∆x . F ε uε uε it ε ε ε ε i + eb+ it − eit = − Then, = T n ´ 0 1 1 X X ³ b b −1 −1 √ Ωuε Ωε − Ωuε Ωε ∆xit (xit − xi ) n T i=1 t=1 ³ T n X ´ 1 1X 0 −1 −1 b b Ωuε Ωε − Ωuε Ωε √ ∆xit (xit − xi ) n T i=1 t=1 = op (1)Op (1) = op (1) 21 because and Thus b uε Ω b −1 − Ωuε Ω−1 = op (1) Ω ε ε T n 0 1 1 XX √ ∆xit (xit − xi ) = Op (1). n T i=1 t=1 T n ¢ 0 1 XX¡ + √ eit − e+ b it (xit − xi ) nT i=1 t=1 n T ´ ´ o i ³ 0 0 1 X 1 X hn³ 0 −1 bΩ b b b λi ΩF ε + Ωuε Ω−1 Ω ∆x = √ − λ + Ω uε it (xit − xi ) ε ε i Fε n i=1 T t=1 T n ´ 0 0 1 1 XX³ 0 −1 −1 b b b √ λi ΩF ε Ωε − λi ΩF ε Ωε ∆xit (xit − xi ) = n T i=1 t=1 T n ´ 0 1 1 XX³ −1 −1 b b +√ Ωuε Ωε − Ωuε Ωε ∆xit (xit − xi ) n T i=1 t=1 T n ´ 0 0 1 1 XX³ 0 −1 bΩ b b λi ΩF ε Ω−1 ∆xit (xit − xi ) + op (1). = √ − λ Ω F ε i ε ε n T i=1 t=1 The remainder of the proof needs to show that T n ´ 0 0 1 1 XX³ 0 −1 bΩ b b √ λi ΩF ε Ω−1 − λ Ω ∆xit (xit − xi ) = op (1) . ε i Fε ε n T i=1 t=1 22 b b b −1 We write A for ΩF ε Ω−1 ε and A for ΩF ε Ωε respectively and then T n ´ 0 0 1 1 XX³ 0 −1 bΩ b b √ − λ Ω λi ΩF ε Ω−1 ∆xit (xit − xi ) i Fε ε ε n T i=1 t=1 T n ´ 0 0 1 1 XX³ 0 bA b = √ ∆xit (xit − xi ) λi A − λ i n T i=1 t=1 T n ´ ³ i 0´ 1 1 XXh 0 ³ b + λ0i − b b ∆xit (xit − xi )0 = √ λi A − A λi A n T i=1 t=1 T n ´ 0 1 1 XX 0 ³ b √ = λi A − A ∆xit (xit − xi ) n T i=1 t=1 T n 0 1 1 X X ³ 0 b0 ´ b +√ λi − λi A∆xit (xit − xi ) n T i=1 t=1 = I + II, say. Term I is a row vector. Let Ij be the jth component of I. Let an identity matrix so that j j 0 be the jth column of = (0, ..., 0, 1, 0, ...0) . Left multiply I by j to obtain the jth component, which is scalar and thus is equal to its trace. That is Ij " ³ ´ b A−A à T n 0 1 1 XX 0 √ = tr λi ∆xit (xit − xi ) n T i=1 t=1 i h b p (1) = tr (A − A)O j λi !# = op (1)Op (1) = op (1) Pn PT 0 0 b λ ∆x (x − x ) it it i j λi = Op (1) and A − A = op (1). i i=1 t=1 P 0 b 1 n ∆xit (xit − xi ) . Then ci = Op (1) and Next consider II. Let ci = A i=1 T because √1 1 nT 23 1 n Pn i=1 kci k2 = Op (1), thus by Lemma 2 (ii), we have II ≤ = = = = = since (n, T → ∞) and √ n T ¯ ¯ n ³ ¯ 0´ √ ¯¯ 1 X 0 b ci ¯¯ n¯ λi − λ i ¯n ¯ i=1 µ ¶ √ 1 nOp 2 δ µ nT ¶ √ 1 nOp min[n, T ] µ ¶ √ n Op min {n, T } µ µ√ ¶ ¶ 1 n Op √ + Op n T op (1) → 0. This establishes T n ´ 0 0 1 1 XX³ 0 −1 −1 b b b √ λi ΩF ε Ωε − λi ΩF ε Ωε ∆xit (xit − xi ) = op (1). n T i=1 t=1 and proves Lemma 3. References [1] Andrews, D. W. K., 2003. Cross-section regression with common shocks. forthcoming in Econometrica. [2] Bai, J., 2003. Inferential theory for factor models of large dimensions. Econometrica 71, 135-171. [3] Bai, J., 2004. Estimating cross-section common stochastic trends in nonstationary panel data. Journal of Econometrics 122, 137-183. [4] Bai, J., and Ng, S., 2002. Determining the number of factors in approximate factor models. Econometrica 70, 191-221. [5] Bai, J., and Ng, S., 2004. A panic attack on unit roots and cointegration. Econometrica 72, 1127-1177. 24 [6] Baltagi, B., and Kao, C., 2000. Nonstationary panels, cointegration in panels and dynamic panels: A survey. Advances in Econometrics 15, 7-51. [7] Baltagi, B., Song, S. H., and Koh, W., 2004. Testing panel data regression models with spatial error correlation. Journal of Econometrics 117, 123-150. [8] Chang, Y., 2002. Nonlinear IV unit root tests in panels with cross-sectional dependency. Journal of Econometrics 116, 261-292. [9] Coakley, J., Fuerts, A., and Smith, R. P., 2002. A principal components approach to cross-section dependence in panels. Manuscript, Birckbeck College, University of London. [10] Forni, M. and Reichlin, L., 1998. Let’s get real: a factor-analytic approach to disaggregated business cycle dynamics. Review of Economic Studies 65, 453-473. [11] Forni, M., Hallin, M., Lippi, M., and Reichlin, L., 2000. Reference cycles: The NBER methodology revisited. CEPR Discussion Paper 2400. [12] Gregory, A., and Head, A., 1999. Common and country-specific fluctuations in productivity, investment, and the current account. Journal of Monetary Economics 44, 423-452. [13] Hall, S. G., Lazarova, S., and Urga, G., 1999. A principle component analysis of common stochastic trends in heterogeneous panel data: Some monte carlo evidence. Oxford Bulletin of Economics and Statistics 61, 749-767. [14] Kao, C., and Chiang, M. H., 2000. On the Estimation and Inference of a Cointegrated Regression in Panel Data. Advances in Econometrics 15, 179-222. [15] Moon, H. R., and Perron, B., 2004. Testing for a unit root in panels with dynamic factors. Journal of Econometrics 122, 81-126. [16] Pesaran, H., 2004. Estimation and inference in large heterogenous panels with a multifactor error structure. Manuscript, Trinity College, Cambridge. 25 [17] Phillips, P. C. B., and Hansen, B. E., 1990. Statistical inference in instrumental variables regression with I(1) processes. Review of Economic Studies 57, 99-125. [18] Phillips, P. C. B., and Moon, H., 1999. Linear regression limit theory for nonstationary panel data,” Econometrica 67, 1057-1111. [19] Phillips, P. C. B., and Sul, D., 2003. Dynamic panel estimation and homogeneity testing under cross section dependence. Econometric Journal 6, 217-259. [20] Robertson, D., and Symon, J., 2000. Factor residuals in SUR regressions: Estimating panels allowing for cross sectional correlation. Manuscript, Faculty of Economics and Politics, University of Cambridge. [21] Stock, J. H., and Watson, M. W., 2002. Forecasting Using Principal Components from a Large Number of Predictors. Journal of the American Statistical Association 97 , 1167—1179. 26 Table 1: Means Biases and Standard Deviation √ of OLS and FM Estimato σλ = 1 σ λ = 10 σF = 1 T=20 T=40 T=60 √ σ F = 10 T=20 T=40 T=60 √ σ F = 0.5 T=20 OLS FMa FMb FMc FMd OLS FMa FMb FMc FMd OLS F -0.045 (0.029) -0.024 (0.010) -0.015 (0.006) -0.025 (0.028) -0.008 (0.010) -0.004 (0.005) -0.029 (0.029) -0.011 (0.010) -0.005 (0.005) -0.001 (0.034) -0.002 (0.010) -0.001 (0.005) -0.006 (0.030) -0.005 (0.010) -0.003 (0.005) -0.046 (0.059) -0.024 (0.020) -0.015 (0.011) -0.025 (0.054) -0.009 (0.019) -0.003 (0.010) -0.029 (0.059) -0.012 (0.019) -0.005 (0.010) -0.001 (0.076) -0.003 (0.021) -0.001 (0.011) -0.006 (0.060) -0.005 (0.018) -0.002 (0.010) -0.045 (0.026) -0.024 (0.009) -0.015 (0.005) -0 (0. -0 (0. -0 (0. -0.054 (0.061) -0.028 (0.021) -0.018 (0.011) -0.022 (0.054) -0.007 (0.019) -0.002 (0.011) -0.036 (0.061) -0.015 (0.019) -0.007 (0.011) 0.011 (0.078) 0.001 (0.021) 0.001 (0.011) -0.005 (0.062) -0.007 (0.019) -0.004 (0.010) -0.057 (0.176) -0.030 (0.059) -0.017 (0.032) -0.024 (0.156) -0.009 (0.054) -0.001 (0.029) -0.038 (0.177) -0.017 (0.057) -0.006 (0.030) 0.013 (0.228) -0.001 (0.061) 0.002 (0.031) -0.003 (0.177) -0.009 (0.053) -0.003 (0.029) -0.054 (0.046) -0.028 (0.016) -0.018 (0.009) -0 (0. -0 (0. -0 (0. -0.002 -0.006 -0.044 (0.056) (0.046) (0.024) -0.003 -0.005 -0.023 (0.016) (0.014) (0.009) -0.001 -0.002 -0.015 (0.008) (0.008) (0.005) the naive CUP-FM -0 (0. -0 (0. -0 (0. -0.044 (0.026) T=40 -0.023 (0.009) T=60 -0.015 (0.005) Note: (a) FMa is the -0.025 -0.028 (0.026) (0.026) -0.009 -0.010 (0.009) (0.009) -0.004 -0.005 (0.005) (0.005) 2S-FM, FMb is the -0.003 -0.006 (0.030) (0.028) -0.003 -0.004 (0.009) (0.009) -0.001 -0.003 (0.005) (0.005) naive 2S-FM, FMc -0.045 -0.026 -0.028 (0.045) (0.041) (0.045) -0.023 -0.009 -0.011 (0.016) (0.015) (0.015) -0.015 -0.004 -0.005 (0.009) (0.008) (0.008) is the CUP-FM and FMd is (b) µλ = 0.1, σ 31 = −0.8, σ 21 = −0.4, θ31 = 0.8, and θ21 = 0.4. Table 2: Means Biases and Standard Deviation of OLS and FM Estimators for Different n and T (n,T) OLS FMa FMb FMc FMd (20, 20) -0.045 -0.019 -0.022 -0.001 -0.006 (0.029) (0.028) (0.029) (0.034) (0.030) (20, 40) -0.024 -0.006 -0.009 -0.001 -0.004 (0.014) (0.014) (0.013) (0.014) (0.013) (20, 60) -0.017 -0.004 -0.006 -0.001 -0.003 (0.010) (0.009) (0.009) (0.009) (0.009) (20, 120) -0.008 -0.001 -0.002 -0.000 -0.001 (0.005) (0.004) (0.005) (0.004) (0.004) (40, 20) -0.044 -0.018 -0.021 -0.002 -0.006 (0.021) (0.019) (0.019) (0.023) (0.021) (40, 40) -0.024 -0.007 -0.009 -0.002 -0.004 (0.010) (0.010) (0.010) (0.010) (0.010) (40, 60) -0.015 -0.003 -0.005 -0.001 -0.002 (0.007) (0.007) (0.007) (0.007) (0.007) (40, 120) -0.008 -0.001 -0.002 -0.001 -0.001 (0.003) (0.003) (0.003) (0.003) (0.003) (60, 20) -0.044 -0.018 -0.022 -0.002 -0.007 (0.017) (0.016) (0.016) (0.019) (0.017) (60, 40) -0.022 -0.006 -0.008 -0.002 -0.004 (0.009) (0.008) (0.008) (0.008) (0.008) (60, 60) -0.015 -0.003 -0.005 -0.001 -0.003 (0.006) (0.005) (0.005) (0.005) (0.005) (60, 120) -0.008 -0.001 -0.002 -0.001 -0.001 (0.003) (0.002) (0.002) (0.002) (0.002) (120, 20) -0.044 -0.018 -0.022 -0.002 -0.007 (0.013) (0.011) (0.012) (0.013) (0.012) (120, 40) -0.022 -0.006 -0.008 -0.002 -0.004 (0.006) (0.006) (0.006) (0.006) (0.006) (120, 60) -0.015 -0.003 -0.005 -0.001 -0.003 (0.004) (0.004) (0.004) (0.004) (0.004) (120, 120) -0.008 -0.001 -0.002 -0.001 -0.002 (0.002) (0.002) (0.002) (0.002) (0.002) (a) µλ = 0.1, σ 31 = −0.8, σ 21 = −0.4, θ 31 = 0.8, and θ 21 = 0.4. Table 3: Means Biases and Standard√Deviation of t-statistics σλ = 1 σ λ = 10 σF = 1 T=20 T=40 T=60 √ σ F = 10 T=20 T=40 T=60 √ σ F = 0.5 T=20 OLS FMa FMb FMc FMd OLS FMa FMb FMc FMd OLS F -1.994 (1.205) -2.915 (1.202) -3.465 (1.227) -1.155 (1.267) -0.941 (1.101) -0.709 (1.041) -1.518 (1.484) -1.363 (1.248) -1.158 (1.177) -0.056 (1.283) -0.227 (1.054) -0.195 (0.996) -0.285 (1.341) -0.559 (1.141) -0.574 (1.100) -0.929 (1.149) -1.355 (1.127) -1.552 (1.146) -0.546 (1.059) -0.465 (0.913) -0.308 (0.868) -0.813 (1.495) -0.766 (1.207) -0.568 (1.113) -0.006 (1.205) -0.128 (0.912) -0.074 (0.851) -0.122 (1.254) -0.326 (1.049) -0.261 (1.016) -2.248 (1.219) -3.288 (1.221) -3.926 (1.244) -1 (1. -1 (1. -0 (1. -1.078 (1.147) -1.575 (1.131) -1.809 (1.158) -0.484 (1.063) -0.355 (0.917) -0.155 (0.879) -0.984 (1.501) -0.963 (1.214) -0.776 (1.131) 0.180 (1.220) 0.042 (0.926) 0.111 (0.867) -0.096 (1.271) -0.407 (1.063) -0.390 (1.035) -0.373 (1.119) -0.561 (1.097) -0.588 (1.108) -0.154 (0.987) -0.152 (0.844) -0.041 (0.812) -0.350 (1.508) -0.397 (1.179) -0.247 (1.078) 0.085 (1.194) -0.014 (0.871) 0.049 (0.811) -0.006 (1.223) -0.190 (1.008) -0.111 (0.983) -1.427 (1.163) -2.082 (1.154) -2.424 (1.192) -0 (1. -0 (0. -0 (0. -0.054 -0.176 -2.367 (1.217) (1.273) (1.231) -0.188 -0.385 -3.462 (0.944) (1.087) (1.236) -0.139 -0.331 -4.149 (0.881) (1.038) (1.249) the naive CUP- FM. -1 (1. -1 (1. -0 (1. -2.196 (1.219) T=40 -3.214 (1.226) T=60 -3.839 (1.239) Note: (a) FMa is the -1.319 -1.606 (1.325) (1.488) -1.093 -1.415 (1.057) (1.155) -0.868 -1.217 (1.088) (1.183) 2S-FM, FMb is the -0.137 -0.327 (1.307) (1.362) -0.311 -0.576 (1.104) (1.169) -0.296 -0.602 (1.037) (1.112) naive 2S-FM, FMc -1.203 -0.734 -1.008 (1.164) (1.112) (1.488) -1.752 -0.619 -0.922 (1.148) (0.962) (1.222) -2.037 -0.446 -0.712 (1.169) (0.908) (1.131) is the CUP-FM and FMd is (b) µλ = 0.1, σ 31 = −0.8, σ 21 = −0.4, θ31 = 0.8, and θ21 = 0.4. Table 4: Means Biases and Standard Deviation of t-statistics for Different n and T (n,T) OLS FMa FMb FMc FMd (20, 20) -1.994 -0.738 -1.032 -0.056 -0.286 (1.205) (1.098) (1.291) (1.283) (1.341) (20, 40) -2.051 -0.465 -0.725 -0.105 -0.332 (1.179) (0.999) (1.126) (1.046) (1.114) (20, 60) -2.129 -0.404 -0.684 -0.162 -0.421 (1.221) (0.963) (1.278) (0.983) (1.111) (20, 120) -2.001 -0.213 -0.456 -0.095 -0.327 (1.222) (0.923) (1.083) (0.931) (1.072) (40, 20) -2.759 -1.017 -1.404 -0.103 -0.402 (1.237) (1.116) (1.291) (1.235) (1.307) (40, 40) -2.915 -0.699 -1.075 -0.227 -0.559 (1.202) (1.004) (1.145) (1.054) (1.141) (40, 60) -2.859 -0.486 -0.835 -0.173 -0.493 (1.278) (0.998) (1.171) (1.014) (1.154) (40, 120) -2.829 -0.336 -0.642 -0.181 -0.472 (1.209) (0.892) (1.047) (0.899) (1.037) (60, 20) -3.403 -1.252 -1.740 -0.152 -0.534 (1.215) (1.145) (1.279) (1.289) (1.328) (60, 40) -3.496 -0.807 -1.238 -0.255 -0.635 (1.247) (1.016) (1.165) (1.053) (1.155) (60, 60) -3.465 -0.573 -0.987 -0.195 -0.574 (1.227) (0.974) (1.111) (0.996) (1.100) (60, 120) -3.515 -0.435 -0.819 -0.243 -0.609 (1.197) (0.908) (1.031) (0.913) (1.020) (120, 20) -4.829 -1.758 -2.450 -0.221 -0.760 (1.345) (1.162) (1.327) (1.223) (1.308) (120, 40) -4.862 -1.080 -1.679 -0.307 -0.831 (1.254) (1.022) (1.159) (1.059) (1.143) (120, 60) -4.901 -0.852 -1.419 -0.329 -0.846 (1.239) (0.964) (1.097) (0.978) (1.077) (120, 120) -5.016 -0.622 -1.203 -0.352 -0.908 (1.248) (0.922) (1.059) (0.927) (1.048) (a) µλ = 0.1, σ 31 = −0.8, σ 21 = −0.4, θ 31 = 0.8, and θ 21 = 0.4.
© Copyright 2026 Paperzz