IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 12, DECEMBER 2009 3589 Ergodic Capacity Upper Bound for Dual MIMO Ricean Systems: Simplified Derivation and Asymptotic Tightness Michail Matthaiou, Member, IEEE, Yannis Kopsinis, Member, IEEE, David I. Laurenson, Member, IEEE, and Akbar M. Sayeed, Senior Member, IEEE Abstract—An analytical upper bound on the ergodic capacity of Multiple-Input Multiple-Output (MIMO) systems is deduced with the aid of a simplified approach that relies on a fundamental power normalization. Given their high practical usability, we are particularly interested in dual configurations where both ends deploy two antenna elements. Contrary to the majority of related studies, where only the common case of Rayleigh fading is considered, our analysis is extended to account for Ricean fading where a deterministic Line-of-Sight (LoS) component exists in the communication link and both ends are affected by spatial correlation. In the following, it is shown that the proposed bound is applicable for any arbitrary Signal-to-Noise Ratio (SNR) and rank of the mean channel matrix. In fact, we consider both conventional and optimized MIMO configurations with equal LoS eigenvalues. Moreover, the tightness of the bound is explored where it is demonstrated that as the SNR tends to zero the bound becomes asymptotically tight; at high SNRs, the offset between empirical capacity and the bound is analytically computed which implies that an explicit asymptotic capacity expression can ultimately be obtained. Index Terms—MIMO systems, ergodic capacity, Ricean fading, spatial fading correlation. I. I NTRODUCTION O VER the last years, a considerable amount of research interest has been devoted to Multiple-Input MultipleOutput (MIMO) systems in response to the increasing demand for higher data rates. The pioneering works of Foschini [1] and Telatar [2] demonstrated the dramatic performance enhancement when multiple antenna elements are employed at both ends of a radio link. In order to comprehend the advantages of this promising technology, one of the most interesting topics that needs to be addressed is the derivation of efficient analytical capacity bounds. By doing so, it is anticipated that the design of practical and simulated MIMO systems would be greatly enriched. Please note that throughout the paper our main interest will lie in dual MIMO configurations which are likely to be used in many future practical systems Paper approved by N. Jindal, the Editor for MIMO Techniques of the IEEE Communications Society. Manuscript received August 1, 2008; revised April 4, 2009. M. Matthaiou is with the Institute for Circuit Theory and Signal Processing, Technische Universität München (TUM), Arcistrasse 21, 80333, Munich, Germany (e-mail: [email protected]). Y. Kopsinis and D. I. Laurenson are with the Institute for Digital Communications (IDCOM), Joint Research Institute for Signal and Image Processing, The University of Edinburgh, EH9 3JL, Edinburgh, U.K. (e-mail: {Y.Kopsinis, Dave.Laurenson}@ed.ac.uk). A. M. Sayeed is with the Department of Electrical and Computer Engineering, University of Wisconsin-Madison, Madison, WI 53706 USA (e-mail: [email protected]). Digital Object Identifier 10.1109/TCOMM.2009.12.080393 (e.g. hand-held devices), thanks to their small size and low implementation cost. The majority of related studies documented in literature consider the tractable case of Rayleigh fading where no direct Line-of-Sight (LoS) exists in the radio channel and the multipath richness is sufficiently high. In this case, a plethora of results is available for various scenarios (see [1]– [4] and references therein). In most real-life channels though, the assumption of Rayleigh fading is often violated due to either a specular wavefront or a strong direct component; then, the entries of the channel matrix can be more effectively modeled by the Ricean distribution. Surprisingly, significantly fewer publications focusing on MIMO Ricean channels have been reported. This trend can be attributed to the difficulty in manipulating hypergeometric functions with two matrix arguments of non-central Wishart matrices compared to the one matrix argument of central (zero-mean) Wishart matrices tied to Rayleigh fading. The first analytical bounds on MIMO Ricean capacity can be found in [5]–[7] where the assumption of uncorrelated fading at both ends was adopted. In [8], [9], these results were extended to account for spatial correlation at a single end while the general case of double-sided spatial correlation was addressed in [10]. The common characteristic of the above cited papers though ([5]–[10]), is that they are limited to the case of rank-1 LoS matrices. The derivation of capacity bounds in the general case of arbitrary-rank mean matrices has been separately addressed in [11]–[14]. In [11] and [12], very tight lower and upper capacity bounds for semi-correlated MIMO Ricean channels were respectively proposed. The work in [13] represents so far the more general approach in the area since it derives several lower and upper bounds assuming all different types of spatial correlation. However, the general upper bound is given as an infinite summation of Hayakawa polynomials, which the authors acknowledge as quite involved and computationally inefficient. On the other hand, the upper bound deduced in [14], which relies on the theory of quadratic forms, is far more tractable and hence will serve as our reference bound hereafter. Whilst the authors in [14] focus on the general properties of quadratic forms with respect to MIMO capacity bounds, our primary goal is to provide a simplified bound derivation and thereafter investigate the implications of the model parameters (Ricean 𝐾-factor and spatial correlation) on the bound performance. More specifically, the contributions of the present paper are threefold. Firstly, an alternative derivation of the upper bound in [14] is proposed for the dual MIMO case under c 2009 IEEE 0090-6778/09$25.00 ⃝ Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply. 3590 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 12, DECEMBER 2009 consideration. Our analysis relies on a well-established power normalization scheme which simplifies significantly all the mathematical formulations. Secondly, using the same normalization as a starting point, the asymptotic tightness of the upper bound is explored in the low and high Signal-to-Noise Ratio (SNR) regimes. In fact, it is explicitly shown that as the SNR tends to zero the bound becomes asymptotically tight, whereas, at infinitely high SNRs the offset from the true capacity is analytically determined as well; hence, exact capacity expressions can be derived in the corresponding asymptotic cases. Finally, in order to formulate a broad framework, two different classes of LoS MIMO systems are assessed, namely a conventional low-rank (LR) and an optimized high-rank (HR) configuration which yields two equal LoS eigenvalues and is of high practical importance [15]-[17]. The paper is organized as follows: In Section II, some essential definitions related to non-central Wishart matrices and quadratic forms are given. In Section III, the underlying MIMO Ricean channel model is presented along with the statistics of the channel matrix. Section IV presents upper capacity bounds for different categories of MIMO systems. In Section V, we focus on the tightness of the proposed bound while the numerical results are given in Section VI. Finally, Section VII summarizes the key findings. Notation: We use upper and lower case boldfaces to denote matrices and vectors. The symbol ∼ 𝒞𝒩 𝑚,𝑛 (M, Σ) stands for a 𝑚 × 𝑛 complex normally distributed matrix with mean M and covariance Σ. The entries of a 𝑚 × 𝑛 matrix A are given by {A}𝑖,𝑗 where 𝑖 = 1, . . . , 𝑚 and 𝑗 = 1, . . . , 𝑛. The symbols 𝑇 𝐻 −1 (⋅) , (⋅) and (⋅) correspond to transposition, Hermitian transposition and matrix inversion respectively whereas ⊗ is the Kronecker product; we use det(⋅) and ∣ ⋅ ∣ to interchangeably denote the determinant while ∥⋅∥𝐹 returns the Frobenius norm. Finally, etr(⋅) is a shorthand notation for exp(tr(⋅)). II. C OMPLEX N ON -C ENTRAL W ISHART M ATRICES A ND Q UADRATIC F ORMS Definition 1: Let the 𝑚×𝑛 complex matrix X, with 𝑚 ≤ 𝑛, be distributed according to X ∼ 𝒞𝒩 𝑚,𝑛 (M, Σ ⊗ I𝑛 ), where Σ ∈ ℂ𝑚×𝑚 is a positive definite Hermitian matrix. Then, the matrix S = XX𝐻 follows the complex non-central Wishart distribution with 𝑛 degrees of freedom and noncentrality matrix Ω = Σ−1 MM𝐻 , commonly denoted as S ∼ 𝒞𝒲 𝑚 (𝑛, Σ, Ω). Definition 2: Let the 𝑚×𝑛 complex matrix X, with 𝑚 ≤ 𝑛, be distributed according to X ∼ 𝒞𝒩 𝑚,𝑛 (M, Σ ⊗ Ψ), where Σ ∈ ℂ𝑚×𝑚 and Ψ ∈ ℂ𝑛×𝑛 are positive definite Hermitian matrices. Then, the matrix Q = XΛX𝐻 , with Λ ∈ ℂ𝑛×𝑛 , follows a non-central matrix-variate quadratic form commonly denoted as Q ∼ 𝒞𝒬𝑚,𝑛 (Λ, Σ, Ψ, M). Please note that non-central quadratic forms degenerate into non-central Wishart matrices when Ψ = I𝑛 and when either Λ = I𝑛 or Λ is idempotent with rank 𝐿 ≥ 𝑚 [13]. The following theorem returns the 𝜐-th moment of the determinant of 2 × 2 complex quadratic forms. Theorem 1: Let Q ∼ 𝒞𝒬2,2 (I2 , Σ, Ψ, M). Then, the 𝜐-th moment of its determinant ∣Q∣ is given by [ ] 𝜐 𝜐 Γ̃2 (𝜐 + 2) ˜ (1) 𝐸 [∣Q∣ ] = ∣ΣΨ∣ 1 𝐹1 (−𝜐; 2; −Θ) Γ̃2 (2) ∏ where Γ̃𝑚 (𝑛) = 𝜋 𝑚(𝑚−1)/2 𝑚 𝑗=1 Γ(𝑛− 𝑗 + 1) is the complex multivariate gamma function, with Γ(⋅) being the standard gamma function, while 1 𝐹˜1 (𝑎, 𝑏, A) is the hypergeometric function of one matrix argument; the involved matrices are accordingly Θ = Ψ−1 M̄M̄𝐻 and M̄ = Σ−1/2 M. Proof: A detailed proof is given in Appendix A. It should be noted that the theorem is applicable only to 2 × 2 quadratic forms since for matrix of 𝑚 × 𝑛, a ) ( 𝑛sizes subsets needs to finite summation over a collection of 𝑚 take place [14]. A simplified formula can now be obtained for the first-order moment of the determinant. Corollary 1: For 𝜐 = 1, (1) reduces to ( ) 1 1 𝐸 [∣Q∣] = 2∣ΣΨ∣ 1 + tr(Θ) + det (Θ) . (2) 2 2 Proof: The authors in [13] showed that for any square matrix B ∈ ℂ𝑚×𝑚 , its hypergeometric function 1 𝐹˜1 (𝑐; 𝑑; B) can be expressed according to ( ) det 1 𝐹˜1 (𝑐 − 𝑚 + 𝑗; 𝑑 − 𝑚 + 𝑗; 𝑏𝑖 ) 𝑏𝑗−1 𝑖 ˜ 1 𝐹1 (𝑐; 𝑑; B) = 𝑚 ∏ (𝑏𝑗 − 𝑏𝑖 ) 𝑖<𝑗 (3) where 𝑏1 , 𝑏2 , . . . , 𝑏𝑚 is the set of non-zero eigenvalues of B. After taking into account the following properties of the scalar hypergeometric functions 1 1 − 2𝑧 + 𝑧 2 (4) 2 1 2 ˜ (5) 1 𝐹1 (−1, 2; 𝑧) = 1 − 𝑧 2 it is trivial to show that for the dual case under consideration 1 ˜ (2 + 𝜃1 + 𝜃2 + 𝜃1 𝜃2 ) (6) 1 𝐹1 (−1; 2; −Θ) = 2 where 𝜃1 , 𝜃2 are the eigenvalues of Θ. The proof concludes after recalling that the sum and product of the eigenvalues return the trace and the determinant of a matrix. ˜ 1 𝐹1 (−2, 1; 𝑧) = III. MIMO C HANNEL M ODEL As was previously highlighted, we are particularly interested in dual MIMO configurations where both the Tx and Rx are affected by spatial correlation. Under Ricean fading conditions, the channel transfer function matrix H consists of a spatially deterministic specular component HL and a randomly distributed component HW which accounts for the scattered signals. Then, the underlying channel model becomes √ √ 𝐾 1 1/2 HL + R1/2 HW R𝑡 H= (7) 𝐾 +1 𝐾 +1 𝑟 where 𝐾 is the Ricean 𝐾-factor expressing the ratio of powers of the free-space signal and the scattered waves. The receive and transmit correlation matrices are respectively defined as {( { } )𝑇 } . (8) R𝑟 ≜ 𝐸H HH𝐻 and R𝑡 ≜ 𝐸H H𝐻 H Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply. MATTHAIOU et al.: ERGODIC CAPACITY UPPER BOUND FOR DUAL MIMO RICEAN SYSTEMS: SIMPLIFIED DERIVATION AND ASYMPTOTIC . . . z zǯ wavefronts in the near-field. Then, the channel matrix becomes full-rank and delivers two equal LoS eigenvalues. For the assumed model, the optimum inter-element spacing is [17] √ 𝜆𝐷 . (11) 𝑠1 = 𝑠2 = 𝑠𝑜𝑝𝑡 ≈ 2 cos2 𝜃 [-s1sinș, 0, s1cosș] s1 ș y [0,0,0)] 3591 x ș [D-s2sinș, 0, s2cosș] D s2 [D, 0, 0] xǯ Fig. 1. A general architecture of a 2 × 2 MIMO system with ULAs at both ends (side view). The coordinates of all elements with regard to the new coordinate system 𝑥′ 𝑦𝑧 ′ are also included. The axis rotation by an angle 𝜃 arranges the array origins to lie on the same axis. The entries of HW are commonly modeled as i.i.d. zeromean, unit variance complex Gaussian random variables and, consequently, the channel matrix is distributed according to (√ ) 𝐾/(𝐾 + 1)HL , (𝐾 + 1)−1 R𝑟 ⊗ R𝑡 . (9) H ∼ 𝒞𝒩 2,2 Throughout the paper, our purpose is to compare the MIMO gains of different configurations and therefore the capacities should be analyzed independently of the average SNR. This is achieved by normalizing the channel matrices so that following condition is fulfilled )] [ ( [ ] 𝐸 ∥H∥2𝐹 = 𝐸 tr HH𝐻 = 4. (10) We recall that the key normalization in (10) has been widely adopted into the capacity characterization of MIMO systems [1], [2], [12], [16], [17]. From a physical viewpoint, the path-loss effects are removed and the system is assumed to have perfect power control. In Fig. 1, a side view of the considered MIMO system is depicted where both ends are equipped with two-element Uniform Linear Arrays (ULAs) and the distance between the first element of each array is 𝐷. The inter-element spacings are 𝑠1 (Tx) and 𝑠2 (Rx). In the following, we will explore two different LoS configurations, namely a LR and an optimized HR configuration. The former represents any conventional architecture with interelement spacings of the order of wavelength 𝜆, and arrays located in each other’s far-field; this means that the LoS signals are basically plane wavefronts whose rays’ phases are inherently correlated to a great extent. These architectures are normally rank-deficient and offer a minimal spatial multiplexing gain since the transmitted LoS signals impinging on the Rx carry almost identical spatial characteristics and therefore their differentiation becomes laborious. The latter configuration belongs to a family of specifically designed full-rank LoS configurations which aim to assign a phase difference of 𝜋/2 between the LoS wavefronts [15]–[17]. This is achieved by appropriately positioning the antenna elements at both ends of the link, so that the LoS signals propagate as spherical We emphasize that for optimized configurations the effects of spatial correlation are rather weak due to the increased antenna spacing. However, in the following analysis the presence of correlation is considered, for the fairness of comparison with conventional configurations. IV. E RGODIC C APACITY U PPER B OUNDS In this section, an alternative simplified derivation for the upper bound of dual MIMO configurations (as given in [14]) is presented based on the power normalization of (10). Assuming that the Rx has perfect CSI while the Tx knows neither the statistics nor the instantaneous CSI, a sensible choice for the Tx is to split the total amount of power equally amongst all data streams. Then, the ergodic MIMO capacity for an SNR per receiver branch, 𝜌, is given by the following well-known formula [1], [2] [ ( ( ))] 𝜌 𝐶 = 𝐸H log2 det I2 + HH𝐻 (bits/s/Hz). (12) 2 A. Double-Sided Correlated Ricean and Rayleigh Fading In the general case of double-sided correlated Ricean fading, where the channel matrix is distributed according to (9), the instantaneous MIMO correlation matrix W̃ = HH𝐻 exhibits the following non-central quadratic distribution ( ) √ W̃ ∼ 𝒞𝒬2,2 I2 , R𝑟 /(𝐾 + 1), R𝑡 , 𝐾/(𝐾 + 1)HL . (13) Theorem 2: The ergodic capacity of a 2 × 2 doublesided correlated MIMO Ricean channel with mean matrix √ 𝐾/(𝐾 + 1)HL , receive correlation matrix (𝐾 + 1)−1 R𝑟 , and transmit correlation matrix R𝑡 , is analytically upper bounded by ( ) 𝛾𝜌2 𝐶 ≤ log2 1 + 2𝜌 + (14) 2 where the parameter 𝛾 and the matrix Θ are respectively ) ( 1 1 (15) 𝛾 = (𝐾 + 1)−2 ∣R𝑟 ∣∣R𝑡 ∣ 1 + tr(Θ) + det (Θ) 2 2 ( )𝐻 𝐻 −1/2 −1/2 Θ = 𝐾R−1 R R H H . (16) L L 𝑡 𝑟 𝑟 Proof: An alternative way to express the ergodic MIMO capacity is through the real positive eigenvalues 𝑤 ˜1 , 𝑤 ˜2 of W̃. Then, (12) can be rewritten as ( [ 𝜌 )( 𝜌 )] (17) ˜1 1 + 𝑤 ˜2 . 𝐶 = 𝐸 log2 1 + 𝑤 2 2 Expanding (17), we can easily get [ ( ( ))] 𝜌2 𝜌 ˜1 + 𝑤 det W̃ 𝐶 = 𝐸 log2 1 + (𝑤 ˜2 ) + . 2 4 Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply. (18) 3592 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 12, DECEMBER 2009 Taking into account that log(⋅) is a concave function and making use of the Jensen’s inequality we obtain ( ( )]) 𝜌2 [ 𝐶 ≤ log2 1 + 2𝜌 + 𝐸 det W̃ . (19) 4 The last inequality follows from the power normalization in [ ( )] ˜2 ] = 4. The upper bound (10), i.e. 𝐸 tr W̃ = 𝐸 [𝑤 ˜1 + 𝑤 in (14) follows immediately after introducing Corollary 1 and simplifying. It is worth mentioning that the same upper capacity bound for 2 × 2 MIMO systems was originally given in [14]. More specifically, by setting 𝑛 = 2 in [14, Eq. (36)] and evaluating the first three basic elementary functions and minor determinant representations involved, the same result can be obtained after some algebraic manipulations. In the present case though, a novel derivation is introduced, through the power normalization of (10), which simplifies extensively the overall analysis. Please note that the proposed derivation is tied to dual configurations since for larger MIMO setups we end up with a series of eigenvalue cross-products that are created after expanding (12). By inspection of (14), it can be inferred that the proposed upper bound is rather simple as it just requires the computation of the elementary functions of three different deterministic matrices (R𝑟 , R𝑡 and Θ) only once. In the case of double-sided correlated Rayleigh fading (𝐾 = 0), the channel matrix is distributed according to H ∼ 𝒞𝒩 2,2 (02 , R𝑟 ⊗ R𝑡 ) and the upper bound in (14) reduces to ) ( 𝜌2 𝐶 ≤ log2 1 + 2𝜌 + ∣R𝑟 R𝑡 ∣ (20) 2 which is in perfect line with the bound derived in [4, Eq. (25)]. B. Uncorrelated Ricean and Rayleigh Fading Due to space constraints, the case of single-side correlation is omitted herein since the formulations are based on exactly the same concept as before. We now consider the special case of both ends exhibiting uncorrelated i.i.d. Ricean fading. Under these circumstances, the channel matrix is distributed as (√ ) 𝐾 1 HL , I2 ⊗ I2 . (21) H ∼ 𝒞𝒩 2,2 𝐾 +1 𝐾 +1 Corollary 2: The ergodic capacity of a 2√ × 2 uncorrelated MIMO Ricean channel with mean matrix 𝐾/(𝐾 + 1)HL and receive correlation matrix R𝑟 = (𝐾 + 1)−1 I2 is analytically upper bounded by ( ) 𝛽𝜌2 𝐶 ≤ log2 1 + 2𝜌 + (22) 2 ) ( where 𝛽 = 1 + 2𝐾 + 0.5𝐾 2 det (T) /(𝐾 + 1)2 and T = HL H𝐻 L . Proof: This corollary is a direct consequence of (14) after taking into account that Θ ≡ Ω ≡ 𝐾T for the case of uncorrelated fading at both ends. Furthermore, it holds that tr (𝐾T) = 𝐾tr (T) and det (𝐾T) = 𝐾 2 det (T) and given that the entries of the deterministic LoS component matrix are normally modeled as unit-amplitude complex exponentials [15]–[17], it is trivial to show that tr (T) = 4. The case of i.i.d. Rayleigh fading is obtained directly from (7), by setting 𝐾 = 0 and R𝑟 = R𝑡 = I2 . The upper bound in (22) then reduces to (𝛽 = 1) ( ) 𝜌2 𝐶 ≤ log2 1 + 2𝜌 + (23) 2 which is identical with the results presented in [3, Theorem 2], [4, Eq. (22)] and [12, Eq. (5)]. V. T IGHTNESS OF THE U PPER B OUND In this section, the asymptotic tightness of the proposed upper bound is investigated in low and high-SNR regimes. To this end, we define the absolute error, 𝜖, inserted by an upper bound, 𝑈 , as 𝜖 = 𝑈 − 𝐶. Corollary 3: The upper bound in (14) becomes asymptotically tight as the SNR tends to zero. Proof: We begin with further upper bounding the ergodic capacity (14) recalling that ln(1 + 𝑥) ≤ 𝑥 ( ) 1 𝛾𝜌2 𝐶≤ 2𝜌 + . (24) ln 2 2 Following [18, Eq. (23)], we can lower bound the ergodic capacity as ( )] [ 𝜌 2 𝐶 ≥ 𝐸 log2 1 + ∥H∥𝐹 2 ] ] [ 1 ( 𝜌 )2 [ 𝜌 2 𝐸 ∥H∥𝐹 − 𝐸 ∥H∥4𝐹 .(25) ≥ 2 ln 2 2 ln 2 2 The second line follows from the property ln(1+𝑥) ≥ 𝑥− 12 𝑥2 . We can now subtract (25) from (24) and then the absolute error 𝜖 of the proposed upper bound becomes ( ]) 1 [ 𝜌2 4 𝛾 + 𝐸 ∥H∥𝐹 (26) 𝜖= 2 ln 2 4 which asymptotically tends to zero as 𝜌 → 0. Corollary 4: As the SNR 𝜌 → ∞, the absolute error inserted by the upper bound in (14) tends to ( ( ))] [ . (27) 𝜖 = log2 (2𝛾) − 𝐸 log2 det W̃ Proof: As 𝜌 → ∞, the upper bound 𝑈 in (14), becomes (𝜌) 𝑈 ≈ log2 (2𝛾) + 2 log2 . (28) 2 In (18), the quadratic term becomes significantly larger at high SNR and thus the ergodic capacity may be approximated as [ ( 2 ( ))] 𝜌 det W̃ 𝐶 ≈ 𝐸 log2 4 (𝜌) ( ( ))] [ . (29) + 𝐸 log2 det W̃ = 2 log2 2 Subtracting (29) from (28), yields (27). From the above equation, it is apparent that the bound’s error is given in a non-analytical form; in this light, the crucial issue is to determine the expectation of the logdet function of a complex non-central quadratic matrix, which involves a nonlinear log function. Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply. MATTHAIOU et al.: ERGODIC CAPACITY UPPER BOUND FOR DUAL MIMO RICEAN SYSTEMS: SIMPLIFIED DERIVATION AND ASYMPTOTIC . . . where 𝜃1 , 𝜃2 are the eigenvalues of Θ, given in (16), while the digamma functions 𝜓(𝑥) are defined as 𝜓(𝑥) ≜ Γ′ (𝑥) 𝑑 ln Γ(𝑥) = . 𝑑𝑥 Γ(𝑥) (31) The polynomial terms Λ1 (Θ) and Λ2 (Θ) are essentially polynomial functions of the eigenvalues 𝜃1 and 𝜃2 , or Λ1 (Θ) = 𝜃2 ℎ1 (𝜃1 ) − ℎ2 (𝜃1 ) (32) Λ2 (Θ) = ℎ2 (𝜃2 ) − 𝜃1 ℎ1 (𝜃2 ) (33) where ℎ1 (𝑥) = ∞ ∑ 𝒫(𝑘, 𝑥) 𝑘=1 𝑘 , ℎ2 (𝑥) = 𝑥 ∞ ∑ 𝒫(𝑘, 𝑥) 𝑘=1 𝑘+1 (34) with 𝒫(𝑎, 𝑥) being the regularized gamma function [19, Eq. (6.5.1)]. Proof: A detailed proof is given in Appendix B. Please note that the derivation of the formula (30) relies, without loss of generality, on the assumption of two non-zero eigenvalues of the matrix Θ. For the case of rank-1 matrices, a similar analysis can be followed and the interested readers are referred to [13] for a thorough discussion. Evidently, after replacing (30) into (27), we can obtain an analytical formula for the bound’s offset at high SNRs, for the general case of double-sided correlated Ricean fading. More importantly, this result can be further used to deduce exact capacity expressions in the high-SNR regime. When the channel exhibits i.i.d. Ricean fading and both ends are employed with optimally designed arrays, the LoS component yields two equal eigenvalues and the following corollary is introduced. Corollary 5: As the SNR 𝜌 → ∞, the absolute error inserted by the upper bound for the case of i.i.d. Ricean fading and optimized LoS configurations tends to [ 1 𝜓(1) + 𝜓(2) − 2 ln(𝐾 + 1) 𝜖 = log2 (2𝛽) − ln 2 ] ∞ ∑ (2𝑘 + 1)𝛾(𝑘, 𝜔) − 𝜔𝑒−𝜔 𝜔 𝑘−1 + (35) (𝑘 + 1)! 𝑘=1 where 𝜔 = 𝜔1 = 𝜔2 represents any∫ of the two equal 𝑥 eigenvalues of Ω ≡ 𝐾T and 𝛾(𝑎, 𝑥) = 0 𝑡𝑎−1 𝑒−𝑡 𝑑𝑡 is the lower incomplete gamma function. Proof: The proof starts by noting that for i.i.d. Ricean fading Θ should be replaced by Ω ≡ 𝐾T in all manipulations. Further, in the specific case of optimized configurations, the equality of eigenvalues leads to a division by zero in (30). In order to circumvent this singularity, we employ de l’Hôpital’s 35 Upper bound Simulated ergodic capacity 30 Ergodic capacity, (bits/s/Hz) Theorem 3:( Let us assume that√the considered) matrix W̃ ∼ 𝒞𝒬2,2 I2 , R𝑟 /(𝐾 + 1), R𝑡 , 𝐾/(𝐾 + 1)HL . Then the first-order moment of the logarithm of its determinant is [ [ ( ( ))] 1 𝐸 log2 det W̃ = 𝜓(1) + 𝜓(2) − 2 ln(𝐾 + 1) ln 2 ] 1 (Λ1 (Θ) + Λ2 (Θ)) (30) + ln ∣R𝑟 R𝑡 ∣ − 𝜃1 − 𝜃2 3593 25 20 K = 10 dB 15 K = 0 dB 10 5 0 0 10 20 30 40 50 SNR, [dB] Fig. 2. Upper bound and simulated ergodic capacity as a function of the SNR for an optimized LoS configuration (𝛿𝑅 = 0.2 and 𝛿𝑇 = 0.5). rule to get a solution at the limit 𝜔1 → 𝜔2 . In particular, the last term in (30) can be rewritten after some algebra as ] [ 𝑑 (𝜔ℎ1 (𝜔 + 𝜀) − ℎ2 (𝜔 + 𝜀)) − ℎ1 (𝜔). (36) lim 𝜀→0 𝑑𝜀 Taking into account that 𝑑 𝑒−𝑥 𝑥𝑎−1 𝒫(𝑎, 𝑥) = 𝑑𝑥 Γ(𝑎) (37) and after some algebraic manipulations we end up with 𝑑 lim ℎ1 (𝜔 + 𝜀) = 𝜀→0 𝑑𝜀 𝑑 ℎ2 (𝜔 + 𝜀) = 𝜀→0 𝑑𝜀 lim ∞ ∑ 𝑒−𝜔 𝜔 𝑘−1 𝑘=1 ∞ ∑ 𝑘=1 𝑘Γ(𝑘) (38) ℎ2 (𝜔) 𝑒−𝜔 𝜔 𝑘−1 + . (39) (𝑘 + 1)Γ(𝑘) 𝜔 Substituting (38)–(39) into (36), factorizing and simplifying yields (35). VI. N UMERICAL R ESULTS In this section, the theoretical analysis presented in Sections IV and V is validated through a set of Monte-Carlo simulations. Assuming a carrier frequency of 5.9 GHz, 𝐷 = 5.3852 m and 𝜃 = 21.80∘ the optimum inter-element spacings via (11) are 𝑠1 = 𝑠2 = 39.85 cm (7.84𝜆) whereas for the conventional configuration the spacings are taken 𝑠1 = 𝑠2 = 2.54 cm (0.5𝜆). Then, the associated sets of the LoS eigenvalues of −5 T = HL H𝐻 L are (2, 2) and (3.999, 4.08 × 10 ), respectively. Throughout the simulations, we adopt the widely used exponential correlation model thanks to its inherent simplicity. Then, the entries of R𝑟 and R𝑡 in (7) can be modeled as {R𝑟 }𝑖,𝑗 = (𝛿𝑅 )∣𝑖−𝑗∣ and {R𝑡 }𝑖,𝑗 = (𝛿𝑇 )∣𝑖−𝑗∣ , where 𝛿𝑅 , 𝛿𝑇 ∈ [0, 1). After generating 50,000 Monte-Carlo realizations of the channel matrix according to (7) and setting 𝛿𝑅 = 0.2 and 𝛿𝑇 = 0.5, the proposed bound in (14) is firstly evaluated against the SNR for different values of the 𝐾-factor. Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply. 3594 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 12, DECEMBER 2009 35 14 Upper bound Simulated ergodic capacity 13 Ergodic capacity, (bits/s/Hz) Ergodic capacity, (bits/s/Hz) 30 25 20 K = 0 dB 15 K = 10 dB 10 5 0 optimized 12 11 10 conventional 9 Upper bound Simulated ergodic capacity 8 0 10 20 30 40 7 50 0 5 10 SNR, [dB] From Fig. 2 and 3, it can been seen that the bound is remarkably tight for the optimized configuration and, likewise, performs satisfactorily for conventional configurations. As anticipated, in the low-SNR regime both bounds converge asymptotically to the empirical values of ergodic capacity. Generally speaking, the bound becomes tighter as 𝐾 increases and SNR decreases which is in agreement with the conclusions of [9], [10], [12]. More importantly, ergodic capacity benefits from the presence of strong non-fading components when both ends are equipped with specifically designed arrays. This is a result of the two orthogonal LoS MIMO subchannels that are established and contradicts the common belief that high 𝐾-factors limit the advantages of MIMO technology [6], [7], [9]. In Fig. 4, the effects of the 𝐾-factor on the performance of the proposed bound are investigated, where it is again apparent that both bounds become tighter with an increasing 𝐾-factor. For Rayleigh-fading conditions though, or 𝐾 ≤ 0 dB, the achieved tightness is degraded and, under these circumstances, it is sensible to use more efficient bounds which are inherently tied to Rayleigh channels (see for instance [3], [4]). Once more, the superiority of the optimized configurations is illustrated as 𝐾 gets larger whereas in the limit, 𝐾 → ∞ dB, conventional configurations degenerate into a single-path link. On the other hand, for 𝐾 ≤ 0 dB the advantages of optimized configurations diminish and in the limit, 𝐾 → −∞ dB, the LoS component vanishes and we end up with a pure i.i.d. Rayleigh channel. In Fig. 5, the relationship between spatial correlation and MIMO capacity is addressed. The effects of the former become less significant (smaller dynamic range) as the 𝐾factor gets higher for both configurations, i.e. high 𝐾-factors provide robustness against spatial correlation. As expected, the optimized setup with large inter-element spacings remains unaffected by the level of correlation; hence, it offers almost the same ergodic capacity regardless of the values of 𝛿𝑅 and 𝛿𝑇 . The conventional configuration, however, suffers from spatial correlation with the ergodic capacity decreasing as 20 Fig. 4. Upper bound and simulate ergodic capacity as a function of the 𝐾-factor (𝛿𝑅 = 0.2, 𝛿𝑇 = 0.5 and 𝜌 = 20 dB). 14 K = 10 dB 13 Ergodic capacity, (bits/s/Hz) Fig. 3. Upper bound and simulated ergodic capacity as a function of the SNR for a conventional LoS configuration (𝛿𝑅 = 0.2 and 𝛿𝑇 = 0.5). 15 K−factor, [dB] K = 0 dB 12 11 K = 0 dB 10 9 Upper bound−opt Simulated ergodic capacity−opt Upper bound−conv Simulated ergodic capacity−conv 8 7 0 0.1 0.2 0.3 0.4 0.5 K = 10 dB 0.6 0.7 0.8 Correlation coefficient Fig. 5. Upper bound and simulated ergodic capacity as a function of the correlation coefficients 𝛿𝑅 = 𝛿𝑇 (𝜌 = 20 dB). correlation gets higher while the tightness of the corresponding bound is relatively improved in the high-correlation regime. This outcome is in agreement with [4], [10], [12]. At the last stage of the evaluation process, we consider the high-SNR deviation between the ergodic capacity and the proposed upper bound using the closed-form formulae in (27), (30) and (35). In Fig. 6, these analytical curves are overlaid with the outputs of a Monte-Carlo simulator with the match being remarkably good. The error associated with optimized configurations is systematically lower than that of conventional ones, as a result of the rank-deficiency of the former. What is more, it appears that the latter error has a much smaller dynamic range revealing that a high 𝐾-factor does not have an extensive impact on its value. On the other hand, the bound for the optimized configuration yields an enhanced tightness as 𝐾 increases and under strong Ricean conditions the corresponding offset is minimized. Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply. MATTHAIOU et al.: ERGODIC CAPACITY UPPER BOUND FOR DUAL MIMO RICEAN SYSTEMS: SIMPLIFIED DERIVATION AND ASYMPTOTIC . . . 1.4 Theorem 1] [ ] [ ) ] ( 𝜐 Γ̃2 (𝜐 + 2) 𝐻 𝜐 = det (Ψ) 𝐸 det X̄X̄ Γ̃2 (2) × etr(−Θ)1 𝐹˜1 (𝜐 + 2; 2; Θ) Analytical 1.2 Simulation conventional Absolute error 1 (41) where we have made use of the basic property, det (CD) = det (DC), for the determinant of the product of square matrices. The proof concludes after introducing the well-known Kummer relation for hypergeometric functions of one matrix argument, i.e. 1 𝐹˜1 (𝑎; 𝑏; S) = etr(S)1 𝐹˜1 (𝑏 − 𝑎; 𝑏; −S). 0.8 0.6 optimized 0.4 0.2 0 3595 A PPENDIX B P ROOF OF T HEOREM 3 0 5 10 15 20 K−factor, [dB] Fig. 6. Analytical and simulated absolute error of the upper bound in the high-SNR regime as a function of the 𝐾-factor (𝛿𝑅 = 0.2 and 𝛿𝑇 = 0.5). VII. C ONCLUSION An alternative simplified derivation of a tractable upper bound for the ergodic capacity of dual MIMO Ricean systems has been proposed, with the key concept originating from a widely used power normalization. The proposed bound depends merely on the SNR and the expected value of the determinant of either a non-central quadratic form or a noncentral Wishart matrix and, further, is not confined to the common case of rank-1 LoS components. The tightness of the bound was assessed in the low and high-SNR regions; in the former, the bound becomes asymptotically tight whereas in the latter the bound’s offset tends to a constant value that was analytically determined and validated through MonteCarlo simulations. For the sake of completeness, we explored two different classes of LoS configurations, these were a conventional and an optimized architecture which benefits from the presence of strong deterministic components by offering two equal LoS eigenvalues. It was demonstrated that the bound is remarkably tight for the optimized case and marginally looser for conventional setups. The proof begins with the following key transformation 𝐸 [ln 𝑥] = 𝑑 𝐸 [𝑥𝜐 ] ∣𝜐=0 𝑑𝜐 which holds since, by definition, 𝑥𝜐 = 𝑒𝜐 ln 𝑥 . By combining (42) and (1) and denoting ( ( ))] [ ( ( ))] [ 1 = 𝐸 ln det W̃ 𝜁 = 𝐸 log2 det W̃ (43) ln 2 and [ ] −1 𝜐 Γ̃2 (𝜐 + 2) ˜ 𝜙(𝑣) = ∣(𝐾 + 1) R𝑟 R𝑡 ∣ 1 𝐹1 (−𝜐; 2; −Θ) Γ̃2 (2) we can directly get 1 𝑑 1 𝑑 = 𝜙(𝑣) ln 𝜙(𝑣) 𝜁= ln 2 𝑑𝜐 ln 2 𝑑𝜐 𝜐=0 𝜐=0 We begin by expressing the determinant of the quadratic form Q ∼ 𝒞𝒬2,2 (I2 , Σ, Ψ, M) as [ )𝜐 ] ( 𝜐 𝐸 [det (Q) ] = 𝐸 det XX𝐻 )𝜐 ] [ ( = 𝐸 det Σ1/2 X̄X̄𝐻 Σ1/2 [ )𝜐 ] ( (40) = det (Σ)𝜐 𝐸 det X̄X̄𝐻 where X̄ ∼ 𝒞𝒩 2,2 (M̄, I2 ⊗ Ψ). Using a result from [14] with the aid of the Cauchy-Binet formula, it can be shown that the matrix X̄𝐻 X̄ ∼ 𝒞𝒲 2 (2, Ψ, Θ). The expectation of the determinant in (40) can now be evaluated through [13, (44) where the second equality follows since 𝜙(0) = 1. It is easily seen that the above differentiation consists of three multiplicative terms. Treating each one separately due to the chain rule, the first term results in ln ∣R𝑟 R𝑡 ∣ − 2 ln(𝐾 + 1). The second one, may be rearranged to give )} { ( )} Γ̃2 (𝜐 + 2) 𝑑 { ( 𝑑 ln ln Γ̃2 (𝜐 + 2) = 𝑑𝜐 𝑑𝜐 Γ̃2 (2) 𝜐=0 𝜐=0 { 2 } 𝑑 ∑ = ln (Γ(𝜐 + 2 − 𝑖 + 1)) (45) 𝑑𝜐 𝑖=1 A PPENDIX A P ROOF OF T HEOREM 1 (42) 𝜐=0 which, through (31), yields 𝜓(1) + 𝜓(2). Focusing now on the last term, we get (46) at the top of next page. The proof concludes after introducing a useful result from [13, Appendix II-B] for the nominator in (46) and simplifying the denominator using 1 𝐹˜1 (𝛼; 𝛼; 𝑥) = exp(𝑥). Substituting (45) and (46) into (44) and collecting common terms yields (30). R EFERENCES [1] G. J. Foschini, “Layered space-time architecture for wireless communications in a fading environment when using multiple antennas,” Bell Labs Tech. J., vol. 1, no. 2, pp. 41–59, Autumn 1996. [2] I. E. Telatar, “Capacity of multi-antenna Gaussian channels,” Europ. Trans. Telecommun., vol. 10, no. 6, pp. 585–595, Nov./Dec. 1999. [3] A. Grant, “Rayleigh fading multi-antenna channels,” EURASIP J. Appl. Signal Process., vol. 2002, no. 3, pp. 316–329, Mar. 2002. [4] H. Shin and J. H. Lee, “Capacity of multi-antenna fading channels: spatial fading correlation, double scattering, and keyhole,” IEEE Trans. Inf. Theory, vol. 49, no. 10, pp. 2636–2647, Oct. 2003. Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply. 3596 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 57, NO. 12, DECEMBER 2009 )} 𝑑 { ( ˜ ln 1 𝐹1 (−𝜐; 2; −Θ) 𝑑𝜐 𝜐=0 = = = [5] Y.-H. Kim and A. Lapidoth, “On the log determinant of non-central Wishart matrices,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), June 2003, p. 54. [6] S. Jin and X. Gao, “Tight lower bounds on the ergodic capacity of Ricean fading MIMO channels,” in Proc. IEEE Int. Conf. Commun. (ICC), May 2005, vol. 4, pp. 2412–2416. [7] —–, “Tight upper bound on the ergodic capacity of the Rician fading MIMO channels,” in Proc. IEEE Wireless Commun. Networking Conf. (WCNC), Mar. 2005, vol. 1, pp. 402–407. [8] M. R. McKay and I. B. Collings, “On the capacity of frequencyflat and frequency-selective Rician MIMO channels with single-sided correlation,” IEEE Trans. Wireless Commun., vol. 5, no. 8, pp. 2038– 2043, Aug. 2005. [9] X. W. Cui, Q. T. Zhang, and Z. M. Feng, “Generic procedure for tightly bounding the capacity of MIMO correlated Rician fading channels,” IEEE Trans. Commun., vol. 53, no. 5, pp. 890–898, May 2005. [10] S. Jin, X. Gao, and X. You, “On the ergodic capacity of rank-1 Riceanfading MIMO channels,” IEEE Trans. Inf. Theory, vol. 53, no. 2, pp. 502–517, Feb. 2007. [11] M. R. McKay and I. B. Collings, “Improved general lower bound for spatially-correlated Rician MIMO capacity,” IEEE Commun. Lett., vol. 10, no. 3, pp. 162–164, Mar. 2006. [12] J. Salo, F. Mikas, and P. Vainikainen, “An upper bound on the ergodic mutual information in Rician fading MIMO channels,” IEEE Trans. Wireless Commun., vol. 5, no. 6, pp. 1415–1421, June 2006. )} 𝑑 { ( ˜ ln etr(−Θ)1 𝐹1 (𝜐 + 2; 2; Θ) 𝑑𝜐 𝜐=0 )} 𝑑 { ( ˜ ln 1 𝐹1 (𝜐 + 2; 2; Θ) 𝑑𝜐 𝜐=0 𝑑 ˜ 𝑑𝜐 1 𝐹1 (𝜐 + 2; 2; Θ) ˜ (2; 2; Θ) 𝜐=0 1 𝐹1 (46) [13] M. R. McKay and I. B. Collings, “General capacity bounds for spatially correlated Rician MIMO channels,” IEEE Trans. Inf. Theory, vol. 51, no. 9, pp. 3121–3145, Sep. 2005. [14] M. R. McKay, P. J. Smith, and I. B. Collings, “New properties of complex noncentral quadratic forms and bounds on MIMO mutual information,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), July 2006, pp. 1209–1213. [15] F. Bøhagen, P. Orten, G. E. Øien, and S. de la Kethulle de Ryhove, “Exact capacity expressions for dual-branch Ricean MIMO systems,” IEEE Trans. Commun., vol. 56, no. 12, pp. 2214–2222, Dec. 2008. [16] I. Sarris and A. R. Nix, “Design and performance assessment of high-capacity MIMO architectures in the presence of a line-of-sight component,” IEEE Trans. Veh. Technol., vol. 56, no. 4, pp. 2194–2202, July 2007. [17] M. Matthaiou, D. I. Laurenson, and C.-X. Wang, “Capacity study of vehicle-to-roadside MIMO channels with a line-of-sight component,” in Proc. IEEE Wireless Commun. Networking Conf. (WCNC), Mar. 2008, pp. 775–779. [18] Ö. Oyman, R. Nabar, H. Bölcskei, and A. Paulraj, “Characterizing the statistical properties of mutual information in MIMO channels,” IEEE Trans. Signal Process., vol. 51, no. 11, pp. 2782–2795, Nov. 2003. [19] M. Abramowitz and I. A. Stegun, Handbook of Mathematical Functions With Formulas, Graphs, and Mathematical Tables. New York: Dover, 1972. Authorized licensed use limited to: T U MUENCHEN. Downloaded on December 13, 2009 at 09:20 from IEEE Xplore. Restrictions apply.
© Copyright 2026 Paperzz