On base station cooperation using statistical CSI in jointly correlated MIMO downlink channels

Zhang, Jun; Jiang, Bin; Jin, Shi; Gao, Xiqi; Wong, Kai-Kit

doi:10.1186/1687-6180-2012-81

Research
Open access
Published: 12 April 2012

On base station cooperation using statistical CSI in jointly correlated MIMO downlink channels

Jun Zhang¹,
Bin Jiang¹,
Shi Jin¹,
Xiqi Gao¹ &
…
Kai-Kit Wong²

EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 81 (2012) Cite this article

3210 Accesses
2 Citations
Metrics details

Abstract

This article studies the transmission of a single cell-edge user's signal using statistical channel state information at cooperative base stations (BSs) with a general jointly correlated multiple-input multiple-output (MIMO) channel model. We first present an optimal scheme to maximize the ergodic sum capacity with per-BS power constraints, revealing that the transmitted signals of all BSs are mutually independent and the optimum transmit directions for each BS align with the eigenvectors of the BS's own transmit correlation matrix of the channel. Then, we employ matrix permanents to derive a closed-form tight upper bound for the ergodic sum capacity. Based on these results, we develop a low-complexity power allocation solution using convex optimization techniques and a simple iterative water-filling algorithm (IWFA) for power allocation. Finally, we derive a necessary and sufficient condition for which a beamforming approach achieves capacity for all BSs. Simulation results demonstrate that the upper bound of ergodic sum capacity is tight and the proposed cooperative transmission scheme increases the downlink system sum capacity considerably.

1 Introduction

Multi-antenna systems, widely known as multiple-input multiple-output (MIMO), have shown considerable gain in spectral efficiency and attracted much attention in recent years, e.g., [1]. There have also been strong interests in utilizing MIMO to cope with multiuser scenarios. However, achieving the theoretical capacity gains in practical cellular environments is problematic because of interferences. More recently, base station (BS) cooperation [2–10] was proposed as a means to improve the performance of cell-edge user and mitigate the problem of inter-cell interference. This is greatly motivated by the fact that BSs may be connected via a wired backbone and the channel state information (CSI) can be shared among the BSs for coordinated transmission. Such BS cooperation in the downlink in particular leads to enormous throughput gains as compared to the conventional single-BS (or single-cell) signal processing where the co-channel interference (often from other cells) is treated as noise.

Coordinated BS transmission in the downlink is often analyzed using a large MIMO Gaussian broadcast channel (BC) model, with the challenge of incorporating per-antenna or per-BS power constraints. MIMO BC capacity region with a sum-power constraint has been well established in [11–14] using the uplink-downlink duality. The achievable rate region of MIMO BC under per-antenna power constraints has also been studied in [15]. Recently, this result has been extended to cope with general linear transmit power constraints in [16, 17]. If the dirty-paper-coding based optimal nonlinear precoder or the minimum-mean-squared-error based optimal linear pre-coder is used, the result in [15–17] can be directly applied to BS cooperation with per-BS power constraints. However, the gain offered by BS cooperation depends greatly on the level of CSI that can be exploited in the optimization. To investigate the full diversity and multiplexing benefits, most previous studies [15–17] assumed to possess perfect CSI at the transmitter, but such results may be severely offset by expensive overheads for acquiring the CSI [18–20]. User mobility also will increase fading rate and make accurate CSI difficult to maintain. For this reason, exploiting statistical CSI at the transmitter side is often more appealing due to much lower overhead. The uplink and downlink statistical CSI are also usually reciprocal in both frequency-division-duplex and time-division-duplex systems [21, 22].

Capacity analysis and transceiver designs using statistical CSI at the transmitter are highly dependent on the assumption of the channel model. The conventional modeling approach has been the Kronecker model [23–25] which separates spatial correlation at the transmitter and receiver ends. Recent measurement campaigns, however, demonstrated that mutual correlation between the transmitter and receiver may be important, which makes the Kronecker model inadequate [26, 27]. The jointly-correlated channel model in contrast not only accounts for the correlation at both ends, but also characterizes their mutual dependence. Recently, [28] derived a closed-form upper bound for the ergodic capacity of the jointly-correlated MIMO channel. Beamforming is a simple linear precoding strategy, in which the transmit covariance matrix is of unit-rank. The optimality of beamforming for some single-user MIMO channels have been studied for the Kronecker model [29, 30], the double-scattering model [31], and the virtual representation model [32], when the transmitter has partial knowledge of the channel. These results have also been extended to the MIMO multiple access channel (MAC) in [32–36].

In this article, we aim to investigate coordinated downlink transmission with cooperative BSs assuming that the mobile user has perfect CSI but the BSs know only statistical CSI. Our main contribution is that the jointly correlated MIMO channel model in [26] is adopted to account for the spatial correlations of the antennas at the BSs and the user and between them. We first present an optimal transmission scheme to maximize the ergodic sum capacity of this channel with per-BS power constraints, from which two important results are revealed: (i) the transmit signals of all BSs are mutually independent; and (ii) the optimal transmit directions for each BS align with the eigen-directions of the BS's own transmit-side correlation matrix. We then employ matrix permanents to derive a closed-form tight upper bound for the ergodic sum capacity of the jointly correlated MIMO channel. Based on this bound, we propose an iterative power allocation algorithm using convex optimization techniques, which converges within only a few iterations. Also, we establish the beamforming optimality conditions for all the BSs. Our study for BS cooperation in the jointly-correlated MIMO downlink channel generalizes the result in [33].

The rest of this article is organized as follows. We present the system model in Section 2. In Section 3, we propose the capacity-achieving optimal transmit scheme and derive the ergodic sum capacity upper bound. Utilizing the bound, we then develop the optimal power allocation policies. Finally, we establish the beamforming optimality conditions for all BSs. Simulation results are presented in Section 4, and we conclude the article in Section 5.

Notations

We use uppercase and lowercase boldface letters to denote matrices and vectors, respectively. I_Nis an N × N identity matrix and 0 denotes an all-zero matrix, while and 1 is an all-one matrix. The matrix inequality ≽ shows the positive semi-definiteness. The superscripts (·)^H, (·)^T, and (·)* represent the conjugate-transpose, transpose, and conjugate operations, respectively. We use $E {\cdot}$ to denote expectation with respect to all random variables within the brackets, and use A ʘ B to denote the Hadamard product of A and B. We use [A]_klor the lower-case representation a_klto denote the (k,l)th entry of A, and a_kdenotes the k th entry of the column vector a. The operators tr(·), det(·), and Per(·) represent the matrix trace, determinant, and permanent, respectively, and diag(x) denotes a diagonal matrix with x along its main diagonal.

2 System model

We consider a downlink cellular network consisting of m BSs, labeled as BS₁, ..., BS_m, which are equipped with $\{N_{t}^{(1)}, N_{t}^{(2)}, . . ., N_{t}^{(m)}\}$ antennas, respectively, and a single cell-edge user with N_rantennas. To improve the performance of the cell-edge user, the BSs are connected by a wired backbone that allows information to be reliably exchanged among them. At the mobile user in the baseband, the received signal can be written in vector form as

y = H x + n,

(1)

where $x ≜ {[x_{1}^{T}, x_{2}^{T}, . . ., x_{m}^{T}]}^{T}$ with x_ibeing the $N_{t}^{(i)} \times 1$ transmitted signal vector of BS_i; n is the N_r× 1 zero-mean additive complex Gaussian noise vector with $E \{n n^{H}\} = N_{0} I_{N_{r}}; H ≜ [\sqrt{Γ_{1}} H_{1}, \sqrt{Γ_{2}} H_{2}, . . ., \sqrt{Γ_{m}} H_{m}]$ with H_ibeing the $N_{r} \times N_{t}^{(i)}$ MIMO channel matrix between BS_iand the user; and Γ_idenotes the large-scale fading between BS_iand the user. It is assumed that x_iand H_isatisfy the following power constraints

E \{tr (x_{i} x_{i}^{H})\} = P_{i},

(2)

and

E \{tr (H_{i} H_{i}^{H})\} = N_{t}^{(i)} N_{r}, for i = 1, 2, . . ., m .

(3)

We find it useful to define the total transmitted power as $P ≜ \sum_{i = 1}^{m} P_{i}$ , the total number of transmit antennas as $N_{t} ≜ \sum_{i = 1}^{m} N_{t}^{(i)}$ , and the transmit signal-to-noise ratio (SNR) as $ρ ≜ \frac{P}{N_{0}}$ .

In this article, we consider the jointly-correlated MIMO channel model [28], given by

H_{i} = U_{r} {\tilde{H}}_{i} U_{t, i}^{H}, for i = 1, 2, . . ., m,

(4)

where

{\tilde{H}}_{i} = D_{i} + M_{i} ⊙ H_{iid, i,}

(5)

U_t,iand U_rare $N_{t}^{(i)} \times N_{t}^{(i)}$ and N_r× N_rdeterministic unitary matrices, D_iis an $N_{r} \times N_{t}^{(i)}$ deterministic matrix, M_iis an $N_{r} \times N_{t}^{(i)}$ deterministic matrix with nonnegative elements, and H_iid,ifor i = 1, 2, ..., m are statistically independent $N_{r} \times N_{t}^{(i)}$ random matrices of independent and identically distributed (i.i.d.) zero-mean unit-variance entries. Note that H_iid,iis not necessarily Gaussian. The matrices D_iand M_ireflect the line-of-sight (LOS) and scattering components of the channel, respectively. Also, we define $D ≜ [\sqrt{Γ_{1}} D_{1}, \sqrt{Γ_{2}} D_{2}, . . ., \sqrt{Γ_{m}} D_{m}]$ , which has at most one nonzero element in each row and each column. Without loss of generality, we assume that the nonzero elements of D are real, with indices (l, l) for 1 ≤ l ≤ min (N_t, N_r).

From (4), the transmit correlation matrices of the BSs and the receive correlation matrix of the user can be expressed, respectively, as

R_{t, i} = E \{H_{i}^{H} H_{i}\} = U_{t, i} Φ_{t, i} U_{t, i}^{H} for i = 1, 2, . . ., m,

(6)

and

R_{r} = E \{H H^{H}\} = U_{r} Φ_{r} U_{r}^{H},

(7)

where Φ_t,iand Φ_rare diagonal matrices, U_t,iand U_rare the eigenvector matrices of the transmit and receive correlation matrices, respectively.

For ease of exposition, the channel coupling matrix is usually defined as [26]

Ω_{i} = E \{{\tilde{H}}_{i} ⊙ {\tilde{H}}_{i}^{*}\}, for i = 1, 2, . . ., m .

(8)

Substituting (5) into (8) yields

Ω_{i} = D_{i} ⊙ D_{i} + M_{i} ⊙ M_{i}, for i = 1, 2, . . ., m,

(9)

and the power constraint (3) can be rewritten as

\sum_{k = 1}^{N_{r}} \sum_{l = 1}^{N_{t}^{(i)}} ω_{k l}^{(i)} = N_{t}^{(i)} N_{r}, for i = 1, 2, . . ., m .

(10)

In the above, the (k,l)th element of Ω_i, i.e., $ω_{k l}^{(i)}$ , corresponds to the average power of the (k,l)th element of ${\tilde{H}}_{i}$ , i.e., ${\tilde{h}}_{k l}^{(i)}$ , which captures the average coupling between the k th receive eigenmode and the l th transmit eigenmode of BS_i.

3 Statistical CSI-aided coordinated BS transmission

Here, we first devise the optimal transmit scheme for each BS, and then derive a closed-form upper bound for the ergodic sum capacity using the matrix permanents. Based on the capacity bound, we develop low-complexity power allocation solutions using convex optimization techniques, followed by discussion of the beamforming optimality conditions for the BSs.

3.1 Optimal transmit scheme

We here assume that the mobile receiver has perfect instantaneous CSI, whereas the BSs only have the statistical CSI including U_t,i, U_r, D_iand M_i(and thus Ω_i) (i = 1, 2, ..., m), and this information can be exchanged among the BSs via the wired backbone. Under these assumptions, the ergodic sum capacity of the downlink system is achieved by selecting the transmitted signal vector x to follow a zero-mean proper Gaussian distribution [1]. Let $E \{x x^{H}\} = \frac{P}{N_{t}} Q$ , where

Q = (\begin{matrix} Q_{11} & Q_{12} & \dots & Q_{1 m} \\ Q_{21} & Q_{22} & \dots & Q_{2 m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ Q_{m 1} & Q_{m 2} & \dots & Q_{m m} \end{matrix}),

(11)

with $Q_{i j} = \frac{N_{t}}{P} E \{x_{i} x_{j}^{H}\}$ . The power constraint (2) can be rewritten as $tr (Q_{i i}) = \frac{P_{t} N_{t}}{P} = P_{i}^{'}$ , for i = 1, 2, ..., m, and the ergodic sum capacity is given by

C = \max_{\underset{Q_{i i} ≽ O \forall i}{tr {Q_{i i}} = P_{i}^{'} \forall_{i}}} E {\log \det (I_{N_{r}} + γ H Q H^{H})},

(12)

where $γ = \frac{ρ}{N_{t}}$ . Let $Q_{i i} = U_{i} Λ_{i} U_{i}^{H}$ , for i = 1, ..., m, with U_ibeing the eigenvector matrix, and $Λ_{i} = diag (λ_{1}^{(i)}, λ_{2}^{(i)}, . . ., λ_{N_{t}^{(i)}}^{(i)})$ the diagonal matrix of the corresponding eigenvalues. The following theorem addresses the optimal transmit direction of each BS.

Theorem 1 The ergodic sum capacity is achieved if the BS transmit signals are all mutually independent (i.e., $Q_{i j}^{o p t} = 0$ , for i ≠ j), and the eigenvector matrix of $Q_{i i}^{o p t}$ for the jointly-correlated channel (4) is given by U_i= U_t,i. The ergodic sum capacity is then expressed as

C = \underset{Λ_{i} ≽ 0 \forall i}{\max_{tr {Λ_{i}} = P_{i}^{'} \forall i}} E {\log \det (I_{N_{r}} + γ \sum_{i = 1}^{m} Γ_{i} {\tilde{H}}_{i} Λ_{i} {\tilde{H}}_{i}^{H})} .

(13)

Proof: Form (4) and (5), the channel matrix H can be expressed as

H = U_{r} {\tilde{H U}}_{t}^{H},

(14)

\tilde{H} = D + M ⊙ H_{iid},

(15)

where

\begin{gathered} \tilde{H} = [\sqrt{Γ_{1}} {\tilde{H}}_{1}, \sqrt{Γ_{2}} {\tilde{H}}_{2}, . . ., \sqrt{Γ_{m}} {\tilde{H}}_{m}], U_{t} = diag \{U_{t, 1}, U_{t, 2}, . . ., U_{t, m}\}, \\ M = [\sqrt{Γ_{1}} M_{1}, \sqrt{Γ_{2}} M_{2}, . . ., \sqrt{Γ_{m}} M_{m}], H_{iid} = [H_{iid, 1}, H_{iid, 2}, . . ., H_{iid, m}] . \end{gathered}

Defining $\tilde{Q} ≜ U_{t}^{H} Q U_{t}$ and substituting (14) into (12) yields

C = \max_{\underset{{\tilde{Q}}_{i i} ≽ 0 \forall i}{tr {{\tilde{Q}}_{i i}} = P_{i}^{'} \forall i}} ℐ (\tilde{Q}),

(16)

where

ℐ (\tilde{Q}) = E \{log det (I_{N_{r}} + γ \tilde{H} \tilde{Q} {\tilde{H}}^{H})\} .

(17)

Note that the optimization condition is met since $tr \{{\tilde{Q}}_{i i}\} = tr \{Q_{i i}\}$ . Now, define Π_lfor 1 ≤ l ≤ N_tas diagonal matrices all of which have their diagonal entries being all 1s except for the (l,l)th entry as -1. As Π_lis a unitary matrix, (17) can be written as

ℐ (\tilde{Q}) = E \{log det (I_{N_{r}} + γ \prod_{l} \tilde{H} \prod_{l} \prod_{l}^{H} \tilde{Q} \prod_{l} \prod_{l}^{H} {\tilde{H}}^{H} \prod_{l}^{H})\} for any \prod_{l} .

(18)

Note that $\tilde{H}$ is given by (15) and $\prod_{l} \tilde{H} \prod_{l}$ has the same distribution as $\tilde{H}$ , since D is a diagonal matrix plus the fact that the entries of M ʘ H_iid are independent and their distributions being symmetric, reversing the sign of some columns does not alter the distribution. Thus, we have

ℐ (\tilde{Q}) = E \{log det (I_{N_{r}} + γ \tilde{H} Π_{l}^{H} \tilde{Q} Π_{l} {\tilde{H}}^{H})\} = ℐ (Π_{l}^{H} \tilde{Q} Π_{l}) .

(19)

From Jensen's inequality, it follows that [37–39]

\begin{align} ℐ (\frac{1}{2} (\tilde{Q} + Π_{l} \tilde{Q} Π_{l}^{H})) & \geq \frac{ℐ (\tilde{Q}) + ℐ (Π_{l} \tilde{Q} Π_{l}^{H})}{2} \\ = ℐ (\tilde{Q}), \end{align}

(20)

where the matrix $\frac{1}{2} (\tilde{Q} + Π_{l} \tilde{Q} Π_{l}^{H})$ has entries equal to those of $\tilde{Q}$ except for the off-diagonals in the l th row and l th column, which are zero. In particular, its trace is identical to that of $\tilde{Q}$ . As a result, nulling the off-diagonal entries of any column and the corresponding row of $\tilde{Q}$ can only increase $ℐ (\tilde{Q})$ . Using the same process N_ttimes, (17) is maximized with a diagonal $\tilde{Q}$ , i.e., $\tilde{Q} = Λ$ . As a result, we have $Q^{opt} = U_{t} {Λ U}_{t}^{H}$ , or

Q_{i j}^{opt} = \{\begin{matrix} U_{t, i} Λ_{i} U_{t, i}^{H} & if i = j, \\ 0 & if i \neq j, \end{matrix}

(21)

where Λ = diag {Λ₁, Λ₂, ..., Λ_m}. As such, (16) can be rewritten as (13).

Theorem 1 reveals that the transmitted signals of all BSs should be mutually independent and the optimal signaling directions of the i-th BS align with the eigenvectors of the transmit-side correlation matrix of the MIMO channel of the i-th BS. This results extend the prior results in [29, 37, 40, 41] to the more general channel model given by (4).

3.2 Ergodic sum capacity upper bound

After knowing the optimal transmit directions of the BSs, the remaining challenge is to determine the eigenvalues of the capacity-achieving input covariance matrix Q_iifor i = 1, ..., m. This is equivalent to optimally allocating the available transmit power over the optimized transmit eigen-directions that are determined by Theorem 1.

In the most general case, it is difficult to derive exact closed-form solutions for the power allocation problem. The main obstacle lies in the complexity in evaluating the expectation in (13) which is usually done by stochastic averaging over a large number of random samples. In this section, our approach is to derive a tight upper bound for the expectation in (13) which can serve as an approximation to the ergodic capacity. Based on this, we develop closed-form power allocation solutions which will be presented in Section 3.3.

Due to the concavity of the log(·) function, C is upper bounded by

C \leq C_{u} = \max_{\underset{Λ_{i} ≽ 0 \forall i}{tr {Λ_{i}} = P_{i}^{'} \forall i}} C_{u} (λ),

(22)

where

C_{u} (λ) = log E \{det (I_{N_{r}} + γ \sum_{i = 1}^{m} Γ_{i} {\tilde{H}}_{i} Λ_{i} {\tilde{H}}_{i}^{H})\},

(23)

with $λ = {[λ_{1}^{T}, λ_{2}^{T}, . . ., λ_{m}^{T}]}^{T}$ , in which λ_idenotes an $N_{t}^{(i)} \times 1$ vector containing the eigenvalues $λ_{j}^{(i)}$ for $j = 1, . . ., N_{t}^{(i)}$ and i = 1, ..., m. The upper bound (22) can be rewritten as

C \leq C_{u} = \max_{\underset{λ_{i} \geq 0 \forall i}{1^{T} λ_{i} = P_{i}^{'} \forall_{i}}} C_{u} (λ) .

(24)

The expectation derivation in (23) is based heavily on exploiting linear-algebraic concepts and the properties of matrix permanents. The permanent of a matrix is defined in a similar fashion to the determinant. The primary difference is that when taking the expansion over minors, all signs are positive. The permanents of M × N matrices have been investigated in [28, 42]. We introduce the definitions and properties of matrix permanents in Appendix 1.

From these definitions, we extend the results of [28] to the case of multiple BSs. We can derive a closed-form expression for the upper bound on the ergodic sum capacity.

Theorem 2 The ergodic sum capacity in (13) is upper bounded by

C \leq C_{u} = \max_{\underset{λ_{i} \geq 0 \forall i}{1^{T} λ_{i} = P_{i}^{'} \forall_{i}}} {\tilde{C}}_{u} (λ),

(25)

where ${\tilde{C}}_{u} (λ) = log \underset{}{Per} (γ [Γ_{1} Ω_{1} Λ_{1}, . . ., Γ_{m} Ω_{m} Λ_{m}]) .$ .

Proof: We start by letting

E (λ) ≜ E \{det (I_{N_{r}} + γ \sum_{i = 1}^{m} Γ_{i} {\tilde{H}}_{i} Λ_{i} {\tilde{H}}_{i}^{H})\} = E \{det (I_{N_{r}} + γ \tilde{H} diag (λ) {\tilde{H}}^{H})\},

(26)

where $\tilde{H} = [\sqrt{Γ_{1}} {\tilde{H}}_{1}, \sqrt{Γ_{2}} {\tilde{H}}_{2}, . . ., \sqrt{Γ_{m}} {\tilde{H}}_{m}]$ . The upper bound of mutual information (23) can be rewritten as

C_{u} (λ) = log E (λ) .

(27)

By using the known result [28, Theorem 2], E(λ) can be expressed as

E (λ) = \underset{}{Per} (γ [Γ_{1} Ω_{1}, . . ., Γ_{m} Ω_{m}] diag (λ)) = \underset{}{Per} (γ [Γ_{1} Ω_{1} Λ_{1}, . . ., Γ_{m} Ω_{m} Λ_{m}]) .

(28)

Substituting (28) into (27) and using (24) completes the proof.

From Theorem 2, we can see that the upper bound of ergodic sum capacity depends on the average SNR and the eigenmode channel coupling matrices Ω_i, for i = 1, 2, ..., m. Low-complexity algorithms about the computation of the matrix permanent were developed in [28].

3.3 Optimizing the power allocation policies

We now consider the transmitter power allocation optimization problem. Based on the upper bound in Theorem 2, we develop low-complexity power allocation solutions using convex optimization techniques and then propose a simple iterative water-filling algorithm (IWFA) for approaching the optimal power allocation policy.

From (25), the power allocation optimization problem can be formulated as

max_{λ_{i} \geq 0 \forall i} {\tilde{C}}_{u} (λ)

(29)

s .t . 1^{T} λ_{i} = P_{i}^{'} \forall i .

(30)

The above problem is a concave optimization problem [28] and the solution can be evaluated by employing standard convex optimization algorithms. In the following, we derive necessary and sufficient conditions for the optimal solution using the Karush-Kuhn-Tucker (KKT) conditions.

Theorem 3 The expected mutual information upper bound ${\tilde{C}}_{u} (λ)$ is concave with respect to λ, and the necessary and sufficient conditions for the optimal power allocation are given by

λ_{j}^{(i)} = {({\tilde{ν}}_{i} - \frac{p (λ_{i (j)})}{q (λ_{i (j)})})}^{+},

(31)

1^{T} λ_{i} = P_{i}^{'}, f o r j = 1, . . ., N_{t}^{(i)}, a n d i = 1, . . ., m,

(32)

where

p (λ_{i (j)}) = \underset{}{Per} (γ B_{i (j)}),

(33)

q (λ_{i (j)}) = \underset{}{Per} (γ B_{i [j]}) - \underset{}{Per} (γ B_{i (j)}),

(34)

B = [Γ₁Ω₁Λ₁, ..., Γ_mΩ_mΛ_m], B_i(j)denotes the block matrix obtained by replacing the ith sub-matrix (i.e., Ω_iΛ_i) of B by Ω_i(j)diag(λ_i(j)), B_i[j]denotes the block matrix obtained by replacing the ith submatrix of B by Ω_idiag(λ_i[j]), Ω_i(j)denotes the sub-matrix of Ω_iobtained by deleting the jth column, λ_i(j)denotes the $(N_{t}^{(i)} - 1) \times 1$ vector obtained by deleting the jth element of λ_i, and λ_i[j]denotes the $N_{t}^{(i)} \times 1$ vector obtained by replacing the jth element of λ_iby unity. In addition, (a)⁺ = max{0, a} and ${\tilde{ν}}_{i}$ is chosen to satisfy the power constraints in (32).

Proof: See Appendix 2.

Since the right-hand-side of (31) is independent of $λ_{j}^{(i)}$ , we propose a simple IWFA to evaluate the optimal power allocation policy which satisfies (31). Simulation results, to be given in Section 4, will demonstrate that this proposed approach works very well and is highly efficient; typically converging after only a few iterations, with the first iteration achieving near-optimal performance. The proposed algorithm includes the following steps:

Step 1 Initialize $λ^{0} = 1, {\tilde{C}}_{u} (λ^{k}) = log \underset{}{Per} (γ B^{k})$ , and k = 0.

Step 2 Calculate $p (λ_{i (j)}^{k}) = \underset{}{Per} (γ B_{i (j)}^{k})$ and $q (λ_{i (j)}) = \underset{}{Per} (γ B_{i [j]}^{k}) - \underset{}{Per} (γ B_{i (j)}^{k})$ , for $j = 1, 2, . . ., N_{t}^{(i)}$ , and i = 1, ..., m.

Step 3 Calculate $λ_{j}^{{(i)}^{k + 1}} = {({\tilde{ν}}_{i} - \frac{p (λ_{i (j)}^{k})}{q (λ_{i (j)}^{k})})}^{+}$ , for $j = 1, . . ., N_{t}^{(i)}$ , and i = 1, ..., m, with the power constraints $\sum_{j = 1}^{N_{t}^{(i)}} λ_{j}^{{(i)}^{k + 1}} = P_{i}^{'}$ for i = 1, ..., m.

Step 4 Calculate ${\tilde{C}}_{u} (λ^{k + 1}) = log \underset{}{Per} (γ B^{k + 1})$ .

Step 5 If ${\tilde{C}}_{u} (λ^{k + 1}) \leq {\tilde{C}}_{u} (λ^{k})$ set $λ^{k + 1} : = \frac{1}{N_{t}} λ^{k + 1} + \frac{N_{t} - 1}{N_{t}} λ^{k}$ , and recalculate ${\tilde{C}}_{u} (λ^{k + 1})$ .

Step 6 Set k := k + 1 and return to Step 2 until the algorithm converges or the iteration number is equal to a predefined value.

In the above, the superscript k specifies the corresponding variable in the k th iteration so that λ^kstands for the value of λ in the k th iteration. In Step 1 of the first iteration, λ is initialized to 1, i.e., equal-power allocation. Note, however, that λ could also be initialized in a different way. For example, it is expected that the channel statistics change smoothly frame by frame, where a more appropriate initialization would be the optimal value of λ from the previous frame. In Step 3, the conventional water-filling algorithm is performed with the required variables p(λ_i(j)) and q(λ_i(j)) calculated in Step 2. Following the calculation of ${\tilde{C}}_{u} (λ)$ in Step 4, Step 5 is performed to guarantee convergence [28]. In Step 6, the convergence of the algorithm can be determined by checking whether $|{\tilde{C}}_{u} (λ^{k + 1}) - {\tilde{C}}_{u} (λ^{k})| (or ∥λ^{k + 1} - λ^{k}∥)$ is less than some predefined value for a given precision.

3.4 Optimality of beamforming

Here, we investigate the optimality of beamforming (i.e., rank-one transmission) [29–34] in the context of the multi-BS cooperation systems. We derive a necessary and sufficient condition for the optimality of beamforming in the multi-BS cooperation systems. For BS_i, we assume that the transmit eigenmodes satisfy the following conditions

τ_{1}^{(i)} \geq τ_{2}^{(i)} \geq \dots \geq τ_{N_{t}^{(i)}}^{(i)},

(35)

where $τ_{j}^{(i)} = \sum_{k = 1}^{N_{r}} ω_{k j}^{(i)}$ and $ω_{k j}^{(i)} = {[Ω_{i}]}_{k j}$ , for $j = 1, . . ., N_{t}^{(i)}$ and i = 1, ..., m.

Theorem 4 For multi-BS cooperation systems, the transmit covariance matrices of all the BSs that achieve the sum capacity are of unit-rank (i.e., beamforming is optimal for all the BSs) if and only if the following inequality is fulfilled:

\frac{1 - E \{\frac{1}{1 + ρ_{i} Γ_{i} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}}{tr [E (A_{i}^{- 1} Δ_{i j})] - ρ_{i} Γ_{i} E \{\frac{{({\tilde{h}}_{i j}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{2}}{1 + ρ_{i} Γ_{i} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}} > ρ_{i} Γ_{i}, f o r j = 2, . . ., N_{t}^{(i)} a n d i = 1, . . ., m .

(36)

where $A = I_{N_{r}} + \sum_{i = 1}^{m} ρ_{i} Γ_{i} {\tilde{h}}_{i 1} {\tilde{h}}_{i 1}^{H}, A_{i} = A - ρ_{i} Γ_{i} {\tilde{h}}_{i 1} {\tilde{h}}_{i 1}^{H}, ρ_{i} = \frac{P i}{N_{0}}, Δ_{i j} = d_{i j} d_{i j}^{H} + m_{i j} m_{i j}^{H} ⊙ I_{N_{r}}, {\tilde{h}}_{i 1}$ and ${\tilde{h}}_{i j}$ are the first and jth columns of ${\tilde{H}}_{i}$ , respectively, with ${\tilde{H}}_{i}$ defined as in (15), and d_ijand m_ijare the jth columns of D_iand M_i, respectively.

Proof: See Appendix 3.

Note that the proof is nontrivial generalization of the techniques in [33] to the jointly correlated MIMO multi-BS cooperation systems. We can make the following observations.

If ${\tilde{h}}_{i j}$ for $j = 1, . . ., N_{t}^{(i)}$ are i.i.d., the left-hand-side of (36) remains unchanged when j varies from 2 to $N_{t}^{(i)}$ , and the right-hand-side of (36) is maximized for i = 2. It means that if the condition for i = 2 holds, then it is also held for all other i. Thus, inserting j = 2 into (36) gives the following condition:
$\frac{1 - E \{\frac{1}{1 + ρ_{i} Γ_{i} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}}{tr [E (A_{i}^{- 1}) Δ_{i 2}] - ρ_{i} Γ_{i} E \{\frac{{({\tilde{h}}_{i 2}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{2}}{1 + ρ_{i} Γ_{i} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}} > ρ_{i} Γ_{i}, for j = 1, . . ., m .$
(37)
If the LOS D = 0 and $m_{i j} = {[a_{1} b_{i, j}, . . ., a_{N_{r}} b_{i, j}]}^{T}$ , where a_land b_i,jare square-roots of the eigenvalues of the receive correlation matrix Φ_rand transmit correlation matrix Φ_t,i, respectively. In this case, the channel degenerates to the Kronecker channel. It can be proved that (37) reduces to the beamforming optimality condition in [33, Theorem 2].

4 Numerical results

In this section, we present numerical results to evaluate the tightness of the capacity bound, and demonstrate the efficiency and performance of the proposed transmitter optimization approach. We consider downlink transmission for coordinated cellular networks with two BSs and single user. Assumption that all cases have the same total transmit power of BSs, i.e., P₁ = P₂. For the jointly correlated channel, we set the LOS D = 0. The spatial channel model (SCM) [43, 44] is used to generate channel matrices of two independent links, i.e., Ω₁, Ω₂. The simulation environment is set to be urban microcell, with the BS antenna spacing d_BS = 0.5λ, and the user antenna spacing d_UE = 0.5λ. The number of distinguishable paths is set to be 1, i.e., flat fading. For each link, 1,000 time samples are generated for the calculation of the statistical CSI. The path loss model is 31.5 + 35 log 10(d) (d(m)) and site to site distance is set to 1,000 m. In all of the following figures, the horizontal axis (SNR) indicates the received SNR.

Figure 1 illustrates the results of the ergodic sum capacity of the joint IWFA and the individual IWFA (where the BSs cannot cooperate) with $N_{t}^{(1)} = 4, N_{t}^{(2)} = 4, N_{r} = 4$ . As we can see, the capacity of joint IWFA is greater than that of individual IWFA. For comparison, the results for the exact ergodic sum capacity are also shown, which were obtained by numerically evaluating (13) using a constrained optimization. In addition, Equal-power allocation and beamforming are optimal in the high and low SNR regimes, respectively.

Figure 2 demonstrates the convergence of the proposed IWFA for optimal power allocation. In this figure, the SNR ρ is set to 20 dB, and the algorithm is initialized using λ⁰ = 1. From these results, we see that the proposed IWFA converges after only a few iterations.

Figure 3 compares the ergodic sum capacity upper bound in (25) with the Monte-Carlo simulation results of the ergodic mutual information by averaging over a large number of independent realizations of H with $\{N_{t}^{(1)} = 2, N_{t}^{(2)} = 2, N_{r} = 2\}, \{N_{t}^{(1)} = 2, N_{t}^{(2)} = 2, N_{r} = 4\}$ and $\{N_{t}^{(1)} = 4, N_{t}^{(2)} = 4, N_{r} = 8\}$ . It shows that the ergodic sum capacity upper bound is tight. Employing this result of the ergodic sum capacity upper bound, the proposed IWFA for optimal power allocation can be achieved the true channel capacity.

To investigate the beamforming optimality condition, the inequality (36) can be rewritten as

\frac{(1 - E \{\frac{1}{1 + ρ_{i} Γ_{i} τ_{1}^{(i)} τ_{1}^{(i) - 1} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}) τ_{j}^{(i)}}{tr [E (A_{i}^{- 1} Δ_{i j})] - ρ_{i} Γ_{i} τ_{1}^{(i)} τ_{1}^{(i) - 1} E \{\frac{{({\tilde{h}}_{i j}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{2}}{1 + ρ_{i} Γ_{i} τ_{1}^{(i)} τ_{1}^{(i) - 1} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}} > ρ_{i} Γ_{i} τ_{j}^{(i)}, for j = 2, . . ., N_{t}^{(i)} and \forall i .

(38)

We set the LOS D = 0 and Γ₁ = Γ₂ = 1, the number of transmit antennas $N_{t}^{(i)} = 2$ and the number of receive antennas N_r= 4, with the same SCM as before. Figure 4 shows the beamforming optimality condition (38) for different numbers of BSs. The condition is plotted as a function of $ρ_{1} τ_{1}^{(1)}$ and $ρ_{1} τ_{2}^{(1)}$ . We see that the region where beamforming is optimal gets larger with increasing number of BSs. Note that these curves lie below the $τ_{1}^{(1)} = τ_{2}^{(1)}$ line because $τ_{1}^{(1)}$ is the largest transmit eigenmode. It can be seen that as m increases, the curves get closer to the $τ_{1}^{(1)} = τ_{2}^{(1)}$ line.

Figure 5 illustrates the beamforming optimality condition (38) for different antenna spacing. We set the number of BSs m = 5. The condition is plotted as a function of $ρ_{1} τ_{1}^{(1)}$ ans $ρ_{1} τ_{2}^{(1)}$ . We see that the region where beamforming is optimal gets larger as the user antenna spacing decreases and remains almost unchanged as the BS spacing changes.

5 Conclusion

We optimized the transmission for jointly correlated MIMO channels using statistical CSI over cooperative cellular networks with multiple BSs and a single cell-edge receiver user. We proposed an optimal transmit scheme to maximize the ergodic capacity and showed that the transmitted signals of all the BSs should be mutually independent and the optimal transmitted directions are the eigenvectors of the BS's own transmit correlation matrix. We also derived a closed-form tight upper bound for the ergodic capacity, based on which we developed low-complexity power allocation solutions using convex optimization techniques and a simple IWFA. Finally, we derived the beamforming optimality conditions for all the BSs.

Appendix 1: The definitions and properties of matrix permanents

Definition 1 For an M × N matrix A, the permanent is defined as

Per (A) = \{\begin{matrix} \sum_{{\hat{α}}_{M} \in S_{N}^{M}} \prod_{i = 1}^{M} a_{i, α_{i}}, & i f M \leq N, \\ \sum_{{\hat{β}}_{N} \in S_{M}^{N}} \prod_{i = 1}^{N} a_{β_{i},_{i}}, & i f M > N, \end{matrix}

(39)

where a_i,jdenotes the (i,j)th element of A; $S_{N}^{k}$ denotes the set of all size-k permutations of the numbers {1, 2, ..., N}, for k ≤ N; ${\hat{α}}_{k} = (α_{1}, α_{2}, . . ., α_{k})$ , α_i∈ {1, 2, ..., N} for 1 ≤ i ≤ k, and α_i≠ α_jfor 1 ≤ i, j ≤ k and i ≠ j.

Definition 2 The extended permanent of A is defined as

\underset{}{Per} (A) = Per ([I_{M} A]) = Per ([I_{N} A^{T}]) .

(40)

According to the above definition, one can easily establish a number of important properties of the matrix permanent, as given in the following lemma [28].

Lemma 1 Let A be an M × N matrix, a an M × 1 vector, b an N × 1 vector, and μ a scale constant. Then, we have

Per (A) = Per (A^{T})

(41)

Per (a) = \sum_{i = 1}^{M} a_{i}

(42)

Per (diag (a)) = det (diag (a))

(43)

Per (μ A) = μ^{min (M, N)} Per (A)

(44)

Per (diag (a) A) = det (diag (a)) Per (A), f o r M \leq N

(45)

Per (A diag (b)) = det (diag (b)) Per (A), f o r M \geq N .

(46)

Lemma 2 Let A be an M × N matrix. Then,

Per (A) = \{\begin{matrix} \sum_{{\hat{σ}}_{k} \in S_{N}^{(k)}} Per (A_{{\hat{σ}}_{k}}^{{\hat{α}}_{k}}) Per (A_{{\hat{σ}}^{'}_{k}}^{{\hat{α}}^{'}_{k}}), & i f M \leq N \\ \sum_{{\hat{σ}}_{k} \in S_{N}^{(k)}} Per (A_{{\hat{β}}_{k}}^{{\hat{σ}}_{k}}) Per (A_{{\hat{β}}^{'}_{k}}^{{\hat{σ}}^{'}_{k}}), & i f M > N, \end{matrix}

(47)

where $S_{N}^{(k)}$ denotes the set of all ordered length-k subsets of the numbers {1, 2, ..., N}; ${\hat{α}}_{k} = (α_{1}, α_{2}, . . ., α_{k})$ , α_i∈ {1, 2, ..., N} for 1 ≤ i ≤ k, and α₁ < α₂ < ··· < α_k; ${\hat{α}}_{k} \in S_{M}^{(k)}$ and ${\hat{β}}_{k} \in S_{N}^{(k)}$ with 1 ≤ k ≤ min(M, N); $A_{{\hat{β}}_{k}}^{{\hat{α}}_{k}}$ is the sub-matrix of A obtained by selecting the rows and columns indexed by ${\hat{α}}_{k}$ and ${\hat{β}}_{k}$ respectively; ${\hat{α}}^{'}_{k}$ and ${\hat{β}}^{'}_{k}$ denote the sequences complementary to ${\hat{α}}_{k}$ and ${\hat{β}}_{k}$ in {1, 2, ..., M} and {1, 2, ..., N}, respectively; $A_{{\hat{β}}^{'}_{k}}^{{\hat{α}}^{'}_{k}}$ is the sub-matrix of A obtained by deleting the rows and columns indexed by ${\hat{α}}_{k}$ and ${\hat{β}}_{k}$ , respectively.

Appendix 2: Proof of Theorem 3

Let $μ = {[μ_{1}^{T}, μ_{2}^{T}, . . ., μ_{m}^{T}]}^{T}$ and ν= [ν₁, ν₂, ..., ν_m]^Tbe the Lagrange multipliers for the inequality constraint λ ≥ 0 and the equality constraint $1^{T} λ_{i} = P_{i}^{'}$ , for i = 1, 2, ..., m, respectively, where $μ_{i}^{T} = {[μ_{1}^{(i)}, μ_{2}^{(i)}, . . ., μ_{N_{t}^{(i)}}^{(i)}]}^{T}$ , for i = 1, 2, ..., m. Then, the KKT conditions satisfied by the optimal λ can be found from solving

\frac{\partial {\tilde{C}}_{u} (λ)}{\partial λ_{j}^{(i)}} + μ_{j}^{(i)} + ν_{i} = 0,

(48)

λ_{i} \geq 0, 1^{T} λ_{i} = P_{i}^{'}, μ_{i} \geq 0, μ_{j}^{(i)} λ_{j}^{(i)} = 0,

(49)

where $\frac{\partial {\tilde{C}}_{u} (λ)}{\partial λ_{j}^{(i)}}$ denotes the partial derivative of ${\tilde{C}}_{u} (λ)$ with respect to $λ_{j}^{(i)}$ , for $1 \leq j \leq N_{t}^{(i)}$ and 1 ≤ i ≤ m. By using the derivative chain rule, we then have

\frac{\partial {\tilde{C}}_{u} (λ)}{\partial λ_{j}^{(i)}} = \frac{1}{E (λ)} \frac{\partial E (λ)}{\partial λ_{j}^{(i)}} .

(50)

Now, letting B = [Γ₁Ω₁Λ₁, ..., Γ_mΩ_mΛ_m], we can write E(λ) = Per (γ B). To evaluate the remaining derivatives in (50), we apply the Laplace expansion property of permanents, given by Lemma 2 in Appendix 1 when k = 1, to express E(λ) as

E (λ) = p (λ_{i (j)}) + λ_{i} q (λ_{i (j)}),

(51)

where p(λ_i(j)) and q(λ_i(j)) are given by (33) and (34), respectively. Thus, (50) becomes

\frac{\partial {\tilde{C}}_{u} (λ)}{\partial λ_{j}^{(i)}} = \frac{q (λ_{i (j)})}{p (λ_{i (j)}) + λ_{i} q (λ_{i (j)})} .

(52)

Substituting (52) into (48) and eliminating the slack variable μ, the KKT conditions become (31) and (32), where (a)⁺ = max{0, a} and ${\tilde{ν}}_{i} = \frac{1}{ν_{i}}$ .

Appendix 3: Proof of Theorem 4

It is obvious that (13) is equivalent to the following convex optimization problem:

min - E \{{log}_{2} |I_{N_{r}} + γ \sum_{i = 1}^{m} Γ_{i} \sum_{j = 1}^{N_{t}^{(i)}} λ_{j}^{(i)} {\tilde{h}}_{i j} {\tilde{h}}_{i j}^{H}|\},

(53)

s .t . \sum_{j = 1}^{N_{t}^{(i)}} λ_{j}^{(i)} = P_{i}^{'}, for i = 1, . . ., m,

(54)

with ${\tilde{h}}_{i j}$ denoting the j-th column of ${\tilde{H}}_{i}$ . The Lagrangian of the problem is given by

L = - E \{{log}_{2} |I_{N_{r}} + γ \sum_{i = 1}^{m} Γ_{i} \sum_{j = 1}^{N_{t}^{(i)}} λ_{j}^{(i)} {\tilde{h}}_{i j} {\tilde{h}}_{i j}^{H}|\} + \sum_{i = 1}^{m} μ_{i} (\sum_{j = 1}^{N_{t}^{(i)}} λ_{j}^{(i)} - {P^{'}}_{i}),

(55)

with μ_idenoting the Lagrange multiplier corresponding to the transmit power constraint of BS_i. Since the problem is convex, we can derive necessary and sufficient conditions for the optimal solution using the KKT conditions. The KKT conditions for BS i can be found as

γ Γ_{i} E \{{\tilde{h}}_{i j}^{H} {(I_{N_{r}} + γ \sum_{i = 1}^{m} Γ_{i} \sum_{j = 1}^{N_{t}^{(i)}} λ_{j}^{(i)} {\tilde{h}}_{i j} {\tilde{h}}_{i j}^{H})}^{- 1} {\tilde{h}}_{i j}\} \leq μ_{i}, for j = 1, . . ., N_{t}^{(i)},

(56)

where the conditions are met with equality only if, for a given j, the corresponding eigenvalue $λ_{j}^{(i)}$ of the input covariance matrix is non-zero. Thus, from (35), beamforming is optimal if (56) is satisfied as an equality for j = 1, and as a strict inequality for all other j. This means that beamforming of BS_ialong the strongest transmit eigenmode is optimal. That is,

E_{i 1} = γ Γ_{i} E \{{\tilde{h}}_{i 1}^{H} A^{- 1} {\tilde{h}}_{i 1}\} = μ_{i},

(57)

E_{i j} = γ Γ_{i} E \{{\tilde{h}}_{i j}^{H} A^{- 1} {\tilde{h}}_{i j}\} < μ_{i}, for j \neq 1 .

(58)

where

A = I_{N_{r}} + γ \sum_{i = 1}^{m} Γ_{i} P_{i}^{'} {\tilde{h}}_{i 1} {\tilde{h}}_{i 1}^{H}

(59)

Equivalently, the beamforming optimality conditions for BS_ican be written as

\frac{E_{i 1}}{E_{i j}} > 1, for j = 2, . . ., N_{t}^{(i)} .

(60)

We now use the matrix inversion formula [45] to give

A^{- 1} = A_{i}^{- 1} - A_{i}^{- 1} {\tilde{h}}_{i 1} {(\frac{1}{ρ_{i} Γ_{i}} + {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{- 1} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1}

(61)

and as a result, we get

\begin{align} {\tilde{h}}_{i 1}^{H} A^{- 1} {\tilde{h}}_{i 1} & = {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1} - {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1} {(\frac{1}{ρ_{i} Γ_{i}} + {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{- 1} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1} \\ = \frac{1}{ρ_{i} Γ_{i}} (1 - \frac{1}{1 + ρ_{i} Γ_{i} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}), \end{align}

(62)

where

A_{i} = A - Γ_{i} P_{i}^{'} {\tilde{h}}_{i 1} {\tilde{h}}_{i 1}^{H},

(63)

Therefore, (57) and (58) can be rewritten as

E_{i 1} = \frac{1}{P_{i}^{'}} - \frac{1}{P_{i}^{'}} E \{\frac{1}{1 + ρ_{i} Γ_{i} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}

(64)

and

E_{i j} = γ Γ_{i} tr [E (A_{i}^{- 1} {\tilde{h}}_{i j} {\tilde{h}}_{i j}^{H})] - γ Γ_{i} E \{{({\tilde{h}}_{i j}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{2} {(\frac{1}{ρ_{i} Γ_{i}} + {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{- 1}\}, for j \neq 1 .

(65)

Note that ${\tilde{h}}_{i j}$ for j ≠ 1 and A_iare independent, and ${\tilde{h}}_{i j} = d_{i j} + m_{i j} ⊙ h_{iid, i j}$ , where d_ij, m_ijand h_iid,ijare the j th column of D_i, M_iand H_iid,i, respectively. Then,

E \{{\tilde{h}}_{i j} {\tilde{h}}_{i j}^{H}\} = d_{i j} d_{i j}^{H} + m_{i j} m_{i j}^{H} ⊙ I_{N_{r}}, for j = 2, \dots, N_{t}^{(i)}

(66)

and (65) can be simplified as

E_{i j} = γ Γ_{i} tr [E (A_{i}^{- 1}) (d_{i j} d_{i j}^{H} + m_{i j} m_{i j}^{H} ⊙ I_{N_{r}})] - γ ρ_{i} Γ_{i}^{2} E \{\frac{{({\tilde{h}}_{i j}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1})}^{2}}{1 + ρ_{i} Γ_{i} {\tilde{h}}_{i 1}^{H} A_{i}^{- 1} {\tilde{h}}_{i 1}}\}, for j \neq 1 .

(67)

Combining (60), (64), and (67), we obtain the desired condition in (36).

References

Telatar IE: Capacity of multi-antenna Gaussian channels. Eur Trans Telecommun 1999, 10(6):585-595. 10.1002/ett.4460100604
Article Google Scholar
Shamai (Shitz) S, Zaidel BM: Enhancing the cellular downlink capacity via co-processing at the transmission end. In Proc IEEE Veh Tech Conf Spring. Volume 3. Rhodes Island, Greece; 2001:1745-1749.
Google Scholar
Shamai (Shitz) S, Somekh O, Zaidel BM: Multi-cell communications: An information theoretic perspective. In Proc Joint Workshop Commun and Coding. Donnini, Florence, Italy; 2004:14-17.
Google Scholar
Somekh O, Zaidel BM, Shamai (Shitz) S: Sum rate characterization of joint multiple cell-site processing. IEEE Trans Inf Theory 2007, 53(12):4473-4497.
Article MathSciNet MATH Google Scholar
Somekh O, Simeone O, Bar-Ness Y, Haimovich AM, Shamai (Shitz) S: Cooperative multicell zero-forcing beamforming in cellular downlink channels. IEEE Trans Inf Theory 2009, 2009(7):3206-3219.
Article MathSciNet Google Scholar
Zhang H, Dai H: Co-channel interference mitigation and cooperative processing in downlink multicell multiuser MIMO networks. EURASIP J Wirel Commun Netw 2004, 2: 222-235.
MATH Google Scholar
Karakayali MK, Foschini GJ, Valenzuela RA, Yates RD: On the maximum common rate achievable in a coordinated network. In Proc IEEE Int Conf Commun. Volume 9. Istanbul, Turkey; 2006:4333-4338.
Google Scholar
Jing S, Tse DNC, Hou J, Soriaga JB, Smee JE, Padovani R: Multi-cell downlink capacity with coordinated processing. EURASIP J Wirel Commun Netw 2008, 2008: 1-19.
Article Google Scholar
Venkatesan S, Lozano A, Valenzuela R: Network MIMO: overcoming intercell interference in indoor wireless systems. In Proc Asilomar Conf Sig Sys and Comput. Pacific Grove, CA, USA; 2007:83-87.
Google Scholar
Huang H, Trivellato M, Hottinen A, Shafi M, Smith P, Valenzuela R: Increasing downlink cellular throughput with limited network MIMO coordination. IEEE Trans Wirel Commun 2009, 8(6):2983-2989.
Article Google Scholar
Caire G, Shamai S: On the achievable throughput of a multiantenna Gaussian broadcast channel. IEEE Trans Inf Theory 2003, 49(7):1691-706. 10.1109/TIT.2003.813523
Article MathSciNet MATH Google Scholar
Viswanath P, Tse D: Sum capacity of the vector Gaussian broadcast channel and uplink-downlink duality. IEEE Trans Inf Theory 2003, 49(8):1912-1921. 10.1109/TIT.2003.814483
Article MathSciNet MATH Google Scholar
Yu W, Cioffi J: Sum capacity of Gaussian vector broadcast channels. IEEE Trans Inf Theory 2004, 50(9):1875-1892. 10.1109/TIT.2004.833336
Article MathSciNet MATH Google Scholar
Vishwanath S, Jindal N: A Goldsmith, Duality, achievable rates, and sum-rate capacity of Gaussian MIMO broadcast channels. IEEE Trans Inf Theory 2003, 49(10):2658-2668. 10.1109/TIT.2003.817421
Article MathSciNet MATH Google Scholar
Yu W, Lan T: Transmitter optimization for the multi-antenna downlink with per-antenna power constraints. IEEE Trans Signal Process 2007, 55(6):2646-2660.
Article MathSciNet Google Scholar
Zhang L, Zhang R, Liang YC, Xin Y, Poor HV: On the Gaussian MIMO BC-MAC duality with multiple transmit covariance constraints. IEEE Trans Inf Theory 2012, 58(4):2064-2078.
Article MathSciNet Google Scholar
Huh H, Papadopoulos H, Caire G: MIMO broadcast channel optimization under general linear constraints. In Proc IEEE Int Sym Info Theory. Seoul, Korea; 2009:2664-2668.
Google Scholar
Ramprashad SA, Caire G: Cellular vs. network MIMO: a comparison including channel state information overhead. In Proc IEEE Int Sym Personal, Indoor Mobile Radio Commun. Tokyo, Japan; 2009:878-884.
Google Scholar
Ramprashad SA, Caire G, Papadopoulos HC: Cellular and network MIMO architectures: MU-MIMO spectral efficiency and costs of channel state information. In Proc Asilomar Conf on Sig Sys and Comp. Pacific Grove, CA, USA; 2009:1811-1818.
Google Scholar
Caire G, Ramprashad SA, Papadopoulos HC: Rethinking network MIMO: cost of CSIT, performance analysis, and architecture comparisons. In Proc Info Theory and App Workshop. San Diego, USA; 2010:1-10.
Google Scholar
Barriac G, Madhow U: Space-time communication for OFDM with implicit channel feedback. IEEE Trans Inf Theory 2004, 50(12):3111-3129. 10.1109/TIT.2004.838379
Article MathSciNet MATH Google Scholar
Barriac G, Madhow U: Space-time precoding for mean and convariance feedback: application to wideband OFDM. IEEE Trans Commun 2006, 54: 96-107.
Article Google Scholar
Shui DS, Foschini GJ, Gans MJ, Kahn JM: Fading correlation and its effect on the capacity of multielement antenna systems. IEEE Trans Commun 2000, 48(3):502-513. 10.1109/26.837052
Article Google Scholar
Shin H, Lee JH: Capacity of multiple-antenna fading channels: spatial fading correlation, double scattering, and keyhole. IEEE Trans Inf Theory 2003, 49(10):2636-2647. 10.1109/TIT.2003.817439
Article MathSciNet MATH Google Scholar
Tulino AM, Lozano A, Verdú S: Impact of antenna correlation on the capacity of multiantenna channels. IEEE Trans Inf Theory 2005, 51(7):2491-2509. 10.1109/TIT.2005.850094
Article MathSciNet MATH Google Scholar
Weichselberger W, Herdin M, Ozcelik H, Bonek E: A stochastic MIMO channel model with joint correlation of both link ends. IEEE Trans Wirel Commun 2006, 5: 90-100.
Article Google Scholar
Zhou Y, Herdin M, Sayeed A, Bonek E: Experimental study of MIMO channel statistics and capacity via the virtual channel representation. UW Tech Report 2007. [http://dune.ece.wisc.edu/pdfs/zhou_meas.pdf]
Google Scholar
Gao XQ, Jiang B, Li X, Gershman AB, McKay MR: Statisitical eigenmode transmission over jointly correlated MIMO channels. IEEE Trans Inf Theory 2009, 55(8):3735-3750.
Article MathSciNet Google Scholar
Jafar SA, Goldsmith A: Transmitter optimization and optimality of beamforming for multiple antenna systems. IEEE Trans Wirel Commun 2004, 3(4):1165-1175. 10.1109/TWC.2004.830822
Article Google Scholar
Jorswieck E, Boche H: Channel capacity and capacity range of beamforming in MIMO wireless systems under correlated fading with covariance feedback. IEEE Trans Wirel Commun 2004, 3(5):1543-1553. 10.1109/TWC.2004.833523
Article Google Scholar
Li X, Jin S, Gao XQ, Mckay MR, Wong KK: Transmitter optimization and beamforming op-timality conditions for doublescattering MIMO channels. IEEE Trans Wirel Commun 2008, 7(9):3647-3654.
Article Google Scholar
Wan H, Chen RR, Liang YB: Optimality of beamforming for MIMO multiple access channels via virtual representation. IEEE Trans Signal Process 2010, 58(10):5458-5463.
Article MathSciNet Google Scholar
Soysal A, Ulukus S: Optimality of beamforming in fading MIMO multiple access channels. IEEE Trans Commun 2009, 57(4):1171-1183.
Article Google Scholar
Li X, Jin S, Gao XQ, Mckay MR: Capacity bounds and low complexity transceiver design for double-scattering MIMO multiple access channels. IEEE Trans Signal Process 2010, 58(5):2809-2822.
Article MathSciNet Google Scholar
Wenm CK, Wong KK: Asymptotic analysis of spatially correlated MIMO multiple-access channels with arbitrary signaling inputs for joint and separate decoding. IEEE Trans Inf Theory 2007, 53: 252-268.
Article MathSciNet MATH Google Scholar
Wen CK, Wong KK, Chen JC: Spatially correlated MIMO multiple-access systems with macro-diversity: asymptotic analysis via statistical physics. IEEE Trans Commun 2007, 55(3):477-488.
Article Google Scholar
Tulino AM, Lozano A, Verdú S: Capacity-achieving input convariance for single-user multi-antena channels. IEEE Trans Wirel Commun 2006, 5(3):662-671.
Article Google Scholar
Jayaweera SK, Poor HV: On the capacity of multi-antenna systems in the presence of Rician fading. In Proc IEEE Veh Tech Conf. Vancouver, BC, Canada; 2002:1963-1967.
Chapter Google Scholar
Jayaweera SK, Poor HV: On the capacity of multiple antenna systems in Rician fading. IEEE Trans Wirel Commun 2005, 4(3):1102-1111.
Article Google Scholar
Visotsky E, Madhow U: Space-time transmit precoding with imperfect feedback. IEEE Trans Inf Theory 2001, 47(6):2632-2639. 10.1109/18.945281
Article MathSciNet MATH Google Scholar
Veeravalli V, Liang Y, Sayeed AM: Correlated MIMO Rayleigh fading channels: capacity, optimal signaling and asymptotics. IEEE Trans Inf Theory 2005, 51(6):2058-2072. 10.1109/TIT.2005.847724
Article MathSciNet MATH Google Scholar
Minc H: Permanents. Addison-Wesley Publishing Company, Mass; 1978.
MATH Google Scholar
Spatial channel model for multiple input multiple output (MIMO) simulations. 3GPP, TR 25.996, V7.0.0 2007.
Salo J, Del Galdo G, Salmi J, Kyosti P, Milojevic M, Laselva D, Schneider C: MATLAB implementation of the 3GPP spatial channel model (3GPP TR 25.996).2005. [http://www.tkk.fi/Units/Radio/scm/]
Google Scholar
Horn RA, Johnson CR: Matrix Analysis. Cambridge University Press, Cambridge; 1985.
Book MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the reviewers for their valuable comments and suggestions. This study was supported by the National Natural Science Foundation of China under Grants 60925004, 60902009 and 61101089, and the Supporting Program for New Century Excellent Talents in University.

Author information

Authors and Affiliations

National Mobile Communications Research Laboratory, Southeast University, Nanjing, 210096, China
Jun Zhang, Bin Jiang, Shi Jin & Xiqi Gao
Department of Electronic and Electrical Engineering, University College London, London, WC1E 7JE, UK
Kai-Kit Wong

Authors

Jun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Shi Jin
View author publications
You can also search for this author in PubMed Google Scholar
Xiqi Gao
View author publications
You can also search for this author in PubMed Google Scholar
Kai-Kit Wong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shi Jin.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Zhang, J., Jiang, B., Jin, S. et al. On base station cooperation using statistical CSI in jointly correlated MIMO downlink channels. EURASIP J. Adv. Signal Process. 2012, 81 (2012). https://doi.org/10.1186/1687-6180-2012-81

Download citation

Received: 25 October 2011
Accepted: 12 April 2012
Published: 12 April 2012
DOI: https://doi.org/10.1186/1687-6180-2012-81

On base station cooperation using statistical CSI in jointly correlated MIMO downlink channels

Abstract

1 Introduction

Notations

2 System model

3 Statistical CSI-aided coordinated BS transmission

3.1 Optimal transmit scheme

3.2 Ergodic sum capacity upper bound

3.3 Optimizing the power allocation policies

3.4 Optimality of beamforming

4 Numerical results

5 Conclusion

Appendix 1: The definitions and properties of matrix permanents

Appendix 2: Proof of Theorem 3

Appendix 3: Proof of Theorem 4

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords