 Research
 Open access
 Published:
Nonlinear joint transmitreceive processing for coordinated multicell systems: centralized and decentralized
EURASIP Journal on Advances in Signal Processing volumeÂ 2015, ArticleÂ number:Â 10 (2015)
Abstract
This paper proposes a nonlinear joint transmitreceive (txrx) processing scheme for downlinkcoordinated multicell systems with multistream multiantenna users. The nonlinear joint txrx processing is formulated as an optimization problem to maximize the minimum signaltointerference noise ratio (SINR) of streams to guarantee the fairness among streams of each user. Nonlinear TomlinsonHarashima precoding (THP) is applied at transmitters, and linear receive processing is applied at receivers, to eliminate the interuser interference and interstream interference. We consider multicell systems under two coordinated modes: centralized and decentralized, corresponding to systems with high and lowcapacity backhaul links, respectively. For the centralized coordinated mode, transmit and receive processing matrices are jointly determined by the central processing unit based on the global channel state information (CSI) shared by base stations (BSs). For the decentralized coordinated mode, transmit and receive processing matrices are computed independently based on the local CSI at each BS. In correspondence, we propose both a centralized and a decentralized algorithm to solve the optimization problem under the two modes, respectively. Feasibility and computational complexity of the proposed algorithms are also analyzed. Simulation results prove that the proposed nonlinear joint txrx processing scheme can achieve user fairness by equalizing the bit error rate (BER) among streams of each user and the proposed scheme outperforms the existing linear joint txrx processing. Moreover, consistent with previous research results, performance of the proposed centralized nonlinear joint txrx processing scheme is proved to be better than that of the decentralized nonlinear joint txrx processing.
1 Introduction
Coordinated multicell is a promising technology to reduce intercell interference and increase user data rate, which has been considered as one of the potential technologies for LTE Advanced [1,2]. To fully utilize the advantage of coordinated multicell technology, it is essential to manage the multiuser interference (MUI) within the coordinated area appropriately as it is directly related to the achievable spectrum efficiency [3]. Precoding is a wellknown technique for MUI mitigation in multiuser multipleinput multipleoutput (MUMIMO) systems [4,5]. The joint transmitreceive (txrx) processing can be used to further improve the downlink performance of MUMIMO systems by optimizing the transmit precoding and receive filter matrices jointly. According to the processing of the transmit precoding, the joint txrx processing technology can be divided into two types, linear and nonlinear schemes.
The coordinated multicell technology can be implemented in a centralized or decentralized mode based on the backhaul capacity of the systems. The centralized coordinated mode can achieve higher data rate at the cost of highcapacity backhaul links in order to enable base stations (BSs) to share their channel state information (CSI) (defined as local CSI) and data. Hence, the centralized approach is limited to systems with sufficient backhaul capacity [6,7]. The decentralized coordinated mode does not require BSs to share their local CSI, and the precoding or txrx processing is conducted at each BS [8]. This approach has less requirement on backhaul link capacity at some loss on the data rate in comparison to the centralized coordinated mode.
In recent years, relevant works on joint txrx processing in coordinated multicell systems have been widely studied under either centralized [919] or decentralized mode [2023]. For designing the nonlinear joint txrx processing, many different optimal objectives have been considered, such as minimizing the sum mean square error (SMSE) or maximizing the SINR; yet, the fairness among the streams of each user has not been solved for the coordinated multicell systems with multistream multiantenna users.
1.1 Prior art
Linear joint txrx processing algorithms have been widely studied for coordinated multicell systems under centralized mode [914]. In [9], block diagonalization (BD) precoding was designed to maximize the weighted sum rate of all users. The txrx processing optimization with the criterion of minimizing the SMSE was presented in [1012], and the authors of [13] proposed a weighted SMSE minimization algorithm by considering the channel gain as the weight factor. In [14], the energy efficiency was considered in the txrx processing design. A new criterion of maximizing weighted sum energy efficiency was formulated, and the optimization problem was solved by an iterative algorithm. For the decentralized coordinated mode, D. Gesbert and R. Holakouei, et al. studied the decentralized linear precoding techniques for the system with singleantenna users recently [2022]. In [20], a distributed precoding scheme based on zeroforcing (ZF) criterion (defined as DZF) and several centralized power allocation approaches was proposed. In [21,22], a characterization of the optimal linear precoding strategy was derived. Distributed virtual SINR (DVSINR) precoding approaches, where each BS balances the ratio between signal gain at the intended user and the interference caused by other users, had been proposed for the particular case of two users in [21] and generalized for multiuser in [22]. The DVSINR scheme was illustrated to satisfy the optimal precoding characterization and outperform DZF.
Compared with the linear joint txrx processing schemes, the nonlinear joint txrx processing schemes are more complex but can obtain more system gain, which have gained much attention recently. Most research about the nonlinear precoding focus on TomlinsonHarashima precoding (THP), as it can achieve approximate performance with the optimal dirty paper coding but has a much lower complexity [5]. For the centralized coordinated mode, the txrx processing scheme was designed to minimize the SMSE in [15] and maximize the SINR in [16], wherein both should be solved by an iterative method, resulting in high computational complexity. The schemes with low complexity were proposed and derived a closedform solution based on minimum average bit error rate (BER) in [17], minimum mean square error (MMSE) in [18], or ZF criterion in [19]. In [18], the receive processing matrix was firstly computed by CSI. Then,the transmit processing matrix and receive weight coefficient were computed based on MMSE. In [19], the algorithm decomposed the MUMIMO channel into parallel independent single user MIMO (SUMIMO) channels, and then, closedform expressions of transmit and receive processing matrices were derived to optimize the performance of each user. The above research works on nonlinear txrx processing were all developed for the centralized coordinated mode. The relevant works for the decentralized coordinated mode are relatively fewer. A decentralized nonlinear precoding, ZFTHP, was proposed in [23] but can only be applied for the system with a single user. To the best of our knowledge, for the system with multistream multiantenna users, the txrx processing solutions under decentralized coordination mode have not been addressed in the literature.
Previous work did not consider fairness among streams of each user in the coordinated multicell system with multistream multiantenna users. It is essential to study the fairness for nonlinear scheme, as unfairness is an inherent character of THP and the worst performance determines the whole performance of the user [24].
1.2 Contributions
In this paper, a nonlinear joint txrx processing scheme is proposed to improve fairness among streams of each user with multiantenna. The nonlinear joint txrx processing is formulated as an optimization problem to maximize the minimum SINR of streams. The performance of the proposed scheme is evaluated under both centralized and decentralized coordinated modes. Two algorithms for solving the optimization problem are derived.
The main work of this paper can be summarized as follows.

Nonlinear joint txrx processing scheme is developed for a coordinated multicell system with multistream multiantenna users under two coordinated modes, centralized and decentralized mode.

Two algorithms, the centralized and the decentralized algorithms, are proposed to solve the optimization problem, and both of them derive the closedform solutions.

The algorithms guarantee the fairness among the streams of each user, which not only boost the performance of each user, but bring much convenience to the modulation/demodulation and coding/decoding procedures.
The remainder of this paper is organized as follows. Section 2 presents the coordinated multicell system model. The proposed nonlinear joint txrx processing scheme is described in detail in Section 3. A performance analysis of the proposed algorithms is developed in Section 4. Simulation results and conclusions are presented in Section 5 and Section 6, respectively.
1.3 Notation
We use uppercase boldface letters to denote matrices and lowercase boldface to denote vectors. The operators (â‹…)^{T},â€‰(â‹…)^{H},â€‰(â‹…)^{â€ },â€‰E(â‹…), and Tr(â‹…) stand for transpose, Hermitian, MoorePenrose pseudoinverse, expectation, and the trace of a matrix, respectively. diag(â‹…) and blockdiag(â‹…) denote diagonal and block diagonal matrix. I and 0 are the identity and the allzero matrix, respectively, with appropriate dimensions. â€–â€‰â‹…â€‰â€–_{ F } represents the Frobenius norm of a matrix. [â‹…]_{ iâ€‰:â€‰j,kâ€‰:â€‰l } denotes the submatrix comprised of row i through row j and column k through column l of a matrix.
2 System model
Consider a downlink coordinated multicell system, where N BSs cooperatively serve K users. Each BS and user is equipped with n _{ t } and n _{ r } antennas, respectively. All BSs share user data and cooperatively transmit the data to an intended user. Each BS transmits \( L={\displaystyle {\sum}_{k=1}^K{l}_k} \) data streams to K users, where l _{ k } is the number of transmitted data streams for user k.
We assume that BSs' transmit power for every user is P. Therefore, the total transmit power of BSs is KP. Denote \( {\mathbf{x}}^k={\left[{\mathbf{x}}_1^{k^T},\cdots, {\mathbf{x}}_N^{k^T}\right]}^{\mathrm{T}} \), where \( {\mathbf{x}}_n^k \) denotes the preprocessed signal transmitted by the nth BS for user k, satisfying \( \mathrm{T}\mathrm{r}\left\{{\mathbf{x}}^k{\mathbf{x}}^{k^H}\right\}=P \). The received signal of the kth user is:
where \( {\mathbf{H}}^k=\left[{\mathbf{H}}_1^k,\cdots, {\mathbf{H}}_N^k\right] \) is the global CSI between BSs and the kth user and \( {\mathbf{H}}_n^k\in {C}^{n_r\times {n}_t} \) denotes the local CSI between the nth BS and the kth user, whose entries are independent and identically distributed (i.i.d.) complex Gaussian variables with zero mean and unit variance. In Equation 1, the second term on the righthand side is MUI, and \( {\mathbf{n}}^k\sim CN\left(\mathbf{0},{\sigma}^2{\mathbf{I}}_{n_r}\right) \) is the additive white Gaussian noise variable.
Each user decodes the desired data by multiplying with the receive processing matrix. The received data of the kth user is given as:
where \( {\mathbf{R}}^k\in {C}^{l_k\times {n}_{\mathrm{r}}} \) denotes the receive processing matrix of the kth user. Ã± ^{k}â€‰=â€‰R ^{k} n ^{k} is the equivalent received noise vector at the kth user.
Let \( \mathbf{y}={\left({\mathbf{y}}^{1^T},\cdots, {\mathbf{y}}^{K^T}\right)}^T \) represent the received signal of the K users. Equation 2 can be expressed as:
where Râ€‰=â€‰blockdiag(R ^{1},â€‰â‹¯,â€‰R ^{K}) is a Lâ€‰Ã—â€‰Kn _{ r } matrix. \( \mathbf{H}={\left[{\mathbf{H}}^{1^{\mathrm{T}}},\cdots, {\mathbf{H}}^{K^{\mathrm{T}}}\right]}^{\mathrm{T}}\in {C}^{K{n}_r\times {N}_t} \) is the global CSI between BSs and K users. \( {\mathbf{H}}_n={\left[{\mathbf{H}}_n^{1^T},\cdots, {\mathbf{H}}_n^{K^T}\right]}^T\in {C}^{K{n}_r\times {n}_t} \) denotes the nth local CSI between the nth BS and K users. \( \mathbf{x}={\displaystyle {\sum}_{k=1}^K{\mathbf{x}}^k} \) denotes the transmit signal at BSs, and \( {\mathbf{x}}_n={\displaystyle {\sum}_{k=1}^K{\mathbf{x}}_n^k} \) is the transmit signal at the nth BS. \( \tilde{\mathbf{n}}={\left({\tilde{\mathbf{n}}}^{1^T},\cdots, {\tilde{\mathbf{n}}}^{K^T}\right)}^T \) is the combination of the receive noise at the K users.
Define \( \boldsymbol{\Lambda} \left(k,t\right)={\mathbf{R}}^k{\displaystyle {\sum}_{n=1}^N{\mathbf{H}}_n^k{\mathbf{x}}_n^t}={\mathbf{R}}^k{\mathbf{H}}^k{\mathbf{x}}^t \). The rate of the kth user is given by
Then, the system sum rate can be obtained by \( r={\displaystyle {\sum}_{k=1}^K{r}^k} \).
The coverage of Ncoordinated BSs is defined as one coordinated area. We mainly focus on the interference within the coordinated area. The interference from other coordinated areas is ignored in this paper, which can be eliminated by intercell interference coordination technology [25] or interference alignment technology [26].
For the centralized coordinated mode, it is assumed that all BSs exchange their local CSI, and the txrx processing matrices are jointly designed at the central processing unit. The system can be seen as a virtual MUMIMO system with N _{ t }â€‰=â€‰Nn _{ t } transmit antennas. On the contrary, for the decentralized coordinated mode, BSs do not share their CSI, and every BS only has knowledge of local CSI between itself and K users. Therefore, the txrx processing matrices are independently designed at each BS.
3 Nonlinear joint transmitreceive processing algorithm
In this section, we present nonlinear joint txrx processing algorithms for a coordinated multicell system under two different coordinated modes. The algorithm structure is firstly shown. Then, we formulate the optimization problem, aiming at maximizing the minimum SINR of streams to guarantee the fairness among the streams of each user. Finally, the algorithms for different coordinated modes are proposed.
3.1 Algorithm structure
The structure of the proposed algorithm is shown in Figure 1. In the proposed algorithms, nonlinear preprocessing is applied at transmitters; meanwhile, linear processing is applied at each receiver.
At the nth(nâ€‰=â€‰1,â€¦,N) transmitter, \( \mathbf{s}={\left({\mathbf{s}}^{1^T},\cdots, {\mathbf{s}}^{K^T}\right)}^T\in {C}^{L\times 1} \) denotes the modulated data vector satisfying E{ss ^{H}}â€‰=â€‰I, where s ^{k} is comprised of the l _{ k } data streams for the kth user. In THP, feedback matrix B _{ n } is a unit lower triangular matrix,
where \( {\mathbf{B}}_n^{k,k} \) is a unit lower triangular matrix with l _{ k }â€‰Ã—â€‰l _{ k } size. u _{ n } is the output data of THP. Therefore, the lth data stream of u _{ n } is interfered by the first (l1) data streams; in other words, the lth(lâ€‰=â€‰2,â€‰â‹¯,â€‰L) element \( {u}_n^l \) in u _{ n } is a linear combination of s ^{j}(jâ€‰â‰¤â€‰l). Assume Mary square constellation is employed to s. To ensure that the real and the imaginary parts of \( {u}_n^l \) are constrained into \( \left(\sqrt{M},\sqrt{M}\right] \), modulo \( 2\sqrt{M} \) operation \( \mod {}_{2\sqrt{M}}\left(\cdot \right) \) is introduced. The output data of THP is expressed as:
where \( {\mathbf{d}}_n=2\sqrt{M}\left({\mathbf{z}}_I+j{\mathbf{z}}_Q\right) \), z _{ I } and z _{ Q } are both integers. Define v _{ n }â€‰=â€‰sâ€‰+â€‰d _{ n }, and then, u _{ n } is written as:
There is a power enhancement of Ï„â€‰=â€‰M/(Mâ€‰âˆ’â€‰1) due to THP, i.e., \( E\left\{{\mathbf{u}}_n{\mathbf{u}}_n^H\right\}=\tau \mathbf{I} \) [27]. The transmit signal at the nth BS is:
where \( {\mathbf{F}}_n=\left[{\mathbf{F}}_n^1,\cdots, {\mathbf{F}}_n^K\right]\in {C}^{n_t\times L} \) is the transmit processing matrix.
At the receivers, the received signal in Equation 3 can be rewritten as:
where Fâ€‰=â€‰diag(F _{1},â€‰â‹¯,â€‰F _{ N }) is a block diagonal matrix with N _{ t }â€‰Ã—â€‰NL size and \( {\mathbf{C}}_{{\mathbf{B}}^{1}}={\left[{\mathbf{B}}_1^{{1}^T},\cdots, {\mathbf{B}}_N^{{1}^T}\right]}^T\in {C}^{NL\times L} \). The user data will finally be obtained by modulo operation and demodulation. Obviously, the received noise power of l _{ k } streams of the kth user is:
3.2 Problem formulation
From Equation 1, it is noticed that the received signal of every user is influenced by MUI. In order to liberate every user from MUI, the relative matrices in this algorithm are designed to satisfy ZF criterion:
where \( \mathbf{W}=\mathrm{diag}\left(\sqrt{P/\left(\tau {l}_1\right)}{\mathbf{I}}_{l_1},\cdots, \sqrt{P/\left(\tau {l}_K\right)}{\mathbf{I}}_{l_K}\right) \). The SINR of the kth user can be obtained as:
In order to guarantee the fairness among streams of each user, we investigate the txrx processing matrices design to maximize the minimum SINR for each stream of each user, which is formulated as follows:
for kâ€‰=â€‰1,â€‰â‹¯,â€‰K,â€‰nâ€‰=â€‰1,â€‰â‹¯,â€‰N. (a) denotes ZF criterion. (b) denotes that B _{ n } is the unit lower triangular matrix, where S _{ i }â€‰=â€‰[I _{ i },â€‰0 _{ iâ€‰Ã—â€‰(Lâ€‰âˆ’â€‰i)}] and e _{ i } is the ith column of I _{ L }. (c) is used to guarantee the power constraint and \( {\mathbf{F}}^k={\left[{\mathbf{F}}_1^{k^H},\cdots, {\mathbf{F}}_N^{k^H}\right]}^H \).
3.3 Centralized algorithm
In Equation 13, the relative matrices are entangled with each other. To solve this problem, we start from the ZF constraint. Every BS is assumed to have the same feedback matrix, denoted as B, which will be determined at the central processing unit based on the global CSI. (a) in Equation 13 can be rewritten as
where \( {\mathbf{F}}^{\hbox{'}}={\left[{\mathbf{F}}_1^T,\cdots, {\mathbf{F}}_N^T\right]}^T \). As B is a unit lower triangular matrix, the left side of Equation 14 should satisfy:
which reveals that \( {\mathbf{F}}^k={\left[{\mathbf{F}}_1^{k^T},\cdots, {\mathbf{F}}_N^{k^T}\right]}^T\in {C}^{N_t\times {l}_k}\left(k>1\right) \) lies in the null space of \( {\overset{\smile }{\mathbf{H}}}^k={\left[{\overline{\mathbf{H}}}^{1^{\mathrm{T}}},\cdots, {\overline{\mathbf{H}}}^{{\left(k1\right)}^{\mathrm{T}}}\right]}^{\mathrm{T}} \), where \( {\overline{\mathbf{H}}}^i={\mathbf{R}}^i{\mathbf{H}}^i \) is the equivalent CSI of the ith user. F ^{k} can be found by doing singular value decomposition (SVD) on \( {\overset{\smile }{\mathbf{H}}}^k \):
We assume that F ^{k} is represented as \( {\mathbf{F}}^k={\overline{\mathbf{F}}}^k{\overline{\overline{\mathbf{F}}}}^k \). Then, \( {\overline{\mathbf{F}}}^k={\mathbf{V}}_0^k\in {C}^{N_t\times \left[{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]} \) is named as the transmit space matrix and \( {\overline{\overline{\mathbf{F}}}}^k \) is the transmit diversity matrix with \( \left[{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]\times {l}_k \) size.
The above analysis is suitable for the kth(kâ€‰>â€‰1) user. Since the first user is not limited by Equation 15, we use \( {\overline{\mathbf{F}}}^1={\mathbf{I}}_{N_t} \). \( {\overline{\mathbf{F}}}^k \) can be formed as:
With \( {\overline{\mathbf{F}}}^k \) and THP, the proposed algorithm decomposes the MUMIMO channel into parallel independent SUMIMO channels [19]. We can comprehend this as follows: for the kth user, \( {\overline{\mathbf{F}}}^k \) is designed to avoid the interference from users (kâ€‰+â€‰1,â€‰â‹¯,â€‰K). Meanwhile, THP is used to eliminate the interference from the first (kâ€‰âˆ’â€‰1) users. Therefore, user k will not suffer from MUI. For any user k(kâ€‰=â€‰1,â€‰â‹¯,â€‰K), \( {\mathbf{B}}^{k,k},{\overline{\overline{\mathbf{F}}}}^k \) and R ^{k} satisfy:
where B ^{k,k} is a unit lower triangular matrix with l _{ k }â€‰Ã—â€‰l _{ k } size. Therefore, \( {\mathbf{B}}^{k,k},{\overline{\overline{\mathbf{F}}}}^k \) and R ^{k}(kâ€‰=â€‰1,â€‰â‹¯,â€‰K) can be designed separately. For the kth user, Equation 13 can be reduced to:
where \( {\mathbf{S}}_i^k=\left[{\mathbf{I}}_i,{\mathbf{0}}_{i\times \left({l}_ki\right)}\right] \), and \( {\mathbf{e}}_i^k \) is the ith column of \( {\mathbf{I}}_{l_k} \).
The optimal solution of Equation 19 can be obtained from the generalized triangular decomposition of \( {\mathbf{H}}^k{\overline{\mathbf{F}}}^k \) [19,28]:
where \( {\mathbf{Q}}^k\in {C}^{n_r\times S} \) and \( {\mathbf{P}}^k\in {C}^{S\times \left[{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]} \) have orthonormal columns, and S is the rank of \( {\mathbf{H}}^k{\overline{\mathbf{F}}}^k \). \( {\mathbf{D}}^k\in {C}^{S\times S} \) is a lower triangular matrix, the diagonal elements of which satisfy:
where \( {\lambda}_i^k \) is the ith largest positive singular value of \( {\mathbf{H}}^k{\overline{\mathbf{F}}}^k \), and \( {\overline{\lambda}}^k={\left({\displaystyle {\prod}_{j=1}^{l_k}{\lambda}_j^k}\right)}^{1/{l}_k} \). Define \( {\boldsymbol{\Lambda}}^k={\left({\overline{\lambda}}^k\right)}^{1}{\mathbf{I}}_{l_k} \). Then \( {\mathbf{R}}^k,{\overline{\overline{\mathbf{F}}}}^k \) and B ^{k,k} are given by:
Based on Equation 10 and Equation 22, the received noise power of every stream of the kth user is \( {\sigma}^2{\left({\overline{\lambda}}^k\right)}^{2} \). Therefore, the SINR of the kth user is:
It can be seen that every stream of the kth user can achieve equal SINR.
Note that for the computation of the transmit space matrix of the kth user \( {\overline{\mathbf{F}}}^k \), we need to know the receive processing matrices of the first (kâ€‰âˆ’â€‰1) users R ^{t}(tâ€‰<â€‰k) and that, for the computation of the receive processing matrix of the kth user R ^{k}, we need to know the transmit space matrix of the kth user. Therefore, \( {\overline{\mathbf{F}}}^k \) and R ^{k} are designed stepbystep, which starts by computing the transmit space matrix and the receive processing matrix of the first user, then computes the matrices for the second user by utilizing the receive processing matrix of the first user and so on.
All of the matrices are designed at the central processing unit, and the receive processing matrices are transmitted to each user by downlink channel. The procedure of the proposed centralized algorithm is summarized in Table 1.
3.4 Decentralized algorithm
In this scenario, as BSs do not exchange their local CSI, each BS independently preprocesses the user data with the local CSI of itself. The data processed by each BS cannot be obtained by other BSs. In order to ensure that the user's receive signal is not interfered by MUI, relative matrices at each BS should satisfy the ZF criterion. Therefore, Equation 11 is reduced to:
where \( {\mathbf{W}}_n=\mathrm{diag}\left({p}_n^1{\mathbf{I}}_{l_1},\cdots, {p}_n^K{\mathbf{I}}_{l_K}\right) \) satisfies \( {\displaystyle {\sum}_{n=1}^N{\mathbf{W}}_n}=\mathbf{W} \).
The receive processing matrix R ^{k}(kâ€‰=â€‰1,â€‰â‹¯,â€‰K) of each user is related to the transmit signals from N BSs. If R ^{k} is computed at BSs, each BS can only decide it dependently as the local CSI of each BS is not exchanged. Generally, R ^{k} derived at different BSs has different values, which is unreasonable. Otherwise, for each user, frequently interactive information with all coordinated BSs is required. It will largely increase the system computational complexity. Therefore, we firstly compute R ^{k}(kâ€‰=â€‰1,â€‰â‹¯,â€‰K) at users. Denote \( {\mathbf{H}}^k=\left[{\mathbf{U}}_1^k\kern0.3em {\mathbf{U}}_0^k\right]{\boldsymbol{\Sigma}}^k{\mathbf{V}}^{k^H} \) as the SVD of H ^{k}, where \( {\mathbf{U}}_1^k\kern0.3em \in {C}^{n_r\times {l}_k} \). Then R ^{k} can be obtained by \( {\mathbf{R}}^k={\mathbf{G}}^k{\left({\mathbf{U}}_1^k\right)}^H \), where G ^{k} is a diagonal matrix for normalizing the received signal and will be determined at the BSs. For frequency division duplex system, user k can only feedback the equivalent local CSI \( {\overline{\mathbf{H}}}_n^k={\mathbf{U}}_1^k{\mathbf{H}}_n^k \) to the nth BS. Therefore, Equation 10 can be rewritten as:
Assume that every BS has equal transmit power pâ€‰=â€‰P/N. Based on the above analysis, Equation 13 is equivalent to the following optimization problem:
for kâ€‰=â€‰1,â€‰â‹¯,â€‰K,â€‰nâ€‰=â€‰1,â€‰â‹¯,â€‰N. In (a), \( {\overline{\mathbf{H}}}_n={\left[{\overline{\mathbf{H}}}_n^{1^T},\cdots, {\overline{\mathbf{H}}}_n^{K^T}\right]}^T \). In (b), g _{ j }(jâ€‰=â€‰1, 2,â‹¯, L) are diagonal elements of G. (d)(e) are used to guarantee the power constraint.
In Equation 26, the relative matrices are entangled with each other. Similarly, to solve this problem, we start from the ZF constraint. Take the nth BS for example. (a) in Equation 26 can be rewritten as:
As G and W _{ n } are diagonal matrices and B _{ n } is a unit lower triangular matrix, the left side of Equation 27 is a lower triangular matrix, i.e.,
which reveals that \( {\mathbf{F}}_n^k\left(k>1\right) \) lies in the null space of \( {\overset{\smile }{\mathbf{H}}}_n^k={\left[{\overline{\mathbf{H}}}_n^{1^T},\cdots, {\overline{\mathbf{H}}}_n^{{\left(k1\right)}^T}\right]}^T \). It can be found by doing SVD on \( {\overset{\smile }{\mathbf{H}}}_n^k \):
We assume that \( {\mathbf{F}}_n^k \) is represented as \( {\mathbf{F}}_n^k={\overline{\mathbf{F}}}_n^k{\overline{\overline{\mathbf{F}}}}_n^k \). We define \( {\overline{\mathbf{F}}}_n^k={\mathbf{V}}_{n0}^k\in {C}^{n_t\times \left[{n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]} \), and \( {\overline{\overline{\mathbf{F}}}}_n^k \) is a \( \left[{n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]\times {l}_k \) matrix.
The above analysis is suitable for the kth(kâ€‰>â€‰1) user. Since the first user is not limited by Equation 28, we use \( {\overline{\mathbf{F}}}_n^k={\mathbf{I}}_{n_t} \). \( {\overline{\mathbf{F}}}_n^k \) can be achieved by:
Similarly, with F _{ n1} and THP, the algorithm decomposes the MUMIMO channel into parallel independent SUMIMO channels. Define \( {\overline{\overline{\mathbf{H}}}}_n^k={\overline{\mathbf{H}}}_n^k{\overline{\mathbf{F}}}_n^k \). For any user k(kâ€‰=â€‰1,â€‰â‹¯,â€‰K), \( {\mathbf{B}}_n^{k,k},{\overline{\overline{\mathbf{F}}}}_n^k \) and G ^{k} satisfy:
Therefore, \( {\mathbf{B}}_n^{k,k},{\overline{\overline{\mathbf{F}}}}_n^k \) and G ^{k}(kâ€‰=â€‰1,â€‰â‹¯,â€‰K) can be designed separately. For the kth user, Equation 26 can be reduced to:
where \( {\mathbf{S}}_i^k=\left[{\mathbf{I}}_i,{\mathbf{0}}_{i\times \left({l}_ki\right)}\right] \), and \( {\mathbf{e}}_i^k \) is the ith column of \( {\mathbf{I}}_{l_k} \). (d) is obtained because \( {\overline{\mathbf{F}}}_n^{k^H}{\overline{\mathbf{F}}}_n^k=\mathbf{I} \).
The optimal solution of Equation 32 is obtained when all l _{ k } streams attain equal SINR [29]. According to Equation 12 and Equation 25, it is equivalent to possess equal value for diagonal elements of G ^{k}, expressed as G ^{k}â€‰=â€‰Î± ^{k} I. Equation 32 is equivalent to:
where \( {\gamma}_n={\alpha}^k/{p}_n^k \). The constrain condition (a) in Equation 33 can be rewritten as \( {\overline{\overline{\mathbf{F}}}}_n^k={\gamma}_n^{1}{\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }}{\mathbf{B}}_n^{k,k} \). Combining with (c), \( {\gamma}_n^2p/\tau =\boldsymbol{T}\boldsymbol{r}\left\{{\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }}{\mathbf{B}}_n^{k,k}{\left({\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }}{\mathbf{B}}_n^{k,k}\right)}^H\right\} \) is obtained. The problem of Equation 33 can be rewritten as:
Actually, \( {\mathbf{B}}_n^{k,k}{\mathbf{e}}_i^k \) denotes the ith column of \( {\mathbf{B}}_n^{k,k} \). The objective of Equation 34 is equivalent to minimizing \( {\left\Vert {\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }}{\mathbf{B}}_n^{k,k}{\mathbf{e}}_i^k\right\Vert}^2 \) for any i(iâ€‰=â€‰1,â€‰â‹¯,â€‰l _{ k }).
Let \( {\mathbf{L}}_n^i\left(i=1,\cdots, {l}_k\right) \) represent the ith column of \( {\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }} \). \( {\mathbf{Y}}_n^i \) is comprised of (iâ€‰+â€‰1,â€‰â‹¯,â€‰l _{ k }) columns of \( {\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }} \), i.e., \( {\mathbf{Y}}_n^i=\left[{\mathbf{L}}_n^{i+1},\cdots, {\mathbf{L}}_n^{l_k}\right] \). Then, we can attain:
By differentiating of Equation 35 with respect to \( {\mathbf{b}}_n^i \) and setting the result to zero, \( {\mathbf{b}}_n^i \) is achieved by:
Therefore, the ith column of \( {\mathbf{B}}_n^{k,k} \) is obtained by:
Then, \( {\mathbf{B}}_n^{k,k} \) is obtained by combining all columns \( {\mathbf{B}}_n^{k,k}{\mathbf{e}}_i^k \) (iâ€‰=â€‰1,â€‰â‹¯,â€‰l _{ k }). Therefore, we can derive \( {\gamma}_n^2=\tau {\left\Vert {\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }}{\mathbf{B}}_n^{k,k}\right\Vert}_F^2/p \). By combining (a) and (d) in Equation 33, G ^{k}â€‰=â€‰Î± ^{k} I is obtained, where:
Finally, B _{ n } is determined by \( {\mathbf{B}}_n={\mathbf{W}}_n^{1}\mathbf{R}{\mathbf{H}}_n{\mathbf{F}}_n \).
In this algorithm, R ^{k}(kâ€‰=â€‰1,â€‰â‹¯,â€‰K) are designed at the users, and other matrices are designed at the BSs. The design of matrices is independently performed at every BS. \( {\left\Vert {\overline{\overline{\mathbf{H}}}}_n^{k^{\dagger }}{\mathbf{B}}_n^{k,k}\right\Vert}_F^2\left(n=1,\cdots, N\right) \) should be transmitted to the kth(kâ€‰=â€‰1,â€‰â‹¯,â€‰K) user by downlink channel for achieving G ^{k} at the kth user. The procedure of the proposed decentralized algorithm is described in Table 2.
3.5 Remark 1 (applicability)
It should be noted that the proposed two algorithms are also suitable for the system with a singledata stream transmitted for each user. Moreover, the proposed two algorithms both are applicable to the noncoordinated system. However, the centralized scheme is suggested to apply for the noncoordinated system, as the decentralized scheme is a suboptimal solution in this situation.
4 Performance analysis
From Equation 23 and Equation 38, it is noted that both of the proposed two algorithms can achieve equal SINR for every stream of the user. They guarantee the balance performance among streams of each user, which bring much convenience to the modulation/demodulation and coding/decoding procedures. In this section, we analyze the feasibility and the computational complexity of the proposed two algorithms.
4.1 Feasibility analysis
In the MIMO system, in order to distinguish every transmit stream, the constraint that the number of transmit data streams is no more than the number of transmit and receive antennas should be satisfied. For the centralized coordinated mode and the decentralized coordinated mode, the constraint on the number of transmit data streams is specified as follows:
Lemma 1: For the centralized coordinated mode, the number of transmit data streams are bounded by Lâ€‰â‰¤â€‰N _{ t },â€‰l _{ k }â€‰â‰¤â€‰n _{ r } ; for the decentralized coordinated mode, the number of transmit data streams are bounded by Lâ€‰â‰¤â€‰n _{ t },â€‰l _{ k }â€‰â‰¤â€‰n _{ r } .
In the proposed centralized algorithm, the design of the transmit space matrix \( {\overline{\mathbf{F}}}^k\left(k=1,\cdots, K\right) \) requires \( {N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}>0 \). Furthermore, to guarantee that the optimization problem Equation 19 has solutions, Sâ€‰â‰¥â€‰l _{ k } is required. As the entries of \( {\mathbf{H}}^k{\overline{\mathbf{F}}}^k \) are zeromean complex Gaussian variables, the rank of \( {\mathbf{H}}^k{\overline{\mathbf{F}}}^k \) is \( S= \min \left({n}_r,{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right) \) with a probability of 1. Therefore, Sâ€‰â‰¥â€‰l _{ k } is the necessary condition to carry out the algorithm. Base on Lemma 1, the necessary condition is satisfied to the centralized coordinated system. Therefore, the proposed centralized algorithm is feasible.
In the proposed decentralized algorithm, \( {n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}>0 \) is required to guarantee the existence of the transmit space matrix \( {\overline{\mathbf{F}}}_n^k\left(k=1,\cdots, K,n=1,\cdots, N\right) \). Moreover, the solution of optimization problem Equation 34 requires \( {n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\ge {l}_k \), which is satisfied in the decentralized coordinated system. Therefore, the proposed decentralized algorithm is feasible.
4.2 Computational complexity
For simplicity, the number of float point operations is used to measure the computational complexity of the proposed algorithms.
In the proposed centralized algorithm, the design of the relative matrices for the kth user includes the following: a onetime multiplication of a l _{ kâ€‰âˆ’â€‰1}â€‰Ã—â€‰n _{ r } matrix and a n _{ r }â€‰Ã—â€‰N _{ t } matrix, the complexity of which is O(l _{ kâ€‰âˆ’â€‰1} n _{ r } N _{ t }); a onetime computation of the null space of a \( {\displaystyle {\sum}_{i=1}^{k1}{l}_i}\times {N}_t \) matrix with \( O\left({\left({\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right)}^2{N}_t\right) \) complexity; a onetime multiplication of a n _{ r }â€‰Ã—â€‰N _{ t } matrix and a \( {N}_t\times \left[{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right] \) matrix, the complexity of which is \( O\left({n}_r{N}_t\left[{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]\right) \); and a onetime computation of the singular value of a \( {n}_r\times \left[{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right] \) matrix with \( O\left({n}_r^2\left[{N}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]\right) \) complexity. Therefore, the complexity of the relative matrices designed for the kth user is \( O\left({\left({\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right)}^2{N}_t+{n}_r^2{N}_t+{n}_r{N}_t^2\right) \).
In the proposed decentralized algorithm, every BS has the same computational complexity. For any BS, the design of the relative matrices for the kth user includes the following: a onetime computation of the singular vector of a n _{ r }â€‰Ã—â€‰n _{ t } matrix with \( O\left({n}_r^2{n}_t\right) \) complexity; onetime multiplications of a l _{ k }â€‰Ã—â€‰n _{ r } matrix and a n _{ r }â€‰Ã—â€‰n _{ t } matrix, the complexity of which is O(l _{ k } n _{ r } n _{ t }); a onetime computation of the null space of a \( {\displaystyle {\sum}_{i=1}^{k1}{l}_i}\times {n}_t \) matrix with \( O\left({\left({\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right)}^2{n}_t\right) \) complexity; a onetime multiplication of a l _{ k }â€‰Ã—â€‰n _{ t } matrix and a \( {n}_t\times \left[{n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right] \) matrix, the complexity of which is \( O\left({l}_k{n}_t\left[{n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]\right) \); and l _{ k }times computation of the MoorePenrose pseudoinverse of a \( \left[{n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]\times \left[{l}_ki\right]\left(i=1,\cdots, {l}_k\right) \) matrix, the complexity of which is \( O\left(\left[{n}_t{\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right]{\displaystyle {\sum}_{i=1}^{l_k}{\left[{l}_ki\right]}^2}\right) \). The complexity of other scalar computations can be ignored. Therefore, the complexity of the relative matrices designed for the kth user is \( O\left({\left({\displaystyle {\sum}_{i=1}^{k1}{l}_i}\right)}^2{N}_t+{n}_r^2{N}_t+N{l}_k{n}_t^2+{l}_k^3{N}_t\right) \).
Assume that the data streams for every user is equal, i.e., l _{1}â€‰= â‹¯â€‰=â€‰l _{ K }â€‰=â€‰l. Thus, the complexity of the proposed centralized algorithm is \( O\left(K{L}^2{N}_t+K{n}_r^2{N}_t+K{n}_r{N}_t^2\right) \), and the decentralized algorithm is \( O\left(K{L}^2{N}_t+K{n}_r^2{N}_t+NL{n}_t^2+K{l}^3{N}_t\right) \).
4.3 Remark 2 (backhaul latency effect)
For centralized coordinated mode, txrx processing matrices are jointly computed at the central processing unit and then reported to every BS through the backhaul link. The existing backhaul latency can affect the system performance. We ignore the backhaul latency effect in the paper and will study it in the future work.
5 Numerical results and discussions
This section presents some simulation results to evaluate the BER performance of the proposed two algorithms. We compare them with the following algorithms: the interferencefree algorithm, the joint transitreceive processing algorithm proposed in [19], and the centralized BD (CBD) and decentralized BD (DBD). As the traditional BD cannot be directly applied in a decentralized manner, here in DBD, receive processing matrix is derived firstly based on the same method for receive processing matrix in the proposed decentralized algorithm, then the precoding matrix is derived based on the ZF criterion. For the system with a single stream transmitted for each user, i.e., l _{ k }â€‰=â€‰1 (kâ€‰=â€‰1,â€¦,K) system, we also compare the proposed decentralized algorithm with DZF [20] and DVSINR [22]. Flat Rayleigh fading channels are considered in simulations. The elements of the channels are i.i.d. complex Gaussian variables with zero mean and unit variance. In this simulation, a 64QAM modulation scheme is employed in the simulation. The signal to noise ratio (SNR) is defined as SNRâ€‰=â€‰P/(SMÏƒ^{2}), where Mâ€‰=â€‰4 is the signal constellation size and S is the average number of the data streams transmitted for each user.
5.1 Balance BER performance among streams of each user
Figure 2 verifies the balance performance among the streams of each user in the proposed algorithms. We consider a 3cell coordinated system with n _{ t }â€‰=â€‰6 transmit antennas and Kâ€‰=â€‰3 users each equipped with n _{ r }â€‰=â€‰3 receive antennas. There are l _{ k }â€‰=â€‰2 (kâ€‰=â€‰1,â€¦,K) data streams transmitted for each user. The BER performance of two streams of the first user and the third user are shown. It can be seen that the two streams of any user achieve the approximately equal BER, not only for the centralized algorithm, but also for the decentralized algorithm. The simulation results are in accordance with the theoretical analysis.
5.2 BER performance comparison of different algorithms
Figure 3 presents the BER performance comparison of the six algorithms. A 2cell coordinated system with n _{ t }â€‰=â€‰6 transmit antennas and Kâ€‰=â€‰3 users each equipped with n _{ r }â€‰=â€‰5 antennas is considered. The number of the data streams transmitted to each user is set to 2, i.e., l _{ k }â€‰=â€‰2 (kâ€‰=â€‰1,â€¦,K). It is noticed that in the interferencefree algorithm, only a single user is served by BSs. On the whole, centralized algorithms have better performance than decentralized algorithms, at the cost of information exchange among BSs. For the proposed algorithms, the centralized algorithm achieves about 7dB gain related to the decentralized algorithm at BERâ€‰=â€‰10^{âˆ’3}. Compared with the existing algorithms, when BERâ€‰=â€‰10^{âˆ’3}, the proposed centralized algorithm has an approximately 5dB gain to the algorithm in [19] and a 10dB gain to CBD. Also, about a 10dB gain is achieved by the proposed decentralized algorithm related to DBD at BERâ€‰=â€‰10^{âˆ’2}.
In Figure 4, we consider the BER performance of a 3cell coordinated system, with n _{ t }â€‰=â€‰6 transmit antennas and Kâ€‰=â€‰3 users each equipped with n _{ r }â€‰=â€‰3 antennas. The number of the data streams transmitted to each user is set to 2, i.e., l _{ k }â€‰=â€‰2 (kâ€‰=â€‰1,â€¦,K). As mentioned in Figure 3, in the interferencefree algorithm, only a single user is served by BSs. As can be seen from Figure 4, centralized algorithms have better BER performance than decentralized algorithms. The proposed centralized algorithm has a lower BER than the algorithm in [19] and CBD, and the proposed decentralized algorithm achieves better performance than DBD. Compared with Figure 3, the performance gains among algorithms are different, as they are related with system configuration.
In Figure 5, the performance of the proposed decentralized algorithm for the system with a single stream transmitted for each user, i.e., l _{ k }â€‰=â€‰1 (kâ€‰=â€‰1,â€¦,K), is verified and compared with the existing decentralized algorithms, DZF [20], and DVSINR [22] in BER. A 3cell coordinated system with n _{ t }â€‰=â€‰6 transmit antennas and Kâ€‰= 5 users each equipped with n _{ r }â€‰=â€‰3 antennas is considered. As can be seen from Figure 5, the proposed decentralized algorithm has a lower BER than other algorithms. When BERâ€‰=â€‰10^{âˆ’3}, it can achieve an approximately 7dB gain compared with DZF, and a 5dB gain compared with DVSINR.
5.3 The effect of the receive antennas and user's number to centralized algorithms
Figure 6 illustrates the effect of the number of receive antennas to the proposed centralized algorithm, the algorithm in [19], and the CBD. We consider a 3cell coordinated system with n _{ t }â€‰=â€‰6 transmit antennas and Kâ€‰=â€‰3 users. The number of the data streams transmitted to each user is set to 2, i.e., l _{ k }â€‰=â€‰2 (kâ€‰=â€‰1,â€¦,K). As can be seen from Figure 6, the performance difference between the proposed centralized algorithm and the algorithm in [19] is increased with the number of the receive antennas. In the proposed centralized algorithm, the receive processing matrix is considered into the MUMIMO channel decomposition. Compared with the algorithm in [19], the decomposed SUMIMO channels have larger dimensions, which increases the system diversity gain and improves the system performance. With a larger number of the receive antennas, the decomposed SUMIMO channels have the same dimension in the proposed centralized algorithm but have smaller dimensions in the algorithm in [19]. Therefore, with increased number of the receive antennas, the proposed centralized algorithm can achieve more performance gain than the algorithm in [19].
In Figure 7, the effect of the number of users to the proposed centralized algorithm, the algorithm in [19] and CBD is illustrated. We consider a 3cell coordinated system with n _{ t }â€‰=â€‰6 transmit antennas and n _{ r }â€‰=â€‰3 receive antennas. The number of the data streams transmitted to each user is set to 2, i.e., l _{ k }â€‰=â€‰2 (kâ€‰=â€‰1,â€¦,K). As can be seen from Figure 7, the increased number of users enlarges the performance differences among the algorithms. The txrx processing matrices of each user, in CBD, are used to eliminate the interference of all other users. Differently, in the proposed centralized algorithm and the algorithm in [19], they are used to eliminate the interference of part of the other users, bringing in more space dimensions for the diversity gain.
5.4 BER performance of the proposed algorithms in a noncoordinated system
In Figure 8, we illustrate the performance of the proposed algorithms for the noncoordinated system with Nâ€‰=â€‰1 and compare them with CBD. In this situation, CBD is equivalent to the traditional BD in a singlecell MIMO system. A MUMIMO system, in which there are n _{ t }â€‰=â€‰8 transmit antennas and Kâ€‰=â€‰4 users each equipped with n _{ r }â€‰=â€‰2 antennas, is considered. The number of the data streams transmitted to each user is set to 2, i.e., l _{ k }â€‰=â€‰2 (kâ€‰= 1,â€¦,K). It is shown that the proposed algorithms can achieve better BER performance than BD, and that the proposed decentralized algorithm is only a suboptimal scheme for the noncoordinated system, as part of the receive processing matrix is not jointly derived with the transmit processing matrix. In this situation, the proposed centralized algorithm is verified to achieve lower BER than the proposed decentralized algorithm. It exhibits an approximately 6dB gain over the decentralized scheme at BERâ€‰=â€‰10^{âˆ’2}.
6 Conclusions
Nonlinear joint txrx processing technology for a coordinated multicell system with multistream multiantenna users has been studied. The capacity of the backhaul link determines different coordinated modes among BSs, including centralized and decentralized coordinated. The proposed centralized algorithm is proposed to derive the txrx processing matrices jointly at the central processing unit. The proposed decentralized algorithm allows each BS design to transmit precoding in a decentralized manner, which alleviates the demand on the backhaul capacity. The analysis and simulation results show that the centralized algorithm achieves better performance than the decentralized algorithm. And, the proposed algorithms achieve better performance than the existing joint txrx processing algorithms and the decentralized linear precodings.
References
M Sawahashi, Y Kishiyama, A Morimoto, D Nishikawa, M Tano, Coordinated multipoint transmission/reception techniques for LTEadvanced [Coordinated and Distributed MIMO]. IEEE Wireless Commun. 17(3), 26â€“34 (2010)
D Lee, H Seo, B Clerckx, E Hardouin, D Mazzarese, S Nagata, K Sayana, Coordinated multipoint transmission and reception in LTEadvanced: deployment scenarios and operational challenges. IEEE Commun. Mag. 50(2), 148â€“155 (2012)
D Gesbert, S Hanly, H Huang, S ShamaiShitz, O Simeone, W Yu, Multicell MIMO cooperative networks: a new look at interference. IEEE J. Sel. Areas Commun. 28(9), 1380â€“1408 (2010)
K Karakayali, GJ Foschini, RA Valenzuela, R Yates, On the maximum common rate achievable in a coordinated network. Proceedings of the IEEE International Conference Communications (IEEE, Istanbul, 2006), pp. 4333â€“4338
Z Keke, RC de Lamare, M Haardt, Multibranch TomlinsonHarashimaprecoding design for MUMIMO systems: theory and algorithms. IEEE Trans Commun. 62(3), 939â€“951 (2014)
S Jing, D Tse, J Soriaga, J Hou, J Smee, R Padovani, Multicell downlink capacity with coordinated processing. EURASIP J. Wireless Commun.Netw 2008, 586878 (2008)
S Liyan, Y Chenyang, H Shengqian, The value of channel prediction in CoMP systems with large backhaul latency. IEEE Trans Commun. 61(11), 4577â€“4590 (2013)
Papadogiannis, E Hardouin, D Gesbert, Decentralisingmulticell cooperative processing: a novel robust framework. EURASIP J. Wireless Commun.Netw 2009, 890685 (2009)
R Zhang, Cooperative multicell block diagonalization with perbasestation power constraints. IEEE J. Sel. Areas Commun. 28, 1435â€“1445 (2010)
S Shi, M Schubert, N Vucic, H Boche, MMSE optimization with perbasestation power constraints for network MIMO systems. Proceedings of the IEEE International Conference Communications (IEEE, Beijing, 2008), pp. 4106â€“4110
J Zhang, Y Wu, S Zhou, J Wang, Joint linear transmitter and receiver design for the downlink of multiuser MIMO systems. IEEE Commun Lett 9(11), 991â€“993 (2005)
RC de Lamare, Adaptive and iterative multibranch MMSE decision feedback detection algorithms for multiantenna systems. IEEE Trans. Wirel. Commun. 12(10), 5294â€“5308 (2013)
H Park, SH Park, HB Kong, I Lee, Weighted sum MSE minimization under perBS power constraint for network MIMO systems. IEEE Commun. Lett 16(3), 360â€“363 (2012)
S He, Y Huang, L Yang, B Ottersten, Coordinated multicell multiuser precoding for maximizing weighted sum energy efficiency. IEEE Trans. Signal Process. 62(3), 741â€“751 (2014)
M Wei, C Xiang, Z Ming, W Jing, Joint streamwise THP transceiver design for the multiuser MIMO downlink. IEICE Trans. Commun. 92(1), 209â€“218 (2009)
W Hardjawana, B Vucetic, Y Li, Multiuser cooperative base station systems with joint precoding and beamforming. IEEE J Sel Top Signal Process 3(6), 1079â€“1093 (2009)
S AdÃ£o, H Reza, G AtÃlio, Power allocation strategies for distributed precodedmulticell based systems. EURASIP J. Wireless Commun. Netw 2011, 1 (2011)
Y Sun, M Wu, M Zhao, C Xu, Transceiver designs using nonlinear precoding for multiuser MIMO systems with limited feedback. Proceedings of the IEEE VehicularTechnology Conference (IEEE, Dresden, 2013), pp. 1â€“5
L Sun, M Lei, Adaptive joint nonlinear transmitreceive processing for multicell MIMO networks. Proceedings of the IEEE Globe Communications Conference (IEEE, Anaheim, 2012), pp. 3766â€“3771
R Holakouei, A Silva, A Gameiro, Distributed versus centralized zeroforcing precoding for multicell OFDM systems. Proceedings of the IEEE Globe Communications ConferenceWorkshops (IEEE, Houston, 2011), pp. 188â€“193
R Zakhour, D Gesbert, Distributed multicellMISO precoding using the layered virtual SINR framework. IEEE Trans. Wireless Commun. 9(8), 2444â€“2448 (2010)
E Bjornson, R Zakhour, D Gesbert, B Ottersten, Cooperative multicell precoding: rate region characterization and distributed strategies with instantaneous and statistical CSI. IEEE Trans. Signal Processing 58(8), 4298â€“4310 (2010)
X Zhao, H Xu, X Yang, Performance enhancement for CoMP based on power allocation and a modified ZFTHP. Proceedings of the IEEE Personal Indoor and Mobile Radio Communications (IEEE, Sydney, 2012), pp. 2309â€“2313
I Krikidis, B Ottersten, Diversity fairness in Tomlinsonâ€“Harashimaprecoded multiuser MIMO through retransmission. IEEE Signal Process Lett. 20(4), 375â€“378 (2013)
G Boudreau, J Panicker, N Guo, R Chang, N Wang, S Vrzic, Interference coordination and cancellation for 4G networks. IEEE Commun. Mag. 47(4), 74â€“81 (2009)
J Tang, S Lambotharan, Interference alignment techniques for MIMO multicell interfering broadcast channels. IEEE Trans. Commun. 61(1), 164â€“175 (2013)
R.F.H.Fisher, Precoding and signal shaping for digital transmission (John Wiley & Sons Ltd, 2002)
Y Jiang, W Hager, J Li, The generalized triangular decomposition. Math. Comput. 77, 1037â€“1056 (2007)
A Wiesel, E Yonina, CS Shlomo, Linear precoding via conic optimization for fixed MIMO receivers. IEEE Trans. Signal Process. 54(1), 161â€“176 (2006)
Acknowledgements
This work is supported by the Special Funding for Beijing Common Construction Project and the Beijing Natural Science Foundation (4144079).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Hu, Z., Feng, C., Zhang, T. et al. Nonlinear joint transmitreceive processing for coordinated multicell systems: centralized and decentralized. EURASIP J. Adv. Signal Process. 2015, 10 (2015). https://doi.org/10.1186/s1363401501932
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1363401501932