# An improved block diagonal precoding scheme for MIMO multicast channel with two users

- Xiaodong Xu
^{1}Email author, - Zilong Zhang
^{1}and - Baisheng Du
^{1}

**2014**:39

https://doi.org/10.1186/1687-6180-2014-39

© Xu et al.; licensee Springer. 2014

**Received: **30 July 2013

**Accepted: **18 March 2014

**Published: **26 March 2014

## Abstract

Matrix theory plays an important role in precoding methodology for multiple input multiple output (MIMO) systems. In this paper, an improved block diagonal (BD) precoding scheme is proposed for a MIMO multicast channel with two users, where the unitary precoding matrix is constructed in a block-wise form by joint triangularization decomposition. In order to reduce large signal-to-noise ratios (SNRs) spread across different transmitted data streams and users, the combination of joint equi-diagonal triangularization (JET) and joint geometric mean decomposition (JGMD) is applied to submatrix construction in the inner process of this precoding scheme. An elaborate implementation is presented, and the existence condition of JGMD is also investigated for two complex-valued matrices with two columns, where the analytical result reveals the connection with the particular channel realization and essentially determines when to consider JGMD for submatrix construction. In addition, the properties of the diagonal elements generated by joint triangularization decomposition are discussed as well as the computational complexity of the proposed scheme. Simulation results indicate that in general, JGMD is employed with high probability in the hybrid model, and the proposed scheme readily outperforms the JET scheme in terms of bit error rate (BER) performance in the moderate to high SNR regimes.

## Keywords

## 1 Introduction

Precoding techniques have gained extensive research interests in the literature due to their capability of achieving the ergodic capacity and improving link reliability for wireless communication over multiple input multiple output (MIMO) channels (see, e.g., [1] and the references therein). Under the assumption of full channel state information (CSI) at the transmit side as well as at the receive side, matrix theory, including several decomposition approaches, is often used as a basic tool to facilitate the precoder design and performance analysis for different types of communication scenarios. For instance, singular value decomposition (SVD) probably serves as a prevailing way to construct the capacity-achieving precoder for a single-user scenario by diagonalizing the MIMO channel matrix. To avoid introducing rather complicated bit loading strategies at the transmit side, the precoding scheme in [2] using geometric mean decomposition (GMD) [3, 4] and later the unitary channel decomposition (UCD) scheme [5] and the root-mean-square decomposition (RMSD) scheme [6] are successively proposed to achieve optimum bit error rate (BER) and channel throughput simultaneously.

Another attractive precoding scheme is recently introduced in [7] and [8] for MIMO multicast channels. Relying on the so-called joint equi-diagonal triangularization (JET) or, equivalently, the joint unitary triangularization (JUT) with fixed diagonal ratio, the research effort therein focuses primarily on constructing one common unitary precoding matrix for two users by jointly triangularizing their channel matrices. However, as one can see, the JET scheme usually results in subchannels with vastly different signal-to-noise ratios (SNRs) for every user, and the BER performance is substantially dominated by those who have the smallest gain on the diagonal, provided that the same modulation and coding scheme is used for all subchannels. Intuitively, conjecture that if there exists a joint GMD (JGMD) decomposition of two matrices, then each user can obtain multiple identical subchannels with equal SNRs and the problem arising from JET can be eliminated thereafter. However, the existence condition of such a decomposition is available only for two 2×2 real-valued matrices [9]. To the authors’ best knowledge, the existence condition of JGMD is still unknown for more generalized cases.

- 1.
The application of JGMD is considered, and the sufficient and necessary condition of the existence of JGMD is derived for the case of two complex-valued matrices with two columns. The analytical result reveals the connection with two-antenna channel realization so that JGMD can be conditionally used to perform submatrix construction for the proposed precoding scheme with more than two transmit antennas.

- 2.
We reformulate and elaborately implement the BD precoding scheme by using JET as the unique submatrix construction algorithm, namely the pure BD-JET scheme in this paper, which can be seen as a supplemental work to [8]. New interesting insight into the property of diagonal elements is also shown for the upper triangular matrices.

- 3.
Since JET may introduce large SNRs spread across different transmitted data streams and users, the hybrid usage of JET and JGMD is proposed to construct submatrices in the inner process of the precoding scheme. The existence condition of JGMD is used to determine whether JGMD is employed or not. The computational burden of the proposed precoding scheme is briefly discussed and compared with the JET scheme counterpart. Benefiting from the high probability of using JGMD instead of JET, the proposed scheme has the capability to guarantee a satisfactory BER performance for each user.

The remainder of this paper is structured as follows. Section 2 briefly describes the MIMO multicast channel model. In the next section, the existence of JGMD is studied and the proposed scheme is presented based on the combination of JET and JGMD. Section 4 illustrates several numerical examples to validate the advantage of the proposed scheme over the JET scheme. Finally, Section 5 summarizes this paper.

## 2 Physical-layer multicasting system

*M*transmit antennas, and the

*i*th user is equipped with

*N*

_{ i }≥

*M*,

*i*= 1,2, receive antennas. Assume that linear precoding is employed at the base station with a unitary precoding matrix $\mathbf{P}\in {\mathbb{C}}^{M\times M}$, and a common message $\mathbf{s}\in {\mathbb{C}}^{M}$ will be transmitted through the multicast channel. At the receive side, the

*i*th user receives

where ${\mathbf{H}}_{i}\in {\mathbb{C}}^{{N}_{i}\times M}$ is the channel matrix with full column rank for the *i* th user and ${\mathbf{z}}_{i}\in {\mathbb{C}}^{{N}_{i}}$ is the circularly symmetric Gaussian white noise vector at the *i* th user.

**P**by jointly decomposing {

**H**

_{ i }} as follows:

with ${a}_{i}=\sqrt[M]{\prod _{j=1}^{M}{\sigma}_{i,j}}$, where the sequence ${\left\{{\sigma}_{i,j}\right\}}_{j=1}^{M}$ is the singular value of **H**_{
i
}. **U**_{1} and **U**_{2} are unitary matrices of dimensions *N*_{1} × *N*_{1} and *N*_{2} × *N*_{2}, respectively, **D**_{1} and **D**_{2} are generalized upper triangular matrices with equal diagonal elements, and (·)^{
†
}denotes the conjugate transpose.

*i*th user yields a nulling output by multiplying ${\mathbf{U}}_{i}^{\u2020}$ to both sides of (1), i.e.,

*M*parallel scalar subchannels, which can be expressed as

where ${\u0177}_{i,j}$, *s*_{
j
}, and ${\widehat{z}}_{i,j}$ represent the *j* th entry of the vectors ${\widehat{\mathbf{y}}}_{i}$, **s**, and ${\widehat{\mathbf{z}}}_{i}$, respectively, and *d*_{i,jj}denotes the *j* th diagonal entry of the upper triangular matrix **D**_{
i
}. Assume that the same modulation constellation is used in all the subchannels to reduce the system complexity, which is consistent with the HIPERLAN/2 and IEEE 802.11 standards. As a result, the overall BER performance of the JET scheme will be restricted by the subchannels with the smallest gain.

Notice that the JET precoding scheme explicitly adopts a promising unitary triangularization decomposition approach by (2) to jointly triangularize two matrices with one common unitary matrix. However, large spread of the diagonal elements ${\left\{{d}_{i,\mathit{\text{jj}}}\right\}}_{j=1}^{M}$ can be easily observed even for ordinary channel conditions, which will inevitably lead to the BER performance attenuation. Based on this observation, the motivation of this paper is to develop such a joint triangularization decomposition approach to dramatically reduce the spread or, equivalently, enlarge the smallest diagonal elements. Khina et al. [9] investigated this situation, but unfortunately, the sufficient and necessary condition of JGMD is still unknown not only for the case of more than two users but also for the case of more than two transmit antennas, except for the special case of two real-valued 2×2 matrices.

## 3 The proposed precoding scheme

To meet the requirement of high-speed data transmission, practical multicasting systems are usually expected to involve a large number of transmit antennas at the base station. Therefore, the equivalent channel matrices often contain columns larger than two in general. In particular, when the multicasting system encounters MIMO frequency-selective fading channels for all users, the dimensions of the equivalent channel matrices will dramatically increase, which will cause an inefficient use of JET at least for two aspects [12]: (1) the multiplication complexity equals $\mathcal{O}\left({M}^{2}\right)$ while directly using a full-size unitary precoding matrix, which will increase with the increase of *M*, and (2) the requirement of feedback bits from the receiver will also increase with the increase of *M* for encoding the precoding matrix.

In the sequel, we construct the block diagonal precoder for multicast channels with column dimensions larger than two. In this way, the aforementioned drawbacks will be mitigated to a great extent. Moreover, via matrix segmentation, it is possible to employ JGMD for reduced-dimensional submatrices to achieve better performance. But before that, let us first consider the existence of JGMD in the next subsection.

### 3.1 JGMD for complex-valued matrix case

*M*= 2, without loss of generality, denote the unitary matrix

**P**as

where **p**_{
i
}is the *i* th column vector of **P** and *θ* ∈ [ 0,*π* /2), *ψ* ∈ [ 0,2 *π*). Then the following lemma states the existence condition of JGMD for two complex-valued matrices.

#### Lemma 1.

*Let*e= [

*σ*

_{1,1}

*σ*

_{1,2},

*σ*

_{2,1}

*σ*

_{2,2}]

^{ T }

*be the vector whose elements are products of the singular values of*${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$.

*There exists JGMD for*${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$

*if and only if the following equation:*

*has solutions for real-valued vector*x

*, where*$\mathbf{x}\triangleq {\left[\text{cos}2\theta ,\text{sin}2\theta \text{cos}\psi ,\text{sin}2\theta \text{sin}\psi \right]}^{T}$

*,*$\mathbf{c}={\left[\frac{\parallel {\mathbf{H}}_{1}{\parallel}_{\mathit{F}}^{2}}{2},\frac{\parallel {\mathbf{H}}_{2}{\parallel}_{\mathit{F}}^{2}}{2}\right]}^{T}$, ∥·∥

_{ F }

*stands for the Frobenius norm, and*

*with h*_{i,jk}*represented as the entry of the jth row and kth column of*$\phantom{\rule{2.83795pt}{0ex}}{\mathbf{H}}_{i}^{\u2020}{\mathbf{H}}_{i}$.

#### Proof.

**P**, one can obtain the QR decomposition of

**H**

_{ i }

**P**as follows:

**H**

_{ i }

**p**

_{1}=

**Q**

_{ i }[

*r*

_{i,11},0]

^{ T }with

*r*

_{i,11}denoting the first diagonal element of

**R**

_{ i }. Subsequently, the square form of

*r*

_{i,11}is obtained by

Resorting to the statement of Lemma 1 in [13], the right-hand side of (9) can be transformed into the matrix form of **c**+**G** **x**, while the left-hand side of (9) essentially represents the SNR of the first subchannel for the *i* th user. Therefore, if there exists JGMD for matrix sequence ${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$, then both ${r}_{i,11}^{2}$ and ${r}_{i,22}^{2}$ should be equal to *σ*_{i,1}*σ*_{i,2} from the determinant viewpoint, which means that (6) has solutions, and vice versa.

#### Remark 1.

**g**is chosen from the unit-length basis vectors in the null space of

**G**,

*ρ*is an arbitrary real value to meet the norm constraint of ∥

**x**∥ = 1, and (·)

^{ † }denotes pseudo-inverse. Apparently, the sufficient and necessary condition that Lemma 1 tells is equivalent to the following one:

When the solution **x** is obtained, the corresponding angular parameters *θ* and *ψ* can then be uniquely acquired through the definition. In this case, the unitary matrix **P** is obtained finally.

#### Remark 2.

Consider more general cases of *K* > 2 users in one multicasting system with an extension version of $\mathbf{c}\in {\mathbb{R}}^{K},\mathbf{e}\in {\mathbb{R}}^{K}$, and $\mathbf{G}\in {\mathbb{R}}^{K\times 3}$, respectively. It is clear from (6) that no proper solution $\mathbf{x}\in {\mathbb{R}}^{3}$ will exist with probability 1. Therefore, we only focus on precoding for multicasting systems with two users. For a complex-valued matrix with column dimension *M* > 2, the existence condition of JGMD is still an open issue.

### 3.2 BD-JET decomposition with discussions about diagonal elements

With the merit of Lemma 1, we are now dedicated in jointly decomposing two matrices into upper triangular forms with equal elements on each diagonal. Before proceeding with the design procedure, we are willing to reformulate BD-JET carefully with full implementation for reasons of completeness, which is not explicitly described in [8]. The following lemma indicates the principle of this decomposition approach.

#### Lemma 2.

*Full and equal column rank matrices*${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$

*can be jointly triangularized into B blocks as follows:*

*where* Q_{
i
}*is an N*_{
i
}× *M semi-unitary matrix,* P *is an M* × *M block diagonal matrix of the form diag* (*P*_{1},*P*_{2},…,*P*_{
B
})*, where each individual block* P_{
j
}*is a b*_{
j
}× *b*_{
j
}*unitary matrix and*$\sum _{j=1}^{B}{b}_{j}=M$. ${\left\{{\mathbf{T}}_{i}\right\}}_{i=1}^{2}$*are M* × *M upper triangular matrices with equal diagonal ratio for each block.*

#### Proof.

by appropriate rearrangement, where **U**_{i ,r}is an *N*_{
i
}× (*N*_{
i
}- *M*) matrix that contains the last *N*_{
i
}- *M* columns of **U**_{i,JET}, and **U**_{i,1},**D**_{i,1} have different dimensions of *N*_{
i
}× *M* and *M* × *M* from those of **U**_{i,JET} and **D**_{i,JET}, respectively. For clarification, the arguments with ‘JET’ suffix denote that they are generated by the regular-form JET.

**X**

_{ i },

**X**

_{i,1}contains the first

*b*

_{1}columns of

**X**

_{ i }, while

**X**

_{i,r}contains the rest of the columns. For the first block, the alternative-form JET via (13) can be used to manipulate ${\left\{{\mathbf{H}}_{i,1}\right\}}_{i=1}^{2}$ as

From (15), it is reasonable to obtain the exact solution of **P**_{1},**T**_{i,1}, and **Q**_{i,1} by setting **P**_{1} = **V**_{1,JET}, **T**_{i,1}= *α*_{i,1}**D**_{i,1}, and **Q**_{i,1}= **U**_{i,1}, respectively. And the diagonal ratio between **T**_{1,1} and **T**_{2,1} is fixed by $\frac{{\alpha}_{1,1}}{{\alpha}_{2,1}}$.

**Q**

_{i,1}and

**Q**

_{i,r}, the remainder of

**H**

_{ i }satisfies that

which has the same decomposition pattern as that of **H**_{i,1}. Thus, we can continue the above operation recursively until all the *B* blocks have been factorized. Consequently, **Q**_{
i
}, **T**_{
i
}, and **P** consist of the output submatrices from every iteration, and finally, (12) is reached.

#### Remark 3.

Lemma 2 allows for a block diagonal decomposition approach, named as BD-JET, to triangularize two matrices simultaneously. The proof is constructive and results in a detailed decomposition procedure, which is not pointed out in [8]. BD-JET can also be seen as a relaxed application of JET since JET is employed in the inner iteration process of BD-JET. Moreover, if there is only one block instead, i.e., *B* = 1, then BD-JET will be reduced to JET. And if *B* = *M*, as a result, the unitary matrix **P** will degrade to an identity matrix. However, there are at least two evidences that state the differences between BD-JET and JET. The first one is that in JET mode and in BD-JET mode, the unitary matrix **P** may have a different structure, except for the same matrix dimension. BD-JET possesses a block diagonal matrix **P** which contains several unitary submatrices. The second one lies in that in JET mode, the ratio of the diagonal elements of **D**_{
i
}is fixed; nevertheless, in BD-JET mode, the ratio of the diagonal elements of **T**_{
i
}, as mentioned in the next remark, is block-equal and not identical between blocks in general.

Let ${\mathbf{H}}_{i,k}={\mathbf{Q}}_{i,k}{\mathbf{T}}_{i,k}{\mathbf{P}}_{k}^{\u2020}$ denote the BD-JET results of the first $\sum _{j=1}^{k}{b}_{j}$ columns of matrices ${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$. The following corollary describes the property of the *k* th block diagonal elements of **T**_{i,k}, defined as $\left[\begin{array}{c}{t}_{i,k,1},{t}_{i,k,2},\dots ,{t}_{i,k,{b}_{k}}\end{array}\right]$.

#### Corollary 1.

*k*th block diagonal entries of the upper triangular matrices ${\left\{{\mathbf{T}}_{i,k}\right\}}_{i=1}^{2}$ have the following property:

where ${a}_{i,k}=\text{det}\left(\underset{i,k}{\overset{\u2020}{\mathbf{H}}}{\mathbf{H}}_{i,k}\right)$.

#### Proof.

*k*= 1, it is apparent that ${a}_{i,1}=\prod _{j=1}^{{b}_{1}}{t}_{i,1,j}^{2}$. For the case of

*k*> 1, according to ${\mathbf{H}}_{i,k}={\mathbf{Q}}_{i,k}{\mathbf{T}}_{i,k}{\mathbf{P}}_{k}^{\u2020}$, it holds that

In other words, the product of the *k* th block diagonal entries of the upper triangular matrices **T**_{i,k}is characterized by (17).

#### Remark 4.

From Corollary 1, the ratio of the *k* th block elements between the upper triangular matrices {**T**_{
i
}} holds for ${r}_{k}=\sqrt[2{b}_{k}]{\frac{{a}_{1,k}{a}_{2,k-1}}{{a}_{1,k-1}{a}_{2,k}}}$. In fact, *r*_{
k
}may be controlled inherently by determinants of the current block as well as the former one. As for JET, the ratio would be kept as ${r}_{\mathit{\text{JET}}}=\sqrt[2M]{\frac{{a}_{1,B}}{{a}_{2,B}}}$ for the whole diagonal elements. For the case of *B* = 1, *r*_{1} = *r*_{
JET
}holds true intuitively. However, as *B* increases, this relationship is invalid in general except that {**H**_{
i
}} have special matrix structures. In addition, we note that the ratio *r*_{
k
}majorizes *r*_{
JET
}by majorization theory [14], and it is proved by the numerical evaluations in the next section that **T**_{
i
}has relatively smaller gain spread in the diagonal than **D**_{
i
}with high probability, which will lead to an improved smallest gain for the worst subchannels.

*b*

_{ k }columns of

**H**

_{i,k}, we can use the properties of the block matrix determinant to further get the following relationship:

### 3.3 The proposed scheme

Although the BD-JET decomposition has no specific constraint on the size of submatrices, the use of JET may still produce large gain spread across different subchannels in the decomposition procedure. In this subsection, we put forward an improved block diagonal precoding scheme for multicasting system with two users, where the unitary precoding matrix is constructed based on a combination of JET and JGMD. Actually, JGMD will act as the major decomposition approach for every two submatrices, thanks to the high existence probability, which is validated by numerical evaluations in the next section. Algorithm 1 summarizes the details of the proposed scheme.

*Remark 5.* Bearing in mind that the existence of JGMD is questionable for higher dimensions (i.e., for more than two transmit antennas), all the variables ${\left\{{b}_{j}\right\}}_{j=1}^{B}$ should have equal values of two for matrices ${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$ who have even number of columns. For the case of odd number of columns in ${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$, all the variables {*b*_{
j
}} still have values of two, but the last one where ${b}_{B}=M-\sum _{j=1}^{B-1}{b}_{j}$ is guaranteed with $B=\u2308\frac{M-1}{2}\u2309$. And JET is automatically assigned for the joint triangularization decomposition of the last block in the current situation, which is omitted in Algorithm 1 for concise clarification purposes. Besides, in contrast with the proposed scheme, we rename the aforementioned BD-JET scheme including JET as the unique algorithm for submatrix construction as the pure BD-JET scheme hereafter.

#### Remark 6.

*N*

_{1}=

*N*

_{2}=

*N*and each block has equal size, i.e.,

*b*

_{ j }=

*L*,∀

*j*. Then the computational complexity of the proposed scheme can be approximately figured out in terms of the number of flops as

for the same ${\left\{{\mathbf{H}}_{i}\right\}}_{i=1}^{2}$. As can be seen from (21) and (22), the computational complexity of the proposed scheme is somewhat larger than the JET counterpart since typically, *L* < *M*. Nevertheless, due to the bock diagonal decomposition, the resulting block diagonal precoder **P** can reduce the multiplication complexity from $\mathcal{O}\left({M}^{2}\right)$ to $\mathcal{O}\left(\frac{{M}^{2}}{B}\right)$ in general.

## 4 Simulation results

In this section, several numerical examples are performed to demonstrate the efficiency of the proposed scheme. A single-group multicasting system with two users is employed and without loss of generality, the number of receive antennas for each user is set equal to each other, i.e., *N*_{1} = *N*_{2} = *N*. Transmit symbols are commonly chosen from quadrature phase shift keying (QPSK) constellation throughout this paper.

### Example 1

*N*× 2 independent and identically distributed (i.i.d) Rayleigh fading channels, Table 1 establishes the existence probability of JGMD versus

*N*, where it is clear that JGMD exists in most cases, especially when

*N*has large values.

**Probability of the existence of JGMD**

N | |||||||
---|---|---|---|---|---|---|---|

2 | 3 | 4 | 5 | 6 | 7 | 8 | |

Probability | 0.695 | 0.829 | 0.881 | 0.907 | 0.928 | 0.936 | 0.949 |

*N*=

*M*, Table 2 shows the possibility that whether or not the smallest diagonal elements caused by the proposed scheme are greater than the pure BD-JET case. From this table, it appears that the proposed scheme always has high probability to produce subchannels with better SNRs than the pure BD-JET. Since JGMD is essentially unused for the case of

*N*=

*M*≤ 3, which means that the proposed scheme is equivalent to the pure BD-JET one in this case, therefore we do not consider this situation in this table. Assuming

*N*=

*M*, more evidences can be observed in Table 3, where the multicasting system suffers from MIMO frequency-selective channels with

*μ*fading paths.

**Probability of improved smallest gain for MIMO frequency-flat channels**

N=M | |||||
---|---|---|---|---|---|

4 | 5 | 6 | 7 | 8 | |

Probability | 0.861 | 0.861 | 0.862 | 0.862 | 0.863 |

**Probability of improved smallest gain for frequency-selective MIMO channels**

μ | |||||||
---|---|---|---|---|---|---|---|

2 | 3 | 4 | 5 | 6 | 7 | 8 | |

Probability | 0.973 | 0.975 | 0.977 | 0.981 | 0.984 | 0.986 | 0.987 |

### Example 2.

*M*=

*N*= 8,

*B*= 4, and

*b*

_{ j }= 2,∀

*j*. Figure 1 depicts the comparative curves of average BER performance of each user versus SNR. As can be seen from this figure, each user almost has the same BER property, and the proposed scheme achieves superior performance over the JET scheme and the pure BD-JET scheme in moderate to high SNR regimes. Due to the unitary precoding, the proposed scheme is expected to have the same throughput as that of the JET scheme for the multicasting system. However, the smallest gain of the worst subchannels is improved by the proposed scheme at the cost of a small increase in computational complexity. The resulting performance gap implies the effective application of JGMD as the major decomposition approach for the inner process of the proposed scheme.

Notice that the unitary precoder **P** consists of four 2 × 2 unitary submatrices in this case. Therefore, it is clear that the block diagonal structure of **P** can reduce matrix multiplication complexity from $\mathcal{O}\left({M}^{2}\right)$ to $\mathcal{O}\left(\frac{{M}^{2}}{4}\right)$ in comparison with the JET scheme.

### Example 3.

Another favorite multicast scenario is about the block transmission through MIMO frequency-selective fading channels with *μ* distinguishable paths. In this example, the system uses a zero-padded transmission mechanism, which means that the system will transmit *μ* zero vectors after *τ* symbol vectors in every *τ* + *μ* symbol duration. In fact, we can acquire the (*τ* + *μ*) *N* × *τ* *M* equivalent channel matrix {**H**_{
i
}} in block Toeplitz structure in the current situation.

*M*=

*N*= 2,

*μ*= 4 effective paths, and

*τ*= 4 symbol block size. In this case, the equivalent channel matrix actually has a dimension of 16 × 8. Similar behaviors manifest that the proposed scheme has reliable performance better than the JET scheme in moderate to high SNR regimes. The pure BD-JET scheme is also examined and plotted in this figure. It is shown that in contrast to the pure BD-JET scheme, the proposed one can still yield a performance improvement about 1 dB in the high SNR case. The BER performance versus the symbol block size

*τ*is also shown in Figure 3 at SNR = 15 dB. It is clear from this figure that although the bandwidth efficiency increases as

*τ*increases, the BER performance degrades a lot for the proposed scheme as well as for the JET scheme. Similarly, performance properties have also been mentioned for the BD-GMD scheme in [12] for unicast scenarios.

## 5 Conclusions

In this paper, we present an improved block diagonal precoding scheme to construct a unitary precoder for a MIMO multicast channel with two users by using joint triangularization decomposition. An elaborate implementation of the pure BD-JET scheme is provided by means of JET. Although the pure BD-JET scheme achieves relative advantages without an explicit constraint on the size of submatrices, the use of JET may suffer from large SNR spread across different transmit data streams and users. Fully aware of the existence condition of JGMD for two transmit antennas, we introduce the hybrid application model of JGMD and JET to construct submatrices in the inner process of the precoding scheme. The high existence probability of JGMD is proved by numerical results so that in the proposed scheme, JGMD is observed to be the major decomposition algorithm for submatrix construction, as compared with the JET counterpart. Besides the analytical result of the computational complexity, properties of the diagonal elements are also derived to gather useful insights into the proposed scheme. Several numerical examples demonstrate that the proposed scheme can guarantee satisfactory BER performance over the JET scheme in moderate to high SNR regimes, especially when the MIMO multicast system is equipped with large-scale antennas.

## Declarations

### Acknowledgements

This work was supported by the National Natural Science Foundation of China under grant number 61271272, the National Science and Technology Special Projects of China under grant number 2012ZX03001007-002, and the National High Technology Research and Development Program of China (863 Program) under grant number 2012AA01A502. The authors would like to acknowledge Prof. Xuchu Dai for his valuable comments and suggestions.

## Authors’ Affiliations

## References

- Vu M, Paulraj A: MIMO wireless linear precoding.
*IEEE Signal Process. Mag*2007, 24(5):86-105.View ArticleGoogle Scholar - Jiang Y, Hager W, Li J: The geometric mean decomposition.
*Linear Algebra Appl.*2005, 396: 373-384.MathSciNetView ArticleMATHGoogle Scholar - Kosowski P, Smoktunowicz A: On constructing unit triangular matrices with prescribed singular values.
*Computing*2000, 64(3):279-285. 10.1007/s006070050047MathSciNetView ArticleMATHGoogle Scholar - Zhang J-K, Kavcic A, Ma X, Wong KM: Design of unitary precoders for ISI channels.
*Proc. IEEE Intern. Conf. on Acous. Speech and Signal Processing*(Orlando, FL, USA, 13–17 May 2002), pp. 2265–2268Google Scholar - Jiang Y, Hager W, Li J: Uniform channel decomposition for MIMO communications.
*IEEE Trans. Signal Process*2005, 53(11):4283-4294.MathSciNetView ArticleGoogle Scholar - Hwang T, Kwon Y: Root mean square decomposition for EST-based spatial multiplexing systems.
*IEEE Trans. Signal Process*2012, 60(3):1295-1306.MathSciNetView ArticleGoogle Scholar - Khina A, Kochman Y, Erez U: Decomposing the MIMO broadcast channel.
*Proc. 48th Annual Allerton Conf. on Commun., Control and Computing*(Allerton, IL, USA, 29 Sept–1 Oct 2010), pp.102–106Google Scholar - Khina A, Kochman Y, Erez U: Joint unitary triangularization for MIMO networks.
*IEEE Trans. Signal Process*2012, 60(1):326-336.MathSciNetView ArticleGoogle Scholar - Khina A, Hitron A, Erez U: Modulation for MIMO networks with several users.
*Proc. IEEE International Symposium on Information Theory*(St. Petersburg, Russia, 31 July–5 Aug 2011), pp. 2452–2456Google Scholar - Lin S, Ho WL, Liang YC: Block diagonal geometric mean decomposition (BD-GMD) for MIMO broadcast channels.
*IEEE Trans. Wireless Commun*2008, 7(7):2778-2789.View ArticleGoogle Scholar - Foschini GJ, Golden GD, Valenzuela RA, Wolniansky PW: Simplified processing for high spectral efficiency wireless communication employing multiple-element arrays.
*Wireless Personal Commun*1999, 6: 311-335.View ArticleGoogle Scholar - Weng C-C, Vaidyanathan PP: Block diagonal GMD for zero-padded MIMO frequency selective channels.
*IEEE Trans. Signal Process*2011, 59(2):713-727.MathSciNetView ArticleGoogle Scholar - Du B, Jiang Y, Xu X, Dai X: Transmit beamforming for MIMO multicast channels.
*Proc. IEEE International Conference on Commun*(Ottawa, Cannada, 10–15 June 2012), pp. 3800–3805Google Scholar - Palomar DP, Jiang Y: MIMO transceiver design via majorization theory.
*Found. Trends Commun. Inform. Theory*2006, 3: 331-551. 10.1561/0100000018View ArticleMATHGoogle Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.