Channel estimation for MIMO multi-relay systems using a tensor approach

Han, Xi; de Almeida, André LF; Yang, Zhen

doi:10.1186/1687-6180-2014-163

Research
Open access
Published: 17 November 2014

Channel estimation for MIMO multi-relay systems using a tensor approach

Xi Han¹,
André LF de Almeida² &
Zhen Yang³

EURASIP Journal on Advances in Signal Processing volume 2014, Article number: 163 (2014) Cite this article

2394 Accesses
26 Citations
Metrics details

Abstract

In this paper, we address the channel estimation problem for multiple-input multiple-output (MIMO) multi-relay systems exploiting measurements collected at the destination only. Assuming that the source, relays, and destination are multiple-antenna devices and considering a three-hop amplify-and-forward (AF)-based training scheme, new channel estimation algorithms capitalizing on a tensor modeling of the end-to-end communication channel are proposed. Our approach provides the destination with the instantaneous knowledge of all the channel matrices involved in the communication. Instead of using separate estimations for each matrix, we are interested in a joint estimation approach. Two receiver algorithms are formulated to solve the joint channel estimation problem. The first one is an iterative method based on a trilinear alternating least squares (TALS) algorithm, while the second one is a closed-form solution based on a Kronecker least squares (KRLS) factorization. A useful lower-bound on the channel training length is derived from an identifiability study. We also show the proposed tensor-based approach is applicable to two-way MIMO relaying systems. Simulation results corroborate the effectiveness of the proposed estimators and provide a comparison with existing methods in terms of channel estimation accuracy and bit error rate (BER).

1 Introduction

Cooperative communications have been considered as a promising concept to improve the link performance in modern wireless communication systems due to spatial diversity gains, enhanced coverage, and increased capacity [1–4]. In this context, relaying has been commonly accepted as a key technique to improve system performance by overcoming channel impairments, such as fading, shadowing, and path loss, in wireless fading channel environments [4–6]. By resorting to relay-assisted cooperation, multiple wireless links between mobile stations and base stations are established to create a virtual multiple-input multiple-output (MIMO) system [7]. In the simplest relay processing strategy, the relay stations amplify and forward the received data towards the base station. In this work, we adopt amplify-and-forward (AF) relaying due to its simplicity of implementation [5]. This strategy is preferable when fixed relay stations have a limited computation capacity as opposed to the base station.

The overall link reliability of cooperative diversity schemes strongly depends on the accuracy of channel state information (CSI) associated with the multiple hops involved in the overall communication. Moreover, the use of common precoding techniques at the source and/or relays generally requires instantaneous CSI knowledge of the different channels to optimize transmission [8, 9]. In practice, however, the CSI is unknown and has to be estimated with the aid of training sequences [10, 11]. For two-hop relaying systems, the associated channel matrices can be estimated in separate LS estimation stages that operate sequentially at the destination [10]. When the communication involves additional hops, such a sequential LS estimation approach still applies by using additional transmission phases. The main problem is that channel estimation errors accumulate across the consecutive stages. In [11], a closed-form solution was proposed for the joint estimation of the channel matrices in a two-hop MIMO relaying system, avoiding error propagation.

A few recent works have developed efficient receiver algorithms based on tensor analysis for channel estimation and/or symbol detection in cooperative systems [12–16]. In [12], a training sequence-based channel estimation algorithm is proposed for two-way relaying systems with multiple antennas at the relays. Recently [14], a channel estimation algorithm based on parallel factor (PARAFAC) model [17, 18] was developed for two-hop MIMO relay systems. The approach allows estimation of the channel matrices associated with both hops by resorting to training sequences. Other few recent works have developed tensor-based receivers for one-way two-hop cooperative systems [13, 15, 16]. In particular, the approach of Ximenes et al. [16] assumes a Khatri-Rao space-time (KRST) coding [19] at the source node, and a semi-blind receiver is proposed by assuming the existence of a direct link between the source and the destination.

The approach of Roemer and Haardt and Rong et al. [12, 14] allows a joint estimation of the channel matrices by resorting to training sequences. With the idea of avoiding the use of training sequences at the users’ and relays’ transmissions, the work [13] proposed a blind receiver for uplink multiuser cooperative diversity systems based on a PARAFAC model for the received signal. However, [13] is limited to a clustered relaying scenario, where relays belonging to the same cluster have the same spatial signature. The common feature of all these works is on the assumption of only two hops (source-to-relays and relays-to-destination). To further extend the coverage area and combat channel impairments such as path-loss and shadowing, it may be advantageous to introduce an additional hop along with an extra communication phase by means of three-hop relaying [5]. We highlight that the interest of the proposed work is on the joint channel estimation problem (i.e., joint channel and symbol estimation is not addressed here). The joint channel estimation problem was addressed in [12] for a two-way relaying system and in [14] for a one-way two-hop system. From a tensor modeling viewpoint, the common feature of both works is on the use of the PARAFAC model. Herein, we focus on a one-way three-hop multi-relay system, while resorting to a PARATUCK2 model to derive the proposed algorithms.

In this work, novel channel estimators are proposed for MIMO multi-relay systems. Assuming that the source, relays, and destination are multiple-antenna devices and considering a three-hop AF-based training scheme, new channel estimation algorithms capitalizing on a multi-linear structure of the end-to-end communication channel are proposed. The proposed approach is based on a PARATUCK2 tensor model [20] of the data collected at the destination only, which allows the channel matrices to be jointly estimated at the destination. Two receiver algorithms are formulated to solve the channel estimation problem. The first one is an iterative channel estimation method based on a trilinear alternating least squares (TALS) algorithm derived from a PARATUCK2 tensor model of the received data, while the second one is a closed-form solution based on a Kronecker least squares (KRLS) factorization. The proposed approach provides an extension of the idea recently proposed in [14] to a more general scenario with two-tier relaying using MIMO AF relays. Identifiability of the channel matrices is also examined in this work, and a useful lower-bound on the channel training length is derived. In contrast to conventional pilot-assisted LS channel estimation, where the channel matrices are estimated separately in consecutive stages, our proposed algorithms make a more efficient use of cooperative diversity by providing a joint estimation of all the channel matrices. As will be clear later, such a joint channel estimation is possible due to the use of the tensor approach to model the end-to-end system.

In comparison with conventional (multi-stage) LS channel estimation [10], the proposed tensor-based estimators have two distinguishing features: i) they avoid accumulation of channel estimation errors since all the channel matrices are estimated simultaneously (either iteratively or in closed-form), and ii) they can operate under less restrictive (and more flexible) conditions on the required number of antennas at the relays and/or destination, as will be clear from our identifiability analysis. Our approach also includes the PARAFAC-based channel estimator of [14] as a particular case. We also show that the proposed tensor modeling approach copes with a two-way MIMO multi-relaying communication system, where the TALS and KRLS channel estimators can be applied.

This paper is organized as follows. In section 2, the system model and working assumptions are described. Section 3 formulates the proposed approach. The data model is recast using tensor analysis, and the two channel estimation algorithms (TALS and KRLS) are derived. Identifiability of the channel matrices is also examined in this section. In section 4, we provide an extension of the proposed tensor-based signal model to a two-way MIMO relaying scenario. Numerical results are presented and discussed in section 5, and the conclusions are drawn in section 6.

Notation: Scalars are denoted by lowercase letters (a,b,…), vectors as lowercase boldface letters (a,b,…), matrices as uppercase boldface letters (A,B,…), and tensors as calligraphic letters $(A, ℬ, \dots)$ . A^T and A^† stand for transpose and pseudo-inverse of A, respectively. To retrieve the element (i,j) of A, we use a(i,j). The i th row of $A \in C^{I \times R}$ is denoted as A_(i,:) while its r th column is denoted by A_(:,r). The operator Di(A) forms a diagonal matrix out of the i th row of A. The Khatri-Rao (columnwise Kronecker) product between $A \in C^{I \times R}$ and $B \in C^{J \times R}$ , i.e., $A ◇ B = [A_{(:, 1)} \otimes B_{(:, 1)}, \dots, A_{(:, R)} \otimes B_{(:, R)}] \in C^{IJ \times R}$ .

2 System model

We consider a three-hop MIMO AF communication system where the source node transmits information to the destination node with the aid of R₁ relays in the first tier and R₂ relays in the second tier. As shown in Figure 1, the source and destination nodes are equipped with N_s≥2 and N_d≥2 antennas, respectively, and half-duplex relays are considered. The q th relay of tier 1, which receives data from the source node, is equipped with I_q antennas, q=1,…,R₁, while the p th relay of tier 2, which receives data from tier 1 relays, is equipped with J_p antennas, p=1,…,R₂. The total number of antennas that transmit in second and third phases are denoted by $N_{1} = I_{1} + \dots + I_{R_{1}}$ and $N_{2} = J_{1} + \dots + J_{R_{2}}$ , respectively.

Some key assumptions are now given: (i) relays are synchronized at the symbol level. More specifically, the timing offset is assumed to be within one symbol period, so that timing information is acquired only through some form of (rough) coarse synchronization; (ii) fading is assumed to be frequency flat, and the data block size is smaller than the channel coherence time so that the channel is considered as time invariant; (iii) the direct links between the source (resp. tier 1 relays) and the destination node are not available^a. This situation is evidenced in the current uplink of IEEE 802.16j.

2.1 Data model

The communication between source and destination is accomplished in three hops. In the first hop, the modulated signal vector $u_{s} (t) \in C^{N_{s} \times 1}$ is transmitted to R₁ relays. The received signal at the q th relay of tier 1 can be written as

y_{sr}^{(q)} (t) = H_{sr}^{(q)} u_{s} (t) + v_{sr}^{(q)} (t)

(1)

where $y_{sr}^{(q)} (t) \in C^{I_{q} \times 1}$ is the received signal vector at the q th relay of tier 1, $H_{sr}^{(q)} \in C^{I_{q} \times N_{s}}$ is the MIMO channel between the source and the q th tier 1 relay, and $v_{sr}^{(q)} (t) \in C^{I_{q} \times 1}$ is an additive noise vector. Noise samples are modeled as independent and identically distributed complex Gaussian random variables with zero mean and unit variance.

In the second hop, the source stops transmission and all the R₁ relays of tier 1 amplify their received signals with diagonal AF matrices $G^{(1)}, \dots, G^{(R_{1})}$ and simultaneously forward the resulting signals to the tier 2 relays. The received signal vector at the p th relay of tier 2 is then given by

y_{rr}^{(p)} (t + 1) = \sum_{q = 1}^{R_{1}} H_{rr}^{(p, q)} G^{(q)} y_{sr}^{(q)} (t) + v_{rr}^{(p)} (t + 1)

(2)

where $H_{rr}^{(p, q)} \in C^{J_{p} \times I_{q}}$ is the MIMO channel linking the R₁ tier 1 relays to R₂ tier 2 relays, while $v_{rr}^{(p)} (t + 1) \in C^{I_{p} \times 1}$ denotes the corresponding noise vector. In the third hop, the source and all tier 1 relays are silent, while the tier 2 relays process the received signal vector with the diagonal AF matrices $J^{(1)}, \dots, J^{(R_{2})}$ and forward their amplified signals to the destination. The received signal vector at the destination is then given by

y_{rd} (t + 2) = \sum_{p = 1}^{R_{2}} H_{rd}^{(p)} J^{(p)} y_{rr}^{(p)} (t + 1) + v_{rd} (t + 2),

(3)

where $H_{rd}^{(p)} \in C^{N_{d} \times J_{p}}$ is the MIMO channel linking the p th tier 1 relay to the destination, and $v_{rd} (t + 2) \in C^{N_{d} \times 1}$ the corresponding additive noise term.

Let us define the multi-relay (block) channel matrices

\begin{array}{lcr} H_{rd} & ≐ & [H_{rd}^{(1)}, \dots, H_{rd}^{(R_{2})}] \in C^{N_{d} \times N_{2}}, \end{array}

(4)

\begin{array}{lcr} H_{rr} & ≐ & [\begin{matrix} H_{rr}^{(1, 1)} & \dots & H_{rr}^{(1, R_{1})} \\ ⋮ & ⋮ & ⋮ \\ H_{rr}^{(R_{2}, 1)} & \dots & H_{rr}^{(R_{2}, R_{1})} \end{matrix}] \in C^{N_{2} \times N_{1}}, \end{array}

(5)

\begin{array}{lcr} H_{sr}^{T} & ≐ & [H_{sr}^{(1) T}, \dots, H_{sr}^{(R_{1}) T}] \in C^{N_{s} \times N_{1}}, \end{array}

(6)

and let $G ≐ bdiag [G^{(1)}, \dots, G^{(R_{1})}] \in C^{N_{1} \times N_{1}}$ and $J ≐ bdiag [J^{(1)}, \dots, J^{(R_{2})}] \in C^{N_{2} \times N_{2}}$ be the two diagonal matrices that collect the AF coefficients of the overall multi-relay system. Using these definitions, and using (1) and (2), we can rewrite (3) as follows:

y_{rd} (t + 2) = H_{rd} J H_{rr} G H_{sr} u_{s} (t) + {\bar{v}}_{rd} (t + 2),

(7)

where ${\bar{v}}_{rd} (t + 2) = {\bar{v}}_{sr} (t) + {\bar{v}}_{rr} (t + 1) + v_{rd} (t + 2)$ is the total noise at the destination, which contains the filtered noise contributions from the multiple relays, with ${\bar{v}}_{sr} (t) = H_{rd} J H_{rr} G v_{sr}$ , ${\bar{v}}_{rr} (t + 1) = H_{rd} J v_{rr} (t + 1)$ , $v_{rr} (t + 1) ≐ {[v_{rr}^{(1) T} (t + 1), \dots, v_{rr}^{(R_{2}) T} (t + 1)]}^{T} \in C^{N_{2} \times 1}$ , $v_{sr} (t) ≐ {[v_{sr}^{(1) T} (t), \dots, v_{sr}^{(R_{1}) T} (t)]}^{T} \in C^{N_{1} \times 1}$ .

Note that, since this work is concerned with channel estimation, the AF matrices G and J cannot be optimized at the transmission (source and relays). Therefore, for simplicity, we have assumed that these matrices are diagonal. The use of non-diagonal AF matrices in the proposed approach is left for a future work. Note also that, once the channels are estimated, the design of full AF matrices can be done, e.g., based on the SVD of the channel matrices, following the idea of [9] or on the mean-square error (MSE) criterion [21]. If simplified AF schemes are used, where only power allocation is done, G and J are diagonal matrices, the coefficients of which can be designed as a function of the mean channel and noise powers [5] or optimized from power allocation strategies, as shown recently in [22].

2.2 Conventional LS estimation method

The simplest approach to estimate the effective channel H_{e
f
f}=H_rdJ H_rrG H_sr (including the amplifying factors) is based on training sequences. If separate estimations of the multi-relay channels H_rd, H_rr, and H_sr are required, for instance, to optimize the source precoding matrix and the relays’ AF matrices, three separate LS estimation stages should operate sequentially at the destination. The method would work similarly to that of Kong and Hua [10]. Denote $S_{0} \in C^{N_{s} \times L_{0}}$ as the training sequence matrix sent by the source node, while $S_{1 d} \in C^{N_{1} \times L_{1}}$ and $S_{2 d} \in C^{N_{2} \times L_{2}}$ are the training sequence matrices sent by the relays at tiers 1 and 2, respectively. Assume that orthogonal training sequences are used in all stages, which implies training sequences of length L₀≥N_s, L₁≥N₁ and L₂≥N₂ at the source, tier 1 and tier 2 relays, respectively. In the first stage, S_2d is transmitted from all tier 2 relays to the destination. The LS estimate of H_rd is obtained as

{\hat{H}}_{rd} = Y_{1} S_{2 d}^{H},

(8)

where $Y_{1} \in C^{N_{d} \times L_{2}}$ is the received signal matrix at the destination during the first training stage. In the second stage, S_1d is transmitted from all tier 1 relays to the destination via AF processing at the tier 1 relays. Defining $Y_{2} \in C^{N_{d} \times L_{1}}$ as the data received from tier 1 relays at the second training stage, an LS estimate of H_rr can be obtained as

{\hat{H}}_{rr} = {({\hat{H}}_{rd} J)}^{†} Y_{2} S_{1 d}^{H} .

(9)

Finally, S₀ is transmitted from the source to the destination via the two tiers of relays. The destination collects the received data in $Y_{3} \in C^{N_{s} \times L_{0}}$ . An estimate of ${\hat{H}}_{sr}$ is then found as

{\hat{H}}_{sr} = {({\hat{H}}_{rd} J {\hat{H}}_{rr} G)}^{†} Y_{3} S_{0}^{H} .

(10)

This method requires 6 transmission phases to provide the destination with all the channel matrices (1 phase for estimating H_rd, 2 phases for estimating H_rr and 3 phases for estimating H_sr). Note that the channel estimation errors accumulate across the consecutive stages, due to the dependency between successive channel estimates. Moreover, this method requires N_d≥N₂≥N₁ for the uniqueness of the LS estimates of ${\hat{H}}_{rr}$ and ${\hat{H}}_{sr}$ . In the following, we adopt a different path to solve this problem by capitalizing on tensor analysis. The idea is to provide the destination with a joint estimate of all the partial channels H_rd, H_rr, and H_sr by exploiting the tensor structure of the end-to-end signal model. The proposed approach allows channel estimation to be performed under less restrictive conditions on the number N_d of receive antennas at the destination compared with the conventional LS estimator, while avoiding error accumulation.

3 Proposed approach

In order to derive the proposed channel estimators, we first recast the formulation of the system model by resorting to multi-way (tensor) analysis. First, let us divide the overall training period into K time blocks. In every time block, the same training sequence matrix $S_{0} \in C^{N_{s} \times L_{0}}$ is transmitted by the source node. In the k th time block, the relays of tiers 1 and 2 use the AF matrices G_k and J_k, respectively, k=1,…,K. Let us define $E \in C^{K \times N_{1}}$ and $F \in C^{K \times N_{2}}$ as channel training matrices such that $D_{k} (E) ≐ G_{k}$ and $D_{k} (F) ≐ J_{k}$ , where $D_{k} (\cdot)$ forms a diagonal matrix out of the k th row of its matrix argument. Otherwise stated, the rows of E (resp. F) hold the AF coefficients of the R₁ (resp. R₂) relays associated with the different time blocks. Then, the signal received at the destination during the k th time block can be written as:

\begin{array}{lcr} y_{k} = H_{rd} D_{k} (F) H_{rr} D_{k} (E) H_{sr} S_{0} + V_{k}, \\ k = 1, \dots, K, \end{array}

(11)

where $V_{k} = H_{rd} D_{k} (F) H_{rr} D_{k} (E) V_{sr, k} + H_{rd} D_{k} (F) V_{rr, k}$ , $v_{sr, k} \in C^{N_{1} \times L_{0}}$ is the noise matrix at the relays during the k th time block, $v_{rr, k} \in C^{N_{2} \times L_{0}}$ is the noise matrix at the second hop relays for the k-th time block, and $V_{rd, k} \in C^{N_{d} \times L_{0}}$ is the noise matrix at the destination for the k th time block.

Regarding the structure of the channel training matrices $E \in C^{K \times N_{1}}$ and $F \in C^{K \times N_{2}}$ , unless otherwise stated, their columns are chosen as length-K random sequences following a uniform distribution between [-1, 1]. These sequences are defined beforehand and known at the destination node. With such a choice, the signals transmitted by the relays across the K time blocks have random phases and are subject to limited power fluctuations. Clearly, this design is not optimal for minimizing the mean square error of the channel estimation. Determining an optimum design for these matrices is a difficult problem and is not pursued in this work. Nevertheless, extensive computer simulations have demonstrated that this choice yields very good results. For convenience, we will come back later to the problem of choosing E and F from a channel identifiability viewpoint. A more elaborated design of these matrices will be then proposed.

Upon reception of the data matrix Y_k, k=1,…,K, an unstructured estimate of the end-to-end channel during the k th time block is first obtained at the destination. Multiplying both sides of (11) with the known training sequence matrix $S_{0}^{H}$ yields

\begin{array}{lcr} {\hat{H}}_{k} = y_{k} S_{0}^{H} \in C^{N_{d} \times N_{s}} \\ = H_{rd} D_{k} (F) H_{rr} D_{k} (E) H_{sr} + V_{k} S_{0}^{H}, \end{array}

(12)

k=1,⋯,K. Let us introduce

{\hat{H}}_{k} = H_{k} + V_{k} S_{0}^{H},

(13)

where

H_{k} = H_{rd} D_{k} (F) H_{rr} D_{k} (E) H_{sr}, k = 1, \dots, K,

(14)

is the matrix-of-interest that represents the effective end-to-end channel, V_k is the total noise matrix, and ${\tilde{H}}_{k}$ is the noisy observation of H_k. We can assemble the set {H₁,⋯,H_K} to form a three-way array, or a third-order tensor, $ℋ \in C^{N_{d} \times N_{s} \times K}$ , whose dimensions are N_d (first dimension), N_s (second dimension), and K (third dimension).

Equation (14) corresponds to a PARATUCK2 model of the (noiseless) tensor [23]. The PARATUCK2 model has first appeared in [20]. A more comprehensive formulation is given in [23], which also details an alternating least squares procedure for estimating its matrix factors. Here, we show that this tensor model can be exploited to derive novel channel estimators for a cooperative MIMO relaying system.

Now, let us define

H_{[1]} ≐ [vec (H_{1}), \dots, vec (H_{K})] \in C^{N_{d} N_{s} \times K}

(15)

where H_[1] is a matrix ‘unfolding’ for the tensor obtained by stacking column-wise its K slices. Define also

W_{k} = D_{k} (F) H_{rr} \underset{k}{D} (E) \in C^{N_{2} \times N_{1}} .

(16)

Substituting (14) into (15), and applying property vec(A C B)=(B^T⊗A)vec(C), we get

\begin{array}{lcr} H_{[1]} & = & (H_{sr}^{T} \otimes H_{rd}) [vec (W_{1}), \dots, vec (W_{K})] \\ = & (H_{sr}^{T} \otimes H_{rd}) diag (vec (H_{rr})) (E^{T} ⊙ F^{T}) \end{array}

(17)

where

E^{T} ⊙ F^{T} = [E_{(1, :)}^{T} \otimes F_{(1, :)}^{T}, \dots, E_{(K, :)}^{T} \otimes F_{(K, :)}^{T}] \in C^{N_{2} N_{1} \times K},

(18)

$E_{(k, :)} \in C^{1 \times N_{1}}$ (resp. $F_{(k, :)} \in C^{1 \times N_{2}}$ ) denote the k th row of E (resp. F), and ⊙ is the Khatri-Rao (columnwise Kronecker) product.

Applying property vec(A diag(x)B)=(B^T⊙A)x, we get from (17) the following expression:

vec (H_{[1]}) = Ω_{1} vec (H_{rr}),

(19)

where

Ω_{1} = [{(E^{T} ⊙ F^{T})}^{T} ⊙ (H_{sr}^{T} \otimes H_{rd})] \in C^{N_{D} N_{s} K \times N_{1} N_{2}} .

(20)

In addition to the matrix unfolding H_[1], it is useful to define two other matrix unfoldings, which collect the information of tensor . Therefore, let us now define

H_{[2]} ≐ [\begin{matrix} H_{1} \\ ⋮ \\ H_{K} \end{matrix}] \in C^{N_{d} K \times N_{s}}, H_{[3]} ≐ [\begin{matrix} H_{1}^{T} \\ ⋮ \\ H_{K}^{T} \end{matrix}] \in C^{N_{s} K \times N_{d}} .

(21)

From (14) and (16), it follows that

\begin{matrix} H_{[2]} = [\begin{matrix} H_{rd} W_{1} \\ ⋮ \\ H_{rd} W_{K} \end{matrix}]_{H}^{sr} = [\begin{matrix} H_{rd} \\ ⋱ \\ H_{rd} \end{matrix}] [\begin{matrix} W_{1} \\ ⋮ \\ W_{K} \end{matrix}] H_{sr} \end{matrix}

(22)

and

\begin{matrix} H_{[3]} = [\begin{matrix} H_{sr}^{T} W_{1}^{T} \\ ⋮ \\ H_{sr}^{T} W_{K}^{T} \end{matrix}] H_{rd}^{T} = [\begin{matrix} H_{sr}^{T} \\ ⋱ \\ H_{sr}^{T} \end{matrix}] [\begin{matrix} W_{1}^{T} \\ ⋮ \\ W_{K}^{T} \end{matrix}] H_{rd}^{T} \end{matrix}

(23)

or, more compactly,

\begin{array}{lcr} H_{[2]} & = & (I_{K} \otimes H_{rd}) Ω_{2} H_{sr}, \end{array}

(24)

\begin{array}{lcr} H_{[3]} & = & (I_{K} \otimes H_{sr}^{T}) Ω_{3} H_{rd}^{T}, \end{array}

(25)

where

Ω_{2} = [\begin{matrix} W_{1} \\ ⋮ \\ W_{K} \end{matrix}] \in C^{N_{2} K \times N_{1}}, Ω_{3} = [\begin{matrix} W_{1}^{T} \\ ⋮ \\ W_{k}^{T} \end{matrix}] \in C^{N_{1} K \times N_{2}} .

(26)

3.1 Identifiability of channel matrices

Identifiability of H_sr, H_rr, and H_rd in the LS sense from H_[1], H_[2], and H_[3] (see Equations (19), (24), and (25)), respectively, requires that $Ω_{1} = [{(E^{T} ⊙ F^{T})}^{T} ⊙ (H_{sr}^{T} \otimes H_{rd})] \in C^{N_{D} N_{s} K \times N_{1} N_{2}}$ , $Z_{[2]} ≐ (I_{K} \otimes H_{rd}) Ω_{2} \in C^{N_{d} K \times N_{1}}$ and $Z_{[3]} ≐ (I_{K} \otimes H_{sr}^{T}) Ω_{3} \in C^{N_{s} K \times N_{2}}$ be full column-rank. These requirements come from the fact that Ω₁, Z_[2], and Z_[3] must be left-invertible, from which the following necessary conditions are obtained:

N_{d} N_{s} K \geq N_{1} N_{2}, N_{d} K \geq N_{1}, N_{s} K \geq N_{2} .

(27)

From the three inequalities and from the fact that we must have K≥2, the lower bound on the number K of time blocks necessary for identifiability is given by

K \geq max (⌈\frac{N_{1} N_{2}}{N_{d} N_{s}}⌉, ⌈\frac{N_{1}}{N_{d}}⌉, ⌈\frac{N_{2}}{N_{s}}⌉, 2),

(28)

where ⌈x⌉ is equal to the smallest integer that is greater than or equal to x.

Note that the identifiability of the channel matrices H_sr, H_rr, and H_rd from the unstructured channel tensor will ensure that the compound channel $H_{c} = H_{rd} H_{rr} H_{sr} \in C^{N_{d} \times N_{s}}$ is strictly unique. Note also that conditions N_dN_sK≥N₁N₂ and N_dK≥N₁ are clearly much less restrictive in terms of the required number N_d of antennas at the destination node, in comparison with the conventional three-step LS estimator that requires N_d≥N₂≥N₁. Otherwise stated, estimation of the partial channels can be done even in situations where the number of receive antennas is much less than the number of relay antennas (provided that K satisfies condition (28)). This situation may arise in scenarios with denser deployments of relay stations, where the total number of relay antennas exceeds those of source and/or destination antennas. As shown by these inequalities, the possibility of affording fewer receive antennas is compensated by an increase on the number K of training blocks, which represents a trade-off.

Condition (28), although necessary, is not sufficient for identifiability. Since $Z_{[2]} ≐ (I_{K} \otimes H_{rd}) Ω_{2} \in C^{N_{d} K \times N_{1}}$ and $Z_{[3]} ≐ (I_{K} \otimes H_{sr}^{T}) Ω_{3} \in C^{N_{s} K \times N_{2}}$ , additionally, must have rank(Ω₂)=N₁ and rank(Ω₃)=N₂, i.e., both $Ω_{2} \in C^{N_{2} K \times N_{1}}$ and $Ω_{3} \in C^{N_{1} k \times N_{2}}$ must be full column-rank. Otherwise, Z_[2] and Z_[3] will be rank-deficient, even if (28) is respected.

Let us assume that the partial channels H_sr, H_rr, and H_rd are full rank matrices, which is a reasonable assumption when the wireless links are assumed to undergo scattering-rich multipath propagation. The following corollaries can then be obtained:

C1
If N₁=N₂, identifiability of the partial channels is guaranteed for N₁=N_s and N₂=N_d;
C2
If N₁=1, identifiability of the partial channels is guaranteed for N₂=N_d and N₂=K;
C3
If N₂=1, identifiability of the partial channels is guaranteed for N₁=N_s and N₁=K.

Remark: For the first corollary, we can note that if N₁≤N_s and N₂≤N_d, then $H_{sr}^{T} \otimes H_{rd}$ is full column-rank, which ensures that $Ω_{1} \in C^{N_{D} N_{s} K \times N_{2}}$ is full column-rank due to its Khatri-Rao product structure [24]. Likewise, $Ω_{2} \in C^{N_{2} K \times N_{1}}$ and $Ω_{3} \in C^{N_{1} k \times N_{2}}$ are also full column-rank in this case, guaranteeing the identifiability of the channel matrices. Regarding the second corollary, it corresponds to a special case of our system model where the first relay tier reduces to a single-antenna relay. In this case, satisfying N₂≤N_d and N₂≤K ensures that Ω₁, Z_[2], and Z_[3] are all full column-rank, so that the three partial channels are identifiable. The same reasoning is valid for the third corollary, which is analogous to the second one.

3.2 Essential uniqueness

Let $\{{\hat{H}}_{sr}, {\hat{H}}_{rr}, {\hat{H}}_{rd}\}$ be an alternative set of matrices yielding the same unstructured channel tensor satisfying the PARATUCK2 model (14). If H_sr, H_rr, and H_rd are full rank and the identifiability conditions (27) are satisfied, then ${\hat{H}}_{sr}$ , ${\hat{H}}_{rr}$ , and ${\hat{H}}_{rd}$ are essentially unique. In this case, we have ${\hat{H}}_{sr} = Δ_{sr} H_{sr}$ , ${\hat{H}}_{rd} = H_{rd} Δ_{rd}$ and ${\hat{H}}_{rr} = Δ_{rr}^{(2)} H_{rr} Δ_{rr}^{(1)}$ , where the following relation holds:

(Δ_{sr} Δ_{rr}^{(1)}) \otimes (Δ_{rd} Δ_{rr}^{(2)}) = I_{N_{1} N_{2}} .

(29)

Note that permutation ambiguity does not exist due to the knowledge of the training matrices E and F. The relation (29) can be obtained by replacing the alternative solutions ${\hat{\bar{H}}}_{sr}$ , ${\hat{H}}_{rd}$ , and ${\hat{H}}_{rr}$ into (14) and then applying some basic manipulations using properties of the Kronecker product. Equation (29) turns into the following relations: $Δ_{rd} Δ_{rr}^{(2)} = α I_{N_{2}}$ and $Δ_{sr} Δ_{rr}^{(1)} = (1 / α) I_{N_{1}}$ , where α is an arbitrary scalar factor. These two relations come from the fact that the Kronecker product between any two diagonal matrices is equal to the identity matrix if and only if these diagonal matrices are (scaled) identity matrices that compensate each other. Consequently, H_sr, H_rr, and H_rd can be recovered in an essentially unique manner up to scaling factors. The scaling ambiguity can be eliminated by normalizing the first column of H_sr or the first row of H_rd to one. Since these ambiguities compensate each other, the compound channel is strictly unique and we have ${\hat{H}}_{c} = {\hat{H}}_{rd} {\hat{H}}_{rr} {\hat{H}}_{sr} = H_{rd} H_{rr} H_{sr} = H_{c}$ .

3.3 Trilinear alternating least squares algorithm

The TALS algorithm is an iterative estimation method that alternates among the LS estimations of the channel matrices H_sr, H_rr, and H_rd by fitting a PARATUCK2 model from the noisy matrices ${\tilde{H}}_{[i]} = H_{[i]} + V_{[i]}$ , i=1,2,3. Note that the noise term V_[i] is constructed in a way analogous to H_[i], i=1,2,3, following Equations (15) and (21), respectively. The AF training matrices E and F are assumed to be known at the destination and are fixed during the estimation process. From (19), (24), and (25), we respectively obtain the following linear optimization problems:

\begin{array}{lcr} \underset{vec (H_{rr})}{argmin} {∥vec ({\tilde{H}}_{[1]}) - Ω_{1} vec (H_{rr})∥}_{F}^{2}, \end{array}

(30)

\begin{array}{lcr} \underset{H_{sr}}{argmin} {∥{\tilde{H}}_{[2]} - (I_{K} \otimes H_{rd}) Ω_{2} H_{sr}∥}_{F}^{2}, \end{array}

(31)

\begin{array}{lcr} \underset{H_{rd}}{argmin} {∥{\tilde{H}}_{[3]} - (I_{K} \otimes H_{sr}^{T}) Ω_{3} H_{rd}^{T}∥}_{F}^{2} . \end{array}

(32)

These LS estimation problems can be solved alternately by estimating one channel matrix at each time, while fixing the other matrices to their values obtained in previous estimation steps. Therefore, each iteration of the algorithm has three estimation steps. The algorithm starts by randomly initializing two out of the three channel matrices and proceeds until convergence. In the following, a summary of the TALS algorithm is provided.

Define $e (n) = vec ({\tilde{H}}_{[1]}) - [{(E^{T} ⊙ F^{T})}^{T} ⊙ ({\hat{H}}_{sr}^{T} (n) \otimes {\hat{H}}_{rd} (n))] {\hat{h}}_{rr} (n)$ . The sum of squared residuals (SSR) at the end of the n th iteration is defined as S S R(n)=e^H(n)e(n). We declare the convergence of the algorithm when |S S R(n)-S S R(n-1)|≤10^-6, meaning that the model reconstruction error does not significantly change between two successive iterations.

Generally, the ALS algorithm is sensitive to the initialization, and convergence to the global minimum can be slow when all the matrix factors of the model are unknown [25]. However, in our case, we have observed that convergence to the global minimum is always achieved (e.g., within 10 to 30 iterations for medium-to-high SNRs) due to the knowledge of the AF training matrices E and F.

3.4 Kronecker least squares algorithm

We now derive a closed-form solution to our channel estimation problem by exploiting the mixed Kronecker/ Khatri-Rao factorization structure of the matrix unfolding H_[1] defined in (17). Starting from (13), the noisy version of (15) is given by:

{\hat{H}}_{[1]} = H_{[1]} + (S_{0}^{H} \otimes I_{N_{d}}) V_{[1]},

(33)

where $V_{[1]} = [vec (V_{1}), \dots, vec (V_{K})] \in C^{N_{d} N_{s} \times K}$ . Let $Z = E^{T} ⊙ F^{T} \in C^{N_{1} N_{2} \times K}$ denote the combined AF training matrix and assume that $Z Z^{H} = I_{N_{1} N_{2}}$ . Multiplying both sides of (33) by Z^H, we have:

{\hat{X}}_{[1]} ≐ {\hat{H}}_{[1]} Z^{H} + (S_{0}^{H} \otimes I_{N_{d}}) V_{[1]} Z^{H}

(34)

where ${\hat{X}}_{[1]} = X_{[1]} + (S_{0}^{H} \otimes I_{N_{d}}) V_{[1]} Z^{H}$ . From (17), we have:

X_{[1]} = (H_{sr}^{T} \otimes H_{rd}) diag (vec (H_{rr})) .

(35)

Our goal is to directly identify the channel matrices from (35). However, let us first address the deterministic design of the AF training matrices E and F such that $Z Z^{H} ≐ (E^{T} ⊙ F^{T}) {(E^{T} ⊙ F^{T})}^{H} = I_{N_{1} N_{2}}$ . Assuming K≥N₁N₂, this condition is satisfied by designing Z, for instance, as a discrete Fourier transform (DFT) matrix. Having fixed the structure of Z, we are left with the problem of factorizing this matrix as the Khatri-Rao product between E^T and F^T. This problem can easily be solved by means of K rank-one matrix factorizations, which admit unique solutions. Note that the k th column of Z can be written as

Z (:, k) = {(E (k, :) \otimes F (k, :))}^{T} \in C^{N_{1} N_{2} \times 1}, k = 1, \dots, K.

Defining a rank-one matrix ${\tilde{Z}}_{k} ≐ unvec (Z (:, k)) \in C^{N_{2} \times N_{1}}$ , it follows that

{\tilde{Z}}_{k} = {(F (k, :))}^{T} E (k, :),

from which E(k,:) and F(k,:) can be determined as the unique right and left singular vectors of ${\tilde{Z}}_{k}$ , k=1,…,K. Note that the proposed design, although not optimized to minimize the mean square error of the channel estimation, ensures that the noise characteristics in (33) will not be changed when ${\hat{H}}_{[1]}$ is post-multiplied by Z^H (i.e., inverse DFT transformation).

Coming back to the channel estimation problem, from (35), let us define $x_{n_{1}, n_{2}} \in C^{N_{s} N_{d} \times 1}$ as the [(n₁-1)N₂+n₂]-th column of $X_{[1]} \in C^{N_{s} N_{d} \times N_{1} N_{2}}$ , n₁=1,…,N₁, n₂=1,…,N₂. Note that

x_{n_{1}, n_{2}} = (H_{sr}^{T} (:, n_{1}) \otimes H_{rd} (:, n_{2})) h_{rr} (n_{2}, n_{1})

(36)

Defining ${\tilde{X}}_{n_{1}, n_{2}} ≐ unvec (x_{n_{1}, n_{2}}) \in C^{N_{d} \times N_{s}}$ as a rank-one matrix obtaining by reshaping, we have

{\tilde{X}}_{n_{1}, n_{2}} = h_{rr} (n_{2}, n_{1}) H_{rd} (:, n_{2}) H_{sr} (n_{1}, :)

(37)

Consider the singular value decomposition (SVD) of ${\tilde{X}}_{n_{1}, n_{2}}$ :

\begin{array}{lcr} {\tilde{X}}_{n_{1}, n_{2}} = U_{n_{1}, n_{2}} Λ_{n_{1}, n_{2}} V_{n_{1}, n_{2}}^{H} \end{array}

(38)

\begin{array}{lcr} n_{1} = 1, \dots, N_{1}, n_{2} = 1, \dots, N_{2} . \end{array}

(39)

From the rank-one property of ${\tilde{X}}_{n_{1}, n_{2}}$ , we have:

\begin{array}{lcr} {\hat{H}}_{rd}^{(n_{2})} (:, n_{2}) = U_{n_{1}, n_{2}} (:, 1), n_{1} = 1, \dots, N_{1}, \end{array}

(40)

\begin{array}{lcr} {\hat{H}}_{sr}^{(n_{1})} (n_{1}, :) = {(V_{n_{1}, n_{2}} (:, 1))}^{T}, n_{2} = 1, \dots, N_{2}, \end{array}

(41)

\begin{array}{lcr} {\hat{h}}_{rr} (n_{2}, n_{1}) = λ_{n_{1}, n_{2}} (1, 1) . \end{array}

(42)

Final estimates of H_rd(:,n₂) and H_sr(:,n₁) can be obtained by averaging over the N₁ and N₂ independent estimates, respectively:

\begin{array}{lcr} {\hat{H}}_{rd} (:, n_{2}) = \frac{1}{N_{1}} \sum_{n_{1} = 1}^{N_{1}} {\hat{H}}_{rd}^{(n_{1})} (:, n_{2}), \end{array}

(43)

\begin{array}{lcr} {\hat{H}}_{sr} (n_{1}, :) = \frac{1}{N_{2}} \sum_{n_{2} = 1}^{N_{2}} {\hat{H}}_{sr}^{(n_{2})} (n_{1}, :), \end{array}

(44)

with

\begin{array}{lcr} {\hat{H}}_{rd} = [{\hat{H}}_{rd} (:, 1), \dots, {\hat{H}}_{rd} (:, N_{2})], \end{array}

(45)

\begin{array}{lcr} {\hat{H}}_{sr} = {[{\hat{H}}_{sr} (:, 1), \dots, {\hat{H}}_{sr} (:, N_{1})]}^{T}, \end{array}

(46)

\begin{array}{lcr} {\hat{H}}_{rr} = [\begin{matrix} λ_{1, 1} (1, 1) & \dots & λ_{N_{1}, 1} (1, 1) \\ ⋮ & ⋮ & ⋮ \\ λ_{1, N_{2}} (1, 1) & \dots & λ_{N_{1}, N_{2}} (1, 1) \end{matrix}] . \end{array}

(47)

Note that the columns of the estimated ${\hat{H}}_{sr}$ and ${\hat{H}}_{rd}$ have unit energy while each entry of ${\hat{H}}_{rr}$ concentrates all the energy of the wireless link connecting the source node to the destination node via a given tier 1-tier 2 relay pair. Such an interpretation is useful for designing transmit and receive spatial filters for system optimization as well as for power allocation purposes.

Discussion: The KRLS algorithm involves the computation of N₁N₂ SVDs to provide rank-one approximations for the matrices ${\hat{X}}_{1, 1}, \dots, {\hat{X}}_{N_{1}, N_{2}}$ , of dimensions N_d×N_s, which are constructed from the N₁N₂ columns ${\hat{X}}_{[1]}$ . The distinguishing feature of the KRLS-based estimator is on the closed-form solution to the problem, as opposed to the TALS algorithm that consists of iterative LS estimation steps, which implies a higher computational complexity. However, note that the KRLS algorithm is only applicable under the condition K≥N₁N₂, which is necessary for Z=E^T⊙F^T to have orthogonal rows, leading to (35). In contrast, the TALS algorithm can operate under a much lower bound on K, as discussed in Section 3.1. This is clearly a trade-off between both estimators in terms of identifiability conditions and computational complexity. As will be shown in the next section, both estimators provide satisfactory performances, and the choice of the best estimator is rather dependent on the design constraints of the system. For instance, we can say that the TALS estimator is preferable if processing power at the receiver is not too limited, as is often the case with base station reception in outdoor micro- or macro-cells. The KRLS solution would be more likely chosen in indoor scenarios, where channel coherence time is long enough to allow for higher values of K.

4 Extension to two-way MIMO relaying systems

In the previous sections, we have focused on a multi-relay cooperative scheme, where transmission is directed in one direction, i.e., from a specific source to a specific destination via two tiers of multiple relays. In this section, we show that the same modeling approach can be extended to a two-way MIMO relaying scenario, where pilot/data transmission takes place in both directions. In the first phase, two sources simultaneously transmit their data to the multiple relays. Note that, in the two-way case, the relays of each tier receive a superposition of $N_{s_{1}} + N_{s_{2}}$ signals coming from sources 1 and 2. In the second and third phases, inter-relay communication takes place. More specifically, in phase two, tier 1 relays transmit signals towards tier 2 relays, while tier 1 relays stay silent. In phase three, the opposite happens. Finally, in the fourth communication phase, all the relays transmit to the two sources, and each one of them receives a superposition of N₁+N₂ signals.

In the first transmission phase, we assume that training symbol matrices $S_{1} \in C^{N_{s_{1}} \times L}$ and $S_{2} \in C^{N_{s_{2}} \times L}$ are transmitted from sources 1 and 2, respectively. We omit the additive noise terms for convenience of presentation. The signal received at the i th relay tier is given by:

X^{(i)} = H_{s_{1} r_{i}} S_{1} + H_{s_{2} r_{i}} S_{2} = H^{(i)} S, i = 1, 2,

(48)

where $H^{(i)} ≐ [H_{s_{1} r_{i}} H_{s_{2} r_{i}}] \in C^{N_{i} \times (N_{s_{1}} + N_{s_{2}})}$ , and $S ≐ {[S_{1}^{T} S_{2}^{T}]}^{T} \in C^{(N_{s_{1}} + N_{2}) \times L}$ . The training sequence S_i chosen by source i, is designed to satisfy the following conditions:

(i)
$S_{i} S_{i}^{H} = I_{N_{i}}$ , i=1,2,
(ii)
$S_{1} S_{2}^{H} = 0_{N_{1} \times N_{2}}$ .

A possible construction satisfying these two conditions is based on the normalized DFT matrix of size $L \times (N_{s_{1}} + N_{s_{2}})$ , with $L \geq N_{s_{1}} + N_{s_{2}}$ . This design allows the sources to eliminate the self-interference generated by their own transmission, when receiving the signal back from the relays.

In the second and third phases, where inter-relay communications happen, the signal received at the relays of tier i from the relays of tier j, (i,j)={(1,2),(2,1)}, can be written as:

\begin{array}{lcr} Z_{k}^{(i)} & = & H_{r_{j} r_{i}} D_{k} (E_{j}) X^{(j)} \\ = & H_{r_{j} r_{i}} D_{k} (E_{j}) H^{(j)} S, \end{array}

(49)

k=1,…,K, where $H_{r_{j} r_{i}} \in C^{N_{i} \times N_{j}}$ is the MIMO channel linking the relays of tier j at transmission to the relays of tier i at reception, (i,j)={(1,2),(2,1)}. Note that channel reciprocity in the inter-relay communications is not a necessary assumption which means that we may have $H_{r_{1} r_{2}} \neq H_{r_{2} r_{1}}$ .

Finally, in the fourth transmission phase, the signals received at sources 1 and 2 are post-multiplied by $S_{2}^{H}$ and $S_{1}^{H}$ , respectively, to accomplish self-interference elimination, yielding

\begin{matrix} Y_{k}^{(1)} = (H_{r_{1} s_{1}} D_{k} (F_{1}) Z_{k}^{(1)}) S_{2}^{H} + (H_{r_{2} s_{1}} D_{k} (F_{2}) Z_{k}^{(2)}) S_{2}^{H} \\ = \underset{tier 2 \to tier 1 relay path}{\underset{⏟}{H_{r_{1} s_{1}} D_{k} (F_{1}) H_{r_{2} r_{1}} D_{k} (E_{2}) H_{s_{2} r_{2}}}} \\ + \underset{tier 1 \to tier 2 relay path}{\underset{⏟}{H_{r_{2} s_{1}} D_{k} (F_{2}) H_{r_{1} r_{2}} D_{k} (E_{1}) H_{s_{2} r_{1}}}} \\ = {\bar{H}}^{(1, 1)} D_{k} ({\bar{F}}_{1, 2}) G_{r r}^{(1)} D_{k} ({\bar{E}}_{2, 1}) {\bar{H}}^{(1, 2)}, k = 1, \dots, K, \end{matrix}

(50)

and

\begin{matrix} Y_{k}^{(2)} = (H_{r_{1} s_{2}} D_{k} (F_{1}) Z_{k}^{(1)}) S_{1}^{H} + (H_{r_{2} s_{2}} D_{k} (F_{2}) Z_{k}^{(2)}) S_{1}^{H} \\ = \underset{tier 1 \to tier 2 relay path}{\underset{⏟}{H_{r_{2} s_{2}} D_{k} (F_{2}) H_{r_{1} r_{2}} D_{k} (E_{1}) H_{s_{1} r_{1}}}} \\ + \underset{tier 2 \to tier 1 relay path}{\underset{⏟}{H_{r_{1} s_{2}} D_{k} (F_{1}) H_{r_{2} r_{1}} D_{k} (E_{2}) H_{s_{1} r_{2}}}} \\ = {\bar{H}}^{(2, 1)} D_{k} ({\bar{F}}_{2, 1}) G_{r r}^{(2)} D_{k} ({\bar{E}}_{1, 2}) {\bar{H}}^{(2, 2)}, k = 1, \dots, K, \end{matrix}

(51)

where

\begin{array}{lcr} {\bar{H}}^{(1, 1)} ≐ [H_{r_{1} s_{1}} H_{r_{2} s_{1}}] \in C^{N_{s_{1}} \times (N_{1} + N_{2})} \end{array}

(52)

\begin{array}{lcr} {\bar{H}}^{(1, 2)} ≐ {[H_{s_{2} r_{2}}^{T} H_{s_{2} r_{1}}^{T}]}^{T} \in C^{(N_{1} + N_{2}) \times N_{s_{2}}} \end{array}

(53)

\begin{array}{lcr} {\bar{H}}^{(2, 1)} ≐ [H_{r_{2} s_{2}} H_{r_{1} s_{2}}] \in C^{N_{s_{2}} \times (N_{1} + N_{2})} \end{array}

(54)

\begin{array}{lcr} {\bar{H}}^{(2, 2)} ≐ {[H_{s_{1} r_{1}}^{T} H_{s_{1} r_{2}}^{T}]}^{T} \in C^{(N_{1} + N_{2}) \times N_{s_{1}}} \end{array}

(55)

\begin{array}{lcr} G_{r r}^{(1)} ≐ blockdiag (H_{r_{2} r_{1}} H_{r_{1} r_{2}}) \in C^{(N_{1} + N_{2}) \times (N_{1} + N_{2})} \end{array}

(56)

\begin{array}{lcr} G_{r r}^{(2)} ≐ blockdiag (H_{r_{1} r_{2}} H_{r_{2} r_{1}}) \in C^{(N_{1} + N_{2}) \times (N_{1} + N_{2})} \end{array}

(57)

\begin{array}{lcr} {\bar{F}}_{i, j} ≐ [F_{i} F_{j}], {\bar{E}}_{i, j} ≐ [E_{i} E_{j}], (i, j) = {(1, 2), (2, 1)} . \end{array}

(58)

Therefore, we can conclude that the signals received at sources 1 and 2 in the considered two-way MIMO relaying scenario (Equations (50) and (51)) follows a PARATUCK2 model. By analogy with the noiseless part of the one-way signal model (14), we have the following correspondences between the factor matrices:

\begin{array}{lcr} (H_{r d}, H_{s r}, H_{r r}) \Leftrightarrow ({\bar{H}}^{(1, 1)}, {\bar{H}}^{(1, 2)}, G_{r r}^{(1)}) \\ (E, F) \Leftrightarrow ({\bar{F}}_{1, 2}, {\bar{E}}_{2, 1}) (source 1) \end{array}

(59)

\begin{array}{lcr} (H_{r d}, H_{s r}, H_{r r}) \Leftrightarrow ({\bar{H}}^{(2, 1)}, {\bar{H}}^{(2, 2)}, G_{r r}^{(2)}) \\ (E, F) \Leftrightarrow ({\bar{F}}_{2, 1}, {\bar{E}}_{1, 2}) (source 2) \end{array}

(60)

Consequently, the tensor-based channel estimation algorithms proposed in the previous section can be equally applied at each source to estimate the channels ${\bar{H}}^{(i, 1)}$ , ${\bar{H}}^{(i, 2)}$ and $G_{r r}^{(i)}$ , i=1,2, from Equations (50) and (51), respectively. If reciprocity is assumed in the two-way relay channels, we have:

\begin{array}{lcr} H_{r_{i} s_{i}} = H_{s_{i} r_{i}}^{T}, i = 1, 2 \end{array}

(61)

\begin{array}{lcr} H_{r_{i} s_{j}} = H_{s_{j} r_{i}}^{T}, (i, j) = {(1, 2), (2, 1)}, \end{array}

(62)

\begin{array}{lcr} H_{r_{i} r_{j}} = H_{r_{j} r_{i}}^{T}, (i, j) = {(1, 2), (2, 1)}, \end{array}

(63)

which in turn implies ${\bar{H}}^{(1, 1)} = {({\bar{H}}^{(2, 2)})}^{T} = H_{s_{1}}$ , ${\bar{H}}^{(1, 2)} = {({\bar{H}}^{(2, 1)})}^{T} = H_{s_{2}}$ , and $G_{r r}^{(1)} = {(G_{r r}^{(2)})}^{T} = G$ . In this particular case, the PARATUCK2 models (50) and (51) become essentially equal, i.e., they depend on the same unknown channel matrices $H_{s_{1}}$ , $H_{s_{2}}$ , and G to be estimated. Note, however, that such a reciprocity is not a necessary assumption of our modeling approach, which can be used in the general case of non-symmetrical two-way MIMO relay channels.

5 Numerical results

We now present computer simulation results for assessing the performance of the proposed channel estimator in selected system configurations. The estimator’s performance is evaluated in terms of the normalized mean square error (NMSE) of the estimated channel matrices. From the estimated channels, the performance in terms of bit error rate (BER) is calculated by assuming a linear receive filter. The BER and NMSE curves are plotted as a function of the overall signal-to-noise ratio (SNR) at the destination. This SNR is given by the ratio between the powers of the useful signal component and the noise component in Equation (11). For each simulated SNR value, the results represent an average over L=5,000 Monte Carlo runs. At each run, the channel coefficients are drawn from a circularly symmetric complex-valued Gaussian distribution with zero-mean and unit variance, while the transmitted symbols are drawn from a BPSK sequence. The SNR level at the tier 1 and tier 2 relays are assumed to be 30 dB above the SNR level at the destination.

For purposes of performance evaluation, the scaling ambiguities affecting the estimates of the channel matrices are removed by assuming the first column of H_sr and first row of H_rd contain all one elements, similarly to [11, 14]. These scaling ambiguities can be determined as follows. First, we find $Δ_{sr} = D_{1} ({\hat{H}}_{sr}^{T})$ and $Δ_{rd} = D_{1} ({\hat{H}}_{rd})$ . Then, applying property (A B)⊗(C D)=(A⊗C)(B⊗D) yields $(Δ_{sr} \otimes Δ_{rd}) (Δ_{rr}^{(1)} \otimes Δ_{rr}^{(2)}) = I_{N_{1} N_{2}}$ , from which we obtain $Δ_{rr}^{(1)} \otimes Δ_{rr}^{(2)} = Δ_{sr}^{- 1} \otimes Δ_{rd}^{- 1}$ . A solution to this relation is then found as $Δ_{rr}^{(1)} = {[D_{1} ({\hat{H}}_{sr}^{T})]}^{- 1}$ and $Δ_{rr}^{(2)} = {[D_{1} ({\hat{H}}_{rd})]}^{- 1}$ .

In Figure 2, we depict the NMSE performance for the compound channel of our proposed estimators in comparison with the conventional LS estimator. The parameters are N_s=2, N₁=4, N₂=4, N_d=6, K=16, L₀=30, and the number of transmitted data symbols is N=1000. We can see that TALS and KRLS have similar performances, which are considerably better than the conventional (three-stage) LS estimator. The worst performance of the LS estimator comes from the error accumulation across successive channel estimation stages, which degrades its overall NMSE performance.

Figure 3 shows the NMSE performance of our proposed estimators in comparison with the two-hop bilinear alternating least squares (BALS) estimator of Rong et al. [14]. This estimator is a special case of the proposed one, where only one tier of relays is used. In this case, model (11) reduces to

\begin{array}{lcr} y_{k} = H_{rd} D_{k} (E) H_{sr} S_{0} + V_{k}, \\ k = 1, \dots, K, \end{array}

(64)

and the channel matrices H_sr and H_rd are estimated by means of a BALS algorithm. The parameter setting is the same as that of Figure 2. It can be seen the proposed estimator operates satisfactorily, being able to effectively estimate the three channel matrices. Figure 3 also indicates the proposed estimator performs close to the BALS estimator operating in a two-hop system. A small performance degradation is observed, which is due to the presence of an additional AF transmission phase of our three-hop system, resulting in a higher overall noise contribution at the destination. Note also that the TALS estimator involves three estimation steps while the BALS one has two estimation steps only.

Figure 4 shows the BER performance of a linear zero forcing (ZF) receiver designed from the estimated channel matrices, which are obtained from the TALS, KRLS, or the conventional LS estimators. The ZF receiver operates on data block collected in the received data matrix $Y \in C^{K N_{d} \times N}$ . The length of the data block is N=100 symbols, and the remaining system parameters are the same as those of the previous experiment. The ZF filter output is given by:

{\hat{S}}_{ZF} = {[\begin{matrix} H_{rd} D_{1} (F) H_{rr} D_{1} (E) H_{sr} \\ ⋮ \\ H_{rd} D_{K} (F) H_{rr} D_{K} (E) H_{sr} \end{matrix}]}^{†} Y .

(65)

This figure shows similar BER performances for TALS and KRLS, which are better than that of the conventional LS algorithm. This result corroborates the effectiveness of our channel estimators when used with linear receiver for symbol detection. In Figure 5, we evaluate the impact of the number of relay antennas on the BER performance of a linear ZF detector using the proposed TALS channel estimator. The fixed system parameters are N_s=2, N_d=6, L₀=30, and K=10. It can be seen that the BER performance is considerably improved as the number of relay antennas is increased, corroborating the expected gains of cooperative diversity. Although not plotted in this figure, the BER curves of the KRLS estimator are similar to those obtained with the TALS one.

Figure 6 depicts the performance of the ZF receiver designed from the perfect CSI for all channel matrices. Two parameter settings are considered, where N_d=2 and 4, respectively. The other system parameters are fixed to N_s=2, N₁=N₂=3, L₀=6, and K=9. First, it can be seen that the BER performances are considerably improved as the number of antennas at the destination is increased, owing to the higher spatial diversity, as expected. From these results, we also find that the TALS and KRLS estimators provide similar results and, more interestingly, their performances are close to that of the perfect CSI case. For instance, for a target BER of 10^-1, the SNR gap with respect to the perfect CSI case is less than 2 dB.

6 Conclusions

We have proposed channel estimation algorithms for MIMO AF multi-relay systems. The proposed estimators are designed to provide the destination (base station) with the instantaneous CSI of all the channels involved in the communication. In contrast to conventional pilot-assisted channel estimation, the proposed algorithms make a more efficient use of cooperative diversity by providing a joint estimation of all the channel matrices thanks to the use of a tensor modeling of the end-to-end system. Such a joint estimation can be accomplished either iteratively (using TALS) or in closed-form (using KRLS). Our numerical results corroborate the effectiveness of the proposed algorithms. The TALS estimator has a higher computational complexity than the KRLS one due to its iterative nature. On the other hand, the minimum condition for operation of KRLS (K≥N₁N₂) is more restrictive than the identifiability conditions of TALS, which implies more training (i.e., higher number of time blocks) to carry out the joint channel estimation. Both algorithms are suitable to the joint channel estimation problem, and a particular choice is mostly dictated by practical system requirements. We have also provided an extension of the proposed approach to two-way MIMO multi-relay system and verified that such an extension results in the same tensor model as the one-way scenario. Consequently, the proposed algorithms can be applied to one- and two-way multi-relay MIMO schemes.

Endnote

^a Since our focus is on the relay channel, direct links are not considered for simplicity. However, the idea proposed in this work can be easily extended to include direct links.

References

Sendonaris A, Erkip E, Aazhang B: User cooperation diversity - part I: system description. IEEE Trans. Commun 2003, 51(11):1927-1938. 10.1109/TCOMM.2003.818096
Article Google Scholar
Sendonaris A, Erkip E, Aazhang B: User cooperation diversity - part II: implementation aspects and performance analysis. IEEE Trans. Commun 2003, 51(11):1939-1948. 10.1109/TCOMM.2003.819238
Article Google Scholar
Laneman JN, Tse DNC, Wornell GW: Cooperative diversity in wireless networks: efficient protocols and outage behavior. IEEE Trans. Inform. Theor 2004, 50(12):3062-3080. 10.1109/TIT.2004.838089
Article MathSciNet MATH Google Scholar
Cao L, Zhang J, Kanno N: Multi-user cooperative communications with relay-coding for uplink IMT-advanced 4G systems. In Proc. IEEE GLOBECOM’09. Honolulu, HI; November 2009:1-6.
Google Scholar
Liu KJR, Sadek AK, Su W, Kwasinski A: Cooperative Communications and Networking. Cambridge University Press, New York, USA; 2009.
MATH Google Scholar
Pabst R, Walke BH, Schultz DC, Herhold P, Yanikomeroglu H, Mukherjee S, Viswanathan H, Lott M, Zirwas W, Dohler M, Aghvami H, Falconer DD, Fettweis GP: Relay-based deployment concepts for wireless and mobile broadband radio. IEEE Comm. Mag 2004, 42(9):80-89. 10.1109/MCOM.2004.1336724
Article Google Scholar
Dohler M, Li Y: Cooperative Communications: Hardware, Channel and PHY. John Wiley & Sons, West Sussex, United Kingdom; 2010.
Book Google Scholar
Rong Y, Tang X, Hua Y: A unified framework for optimizing linear nonregenerative multicarrier MIMO relay communication systems. IEEE Trans. Signal Process 2009, 57(12):4837-4851.
Article MathSciNet Google Scholar
Toding A, Khandaker MRA, Rong Y: Joint source and relay optimization for parallel MIMO relay networks. EURASIP J. Adv. Signal Process 2012, 174: 1-7.
Google Scholar
Kong T, Hua Y: Optimal design of source and relay pilots for mimo relay channel estimation. IEEE Trans. Signal Process 2011, 59(9):4438-4446.
Article MathSciNet Google Scholar
Lioliou P, Viberg M, Coldrey M: Efficient channel estimation techniques for amplify and forward relaying systems. IEEE Trans. Comm 2012, 60(11):3150-3155.
Article Google Scholar
Roemer F, Haardt M: Tensor-based channel estimation and iterative refinements for two-way relaying with multiple antennas and spatial reuse. IEEE Trans. Signal Process 2010, 58(11):5720-5735.
Article MathSciNet Google Scholar
Fernandes CAR, de Almeida ALF, Costa DB: Unified tensor modeling for blind receivers in multiuser uplink cooperative systems. IEEE Signal Process. Lett 2012, 19(5):247-250.
Article Google Scholar
Rong Y, Khandaker MRA, Xiang Y: Channel estimation of dual-hop MIMO relay system via parallel factor analysis. IEEE Trans. Wireless Comm 2012, 11(6):2224-2233.
Article Google Scholar
de Almeida ALF, Fernandes CAR, Benevides da Costa D: Multiuser detection for uplink ds-cdma amplify-and-forward relaying systems. IEEE Signal Process. Lett 2013, 20(7):697-700.
Article Google Scholar
Ximenes LR, Favier G, Almeida ALF, Silva YCB: PARAFAC-PARATUCK semi-blind receivers for two-hop cooperative MIMO relay systems. IEEE Trans. Signal Process 2014, 62(14):3604-3615.
Article MathSciNet Google Scholar
Harshman RA: Foundations of the PARAFAC procedure: model and conditions for an ‘explanatory’ multi-mode factor analysis. UCLA Working Papers Phonetics 1970, 16: 1-84.
Google Scholar
Carroll JD, Chang J-J: Analysis of individual differences in multidimensional scaling via an N-way generalization of “Eckart-Young” decomposition. Psychometrika 1970, 35(3):283-319. 10.1007/BF02310791
Article MATH Google Scholar
Sidiropoulos ND, Budampati RS: Khatri-Rao space-time codes. IEEE Trans. Signal Process 2002, 50(10):2396-2407. 10.1109/TSP.2002.803341
Article MathSciNet Google Scholar
Harshman RA, Lundy ME: Uniqueness proof for a family of models sharing features of Tucker’s three-mode factor analysis and PARAFAC/CANDECOMP. Psychometrika 1996, 61(1):133-154. 10.1007/BF02296963
Article MathSciNet MATH Google Scholar
Chalise BK, Zhang YD, Amin MG: Joint Optimization of Source Beamformer and Relay Coefficients Using MSE Criterion. Proc. of SPIE’12 May 2012.
Google Scholar
Mohammadi M, Ardebilipour M, Mobini Z, Zadeh R-A: Performance analysis and power allocation for multi-hop multi-branch amplify-and-forward cooperative networks over generalized fading channels. EURASIP J. Wireless Commun. Networking 2013, 2013(1):1-13. 10.1186/1687-1499-2013-1
Article Google Scholar
Bro R: Multi-way analysis in the food industry: Models, algorithms and applications. PhD thesis, University of Amsterdam, Amsterdam, 1998
Google Scholar
Sidiropoulos ND, Liu X: Identifiability results for blind beamforming in incoherent multipath with small delay spread. IEEE Trans. Signal Process 2001, 49(1):228-236. 10.1109/78.890366
Article Google Scholar
Smilde A, Bro R, Geladi P: Multi-way Analysis: Applications in the Chemical Sciences. John Wiley & Sons, West Sussex, England; 2004.
Book Google Scholar

Download references

Acknowledgements

André L. F. de Almeida is partially supported by CNPq and CAPES. This work was supported by the China’s Next Generation Internet Project (CNGI Project) (CNGI-12-03-009) and DNSLAB. This work was also supported by National Natural Science Foundation of China (Grant No. 61173017).

Author information

Authors and Affiliations

Information Network Center, Beijing University of Posts and Telecommunications, Beijing, China
Xi Han
Department of Teleinformatics Engineering, Federal University of Ceará, Campus do Pici, B. 725, 60455-970, Fortaleza, Brazil
André LF de Almeida
School of Computer Science, Beijing University of Posts and Telecommunications, 100876, Beijing, China
Zhen Yang

Authors

Xi Han
View author publications
You can also search for this author in PubMed Google Scholar
André LF de Almeida
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xi Han.

Additional information

Competing interests

The authors declare that they have no competing interests.

Xi Han, André LF de Almeida and Zhen Yang contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Han, X., de Almeida, A.L. & Yang, Z. Channel estimation for MIMO multi-relay systems using a tensor approach. EURASIP J. Adv. Signal Process. 2014, 163 (2014). https://doi.org/10.1186/1687-6180-2014-163

Download citation

Received: 28 May 2014
Accepted: 24 October 2014
Published: 17 November 2014
DOI: https://doi.org/10.1186/1687-6180-2014-163

Channel estimation for MIMO multi-relay systems using a tensor approach

Abstract

1 Introduction

2 System model

2.1 Data model

2.2 Conventional LS estimation method

3 Proposed approach

3.1 Identifiability of channel matrices

3.2 Essential uniqueness

3.3 Trilinear alternating least squares algorithm

3.4 Kronecker least squares algorithm

4 Extension to two-way MIMO relaying systems

5 Numerical results

6 Conclusions

Endnote

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

About this article

Cite this article

Share this article

Keywords