Bounds on the capacity regions of half-duplex Gaussian MIMO relay channels

Gerdes, Lennart; Riemensberger, Maximilian; Utschick, Wolfgang

doi:10.1186/1687-6180-2013-43

Research
Open access
Published: 06 March 2013

Bounds on the capacity regions of half-duplex Gaussian MIMO relay channels

Lennart Gerdes¹,
Maximilian Riemensberger¹ &
Wolfgang Utschick¹

EURASIP Journal on Advances in Signal Processing volume 2013, Article number: 43 (2013) Cite this article

2663 Accesses
5 Citations
Metrics details

Abstract

This article considers uni- and bidirectional communication in the half-duplex Gaussian multiple-input multiple-output (MIMO) relay channel. Assuming perfect channel state information at all nodes and the use of time division duplex communications protocol to separate transmissions and receptions at all nodes, we propose a dual decomposition approach to efficiently determine upper and lower bounds on the capacity and the capacity region of the half-duplex relay channel and the restricted half-duplex two-way relay channel, respectively. Our approach allows to quantify the fundamental limits of the considered relay networks, and the obtained results may serve as benchmarks when studying different and/or suboptimal relay strategies or the impact of channel estimation errors. Furthermore, we discuss how our dual decomposition approach may be used for designing optimal resource allocation protocols.

1 Introduction

A central aspect of today’s and future wireless network standards is the question of how to provide high-speed and high-quality service to a steadily growing number of mobile users without an increase of available bandwidth. One means to improve throughput, spectral efficiency, and reliability is to equip the communication devices with multiple antennas as it is well-known that multi-antenna systems offer substantial gains over single-antenna systems [1, 2]. Another means to achieve above goals and to extend coverage is the use of relays, which support the communication between source(s) and destination(s) but usually do not have own information to transmit. The concept of relaying goes back as far as 1971 when van der Meulen introduced the relay channel model [3]. In contrast to point-to-point channels, the capacity of the relay channel remains unknown in general, but upper and lower bounds have of course been derived [4].

In this study, we consider the combination of multiple-antenna systems and the concept of relaying. In particular, we determine upper and lower bounds on the capacity and the capacity region of the Gaussian multiple-input multiple-output (MIMO) relay channel and the Gaussian MIMO two-way relay channel with a half-duplex constraint^a. While this topic is interesting and relevant in itself, note also that both the relay channel and the two-way relay channel are elementary building blocks of general multi-hop wireless networks. A fundamental understanding of these two small networks and their performance limits can thus help to determine the limits on the performances of larger communication networks, e.g., by decomposing a larger network into subgraphs whose performances can be more easily specified.

In their pioneering study on the relay channel, Cover and El Gamal derived a capacity upper bound and achievable rates based on a then new cut-set bound (CSB) and two coding schemes that are now referred to as decode-and-forward (DF) and compress-and-forward (CF), respectively. In [5, 6], the cut-set bound and the DF scheme are used to derive bounds on the capacity of the half-duplex relay channel. For Gaussian single-antenna channels, corresponding bounds are presented in [7, 8]. A (generally loose) upper bound to the CSB of the full-duplex Gaussian MIMO relay channel is provided in [9]. Achievable rates for this channel based on point-to-point transmission, the cascaded relay channel, and a suboptimal DF scheme are given, too. In [10], it is shown that the cut-set bound and the maximum achievable DF rate for this MIMO relay channel can be obtained as the solutions of convex optimization problems, which also holds if a half-duplex constraint is imposed and frequency division duplex (FDD) with an average power constraint is considered. For the full-duplex case, the same result was later independently derived in [11] and then extended to the half-duplex relay channel with a time division duplex (TDD) protocol and per protocol phase transmit power constraints imposed on source and relay [12]. A similar study for both the full-duplex and the half-duplex case with TDD is presented in [13]. However, it can be verified that the expressions resulting from those derivations are only upper bounds to the optimal solutions.

The two-way relay channel models the more common and important scenario where two terminals want to exchange information with the aid of a relay. It was introduced in [14], where the authors showed that a significant portion of the loss in spectral efficiency suffered in the one-way relay channel due to the half-duplex constraint can be compensated when bidirectional communication is considered. Most scientific articles have analyzed the half-duplex two-way relay channel in combination with a communication protocol consisting of two phases, a multiple access (MAC) phase and a broadcast (BC) phase [14–18]. In the MAC phase, the terminals transmit their messages to the relay, and subsequently, in the BC phase, the relay broadcasts its message to the terminals. With this protocol, however, all information is sent via the relay since the terminals cannot overhear each other’s transmissions due to the half-duplex constraint. As a result, protocols composed of more than two phases that utilize the direct link between the terminals can yield larger achievable rate regions in general [19–21].

The contributions of this article are as follows. We present a dual decomposition approach that allows to evaluate upper and lower bounds on the capacity and the capacity region of the half-duplex Gaussian MIMO relay channel and the restricted half-duplex Gaussian MIMO two-way relay channel, respectively. To this end, perfect channel state information (CSI) at all nodes and the use of TDD protocols to separate transmissions and receptions at all nodes are assumed. We show how the proposed dual decomposition approach can be applied to efficiently tackle the joint optimization of input signals and time allocation that needs to be solved in order to obtain the desired results. In the dual domain, the problem decomposes into subproblems that are easier to solve and for which standard convex optimization tools can be used. With our optimization approach, it is hence possible to efficiently obtain numerical results that quantify the fundamental limits of uni- and bidirectional communication in the half-duplex Gaussian MIMO relay channel. These results can then serve as benchmarks when studying different and/or suboptimal relay strategies or the impact of channel estimation errors on the performance of the considered relay networks. Moreover, our dual decomposition approach may be used for designing optimal resource allocation protocols, as discussed later in this article.

The approach proposed here is a nontrivial extension of a similar dual decomposition approach presented in [21]. There, we considered bounds on achievable rate regions for the same relay networks, but the transmit powers of all nodes were assumed to be bounded above by some finite value for every protocol phase. In this study, we modify this approach such that it can handle the average transmit power constraints under which the information theoretic capacity bounds (cut-set bound and achievable DF rate) we are interested in here were derived. We remark that the problems we need to solve become considerably more difficult due to the average transmit power constraints, both from a theoretical and practical point of view. This is because we need to introduce more dual variables and because the constraint sets of the subproblems encountered in the dual domain become unbounded. The latter means that several additional mathematical details have to be taken into account in order to ensure correctness of the optimization strategy. What is more, the power constraints considered in [21] can easily be incorporated into the optimization framework presented in this article, which is not the case vice versa. In this sense, the optimization approach presented here is more general than that of [21].

The remainder of this article is organized as follows. Section 2 introduces the system model for the restricted half-duplex Gaussian MIMO two-way relay channel. It should be mentioned here that our analysis focuses on the half-duplex two-way relay channel since it includes the half-duplex relay channel as a special case. In Section 3, we derive an outer bound on the capacity region of the restricted half-duplex Gaussian MIMO two-way relay channel and show how it can numerically be evaluated by means of the aforementioned dual decomposition approach. An inner bound on the capacity region is given by the rate region that can be achieved when the relay uses the decode-and-forward scheme. This achievable rate region and how it can be evaluated is discussed in Section 4. Numerical results for both uni- and bidirectional communication in the half-duplex Gaussian MIMO relay channel are presented in Section 5, and Section 6 concludes the article.

Notation: $R_{+}$ stands for the set of nonnegative real numbers. Matrices are denoted by bold capital letters, vectors by bold lowercase characters. The identity matrix, the zero matrix/vector, and the all-ones vector are specified by I, 0, and 1, respectively, where the dimensions are indicated by subscripts if necessary. A ⁻¹, A ^‡, A ^T, A ^H, and tr(A) denote the inverse, Moore-Penrose pseudoinverse, transpose, conjugate transpose, and trace of a matrix A, while A≽B means that A−B is positive semidefinite. E[·] is the expectation operator and $x \sim N_{C} (μ, C)$ means that x is a circularly symmetric complex Gaussian random vector with mean μ and covariance matrix C. Finally, I(X;Y|Z) denotes the conditional mutual information of random variables X and Y given Z and h(X|Y) is the differential entropy of X given Y.

2 System model

In the one-way relay channel, one source transmits information to one destination with the help of a relay. This simple unidirectional relay network is obviously only a special case of the two-way relay channel, where two terminals exchange information with the aid of the relay. Therefore, our analysis focuses on the half-duplex Gaussian MIMO two-way relay channel. More specifically, we consider the restricted two-way relay channel, i.e., the bidirectional communication is restricted in the sense that the encoders at the two terminals can neither cooperate, nor are they able to use previously decoded information to encode their messages. The most general communication protocol for this channel model is composed of all six phases (network states) where either one or two nodes transmit, as first noted in [22]. Evidently, no information can be conveyed when all nodes are silent or when all nodes transmit at the same time, where the latter is due to the half-duplex constraint imposed on all nodes. The six different phases are illustrated in Figure 1, where nodes 1 and 2 represent the two terminals and R is the relay.

Let N _A and N _B be the number of antennas at node A and node B, let $x_{A}^{(i)} \in C^{N_{A}}$ and $y_{B}^{(i)} \in C^{N_{B}}$ denote the transmit signal of node A and the receive signal of node B during phase i, respectively, and let $H_{AB} \in C^{N_{B} \times N_{A}}$ denote the channel gain matrix between nodes A and B for all i∈{1,…,6}. Then, the phases are characterized as follows:

(1)
Node 1 transmits to node 2 and the relay:
$\begin{align} y_{R}^{(1)} = H_{1 R} x_{1}^{(1)} + n_{R}^{(1)}, n_{R}^{(1)} \sim N_{C} (0, I_{N_{R}}), \\ y_{2}^{(1)} = H_{12} x_{1}^{(1)} + n_{2}^{(1)}, n_{2}^{(1)} \sim N_{C} (0, I_{N_{2}}) . \end{align}$

(2)
Node 2 transmits to node 1 and the relay:
$\begin{align} y_{R}^{(2)} = H_{2 R} x_{2}^{(2)} + n_{R}^{(2)}, n_{R}^{(2)} \sim N_{C} (0, I_{N_{R}}), \\ y_{1}^{(2)} = H_{21} x_{2}^{(2)} + n_{1}^{(2)}, n_{1}^{(2)} \sim N_{C} (0, I_{N_{1}}) . \end{align}$

(3)
Node 1 and node 2 transmit to the relay:
$\begin{align} y_{R}^{(3)} = H_{1 R} x_{1}^{(3)} + H_{2 R} x_{2}^{(3)} + n_{R}^{(3)}, n_{R}^{(3)} \sim N_{C} (0, I_{N_{R}}) . \end{align}$

(4)
The relay transmits to node 1 and node 2:
$\begin{align} y_{1}^{(4)} = H_{R 1} x_{R}^{(4)} + n_{1}^{(4)}, n_{1}^{(4)} \sim N_{C} (0, I_{N_{1}}), \\ y_{2}^{(4)} = H_{R 2} x_{R}^{(4)} + n_{2}^{(4)}, n_{2}^{(4)} \sim N_{C} (0, I_{N_{2}}) . \end{align}$

(5)
The relay and node 2 transmit to node 1:
$\begin{align} y_{1}^{(5)} = H_{R 1} x_{R}^{(5)} + H_{21} x_{2}^{(5)} + n_{1}^{(5)}, n_{1}^{(5)} \sim N_{C} (0, I_{N_{1}}) . \end{align}$

(6)
The relay and node 1 transmit to node 2:
$\begin{align} y_{2}^{(6)} = H_{R 2} x_{R}^{(6)} + H_{12} x_{1}^{(6)} + n_{2}^{(6)}, n_{2}^{(6)} \sim N_{C} (0, I_{N_{2}}) . \end{align}$

Here, we have assumed that the channels are the same for all network states in order to simplify the notation. This is without loss of generality, however, since we anyhow require all channels to be perfectly known at all nodes for the discussions below. Moreover, the additive white Gaussian noise $n_{A}^{(i)}$ received at node A during phase i is assumed to be independent of the noise $n_{B}^{(j)}$ received at another node B for all phases j∈{1,…,6} and independent of $n_{A}^{(j)}$ for all j≠i.

With each node A that transmits in the i th phase a transmit covariance matrix

\begin{align} R_{A}^{(i)} = E [x_{A}^{(i)} x_{A}^{(i), H}] \end{align}

(1)

is associated, and the average transmit power consumed by the node during this phase is given by $p_{A}^{(i)} = tr (R_{A}^{(i)})$ . Furthermore, if the two nodes A and B transmit simultaneously during phase i, we have a joint transmit covariance matrix

\begin{align} R^{(i)} = E [[\begin{matrix} x_{A}^{(i)} \\ x_{B}^{(i)} \end{matrix}] {[\begin{matrix} x_{A}^{(i)} \\ x_{B}^{(i)} \end{matrix}]}^{H}] = [\begin{matrix} R_{A}^{(i)} & R_{AB}^{(i)} \\ R_{AB}^{(i), H} & R_{B}^{(i)} \end{matrix}] \end{align}

(2)

for this phase. By defining the selection matrices

D_{A}^{(i)} = [\begin{matrix} I_{N_{A}} & 0_{N_{A} \times N_{B}} \end{matrix}], D_{B}^{(i)} = [\begin{matrix} 0_{N_{B} \times N_{A}} & I_{N_{B}} \end{matrix}],

(3)

the transmit covariance matrices $R_{A}^{(i)}$ and $R_{B}^{(i)}$ of the two transmitting nodes can be expressed as linear functions of the joint transmit covariance matrix R ⁽ⁱ⁾:

R_{A}^{(i)} = D_{A}^{(i)} R^{(i)} D_{A}^{(i), H}, R_{B}^{(i)} = D_{B}^{(i)} R^{(i)} D_{B}^{(i), H} .

(4)

3 Outer bound on capacity region

In this section, we establish an outer bound on the capacity region of the restricted half-duplex Gaussian MIMO two-way relay channel and, as our main contribution, propose an efficient method to evaluate it. The outer bound region is obtained by applying the cut-set bound, which was originally derived for the one-way relay channel in [4], to the information flow from node 1 to node 2 as well as to the information flow from node 2 to node 1. In particular, we first consider the cut-set outer bound for the general half-duplex two-way relay channel and then show that, for Gaussian channels, it is equivalent to the cut-set outer bound for the restricted half-duplex two-way relay channel. While it is not known whether the cut-set bound is tight in general, there is no known tighter bound for the relay channel. What is more, it is tight for all classes of relay channels for which the capacity is known. These include the physically degraded and the reversely degraded relay channel [4], the semideterministic relay channel [23], and the relay channel with orthogonal components [24].

Theorem 1

Suppose ( R ₁,R ₂ ) is an achievable rate pair for the half-duplex two-way relay channel, where R ₁ is associated with the rate of the information sent from node 1 to node 2 and R ₂ with that of the reverse direction. Then,

\begin{array}{l} (R_{1}, R_{2}) \in C_{OB} = ⋃_{\prod_{i = 1}^{6} p_{X_{1}^{(i)} X_{2}^{(i)} X_{R}^{(i)}}} \{(C_{1}, C_{2}) \in R_{+}^{2} : τ_{i} \geq 0, \forall i \in {1, \dots, 6}, \sum_{i = 1}^{6} τ_{i} = 1, \\ C_{1} \leq τ_{1} I (X_{1}^{(1)}; Y_{R}^{(1)} Y_{2}^{(1)}) + τ_{3} I (X_{1}^{(3)}; Y_{R}^{(3)} | X_{2}^{(3)}) + τ_{6} I (X_{1}^{(6)}; Y_{2}^{(6)} | X_{R}^{(6)}), \\ C_{1} \leq τ_{1} I (X_{1}^{(1)}; Y_{2}^{(1)}) + τ_{4} I (X_{R}^{(4)}; Y_{2}^{(4)}) + τ_{6} I (X_{1}^{(6)} X_{R}^{(6)}; Y_{2}^{(6)}), \\ C_{2} \leq τ_{2} I (X_{2}^{(2)}; Y_{R}^{(2)} Y_{1}^{(2)}) + τ_{3} I (X_{2}^{(3)}; Y_{R}^{(3)} | X_{1}^{(3)}) + τ_{5} I (X_{2}^{(5)}; Y_{1}^{(5)} | X_{R}^{(5)}), \\ C_{2} \leq τ_{2} I (X_{2}^{(2)}; Y_{1}^{(2)}) + τ_{4} I (X_{R}^{(4)}; Y_{1}^{(4)}) + τ_{5} I (X_{2}^{(5)} X_{R}^{(5)}; Y_{1}^{(5)})\}, \end{array}

(5)

where $X_{A}^{(i)}$ and $Y_{B}^{(i)}$ represent the channel input of node A and the channel output of node B during phase i , respectively, and the duration of the ith phase is denoted by τ _i.

Proof

The result directly follows from ([5], Thm. 1) by considering all six network states (TDD phases) and both directions of data transmission. In particular, the rate bounds originate from the four cut-sets depicted in Figure 2; the first two cut-sets (shown in Figures 2a,b) yield the upper bounds on R ₁, whereas the bounds on R ₂ are determined by the third (Figure 2c) and fourth (Figure 2d) cut-sets. After having identified which of the six protocol phases need to be considered for which cut-set, e.g., phases 1, 3, 6 for the first one, straightforward application of ([5], Thm. 1) gives the constraints specified in (5). □

We remark that the order of the phases in the transmission protocol is irrelevant if we only consider this outer bound region $C_{OB}$ ; only the portion of the time τ _i that phase i is used matters. While it is clear that the optimal joint input distribution factors as $\prod_{i = 1}^{6} p_{X_{1}^{(i)} X_{2}^{(i)} X_{R}^{(i)}}$ , the following proposition additionally shows that $p_{X_{1}^{(3)} X_{2}^{(3)}} = p_{X_{1}^{(3)}} p_{X_{2}^{(3)}}$ maximizes $C_{OB}$ for the Gaussian relay channel.

Proposition 2

The input distribution for phase 3 that maximizes $C_{OB}$ for the half-duplex Gaussian two-way relay channel factors as $p_{X_{1}^{(3)} X_{2}^{(3)}} = p_{X_{1}^{(3)}} p_{X_{2}^{(3)}}$ .

Proof

In the third phase, both node 1 and node 2 transmit to the relay so that the input-output characteristic for the Gaussian relay channel is generally specified by $Y_{R}^{(3)} = f_{1} (X_{1}^{(3)}) + f_{2} (X_{2}^{(3)}) + N_{R}^{(3)}$ , where f ₁ and f ₂ are deterministic functions that represent the transformations of the input signals induced by the channel gains, and where $N_{R}^{(3)}$ denotes the Gaussian noise received at the relay, which is independent of the signals $X_{1}^{(3)}$ and $X_{2}^{(3)}$ . There are two mutual information terms associated with phase 3 in (5): $I (X_{1}^{(3)}; Y_{R}^{(3)} | X_{2}^{(3)})$ in the first condition and $I (X_{2}^{(3)}; Y_{R}^{(3)} | X_{1}^{(3)})$ in the third one. Both of them are maximized if $X_{1}^{(3)}$ and $X_{2}^{(3)}$ are independent, as shown by the following chain of inequalities:

\begin{align} I (X_{1}^{(3)}; Y_{R}^{(3)} | X_{2}^{(3)}) & = h (Y_{R}^{(3)} | X_{2}^{(3)}) - h (Y_{R}^{(3)} | X_{1}^{(3)} X_{2}^{(3)}) \\ = h (f_{1} (X_{1}^{(3)}) + N_{R}^{(3)} | X_{2}^{(3)}) - h (N_{R}^{(3)}) \\ \leq h (f_{1} (X_{1}^{(3)}) + N_{R}^{(3)}) - h (N_{R}^{(3)}), \end{align}

with equality if and only if $X_{1}^{(3)}$ and $X_{2}^{(3)}$ are independent ([25], Cor. to Thm. 8.6.1). The same of course holds if the roles of $X_{1}^{(3)}$ and $X_{2}^{(3)}$ are reversed, which proves the proposition. □

Proposition 2 hence implies that $C_{OB}$ is also the cut-set outer bound for the restricted half-duplex Gaussian two-way relay channel, which requires that $p_{X_{1}^{(3)} X_{2}^{(3)}} = p_{X_{1}^{(3)}} p_{X_{2}^{(3)}}$ as the terminals must not cooperate in encoding their messages. Moreover, it can be shown that Gaussian inputs are optimal for each phase ([26], Prop. 2). Since a Gaussian distribution is completely determined by its mean and covariance, the optimal zero mean input for phase i is specified by R ⁽ⁱ⁾, where $R_{12}^{(3)} = 0_{N_{1} \times N_{2}}$ holds for the optimal R ⁽³⁾ as a consequence of Proposition 2. Note also that the cut-set bound was derived under the assumption of average transmit power constraints on every node, i.e., $\sum_{i = 1}^{6} τ_{i} E [x_{A}^{(i), H} x_{A}^{(i)}] = \sum_{i = 1}^{6} τ_{i} tr (R_{A}^{(i)}) \leq P_{A}$ if P _A denotes the available transmit power node A may consume on average.

Now, let us turn to the main subject of this section and the entire article, which is to evaluate the outer bound region $C_{OB}$ for the Gaussian MIMO relay channel. One way of achieving this, and the one we choose here, is to determine its boundary by solving weighted sum rate (WSR) maximization problems over $C_{OB}$ for different weight vectors $w \in R_{+}^{2}$ . In particular, the boundary of $C_{OB}$ can be determined with arbitrary precision by varying the ratio of the weights $\frac{w_{1}}{w_{2}}$ from zero to infinity ^b. For a given weight vector, the weighted sum rate maximization we then need to solve reads as

\begin{align} {max}_{r} w^{T} r s. t. r \in C_{OB} . \end{align}

(6)

We remark that the maximum of problem (6) is well-defined and that a maximizer $r^{⋆} \in C_{OB}$ exists. This is because $C_{OB}$ is closed and bounded (and thus compact) if the transmit powers P ₁, P ₂, and P _R the nodes may consume on average are finite, which we of course assume below. Hence, Weierstrass’ theorem ([27], Thm. 2.3.1) guarantees that problem (6) attains its maximum.

For the purpose of solving such a WSR maximization problem, we take an approach that is similar to that chosen in [21] and which can be summarized as follows. Since the formulation of (6) is not very convenient if we actually want to perform the optimization, we seek a parameterization that is more suitable to the problem. As a first step towards this end, we find a convex parameterization of the outer bound region $C_{OB}$ in Section 3.1. Since the objective function is linear, we obtain a convex optimization problem for which strong duality holds so that it can equivalently be solved in the dual domain. The corresponding dual problem is derived in Section 3.2. We then choose to solve this dual problem by means of the cutting plane algorithm, which is discussed in Section 3.3. Finally, we need to recover the optimal primal solution from the optimal solution to the dual problem. How this so-called primal reconstruction works for the considered weighted sum rate maximization problem is explained in Section 3.4.

Convex parameterization of outer bound region $C_{OB}$

As a first step towards a convex parameterization of the outer bound region $C_{OB}$ , we define six rate-power regions $S_{1}, \dots, S_{6}$ , one for each phase of the transmission protocol. Basically, $S_{i}$ specifies the contribution of protocol phase i to the outer bound region, both in terms of rates and power consumption. For the Gaussian MIMO relay channel with the optimal Gaussian inputs, the mutual information terms specifying the rates boil down to the well-known log-det expressions. Consequently, the six rate-power regions are given by ^c

\begin{align} S_{1} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{2} + N_{R}} + H_{1} R^{(1)} H_{1}^{H}), \\ r_{2} \leq log det (I_{N_{2}} + H_{12} R^{(1)} H_{12}^{H}), \\ p_{1} = tr (R^{(1)}), p_{2} = 0, p_{3} = 0, \\ R^{(1)} ≽ 0\}, \end{align}

(7)

\begin{align} S_{2} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{1} + N_{R}} + H_{2} R^{(2)} H_{2}^{H}), \\ r_{2} \leq log det (I_{N_{1}} + H_{21} R^{(2)} H_{21}^{H}), \\ p_{1} = 0, p_{2} = tr (R^{(2)}), p_{3} = 0, \\ R^{(2)} ≽ 0\}, \end{align}

(8)

\begin{align} S_{3} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{R}} + H_{1 R} R_{1}^{(3)} H_{1 R}^{H}), \\ r_{2} \leq log det (I_{N_{R}} + H_{2 R} R_{2}^{(3)} H_{2 R}^{H}), \\ p_{1} = tr (R_{1}^{(3)}), p_{2} = tr (R_{2}^{(3)}), p_{3} = 0, \\ R_{1}^{(3)} ≽ 0, R_{2}^{(3)} ≽ 0\}, \end{align}

(9)

\begin{align} S_{4} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{2}} + H_{R 2} R^{(4)} H_{R 2}^{H}), \\ r_{2} \leq log det (I_{N_{1}} + H_{R 1} R^{(4)} H_{R 1}^{H}), \\ p_{1} = 0, p_{2} = 0, p_{3} = tr (R^{(4)}), \\ R^{(4)} ≽ 0\}, \end{align}

(10)

\begin{align} S_{5} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{1}} + H_{21} Q^{(5)} H_{21}^{H}), \\ r_{2} \leq log det (I_{N_{1}} + H_{5} R^{(5)} H_{5}^{H}), \\ p_{1} = 0, p_{2} = tr (D_{2}^{(5)} R^{(5)} D_{2}^{(5), H}), \\ p_{3} = tr (D_{R}^{(5)} R^{(5)} D_{R}^{(5), H}), \\ Q^{(5)} ≽ 0, R^{(5)} - D_{2}^{(5), H} Q^{(5)} D_{2}^{(5)} ≽ 0\} \end{align}

(11)

\begin{align} S_{6} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{2}} + H_{12} Q^{(6)} H_{12}^{H}), \\ r_{2} \leq log det (I_{N_{2}} + H_{6} R^{(6)} H_{6}^{H}), \\ p_{1} = tr (D_{1}^{(6)} R^{(6)} D_{1}^{(6), H}), p_{2} = 0, \\ p_{3} = tr (D_{R}^{(6)} R^{(6)} D_{R}^{(6), H}), \\ Q^{(6)} ≽ 0, R^{(6)} - D_{1}^{(6), H} Q^{(6)} D_{1}^{(6)} ≽ 0\}, \end{align}

(12)

with $H_{1} = {[\begin{matrix} H_{1 R}^{H} & H_{12}^{H} \end{matrix}]}^{H}$ , $H_{2} = {[\begin{matrix} H_{2 R}^{H} & H_{21}^{H} \end{matrix}]}^{H}$ , $H_{5} = [\begin{matrix} H_{21} & H_{R 1} \end{matrix}]$ , $H_{6} = [\begin{matrix} H_{12} & H_{R 2} \end{matrix}]$ and $D_{2}^{(5)}$ , $D_{R}^{(5)}$ , $D_{1}^{(6)}$ , $D_{R}^{(6)}$ being appropriate selection matrices as defined in (3). It is straightforward to verify that $S_{1}, \dots, S_{6}$ are convex sets which are parameterized by means of the (joint) transmit covariance matrices R ⁽¹⁾,…,R ⁽⁶⁾, respectively. They are not compact, however, because neither the rates nor the transmit powers are bounded above. In fact, this is the main difference to the problem considered in [21], where the average transmit powers for each phase and thus also the rate regions associated with each phase are bounded. As a result, the derivation of the dual problem and its solution by means of the cutting plane algorithm become considerably more difficult, as discussed in Sections 3.2 and 3.3.

Remark 1

In order to arrive at above formulations for $S_{5}$ and $S_{6}$ , the corresponding constraints on R ₁ have to be reformulated. This is done by introducing the auxiliary variables Q ⁽⁵⁾ and Q ⁽⁶⁾ to relax the equality constraints on the conditional covariance matrices $R_{2 | R}^{(5)} = R_{2}^{(5)} - R_{2 R}^{(5)} R_{R}^{(5), ‡} R_{2 R}^{(5), H}$ and $R_{1 | R}^{(6)} = R_{1}^{(6)} - R_{1 R}^{(6)} R_{R}^{(6), ‡} R_{1 R}^{(6), H}$ , respectively, before applying the (generalized) Schur complement condition. For more details, we refer the reader to [10], where this reformulation was first presented assuming that $R_{R}^{(5), - 1}$ exists, or to [11], where the same result was later independently derived for the more general case when $R_{R}^{(5)}$ need not have full rank.

Suppose that P ₁, P ₂, and P _R denote the finite transmit powers that terminal 1, terminal 2, and the relay may consume on average, respectively, and let $p_{Tx} = {[\begin{matrix} P_{1} & P_{2} & P_{R} \end{matrix}]}^{T}$ . Having defined the six rate-power regions and the vector p _Tx, we can now rewrite problem (6) as follows:

\begin{array}{l} max_{r, τ_{i}, r_{i}, p_{i}} w^{T} r s. t. Ar \leq \sum_{i = 1}^{6} τ_{i} B_{i} r_{i}, \sum_{i = 1}^{6} τ_{i} p_{i} \leq p_{Tx}, \sum_{i = 1}^{6} τ_{i} = 1, \\ τ_{i} \geq 0, (r_{i}, p_{i}) \in S_{i}, \forall i \in {1, \dots, 6} . \end{array}

(13)

Like in [21], each row of $A = {[\begin{matrix} 1 & 1 & 0 & 0 \\ 0 & 0 & 1 & 1 \end{matrix}]}^{T}$ selects one of the four rate constraints as defined in the outer bound region $C_{OB}$ (cf. (5)), and the corresponding rows of the matrices B _i∈{0,1}^4×2 specify the structures of these constraints with regard to the sets $S_{i}$ : $B_{1} = B_{6} = {[\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \end{matrix}]}^{T}$ , $B_{2} = B_{5} = {[\begin{matrix} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]}^{T}$ , $B_{3} = {[\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}]}^{T}$ , $B_{4} = {[\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]}^{T}$ . Furthermore, the fact that the three nodes are subject to average transmit power constraints is reflected in the term $\sum_{i = 1}^{6} τ_{i} p_{i} \leq p_{Tx}$ , where $p_{i} = {[\begin{matrix} p_{1}^{(i)} & p_{2}^{(i)} & p_{R}^{(i)} \end{matrix}]}^{T}$ is the vector of average transmit powers consumed by the three nodes during phase i.

Remark 2

The optimization problem (13) would be convex for fixed τ ₁,…,τ ₆. The reason it is a nonconvex parameterization of (6) if the time shares are optimization variables is that the functions τ _i B _i r _i and τ _i p _i are not jointly concave in τ _i,r _i and jointly convex in τ _i,p _i, respectively.

Consequently, another reformulation step is required, and for this purpose, we define the set

\begin{array}{l} S = \{(y, z) \in R_{+}^{4} \times R_{+}^{3} : y = \sum_{i = 1}^{6} τ_{i} B_{i} r_{i}, z = \sum_{i = 1}^{6} τ_{i} p_{i}, \sum_{i = 1}^{6} τ_{i} = 1, \\ τ_{i} \geq 0, (r_{i}, p_{i}) \in S_{i}, \forall i \in {1, \dots, 6}\} . \end{array}

(14)

Proposition 3. $S$ is a convex set.

Proof. See Appendix 1. □

Using this definition of $S$ , the weighted sum rate maximization problem (6) is equivalently expressed as

\begin{align} {max}_{r, y, z} w^{T} r s. t. Ar \leq y, z \leq p_{Tx}, (y, z) \in S . \end{align}

(15)

Because $S$ is a convex set with nonempty relative interior, (15) is a convex optimization problem for which strong duality holds ([28], Sec. 5.3.2). In particular, the constraints of problem (15) specify a convex set, which means that a convex parameterization of the outer bound region $C_{OB}$ is given by

\begin{align} C_{OB} = \{r \in R_{+}^{2} : Ar \leq y, z \leq p_{Tx}, (y, z) \in S\} . \end{align}

(16)

Derivation of the dual function

Since we have strong duality for problem (15), we can equivalently solve it in the dual domain. In the approach considered here, the constraints Ar≤y and z≤p _Tx are incorporated into the objective function using the Lagrangian multipliers $λ \in R^{4}$ and $μ \in R^{3}$ . This leads to a dual problem where the six phases are decoupled. In particular, it will show that this approach allows to solve (15) without explicitly optimizing the time allocation parameters τ ₁,…τ ₆. The Lagrangian function reads as

\begin{align} L (r, y, z, λ, μ) = w^{T} r - λ^{T} (Ar - y) - μ^{T} (z - p_{Tx}), \end{align}

(17)

and the resulting dual function is given by

\begin{align} Θ (λ, μ) & = sup_{r, (y, z) \in S} L (r, y, z, λ, μ) \\ = \{\begin{array}{l} μ^{T} p_{Tx} + {sup}_{(y, z) \in S} \{λ^{T} y - μ^{T} z\} & if A^{T} λ = w, \\ + \infty & otherwise . \end{array} \end{align}

(18)

Applying the definition of $S$ yields

\begin{align} {sup}_{(y, z) \in S} \{λ^{T} y - μ^{T} z\} = max_{i = 1, \dots, 6} ({sup}_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\}) . \end{align}

(19)

If none of the channel gain matrices between the two terminals or between one of the terminals and the relay is a zero matrix, we have the following proposition ^d.

Proposition 4

For any μ≥0 and any λ≥0 that satisfies A ^T λ=w, the value of the dual function Θ(λ,μ) is finite if and only if the following three conditions hold:

1.
μ ₁>0or μ ₁=0,λ ₁=λ ₂=0,
2.
μ ₂>0or μ ₂=0,λ ₃=λ ₄=0,
3.
μ ₃>0or μ ₃=0,λ ₂=λ ₄=0.

Proof

See Appendix 2. □

The meaning of Proposition 4 is as follows. For the subproblems

\begin{align} sup_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\}, i \in {1, \dots, 6}, \end{align}

(20)

the Lagrangian multipliers μ ₁, μ ₂, and μ ₃ can be understood as prices associated with the powers P ₁, P ₂, and P _R consumed by node 1, node 2, and the relay, respectively. If all prices are positive, each of the subproblems is guaranteed to have a finite optimal solution because the cost of power μ ^T p _i is a linear function of $p_{1}^{(i)}$ , $p_{2}^{(i)}$ , $p_{R}^{(i)}$ , whereas λ ^T B _i r _i increases only logarithmically with the powers. If one of the prices is zero, however, the transmit power of the corresponding node and the associated transmit data rates can be increased to infinity without incurring any costs. Consequently, the subproblems for all phases i∈{1,…,6} in which this node transmits take the value infinity unless all the entries of the r _i’s to which transmissions by the node contribute are weighted with zero.

Remark 3

Note that λ ₃=λ ₄=0 (λ ₁=λ ₂=0) may result in Θ(λ,μ)<∞ only if w ₂=0 (w ₁=0) because otherwise A ^T λ≠w. If w ₂=0 (w ₁=0), however, the WSR maximization over $C_{OB}$ (6) reduces to maximizing the cut-set bound for the one-way relay channel with terminal 1 (terminal 2) being the source and terminal 2 (terminal 1) being the destination. In particular, w ₂=0 yields $λ_{3}^{⋆} = λ_{4}^{⋆} = 0$ , which in turn implies $μ_{2}^{⋆} = 0$ and

\begin{array}{l} max_{i = 1, \dots, 6} max_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} \\ = max_{i = 1, 6} max_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} \end{array}

(21)

That is, only phases 1 and 6 of our 6-phase protocol need to be considered for the optimal solution, which is equivalent to setting τ ₂=τ ₃=τ ₄=τ ₅=0 in (13). The de facto communication protocol for this case is therefore consistent with that used for the half-duplex one-way relay channel if terminal 1 is the source and terminal 2 is the destination [12, 21]. Similarly, w ₁=0 implies $λ_{1}^{⋆} = λ_{2}^{⋆} = 0$ , $μ_{1}^{⋆} = 0$ , and the optimal solution involves only phases 2 and 5. If w>0, on the other hand, we can conclude from Proposition 4 that Θ(λ,μ)<∞ requires μ ₁>0 and μ ₂>0.

Remark 4

For μ ₃=0, it follows from Proposition 4 that Θ(λ,μ)<∞ only if λ ₂=λ ₄=0. But λ ₂=λ ₄=0 means that transmissions by the relay have no effect on the dual function since the corresponding rates are all weighted with zero. This is independent of P _R and the channel gain matrices, and as a result, we have

\begin{array}{l} max_{i = 1, \dots, 6} max_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} \\ = max_{i = 1, 2, 3} max_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} \end{array}

(22)

in this case. Moreover, it is clear that phase 3 contributes nothing to the bidirectional communication if the relay cannot forward the information it previously received. Hence, the optimal solution could only involve phases 1 and 2 if μ ₃=0, meaning that only the direct link between the terminals would be utilized, and $λ = {[\begin{matrix} w_{1} & 0 & w_{2} & 0 \end{matrix}]}^{T}$ would be the optimizer of the dual problem. But for this λ the primal feasibility and complementary slackness conditions of the primal problem (15) would only be satisfied simultaneously if $X_{2}^{(2)} - Y_{1}^{(2)} - Y_{R}^{(2)}$ and $X_{1}^{(1)} - Y_{2}^{(1)} - Y_{R}^{(1)}$ formed Markov chains. This is an academic special case that our system model does not permit. Consequently, μ ₃>0 if λ≥0 and Θ(λ,μ)<∞.

From Proposition 4 and the two subsequent remarks, it follows that for λ≥0 and positive weight vectors w>0 the dual function is equal to

\begin{align} Θ (λ, μ) & = \{\begin{array}{l} μ^{T} p_{Tx} + max_{i = 1, \dots, 6} (max_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\}) & if A^{T} λ = w, μ > 0, \\ + \infty & otherwise . \end{array} \end{align}

(23)

In order to determine an optimal solution to the original weighted sum rate maximization problem (6), we thus have to solve the dual problem

\begin{align} min_{λ, μ} μ^{T} p_{Tx} & + max_{i = 1, \dots, 6} max_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} \\ s.t. λ \geq 0, A^{T} λ = w, μ > 0 . \end{align}

(24)

Remark 5

Because μ>0, the constraint set of this dual problem is not closed so that the existence of a minimizing solution cannot be guaranteed by Weierstrass’ theorem. However, since the maximum of (6) is well-defined and strong duality holds, the minimum of (24) is also well-defined.

Solution by means of cutting plane algorithm

A simple yet efficient algorithm that can be used to solve the dual problem (24) is the cutting plane algorithm ([27], Sec. 6.4), an outer-approximation method where the feasible set of the problem is approximated by a finite number of feasible points and iteratively refined by a set of linear inequalities. In each iteration of the cutting plane algorithm, a linear program, the so-called master program, must be solved and the dual function Θ(λ,μ) must be evaluated. In the ℓ th iteration, the master program reads as

\begin{array}{l} min_{α, λ, μ} α s.t. α \geq μ^{T} (p_{Tx} - p^{(k)}) + λ^{T} v^{(k)}, \forall k \in {1, \dots, ℓ}, \\ λ \geq 0, A^{T} λ = w, μ > 0, \end{array}

(25)

where, for all k∈{1,…,ℓ}, we have $(r^{(k)}, p^{(k)}) \in S_{i}$ for some i∈{1,…,6} and v ^(k)=B _i r ^(k). As can be seen from (23), evaluating the dual function requires to solve six independent convex optimization problems, one over each of the sets $S_{i}$ associated with the six phases of the communication protocol. For this purpose, standard semidefinite program (SDP) solvers like SDPT3 [29] that are capable of dealing with the weighted sum of log-det terms in the objective function can be applied. For a convergence analysis and more details on the cutting plane method, we refer the reader to ([27], Sec. 6.4).

Remark 6

In order for the cutting plane algorithm to work in practice, we replace the constraint μ>0 by μ≥0. This does not change the optimal solution of the dual problem (24) because we know that the optimizer satisfies μ ^⋆>0. However, proper initialization of the cutting plane method then has to be ensured. In particular, if α ⁽¹⁾,λ ⁽¹⁾,μ ⁽¹⁾ are the optimizers of the master program in the first iteration, we actually have to choose several initial points $(r^{(k)}, p^{(k)}) \in S_{i}$ for some i∈{1,…,6} such that α ⁽¹⁾ is finite and μ ⁽¹⁾>0 is guaranteed. Otherwise, the algorithm runs into problems when the dual function is evaluated.

Remark 7

Since μ ^⋆>0 for w>0, it follows from the complimentary slackness condition of the primal problem (15) that z ^⋆=p _Tx, which means that the three nodes use all their available transmit power. This in turn implies r ^⋆>0 whenever w>0, i.e., the tangents to the boundary of $C_{OB}$ at the optimal unidirectional points (C _1,max,0) and (0,C _2,max) are orthogonal to the axes.

Primal reconstruction

As previously mentioned, the proposed dual decomposition approach allows to determine the optimal value of (6) without explicitly optimizing the time shares allocated to the six phases of the communication protocol. On the one hand, the decoupling of the phases considerably simplifies the optimization, but on the other, we want to know the optimal rate vector r ^⋆ and possibly the optimal time shares $τ_{i}^{⋆}$ , e.g., for the purpose of designing resource allocation protocols. To this end, we need to generate the optimal primal solution from the optimal solution to the dual problem, a process that is generally referred to as primal reconstruction or primal recovery. Since we apply the cutting plane algorithm to solve the dual problem, the primal recovery scheme to obtain the optimal rate vector r ^⋆ and the optimal time shares $τ_{i}^{⋆}$ is fairly simple. Assume the cutting plane algorithm has converged to the optimal solution of the dual problem after L iterations, and consider the dual problem of the corresponding master program (25) given by

\begin{align} {max}_{x, u} w^{T} x s.t. Ax \leq \sum_{k = 1}^{L} u_{k} v^{(k)}, \sum_{k = 1}^{L} u_{k} p^{(k)} \leq p_{Tx}, \\ \sum_{k = 1}^{L} u_{k} = 1, u_{k} \geq 0, \forall k \in {1, \dots, L} . \end{align}

(26)

We remark that this problem is an approximation of the primal problem (15) where the set $S$ is replaced by a convex combination of feasible points $\{(v^{(1)}, p^{(1)}), \dots, (v^{(L)}, p^{(L)})\} \subset S$ and where $x \in R^{2}$ and u _k denote the Lagrangian multipliers associated with the constraints A ^T λ=w and α≥μ ^T(p _Tx−p ^(k))+λ ^T v ^(k) of the master program, respectively. Letting $K_{i} = {\{k : (r^{(k)}, p^{(k)}) \in S_{i}, v^{(k)} = B_{i} r^{(k)}\}}^{e}$ , we can rewrite (26) as

\begin{align} {max}_{x, u} w^{T} x s.t. Ax \leq \sum_{i = 1}^{6} \sum_{k \in K_{i}} u_{k} B_{i} r^{(k)}, \sum_{i = 1}^{6} \sum_{k \in K_{i}} u_{k} p^{(k)} \leq p_{Tx}, \\ \sum_{k = 1}^{L} u_{k} = 1, u_{k} \geq 0, \forall k \in {1, \dots, L} . \end{align}

(27)

Furthermore, it can be shown that $\sum_{k \in K_{i}} u_{k} B_{i} r^{(k)} = (\sum_{k \in K_{i}} u_{k}) B_{i} {\tilde{r}}_{i}$ and $\sum_{k \in K_{i}} u_{k} p^{(k)} = (\sum_{k \in K_{i}} u_{k}) {\tilde{p}}_{i}$ for some $({\tilde{r}}_{i}, {\tilde{p}}_{i}) \in S_{i}$ since $S_{i}$ is a convex set for all i∈{1,…,6}. If we insert these expressions in (26) and compare the result to (13), we can conclude that

\begin{align} τ_{i}^{⋆} = \sum_{k \in K_{i}} u_{k}^{⋆}, \forall i \in {1, \dots, 6} . \end{align}

(28)

The optimal time shares $τ_{i}^{⋆}$ are therefore easily obtained from the optimal Lagrangian dual variables $u_{k}^{⋆}$ , k∈{1,…,L}, that correspond to the constraints α≥μ ^T(p _Tx−p ^(k))+λ ^T v ^(k) in the master program. Moreover, it is clear that x ^⋆, which denotes the vector of optimal dual variables corresponding to the equality constraints A ^T λ=w, yields the optimal rate vector r ^⋆.

Remark 8

Since all $S_{i}$ are convex, time sharing within any of the six phases of the communication protocol is not necessary. As a result, there will be no more than one $k \in K_{i}$ with $u_{k}^{⋆} > 0$ for every i∈{1,…,6}.

An achievable rate region using the DF scheme

To obtain an inner bound on the capacity region of the restricted half-duplex Gaussian MIMO two-way relay channel, we consider the rate region that is achievable with the decode-and-forward coding scheme in this section. Like the cut-set bound, the DF coding scheme is due to Cover and El Gamal [4]. Requiring the relay to decode the source message can be a severe constraint so that other relaying strategies like compress-and-forward or amplify-and-forward can achieve higher rates for certain channel conditions. For single-antenna nodes, this is for example illustrated in [8, 26]. Nevertheless, we consider only the DF strategy in this article because the corresponding achievable rate region $R_{DF}$ is very similar in structure to $C_{OB}$ and can thus be evaluated using the same methodology as described in the previous section.

Theorem 5

If the relay uses the decode-and-forward coding scheme, the following rate region is achievable for the restricted half-duplex two-way relay channel:

\begin{array}{l} R_{DF} = ⋃_{\prod_{i = 1}^{6} p_{X_{1}^{(i)} X_{2}^{(i)} X_{R}^{(i)}}} \{(R_{1}, R_{2}) \in R_{+}^{2} : τ_{i} \geq 0, \forall i \in {1, \dots, 6}, \sum_{i = 1}^{6} τ_{i} = 1, \\ R_{1} \leq τ_{1} I (X_{1}^{(1)}; Y_{R}^{(1)}) + τ_{3} I (X_{1}^{(3)}; Y_{R}^{(3)} | X_{2}^{(3)}) + τ_{6} I (X_{1}^{(6)}; Y_{2}^{(6)} | X_{R}^{(6)}), \\ R_{1} \leq τ_{1} I (X_{1}^{(1)}; Y_{2}^{(1)}) + τ_{4} I (X_{R}^{(4)}; Y_{2}^{(4)}) + τ_{6} I (X_{1}^{(6)} X_{R}^{(6)}; Y_{2}^{(6)}), \\ R_{2} \leq τ_{2} I (X_{2}^{(2)}; Y_{R}^{(2)}) + τ_{3} I (X_{2}^{(3)}; Y_{R}^{(3)} | X_{1}^{(3)}) + τ_{5} I (X_{2}^{(5)}; Y_{1}^{(5)} | X_{R}^{(5)}), \\ R_{2} \leq τ_{2} I (X_{2}^{(2)}; Y_{1}^{(2)}) + τ_{4} I (X_{R}^{(4)}; Y_{1}^{(4)}) + τ_{5} I (X_{2}^{(5)} X_{R}^{(5)}; Y_{1}^{(5)}), \\ R_{1} + R_{2} \leq τ_{1} I (X_{1}^{(1)}; Y_{R}^{(1)}) + τ_{2} I (X_{2}^{(2)}; Y_{R}^{(2)}) + τ_{3} I (X_{1}^{(3)} X_{2}^{(3)}; Y_{R}^{(3)}) \\ + τ_{5} I (X_{2}^{(5)}; Y_{1}^{(5)} | X_{R}^{(5)}) + τ_{6} I (X_{1}^{(6)}; Y_{2}^{(6)} | X_{R}^{(6)})\} . \end{array}

(29)

Proof

This result is derived in [22] by adapting the DF coding scheme to the 6-phase communication protocol introduced in Section 2 (with the phases performed in exactly that order) and applying it to both directions of information transfer. A brief outline of the coding scheme that achieves $R_{DF}$ is given in Appendix 3. □

In theory, a different ordering of the phases may increase the achievable rate region $R_{DF}$ . To the best of our knowledge, however, the 6-phase protocol we use is the most general protocol for the half-duplex two-way relay channel that has been considered in the literature so far. In particular, it includes the 2-phase multiple access broadcast protocol (MABC: consisting of phases 3, 4), the 3-phase time division broadcast protocol (TDBC: 1, 2, 4), and the 4-phase hybrid broadcast protocol (HBC: 1, 2, 3, 4) used in [14–20], for example ^f. In addition, it also covers the approach of using time sharing between the one-way relay channels in both directions to exchange information between the terminals, which we termed one-way time sharing (OWTS: 1, 2, 5, 6) in [21].

Like for the outer bound region $C_{OB}$ , the optimal joint input distribution factors as $\prod_{i = 1}^{6} p_{X_{1}^{(i)} X_{2}^{(i)} X_{R}^{(i)}}$ , where $p_{X_{1}^{(3)} X_{2}^{(3)}} = p_{X_{1}^{(3)}} p_{X_{2}^{(3)}}$ must be fulfilled due to the assumption of the restricted half-duplex two-way relay channel, which prohibits the nodes from cooperating in encoding their messages. Furthermore, the optimal input distribution for each phase i∈{1,…,6} can be shown to be Gaussian again.

Note that, as in Section 3, our main objective is again to evaluate the achievable rate region $R_{DF}$ for the Gaussian MIMO relay channel. Clearly, the boundary of the achievable rate region $R_{DF}$ can also be determined by means of solving WSR maximization problems with different weight vectors. As $R_{DF}$ and $C_{OB}$ are very similar in structure, the approach we use to solve one such problem is essentially the same as for the outer bound region. First, we find a convex parameterization for $R_{DF}$ . Subsequently, we solve the problem in the dual domain by means of the cutting plane algorithm, and finally, we perform the primal reconstruction.

For the purpose of deriving a convex parameterization for $R_{DF}$ , let

\begin{align} S_{1}^{'} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{R}} + H_{1 R} R^{(1)} H_{1 R}^{H}), \\ r_{2} \leq log det (I_{N_{2}} + H_{12} R^{(1)} H_{12}^{H}), \\ p_{1} = tr (R^{(1)}), p_{2} = 0, p_{3} = 0, \\ R^{(1)} ≽ 0\}, \end{align}

(30)

\begin{align} S_{2}^{'} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{R}} + H_{2 R} R^{(2)} H_{2 R}^{H}), \\ r_{2} \leq log det (I_{N_{1}} + H_{21} R^{(2)} H_{21}^{H}), \\ p_{1} = 0, p_{2} = tr (R^{(2)}), p_{3} = 0, \\ R^{(2)} ≽ 0\}, \end{align}

(31)

\begin{align} S_{3}^{'} = \{(r, p) \in R_{+}^{2} \times R_{+}^{3} : & r_{1} \leq log det (I_{N_{R}} + H_{1 R} R_{1}^{(3)} H_{1 R}^{H}), \\ r_{2} \leq log det (I_{N_{R}} + H_{2 R} R_{2}^{(3)} H_{2 R}^{H}), \\ r_{1} + r_{2} \leq log det (I_{N_{R}} + H_{1 R} R_{1}^{(3)} H_{1 R}^{H} \\ + H_{2 R} R_{2}^{(3)} H_{2 R}^{H}), \\ p_{1} = tr (R_{1}^{(3)}), p_{2} = tr (R_{2}^{(3)}), p_{3} = 0, \\ R_{1}^{(3)} ≽ 0, R_{2}^{(3)} ≽ 0\}, \end{align}

(32)

and $S_{i}^{'} = S_{i}$ for i∈{4,5,6}. Like $S_{i}$ defined in the previous section, every $S_{i}^{'}$ is a convex set that is parameterized by means of the (joint) transmit covariance matrix R ⁽ⁱ⁾ and that specifies the contribution of phase i to $R_{DF}$ . Having defined these unbounded convex sets $S_{i}^{'}$ , we can now express the weighted sum rate maximization problem that yields a point on the boundary of $R_{DF}$ as follows:

\begin{array}{l} max_{r, τ_{i}, r_{i}, p_{i}} w^{T} r s. t. A^{'} r \leq \sum_{i = 1}^{6} τ_{i} B_{i}^{'} r_{i}, \sum_{i = 1}^{6} τ_{i} p_{i} \leq p_{Tx}, \sum_{i = 1}^{6} τ_{i} = 1, \\ τ_{i} \geq 0, (r_{i}, p_{i}) \in S_{i}^{'}, \forall i \in {1, \dots, 6} . \end{array}

(33)

Observe that the main difference compared to (13) is the additional constraint on the sum rate R ₁+R ₂ in $R_{DF}$ so that $A^{'} = {[\begin{matrix} 1 & 1 & 0 & 0 & 1 \\ 0 & 0 & 1 & 1 & 1 \end{matrix}]}^{T}$ and $B_{1}^{'} = B_{6}^{'} = {[\begin{matrix} 1 & 0 & 0 & 0 & 1 \\ 0 & 1 & 0 & 0 & 0 \end{matrix}]}^{T}$ , $B_{2}^{'} = B_{5}^{'} = {[\begin{matrix} 0 & 0 & 1 & 0 & 1 \\ 0 & 0 & 0 & 1 & 0 \end{matrix}]}^{T}$ , $B_{3}^{'} = {[\begin{matrix} 1 & 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 & 1 \end{matrix}]}^{T}$ , $B_{4}^{'} = {[\begin{matrix} 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \end{matrix}]}^{T}$ . This constraint comes from the third phase of the communication protocol, a multiple access phase where both terminals transmit to the relay. The sum rate constraint in $R_{DF}$ occurs because the relay must decode the messages from node 1 and node 2 when it uses DF.

Since all results from Section 3 apply here accordingly ^g, the remaining steps of the optimization follow along the same lines as for the outer bound region $C_{OB}$ . First, we define the convex set

\begin{array}{l} S^{'} = \{(y, z) \in R_{+}^{5} \times R_{+}^{3} : y = \sum_{i = 1}^{6} τ_{i} B_{i}^{'} r_{i}, z = \sum_{i = 1}^{6} τ_{i} p_{i}, \\ \sum_{i = 1}^{6} τ_{i} = 1, τ_{i} \geq 0, (r_{i}, p_{i}) \in S_{i}^{'}, \forall i \in {1, \dots, 6}\} \end{array}

(34)

and reformulate (33) as

\begin{align} {max}_{r, y, z} w^{T} r s.t. A^{'} r \leq y, z \leq p_{Tx}, (y, z) \in S^{'} . \end{align}

(35)

Then, we use the dual decomposition approach in combination with the cutting plane method to obtain an optimal solution to this convex optimization problem.

Note that, after having obtained the solution, the optimal time shares $τ_{i}^{⋆}$ , i∈{1,…,6}, i.e., the optimal durations of the six protocol phases, tell us which of these phases are part of the optimal transmission protocol for a given weight vector w. In particular, the optimal protocol includes phase i if and only if $τ_{i}^{⋆} > 0$ . Furthermore, our dual decomposition approach cannot only be applied to WSR maximization problems, but to any convex optimization problem for which strong duality holds. As a result, this approach may be used for the design of resource allocation protocols, e.g., by considering utility maximization problems with concave utility functions.

Numerical results

In this section, numerical results yielding bounds on the capacity of the half-duplex Gaussian relay channel as well as numerical results giving bounds on the capacity region of the restricted half-duplex Gaussian two-relay channel are presented. More specifically, we evaluate and compare the outer bound region $C_{OB}$ and the rate region $R_{DF}$ that can be achieved with the relay using the decode-and-forward scheme for different scenarios in the two-way case. For unidirectional communication, these regions reduce to the cut-set bound C _OB and the achievable rate R _DF, which give upper and lower bounds on the capacity of the half-duplex Gaussian one-way relay channel.

As an example scenario, let us consider the line network depicted in Figure 3. This is a simple but commonly used geometry (cf. [20, 26]) where the distance d ₁₂=1 between the terminals is fixed and the relay is positioned on the line connecting the two terminals such that d _1R=|d| and d _2R=|1−d|. Furthermore, it is assumed that each node may consume the same transmit power P ₁=P ₂=P _R=10 on average, which for instance is a reasonable assumption in ad hoc networks. Finally, we assume that the path loss exponent is equal to α=4, which is a typical value for urban macrocell environments or multi-level office buildings (cf. [30], Table 2.2), and that all channel coefficients are perfectly known at all nodes.

Within this framework, two different relay network configurations are considered. In the first one, all nodes have a single antenna and the real-valued scalar channel coefficients are specified by $h_{AB} = d_{AB}^{- α / 2}$ , which of course implies h _AB=h _BA. Note that, due to the assumption of real-valued channels, all rate vectors obtained with the presented optimization framework have to be divided by 2 since the rates are specified by $\frac{1}{2} log (\cdot)$ in this case as opposed to log(·) for complex-valued channels. In the second configuration, all nodes are equipped with two antennas. The channel gain matrices are then assumed to be complex random and independent, where the entries of H _AB are independent and identically distributed complex Gaussian random variables with zero mean and variance $d_{AB}^{- α}$ . In addition, we assume that the channels are reciprocal, i.e., $H_{AB} = H_{BA}^{T}$ .

For both the single- and the multi-antenna scenario, Figure 4 shows the cut-set outer bound C _OB and the achievable DF rate R _DF for the half-duplex one-way relay channel over the distance d=d _1R between terminal 1 and the relay. Here, we have assumed that terminal 1 is the source and that terminal 2 is the destination of the unidirectional communication, which means that only phases 1 and 6 of the 6-phase communication protocol are used. We remark that the results for the multi-antenna case are averaged over 1000 independent channel realizations. For comparison, the best outer bound C _OB,PP and the best achievable DF rate R _DF,PP that can be obtained if the source and the relay are subject to per protocol phase transmit power constraints of the form $tr (R_{A}^{(i)}) \leq P_{A}, i \in {1, 6}$ , are plotted as well. Note that this condition is more restrictive than the average transmit power constraint $\sum_{i = 1, 6} τ_{i} tr (R_{A}^{(i)}) \leq P_{A}$ with τ ₁,τ ₆≥0,τ ₁+τ ₆=1 so that C _OB≥C _OB,PP and R _DF≥R _DF,PP.

It can be observed from Figure 4 that the decode-and-forward strategy achieves capacity if the relay is close enough to the source, which is a well-known fact that has previously been noted for the full-duplex case, e.g., in [26]. We also see that the optimal relay positions lie in the range 0.3≤d≤0.5, with the optimal values of d being almost the same for both power constraints. These observations are to be interpreted with caution, however, as the optimal relay position heavily depends on the path loss coefficient as well as the available transmit powers. Another non-surprising observation is that, although a factor of 2 is due to the fact that we use real-valued channels for the single-antenna configuration, substantial rate gains can be achieved without increasing P ₁ or P _R when multiple antennas are used at each node. More interestingly, the gap between C _OB and C _OB,PP as well as that between R _DF and R _DF,PP vanishes when the relay is moved closer to the destination. This can be explained as follows. The source-relay link, and thus the phase in which the relay listens to the source, increasingly becomes the bottleneck of the information transfer with increasing d. As d approaches d ₁₂=1, the optimal time share $τ_{1}^{⋆}$ of phase 1 also approaches 1. Hence, the relay power and the transmit power constraint imposed on the relay have no effect on the optimal solution. Furthermore, the average transmit power constraint imposed on the source becomes $τ_{1}^{⋆} tr (R_{1}^{(1)}) + τ_{6}^{⋆} tr (R_{1}^{(6)}) \approx tr (R_{1}^{(1)}) \leq P_{1}$ , i.e., it basically amounts to a per phase power constraint for phase 1.

For the bidirectional communication in the half-duplex two-way relay channel, we consider three different relay positions: (a) the relay is exactly in the middle between the two terminals (d=0.5); (b) the relay is placed near terminal 1 (d=0.25); (c) the relay is very close to terminal 1 (d=0.1). For each of these scenarios, Figures 5 (single-antenna) and 6 (multi-antenna, results for one particular random channel realization) show the achievable DF rate regions $R_{DF}$ and the outer bound regions $C_{OB}$ . For comparison, the best achievable DF rate regions $R_{DF,PP}$ and the best outer bound regions $C_{OB,PP}$ that can be obtained with per phase power constraints imposed on all nodes are also illustrated. Like for unidirectional transmission, we observe that $C_{OB} \supset C_{OB,PP}$ and $R_{DF} \supset R_{DF,PP}$ for all scenarios since the average power constraint $\sum_{i = 1}^{6} τ_{i} tr (R_{A}^{(i)}) \leq P_{A}$ with τ _i≥0,∀i∈{1,…,6}, and $\sum_{i = 1}^{6} τ_{i} = 1$ is less restrictive than the per phase power constraint $tr (R_{A}^{(i)}) \leq P_{A}, \forall i \in {1, \dots, 6}$ .

First of all, note that the results shown in Figures 5 and 6 allow to draw the same conclusions as for the one-way case: If the relay is close enough to terminal 1, the decode-and-forward scheme achieves the cut-set bound for the unidirectional communication from terminal 1 to terminal 2, i.e., R _1,max=C _1,max, regardless of whether we consider average or per phase transmit power constraints. Furthermore, the same R _2,max (or C _2,max) is obtained for both types of power constraints when d _2R approaches 1. Beyond that, a noteworthy observation is that the greatest benefit of the less restrictive average transmit power constraints is obtained if we are interested in the sum rate R ₁+R ₂, whereas the performance improvement is less pronounced for asymmetric rate requirements. Finally, observe that the gaps between the boundaries of $C_{OB}$ and $C_{OB,PP}$ are like the gaps between the boundaries of $R_{DF}$ and $R_{DF,PP}$ for all ratios $\frac{R_{1}}{R_{2}}$ and all scenarios considered here.

In order to assess the complexity of determining the achievable rate regions and outer bound regions, Table 1 illustrates the average number of iterations the cutting plane algorithm needs per weighted sum rate maximization problem until it converges for the different scenarios in the multi-antenna case ^h. Here, the parameter ϵ that specifies the absolute accuracy of the optimal value was set to 10⁻². Note that the number of required iterations is very small if we consider the per protocol phase transmit power constraint. Unfortunately, the numbers roughly triple with the average power constraint that yields the information theoretic bounds on the capacity and the capacity region of the half-duplex Gaussian relay channel and the half-duplex Gaussian two-way relay channel, respectively. The main reason for this is that we need more dual variables to formulate the dual problem in the latter case. Since the number of required iterations remains reasonably small, however, these results confirm that the proposed dual decomposition approach indeed allows to efficiently evaluate achievable rate regions and corresponding outer bounds for the considered half-duplex Gaussian relay networks. Assuming knowledge of all channel gain matrices, it is hence possible to numerically evaluate their fundamental limits.

Table 1 Average number of cutting plane iterations needed per weighted sum rate maximization problem ( N ₁ =N ₂ =N _R =2 and ϵ=10 ⁻² )

Full size table

Conclusion

In this article, we presented a generic method that allows to determine the fundamental limits of uni- and bidirectional communication in the half-duplex Gaussian MIMO relay channel. More specifically, we proposed a dual decomposition approach to evaluate upper and lower bounds on the capacity or the capacity region of the considered MIMO relay channels, for which perfect channel state information (CSI) was assumed. To this end, we modified the approach that was previously proposed in [21] such that the average transmit power constraints under which the cut-set outer bound and the achievable decode-and-forward (DF) rates were derived can be handled. It was shown that the joint optimization of input signals and time allocation decomposes into subproblems that are easier to solve in the dual domain, and we gave an example of how to solve the resulting dual problem by means of the cutting plane algorithm. The beauty of the proposed approach lies in the fact that the phases of the respective communication protocol decouple in the dual problem. As a result, evaluating the dual function only requires to solve one convex problem for each phase of the communication protocol, which can be done by applying standard semidefinite program (SDP) tools like SDPT3. It is this property that makes dual decomposition so attractive here, especially since the cutting plane algorithm converges after a reasonably small number of iterations.

Furthermore, we remark that our results may be used for protocol design with DF relays in the future. For the one-way case, we can determine what fraction of time the relay should listen to the source and how long it should transmit. For the two-way case, the benefit of our approach is even greater. By not restricting ourselves to any specific protocol from the outset, we let an optimization problem determine which protocol phases should be used and for what fraction of time they should be active to obtain the best performance. At the same time, the approach allows to evaluate any specific communication protocol. All we need to do is set the time shares of the phases that shall not be part of the considered protocol to zero.

Finally, note that average and per phase transmit power constraints can easily be combined using the framework presented in this article. For this purpose, we simply need to add the per phase transmit power constraints to the definitions of the sets $S_{i}$ and $S_{i}^{'}$ that specify the contributions of the different protocol phases to the outer bound region and the achievable rate region, respectively. Since the sets are then bounded, Proposition 4 becomes obsolete as we do not need a condition on the dual variables μ for the dual function to be finite. The per phase power constraints considered in [21] can therefore easily be incorporated into the optimization framework presented in this article. Since the converse is not true, the optimization approach presented here generalizes that of [21].

Endnotes

^aIn contrast to full-duplex devices, half-duplex nodes cannot transmit and receive simultaneously in the same frequency band, which means that they require orthogonal resources (time, frequency) for transmission and reception. ^bAnother option to determine points on the boundary of the outer bound region would be to solve rate balancing problems over $C_{OB}$ for different ratios of the two rates. ^cNote that R ₁ and R ₂ only denote two entries of the sets $S_{1}, \dots, S_{6}$ . They are not to be confused with R ₁ and R ₂, which specify the rates of the information exchanged by nodes 1 and 2. ^dNote that this assumption is not really a restriction. If the relay is not connected to both terminals, it cannot help the communication between the terminals. And while the direct channel between the terminals may be very weak, e.g., due to high path loss, it is still reasonable to assume it supports rates strictly greater than zero. ^eIf there exists an (r ^(k),p ^(k)) such that $(r^{(k)}, p^{(k)}) \in S_{i}$ and v ^(k)=B _i r ^(k) for more than one i∈{1,…,6}, we assign the index k to only one set $K_{i}$ so that $K_{i} \cap K_{j} = \emptyset$ for i≠j. ^fThe protocol names are due to [19, 20], which are the only two articles among references [14–20] that do not only consider the multiple access broadcast (MABC) protocol. ^gThe reasoning why μ ₃>0 must hold for Θ(λ,μ)<∞ is more complicated in this case since λ ₂=λ ₄=0 does not imply (22). However, the final conclusion remains the same. ^hIn order to obtain the results for the one-way case, we simply let $w = [\begin{matrix} 1 \\ 0 \end{matrix}]$ and considered only phases 1 and 6 in the evaluation of the dual function as explained in Remark 3.

Appendix 1

Proof of Proposition 3

Let $(y, z), (y^{'}, z^{'}) \in S$ and λ∈[0,1]. Moreover, define α _i=λ τ _i and $β_{i} = (1 - λ) τ_{i}^{'}$ . Then,

\begin{align} λ y + (1 - λ) y^{'} & = λ \sum_{i = 1}^{6} τ_{i} B_{i} r_{i} + (1 - λ) \sum_{i = 1}^{6} τ_{i}^{'} B_{i} r_{i}^{'} \\ = \sum_{i = 1}^{6} B_{i} (α_{i} r_{i} + β_{i} r_{i}^{'}) \\ = \sum_{i = 1}^{6} (α_{i} + β_{i}) B_{i} (\frac{α_{i}}{α_{i} + β_{i}} r_{i} + \frac{β_{i}}{α_{i} + β_{i}} r_{i}^{'}) \end{align}

and

\begin{align} λ z + (1 - λ) z^{'} & = λ \sum_{i = 1}^{6} τ_{i} p_{i} + (1 - λ) \sum_{i = 1}^{6} τ_{i}^{'} p_{i}^{'} = \sum_{i = 1}^{6} α_{i} p_{i} + β_{i} p_{i}^{'} \\ = \sum_{i = 1}^{6} (α_{i} + β_{i}) (\frac{α_{i}}{α_{i} + β_{i}} p_{i} + \frac{β_{i}}{α_{i} + β_{i}} p_{i}^{'}) . \end{align}

Since α _i,β _i≥0, $(r_{i}, p_{i}), (r_{i}^{'}, p_{i}^{'}) \in S_{i}$ , and $S_{i}$ is convex, it follows that $\frac{α_{i}}{α_{i} + β_{i}} (r_{i}, p_{i}) + \frac{β_{i}}{α_{i} + β_{i}} (r_{i}^{'}, p_{i}^{'}) \in S_{i}$ , i.e.,

\begin{align} λ y + (1 - λ) y^{'} & = \sum_{i = 1}^{6} (α_{i} + β_{i}) B_{i} {\tilde{r}}_{i}, \\ λ z + (1 - λ) z^{'} & = \sum_{i = 1}^{6} (α_{i} + β_{i}) {\tilde{p}}_{i}, where ({\tilde{r}}_{i}, {\tilde{p}}_{i}) \in S_{i} . \end{align}

Furthermore, 0≤α _i+β _i≤1, ∀i∈{1,…,6}, and $\sum_{i = 1}^{6} α_{i} + β_{i} = \sum_{i = 1}^{6} (λ τ_{i} + (1 - λ) τ_{i}^{'}) = 1$ , which means that $λ (y, z) + (1 - λ) (y^{'}, z^{'}) \in S$ . This proves the proposition. □

Appendix 2

Proof of Proposition 4

For any λ≥0 such that $A^{T} λ = [\begin{matrix} λ_{1} + λ_{2} \\ λ_{3} + λ_{4} \end{matrix}] = [\begin{matrix} w_{1} \\ w_{2} \end{matrix}] = w$ , note that Θ(λ,μ)<∞ is equivalent to ${sup}_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} = {max}_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} < \infty$ for all i∈{1,…,6}. For μ>0, we hence prove the “if” part of the proposition by exemplarily showing that ${sup}_{(r_{i}, p_{i}) \in S_{i}} \{λ^{T} B_{i} r_{i} - μ^{T} p_{i}\} < \infty$ for i=1 as corresponding statements for i=2,…,6 follow along the same lines.

With $B_{1} = {[\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \end{matrix}]}^{T}$ and only terminal 1 transmitting during phase 1, we have

\begin{array}{l} max_{(r, p) \in S_{1}} \{λ^{T} B_{1} r - μ^{T} p\} \\ = max_{(r, p) \in S_{1}} \{λ_{1} r_{1} + λ_{2} r_{2} - μ_{1} p_{1}\} \leq max_{(r, p) \in S_{1}} \{(λ_{1} + λ_{2}) r_{1} - μ_{1} p_{1}\} \\ = max_{R^{(1)} ≽ 0} (λ_{1} + λ_{2}) log det (I_{N_{2} + N_{R}} + H_{1} R^{(1)} H_{1}^{H}) - μ_{1} tr (R^{(1)})\} \\ = max_{R^{(1)} ≽ 0} (λ_{1} + λ_{2}) log det (I_{N_{1}} + H_{1}^{H} H_{1} R^{(1)}) - μ_{1} tr (R^{(1)})\}, \end{array}

where the inequality is due to the fact that $I (X_{1}^{(1)}; Y_{R}^{(1)} Y_{2}^{(1)}) = I (X_{1}^{(1)}; Y_{2}^{(1)}) + I (X_{1}^{(1)}; Y_{R}^{(1)} | Y_{2}^{(1)}) \geq I (X_{1}^{(1)}; Y_{2}^{(1)})$ , which follows from the nonnegativity of and the chain rule for mutual information ([25], Chap. 2). Now, suppose $H_{1}^{H} H_{1} = V Φ V^{H}$ with $Φ = diag (φ_{1}, \dots, φ_{N_{1}})$ is the eigenvalue decomposition of $H_{1}^{H} H_{1}$ , and let us also express R ⁽¹⁾ by means of its eigenvalue decomposition R ⁽¹⁾=U Σ U ^H. Then, the trace of R ⁽¹⁾ is independent of the modal matrix U and equal to the sum of its nonnegative eigenvalues $σ_{1}, \dots, σ_{N_{1}}$ . Moreover, Hadamard’s inequality ([25], Thm. 17.9.2) can be used to show that, with U=V,

\begin{align} max_{R^{(1)} ≽ 0} \{(λ_{1} + λ_{2}) log det (I_{N_{1}} + H_{1}^{H} H_{1} R^{(1)}) - μ_{1} tr (R^{(1)})\} \\ = max_{σ_{k} \geq 0} \sum_{k = 1}^{N_{1}} (λ_{1} + λ_{2}) log (1 + φ_{k} σ_{k}) - μ_{1} σ_{k} . \end{align}

For μ ₁>0, the right-hand side of above equality has a waterfilling type solution given by

\begin{align} σ_{k}^{⋆} = max \{\frac{λ_{1} + λ_{2}}{μ_{1}} - \frac{1}{φ_{k}}, 0\}, \end{align}

which implies that $0 \leq σ_{k}^{⋆} \leq \frac{λ_{1} + λ_{2}}{μ_{1}} < \infty$ for all k∈{1,…,N ₁}, and consequently,

\begin{align} {max}_{(r, p) \in S_{1}} λ^{T} B_{1} r - μ^{T} p\} & \leq \sum_{k = 1}^{N_{1}} (λ_{1} + λ_{2}) log (1 + φ_{k} σ_{k}^{⋆}) \\ - μ_{1} σ_{k}^{⋆} < ∞. \end{align}

The proofs of the converse and the “if” part of the proposition for μ≯0 are omitted because they directly follow from the necessary and sufficient conditions for Θ(λ,μ)<∞ if μ _k=0, k∈{1,2,3}. □

Appendix 3

Outline of coding scheme that achieves $R_{DF}$

The achievability of $R_{DF}$ is proved in [22] for a discrete memoryless channel (DMC) without feedback. The coding scheme uses random encoding and jointly typical decoding on the n th extension of the DMC (see [25], Sec. 7.5 for a definition), meaning that the data transmission is performed with n channel uses. Furthermore, it is assumed that TDD phase i is used n _i times, where $\frac{n_{i}}{n} \to τ_{i} \in [0, 1]$ as n grows large.

The message $W_{1} \in {1, \dots, 2^{n R_{1}}}$ is to be transmitted from node 1 to node 2, whereas $W_{2} \in {1, \dots, 2^{n R_{2}}}$ denotes the message to be sent from terminal 2 to terminal 1 that is independent of W ₁. Both messages are split into six parts: W ₁=(W ₁₁,…,W ₁₆) and W ₂=(W ₂₁,…,W ₂₆) such that $W_{1 a} \in {1, \dots, 2^{n R_{1 a}}}$ and $W_{2 b} \in {1, \dots, 2^{n R_{2 b}}}$ , a,b∈{1,…,6}. The messages are then conveyed to the other terminal as follows:

Phase 1: Node 1 transmits a codeword $X_{1}^{(1)} (W_{11}, W_{12}, W_{13})$ .

Phase 2: Node 2 transmits a codeword $X_{2}^{(2)} (W_{21}, W_{22}, W_{23})$ .

Phase 3: Node 1 transmits a codeword $X_{1}^{(3)} (W_{14}, W_{15})$ and node 2 sends $X_{2}^{(3)} (W_{24}, W_{25})$ . The two codewords are independent!

After phase 3, the relay reliably decodes the messages (W ₁₁,…,W ₁₅) and (W ₂₁,…,W ₂₅), which requires

\begin{aligned} R_{11} + R_{12} + R_{13} & < τ_{1} I (X_{1}^{(1)}; Y_{R}^{(1)}), \\ R_{21} + R_{22} + R_{23} & < τ_{2} I (X_{2}^{(2)}; Y_{R}^{(2)}), \\ R_{14} + R_{15} & < τ_{3} I (X_{1}^{(3)}; Y_{R}^{(3)} | X_{2}^{(3)}), \\ R_{24} + R_{25} & < τ_{3} I (X_{2}^{(3)}; Y_{R}^{(3)} | X_{1}^{(3)}), \\ R_{14} + R_{15} + R_{24} + R_{25} & < τ_{3} I (X_{1}^{(3)} X_{2}^{(3)}; Y_{R}^{(3)}) . \end{aligned}

Phase 4: The relay transmits a codeword $X_{R}^{(4)} (W_{11}, W_{14}, W_{21}, W_{24})$ .

Phase 5: The relay sends a codeword $X_{R}^{(5)} (W_{22}, W_{25})$ , whereas node 2 transmits $X_{2}^{(5)} (W_{22}, W_{25}, W_{26})$ . Note that the two codewords are not independent, but correlated by design in general!

Phase 6: The relay sends a codeword $X_{R}^{(6)} (W_{12}, W_{15})$ and node 1 transmits $X_{1}^{(6)} (W_{12}, W_{15}, W_{16})$ . Again, note that the two codewords are not independent, but correlated by design in general!

After phase 6, each terminal reliably decodes all parts of the message transmitted by the respective other terminal. Reliable decoding at terminal 1 imposes the conditions

\begin{align} R_{21} + R_{24} & < τ_{4} I (X_{R}^{(4)}; Y_{1}^{(4)}), \\ R_{22} + R_{25} & < τ_{5} I (X_{R}^{(5)}; Y_{1}^{(5)}), \\ R_{26} & < τ_{5} I (X_{2}^{(5)}; Y_{1}^{(5)} | X_{R}^{(5)}), \\ R_{23} & < τ_{2} I (X_{2}^{(2)}; Y_{1}^{(2)}), \end{align}

whereas reliable decoding at terminal 2 requires

\begin{align} R_{11} + R_{14} & < τ_{4} I (X_{R}^{(4)}; Y_{2}^{(4)}), \\ R_{12} + R_{15} & < τ_{6} I (X_{R}^{(6)}; Y_{2}^{(6)}), \\ R_{16} & < τ_{6} I (X_{1}^{(6)}; Y_{2}^{(6)} | X_{R}^{(6)}), \\ R_{13} & < τ_{1} I (X_{1}^{(1)}; Y_{2}^{(1)}) . \end{align}

Noting that $R_{1} = \sum_{a = 1}^{6} R_{1 a}$ , $R_{2} = \sum_{b = 1}^{6} R_{2 b}$ , putting all constraints together, and taking the closure of the resulting achievable rate region yields $R_{DF}$ .

While the achievable rate region $R_{DF}$ was derived for a DMC, we remark that Theorem 5 remains valid for channel models with continuous random variables. This is because the decode-and-forward strategy can be derived by means of weakly typical sequences and since the concept of weak typicality applies to continuous random variables as well (cf. [26], Rem. 28).

References

Foschini GJ, Gans MJ: On limits of wireless communications in a fading environment when using multiple antennas. Wirel Personal Commun 1998, 6: 311-335. 10.1023/A:1008889222784
Article Google Scholar
Telatar IE: Capacity of multi-antenna Gaussian channels. Europ. Trans. Telecommun 1999, 10: 585-595. 10.1002/ett.4460100604
Article Google Scholar
van der Meulen EC: Three-terminal communication channels. Adv. Appl. Probab 1971, 3: 120-154. 10.2307/1426331
Article MathSciNet MATH Google Scholar
Cover TM, EL Gamal A: Capacity theorems for the relay channel. IEEE Trans. Inf. Theory 1979, 25(5):572-584. 10.1109/TIT.1979.1056084
Article MathSciNet MATH Google Scholar
Khojastepour MA, Sabharwal A, Aazhang B: Bounds on achievable rates for general multi-terminal networks with practical constraints. In Information Processing in Sensor Networks, Volume 2634 of Lecture Notes in Computer Science. Edited by: Zhao F, Guibas L. Berlin: Springer; 2003:146-161.
Google Scholar
Khojastepour MA, Sabharwal A, Aazhang B: On the capacity of ‘Cheap’ relay networks. In 37th Annual Conference on Information Sciences and Systems (CISS). Baltimore; 2003.
Google Scholar
Khojastepour MA, Sabharwal A, Aazhang B: On capacity of Gaussian ‘Cheap’ relay channel. In IEEE Global Telecommunications Conference (GLOBECOM). San Francisco; 2003:1776-1780.
Google Scholar
Host-Madsen A, Zhang J: Capacity bounds and power allocation for wireless relay channels. IEEE Trans. Inf. Theory 2005, 51(6):2020-2040. 10.1109/TIT.2005.847703
Article MathSciNet MATH Google Scholar
Wang B, Zhang J, Host-Madsen A: On the capacity of MIMO relay channels. IEEE Trans. Inf. Theory 2005, 51: 29-43.
Article MathSciNet MATH Google Scholar
Ng CTK, Foschini GJ: Transmit signal and bandwidth optimization in multiple-antenna relay channels. IEEE Trans. Commun 2011, 59(11):2987-2992.
Article Google Scholar
Gerdes L, Utschick W: Optimized capacity bounds for the MIMO relay channel. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Prague; 2011:3336-3339.
Google Scholar
Gerdes L, Utschick W: Optimized capacity bounds for the half-duplex Gaussian MIMO relay channel. In International ITG Workshop on Smart Antennas (WSA). Aachen; 2011.
Google Scholar
Simoens S, Muñoz-Medina O, Vidal J, del Coso A: On the Gaussian MIMO relay channel with full channel state information. IEEE Trans. Signal Process 2009, 57(9):3588-3599.
Article MathSciNet Google Scholar
Rankov B, Wittneben A: Spectral efficient signaling for half-duplex relay channels. In 39th Asilomar Conference on Signals, Systems and Computers. Monterey; 2005:1066-1071.
Google Scholar
Hammerstroem I, Kuhn M, Esli C, Zhao J, Wittneben A, Bauch G: MIMO two-way relaying with transmit CSI at the relay. IEEE Workshop on Signal Process. Advances in Wireless Communications (SPAWC) (Helsinki) 2007.
Google Scholar
Oechtering TJ, Schnurr C, Bjelakovic I, Boche H: Broadcast capacity region of two-phase bidirectional relaying. IEEE Trans. Inf. Theory 2008, 54: 454-458.
Article MathSciNet MATH Google Scholar
Schnurr C, Oechtering TJ, Stanczak S: Achievable rates for the restricted half-duplex two-way relay channel. In 41st Asilomar Conference on Signals, Systems and Computers. Monterey; 2007:1468-1472.
Google Scholar
Oechtering TJ, Wyrembelski RF, Boche H: On the optimal transmission for the MIMO bidirectional broadcast channel. In IEEE International Conference on Communications (ICC). Dresden; 2009.
Google Scholar
Kim SJ, Mitran P, Tarokh V: Performance bounds for bidirectional coded cooperation protocols. IEEE Trans. Inf. Theory 2008, 54(11):5235-5241.
Article MathSciNet MATH Google Scholar
Kim SJ, Devroye N, Mitran P, Tarokh V: Comparison of bi-directional relaying protocols. In IEEE Sarnoff Symposium. Princeton; 2008.
Google Scholar
Gerdes L, Riemensberger M, Utschick W: On achievable rate regions for half-duplex Gaussian MIMO relay channels: a decomposition approach. IEEE J. Sel. Areas Commun 2012, 30(8):1319-1330.
Article Google Scholar
Stein M: Towards optimal schemes for the half-duplex two-way relay channel. Submitted to IEEE J. Sel. Areas Commun 2011. http://arxiv.org/abs/1101.3198
Google Scholar
El Gamal A, Aref M: The capacity of the semideterministic relay channel. IEEE Trans. Inf. Theory 1982, 28(3):536. 10.1109/TIT.1982.1056502
Article MATH Google Scholar
El Gamal A, Zahedi S: Capacity of a class of relay channels with orthogonal components. IEEE Trans. Inf. Theory 2005, 51(5):1815-1817. 10.1109/TIT.2005.846438
Article MathSciNet MATH Google Scholar
Cover TM, Thomas JA: Elements of Information Theory. Hoboken: John Wiley & Sons; 2006.
MATH Google Scholar
Kramer G, Gastpar M, Gupta P: Cooperative strategies and capacity theorems for relay networks. IEEE Trans. Inf. Theory 2005, 51(9):3037-3063. 10.1109/TIT.2005.853304
Article MathSciNet MATH Google Scholar
Bazaraa MS, Sherali HD, Shetty CM: Nonlinear Programming. Hoboken: John Wiley & Sons; 2006.
Book MATH Google Scholar
Boyd S, Vandenberghe L: Convex Optimization. New York: Cambridge University Press; 2004.
Book MATH Google Scholar
Toh KC, Todd MJ: RH Tutuncu, On the implementation and usage of SDPT3—a MATLAB software package for semidefinite-quadratic-linear programming, version 4.0. 2010. http://hdl.handle.net/1813/15133
Google Scholar
Goldsmith A: Wireless Communications. New York: Cambridge University Press; 2005.
Book Google Scholar

Download references

Acknowledgements

This study was supported by the Deutsche Forschungsgemeinschaft (DFG) under grant Ut36_11.

Author information

Authors and Affiliations

Associate Institute for Signal Processing, Technische Universität München, München, 80290, Germany
Lennart Gerdes, Maximilian Riemensberger & Wolfgang Utschick

Authors

Lennart Gerdes
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Riemensberger
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Utschick
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lennart Gerdes.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Gerdes, L., Riemensberger, M. & Utschick, W. Bounds on the capacity regions of half-duplex Gaussian MIMO relay channels. EURASIP J. Adv. Signal Process. 2013, 43 (2013). https://doi.org/10.1186/1687-6180-2013-43

Download citation

Received: 22 June 2012
Accepted: 04 February 2013
Published: 06 March 2013
DOI: https://doi.org/10.1186/1687-6180-2013-43

Bounds on the capacity regions of half-duplex Gaussian MIMO relay channels

Abstract

1 Introduction

2 System model

3 Outer bound on capacity region

Theorem 1

Proof

Proposition 2

Proof

Convex parameterization of outer bound region C OB

Remark 1

Remark 2

Proposition 3. S is a convex set.

Proof. See Appendix 1. □

Derivation of the dual function

Proposition 4

Proof

Remark 3

Remark 4

Remark 5

Solution by means of cutting plane algorithm

Remark 6

Remark 7

Primal reconstruction

Remark 8

An achievable rate region using the DF scheme

Theorem 5

Proof

Numerical results

Conclusion

Endnotes

Appendix 1

Proof of Proposition 3

Appendix 2

Proof of Proposition 4

Appendix 3

Outline of coding scheme that achieves R DF

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Convex parameterization of outer bound region $C_{OB}$

Proposition 3. $S$ is a convex set.

Outline of coding scheme that achieves $R_{DF}$