 Research
 Open Access
 Published:
Bounds on the capacity regions of halfduplex Gaussian MIMO relay channels
EURASIP Journal on Advances in Signal Processing volume 2013, Article number: 43 (2013)
Abstract
This article considers uni and bidirectional communication in the halfduplex Gaussian multipleinput multipleoutput (MIMO) relay channel. Assuming perfect channel state information at all nodes and the use of time division duplex communications protocol to separate transmissions and receptions at all nodes, we propose a dual decomposition approach to efficiently determine upper and lower bounds on the capacity and the capacity region of the halfduplex relay channel and the restricted halfduplex twoway relay channel, respectively. Our approach allows to quantify the fundamental limits of the considered relay networks, and the obtained results may serve as benchmarks when studying different and/or suboptimal relay strategies or the impact of channel estimation errors. Furthermore, we discuss how our dual decomposition approach may be used for designing optimal resource allocation protocols.
1 Introduction
A central aspect of today’s and future wireless network standards is the question of how to provide highspeed and highquality service to a steadily growing number of mobile users without an increase of available bandwidth. One means to improve throughput, spectral efficiency, and reliability is to equip the communication devices with multiple antennas as it is wellknown that multiantenna systems offer substantial gains over singleantenna systems [1, 2]. Another means to achieve above goals and to extend coverage is the use of relays, which support the communication between source(s) and destination(s) but usually do not have own information to transmit. The concept of relaying goes back as far as 1971 when van der Meulen introduced the relay channel model [3]. In contrast to pointtopoint channels, the capacity of the relay channel remains unknown in general, but upper and lower bounds have of course been derived [4].
In this study, we consider the combination of multipleantenna systems and the concept of relaying. In particular, we determine upper and lower bounds on the capacity and the capacity region of the Gaussian multipleinput multipleoutput (MIMO) relay channel and the Gaussian MIMO twoway relay channel with a halfduplex constraint^{a}. While this topic is interesting and relevant in itself, note also that both the relay channel and the twoway relay channel are elementary building blocks of general multihop wireless networks. A fundamental understanding of these two small networks and their performance limits can thus help to determine the limits on the performances of larger communication networks, e.g., by decomposing a larger network into subgraphs whose performances can be more easily specified.
In their pioneering study on the relay channel, Cover and El Gamal derived a capacity upper bound and achievable rates based on a then new cutset bound (CSB) and two coding schemes that are now referred to as decodeandforward (DF) and compressandforward (CF), respectively. In [5, 6], the cutset bound and the DF scheme are used to derive bounds on the capacity of the halfduplex relay channel. For Gaussian singleantenna channels, corresponding bounds are presented in [7, 8]. A (generally loose) upper bound to the CSB of the fullduplex Gaussian MIMO relay channel is provided in [9]. Achievable rates for this channel based on pointtopoint transmission, the cascaded relay channel, and a suboptimal DF scheme are given, too. In [10], it is shown that the cutset bound and the maximum achievable DF rate for this MIMO relay channel can be obtained as the solutions of convex optimization problems, which also holds if a halfduplex constraint is imposed and frequency division duplex (FDD) with an average power constraint is considered. For the fullduplex case, the same result was later independently derived in [11] and then extended to the halfduplex relay channel with a time division duplex (TDD) protocol and per protocol phase transmit power constraints imposed on source and relay [12]. A similar study for both the fullduplex and the halfduplex case with TDD is presented in [13]. However, it can be verified that the expressions resulting from those derivations are only upper bounds to the optimal solutions.
The twoway relay channel models the more common and important scenario where two terminals want to exchange information with the aid of a relay. It was introduced in [14], where the authors showed that a significant portion of the loss in spectral efficiency suffered in the oneway relay channel due to the halfduplex constraint can be compensated when bidirectional communication is considered. Most scientific articles have analyzed the halfduplex twoway relay channel in combination with a communication protocol consisting of two phases, a multiple access (MAC) phase and a broadcast (BC) phase [14–18]. In the MAC phase, the terminals transmit their messages to the relay, and subsequently, in the BC phase, the relay broadcasts its message to the terminals. With this protocol, however, all information is sent via the relay since the terminals cannot overhear each other’s transmissions due to the halfduplex constraint. As a result, protocols composed of more than two phases that utilize the direct link between the terminals can yield larger achievable rate regions in general [19–21].
The contributions of this article are as follows. We present a dual decomposition approach that allows to evaluate upper and lower bounds on the capacity and the capacity region of the halfduplex Gaussian MIMO relay channel and the restricted halfduplex Gaussian MIMO twoway relay channel, respectively. To this end, perfect channel state information (CSI) at all nodes and the use of TDD protocols to separate transmissions and receptions at all nodes are assumed. We show how the proposed dual decomposition approach can be applied to efficiently tackle the joint optimization of input signals and time allocation that needs to be solved in order to obtain the desired results. In the dual domain, the problem decomposes into subproblems that are easier to solve and for which standard convex optimization tools can be used. With our optimization approach, it is hence possible to efficiently obtain numerical results that quantify the fundamental limits of uni and bidirectional communication in the halfduplex Gaussian MIMO relay channel. These results can then serve as benchmarks when studying different and/or suboptimal relay strategies or the impact of channel estimation errors on the performance of the considered relay networks. Moreover, our dual decomposition approach may be used for designing optimal resource allocation protocols, as discussed later in this article.
The approach proposed here is a nontrivial extension of a similar dual decomposition approach presented in [21]. There, we considered bounds on achievable rate regions for the same relay networks, but the transmit powers of all nodes were assumed to be bounded above by some finite value for every protocol phase. In this study, we modify this approach such that it can handle the average transmit power constraints under which the information theoretic capacity bounds (cutset bound and achievable DF rate) we are interested in here were derived. We remark that the problems we need to solve become considerably more difficult due to the average transmit power constraints, both from a theoretical and practical point of view. This is because we need to introduce more dual variables and because the constraint sets of the subproblems encountered in the dual domain become unbounded. The latter means that several additional mathematical details have to be taken into account in order to ensure correctness of the optimization strategy. What is more, the power constraints considered in [21] can easily be incorporated into the optimization framework presented in this article, which is not the case vice versa. In this sense, the optimization approach presented here is more general than that of [21].
The remainder of this article is organized as follows. Section 2 introduces the system model for the restricted halfduplex Gaussian MIMO twoway relay channel. It should be mentioned here that our analysis focuses on the halfduplex twoway relay channel since it includes the halfduplex relay channel as a special case. In Section 3, we derive an outer bound on the capacity region of the restricted halfduplex Gaussian MIMO twoway relay channel and show how it can numerically be evaluated by means of the aforementioned dual decomposition approach. An inner bound on the capacity region is given by the rate region that can be achieved when the relay uses the decodeandforward scheme. This achievable rate region and how it can be evaluated is discussed in Section 4. Numerical results for both uni and bidirectional communication in the halfduplex Gaussian MIMO relay channel are presented in Section 5, and Section 6 concludes the article.
Notation: ${\mathbb{R}}_{+}$ stands for the set of nonnegative real numbers. Matrices are denoted by bold capital letters, vectors by bold lowercase characters. The identity matrix, the zero matrix/vector, and the allones vector are specified by I, 0, and 1, respectively, where the dimensions are indicated by subscripts if necessary. A ^{−1}, A ^{‡}, A ^{T}, A ^{H}, and tr(A) denote the inverse, MoorePenrose pseudoinverse, transpose, conjugate transpose, and trace of a matrix A, while A≽B means that A−B is positive semidefinite. E[·] is the expectation operator and $\mathit{x}\sim {\mathcal{N}}_{\mathbb{C}}(\mathit{\mu},\mathit{C})$ means that x is a circularly symmetric complex Gaussian random vector with mean μ and covariance matrix C. Finally, I(X;YZ) denotes the conditional mutual information of random variables X and Y given Z and h(XY) is the differential entropy of X given Y.
2 System model
In the oneway relay channel, one source transmits information to one destination with the help of a relay. This simple unidirectional relay network is obviously only a special case of the twoway relay channel, where two terminals exchange information with the aid of the relay. Therefore, our analysis focuses on the halfduplex Gaussian MIMO twoway relay channel. More specifically, we consider the restricted twoway relay channel, i.e., the bidirectional communication is restricted in the sense that the encoders at the two terminals can neither cooperate, nor are they able to use previously decoded information to encode their messages. The most general communication protocol for this channel model is composed of all six phases (network states) where either one or two nodes transmit, as first noted in [22]. Evidently, no information can be conveyed when all nodes are silent or when all nodes transmit at the same time, where the latter is due to the halfduplex constraint imposed on all nodes. The six different phases are illustrated in Figure 1, where nodes 1 and 2 represent the two terminals and R is the relay.
Let N _{A} and N _{B} be the number of antennas at node A and node B, let ${\mathit{x}}_{\mathrm{A}}^{(i)}\in {\mathbb{C}}^{{N}_{\mathrm{A}}}$ and ${\mathit{y}}_{\mathrm{B}}^{(i)}\in {\mathbb{C}}^{{N}_{\mathrm{B}}}$ denote the transmit signal of node A and the receive signal of node B during phase i, respectively, and let ${\mathit{H}}_{\text{AB}}\in {\mathbb{C}}^{{N}_{\mathrm{B}}\times {N}_{\mathrm{A}}}$ denote the channel gain matrix between nodes A and B for all i∈{1,…,6}. Then, the phases are characterized as follows:

(1)
Node 1 transmits to node 2 and the relay:
$$\begin{array}{l}{\mathit{y}}_{\mathrm{R}}^{(1)}={\mathit{H}}_{1\mathrm{R}}{\mathit{x}}_{1}^{(1)}+{\mathit{n}}_{\mathrm{R}}^{(1)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{\mathrm{R}}^{(1)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{\mathrm{R}}}),\phantom{\rule{2em}{0ex}}\\ {\mathit{y}}_{2}^{(1)}={\mathit{H}}_{12}{\mathit{x}}_{1}^{(1)}+{\mathit{n}}_{2}^{(1)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{2}^{(1)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{2}}).\phantom{\rule{2em}{0ex}}\end{array}$$

(2)
Node 2 transmits to node 1 and the relay:
$$\begin{array}{l}{\mathit{y}}_{\mathrm{R}}^{(2)}={\mathit{H}}_{2\mathrm{R}}{\mathit{x}}_{2}^{(2)}+{\mathit{n}}_{\mathrm{R}}^{(2)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{\mathrm{R}}^{(2)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{\mathrm{R}}}),\phantom{\rule{2em}{0ex}}\\ {\mathit{y}}_{1}^{(2)}={\mathit{H}}_{21}{\mathit{x}}_{2}^{(2)}+{\mathit{n}}_{1}^{(2)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{1}^{(2)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{1}}).\phantom{\rule{2em}{0ex}}\end{array}$$

(3)
Node 1 and node 2 transmit to the relay:
$$\begin{array}{l}\phantom{\rule{15.0pt}{0ex}}{\mathit{y}}_{\mathrm{R}}^{(3)}={\mathit{H}}_{1\mathrm{R}}{\mathit{x}}_{1}^{(3)}+{\mathit{H}}_{2\mathrm{R}}{\mathit{x}}_{2}^{(3)}+{\mathit{n}}_{\mathrm{R}}^{(3)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{\mathrm{R}}^{(3)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{\mathrm{R}}}).\end{array}$$

(4)
The relay transmits to node 1 and node 2:
$$\begin{array}{l}{\mathit{y}}_{1}^{(4)}={\mathit{H}}_{\mathrm{R}1}{\mathit{x}}_{\mathrm{R}}^{(4)}+{\mathit{n}}_{1}^{(4)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{1}^{(4)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{1}}),\phantom{\rule{2em}{0ex}}\\ {\mathit{y}}_{2}^{(4)}={\mathit{H}}_{\mathrm{R}2}{\mathit{x}}_{\mathrm{R}}^{(4)}+{\mathit{n}}_{2}^{(4)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{2}^{(4)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{2}}).\phantom{\rule{2em}{0ex}}\end{array}$$

(5)
The relay and node 2 transmit to node 1:
$$\begin{array}{l}\phantom{\rule{20.0pt}{0ex}}{\mathit{y}}_{1}^{(5)}={\mathit{H}}_{\mathrm{R}1}{\mathit{x}}_{\mathrm{R}}^{(5)}+{\mathit{H}}_{21}{\mathit{x}}_{2}^{(5)}+{\mathit{n}}_{1}^{(5)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{1}^{(5)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{1}}).\end{array}$$

(6)
The relay and node 1 transmit to node 2:
$$\begin{array}{l}\phantom{\rule{20.0pt}{0ex}}{\mathit{y}}_{2}^{(6)}={\mathit{H}}_{\mathrm{R}2}{\mathit{x}}_{\mathrm{R}}^{(6)}+{\mathit{H}}_{12}{\mathit{x}}_{1}^{(6)}+{\mathit{n}}_{2}^{(6)},\phantom{\rule{1em}{0ex}}{\mathit{n}}_{2}^{(6)}\sim {\mathcal{N}}_{\mathbb{C}}(0,{\mathbf{I}}_{{N}_{2}}).\end{array}$$
Here, we have assumed that the channels are the same for all network states in order to simplify the notation. This is without loss of generality, however, since we anyhow require all channels to be perfectly known at all nodes for the discussions below. Moreover, the additive white Gaussian noise ${\mathit{n}}_{\mathrm{A}}^{(i)}$ received at node A during phase i is assumed to be independent of the noise ${\mathit{n}}_{\mathrm{B}}^{(j)}$ received at another node B for all phases j∈{1,…,6} and independent of ${\mathit{n}}_{\mathrm{A}}^{(j)}$ for all j≠i.
With each node A that transmits in the i th phase a transmit covariance matrix
is associated, and the average transmit power consumed by the node during this phase is given by ${p}_{\mathrm{A}}^{(i)}=\phantom{\rule{1pt}{0ex}}\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{\mathrm{A}}^{(i)}\right)$. Furthermore, if the two nodes A and B transmit simultaneously during phase i, we have a joint transmit covariance matrix
for this phase. By defining the selection matrices
the transmit covariance matrices ${\mathit{R}}_{\mathrm{A}}^{(i)}$ and ${\mathit{R}}_{\mathrm{B}}^{(i)}$ of the two transmitting nodes can be expressed as linear functions of the joint transmit covariance matrix R ^{(i)}:
3 Outer bound on capacity region
In this section, we establish an outer bound on the capacity region of the restricted halfduplex Gaussian MIMO twoway relay channel and, as our main contribution, propose an efficient method to evaluate it. The outer bound region is obtained by applying the cutset bound, which was originally derived for the oneway relay channel in [4], to the information flow from node 1 to node 2 as well as to the information flow from node 2 to node 1. In particular, we first consider the cutset outer bound for the general halfduplex twoway relay channel and then show that, for Gaussian channels, it is equivalent to the cutset outer bound for the restricted halfduplex twoway relay channel. While it is not known whether the cutset bound is tight in general, there is no known tighter bound for the relay channel. What is more, it is tight for all classes of relay channels for which the capacity is known. These include the physically degraded and the reversely degraded relay channel [4], the semideterministic relay channel [23], and the relay channel with orthogonal components [24].
Theorem 1
Suppose ( R _{1},R _{2} ) is an achievable rate pair for the halfduplex twoway relay channel, where R _{1} is associated with the rate of the information sent from node 1 to node 2 and R _{2} with that of the reverse direction. Then,
where ${X}_{\mathrm{A}}^{(i)}$ and ${Y}_{\mathrm{B}}^{(i)}$ represent the channel input of node A and the channel output of node B during phase i , respectively, and the duration of the ith phase is denoted by τ _{ i }.
Proof
The result directly follows from ([5], Thm. 1) by considering all six network states (TDD phases) and both directions of data transmission. In particular, the rate bounds originate from the four cutsets depicted in Figure 2; the first two cutsets (shown in Figures 2a,b) yield the upper bounds on R _{1}, whereas the bounds on R _{2} are determined by the third (Figure 2c) and fourth (Figure 2d) cutsets. After having identified which of the six protocol phases need to be considered for which cutset, e.g., phases 1, 3, 6 for the first one, straightforward application of ([5], Thm. 1) gives the constraints specified in (5). □
We remark that the order of the phases in the transmission protocol is irrelevant if we only consider this outer bound region ${\mathcal{C}}_{\text{OB}}$; only the portion of the time τ _{ i } that phase i is used matters. While it is clear that the optimal joint input distribution factors as $\prod _{i=1}^{6}{p}_{{X}_{1}^{(i)}{X}_{2}^{(i)}{X}_{\mathrm{R}}^{(i)}}$, the following proposition additionally shows that ${p}_{{X}_{1}^{(3)}{X}_{2}^{(3)}}={p}_{{X}_{1}^{(3)}}{p}_{{X}_{2}^{(3)}}$ maximizes ${\mathcal{C}}_{\text{OB}}$ for the Gaussian relay channel.
Proposition 2
The input distribution for phase 3 that maximizes ${\mathcal{C}}_{\text{OB}}$ for the halfduplex Gaussian twoway relay channel factors as ${p}_{{X}_{1}^{(3)}{X}_{2}^{(3)}}={p}_{{X}_{1}^{(3)}}{p}_{{X}_{2}^{(3)}}$.
Proof
In the third phase, both node 1 and node 2 transmit to the relay so that the inputoutput characteristic for the Gaussian relay channel is generally specified by ${Y}_{\mathrm{R}}^{(3)}={f}_{1}({X}_{1}^{(3)})+{f}_{2}({X}_{2}^{(3)})+{N}_{\mathrm{R}}^{(3)}$, where f _{1} and f _{2} are deterministic functions that represent the transformations of the input signals induced by the channel gains, and where ${N}_{\mathrm{R}}^{(3)}$ denotes the Gaussian noise received at the relay, which is independent of the signals ${X}_{1}^{(3)}$ and ${X}_{2}^{(3)}$. There are two mutual information terms associated with phase 3 in (5): $I({X}_{1}^{(3)};{Y}_{\mathrm{R}}^{(3)}{X}_{2}^{(3)})$ in the first condition and $I({X}_{2}^{(3)};{Y}_{\mathrm{R}}^{(3)}{X}_{1}^{(3)})$ in the third one. Both of them are maximized if ${X}_{1}^{(3)}$ and ${X}_{2}^{(3)}$ are independent, as shown by the following chain of inequalities:
with equality if and only if ${X}_{1}^{(3)}$ and ${X}_{2}^{(3)}$ are independent ([25], Cor. to Thm. 8.6.1). The same of course holds if the roles of ${X}_{1}^{(3)}$ and ${X}_{2}^{(3)}$ are reversed, which proves the proposition. □
Proposition 2 hence implies that ${\mathcal{C}}_{\text{OB}}$ is also the cutset outer bound for the restricted halfduplex Gaussian twoway relay channel, which requires that ${p}_{{X}_{1}^{(3)}{X}_{2}^{(3)}}={p}_{{X}_{1}^{(3)}}{p}_{{X}_{2}^{(3)}}$ as the terminals must not cooperate in encoding their messages. Moreover, it can be shown that Gaussian inputs are optimal for each phase ([26], Prop. 2). Since a Gaussian distribution is completely determined by its mean and covariance, the optimal zero mean input for phase i is specified by R ^{(i)}, where ${\mathit{R}}_{12}^{(3)}={0}_{{N}_{1}\times {N}_{2}}$ holds for the optimal R ^{(3)} as a consequence of Proposition 2. Note also that the cutset bound was derived under the assumption of average transmit power constraints on every node, i.e., ${\sum}_{i=1}^{6}{\tau}_{i}E\left[{\mathit{x}}_{\mathrm{A}}^{(i),\mathrm{H}}{\mathit{x}}_{\mathrm{A}}^{(i)}\right]={\sum}_{i=1}^{6}{\tau}_{i}\phantom{\rule{1pt}{0ex}}\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{\mathrm{A}}^{(i)}\right)\le {P}_{\mathrm{A}}$ if P _{A} denotes the available transmit power node A may consume on average.
Now, let us turn to the main subject of this section and the entire article, which is to evaluate the outer bound region ${\mathcal{C}}_{\text{OB}}$ for the Gaussian MIMO relay channel. One way of achieving this, and the one we choose here, is to determine its boundary by solving weighted sum rate (WSR) maximization problems over ${\mathcal{C}}_{\text{OB}}$ for different weight vectors $\mathit{w}\in {\mathbb{R}}_{+}^{2}$. In particular, the boundary of ${\mathcal{C}}_{\text{OB}}$ can be determined with arbitrary precision by varying the ratio of the weights $\frac{{w}_{1}}{{w}_{2}}$ from zero to infinity ^{b}. For a given weight vector, the weighted sum rate maximization we then need to solve reads as
We remark that the maximum of problem (6) is welldefined and that a maximizer ${\mathit{r}}^{\star}\in {\mathcal{C}}_{\text{OB}}$ exists. This is because ${\mathcal{C}}_{\text{OB}}$ is closed and bounded (and thus compact) if the transmit powers P _{1}, P _{2}, and P _{R} the nodes may consume on average are finite, which we of course assume below. Hence, Weierstrass’ theorem ([27], Thm. 2.3.1) guarantees that problem (6) attains its maximum.
For the purpose of solving such a WSR maximization problem, we take an approach that is similar to that chosen in [21] and which can be summarized as follows. Since the formulation of (6) is not very convenient if we actually want to perform the optimization, we seek a parameterization that is more suitable to the problem. As a first step towards this end, we find a convex parameterization of the outer bound region ${\mathcal{C}}_{\text{OB}}$ in Section 3.1. Since the objective function is linear, we obtain a convex optimization problem for which strong duality holds so that it can equivalently be solved in the dual domain. The corresponding dual problem is derived in Section 3.2. We then choose to solve this dual problem by means of the cutting plane algorithm, which is discussed in Section 3.3. Finally, we need to recover the optimal primal solution from the optimal solution to the dual problem. How this socalled primal reconstruction works for the considered weighted sum rate maximization problem is explained in Section 3.4.
Convex parameterization of outer bound region ${\mathcal{C}}_{\text{OB}}$
As a first step towards a convex parameterization of the outer bound region ${\mathcal{C}}_{\text{OB}}$, we define six ratepower regions ${\mathcal{S}}_{1},\dots ,{\mathcal{S}}_{6}$, one for each phase of the transmission protocol. Basically, ${\mathcal{S}}_{i}$ specifies the contribution of protocol phase i to the outer bound region, both in terms of rates and power consumption. For the Gaussian MIMO relay channel with the optimal Gaussian inputs, the mutual information terms specifying the rates boil down to the wellknown logdet expressions. Consequently, the six ratepower regions are given by ^{c}
with ${\mathit{H}}_{1}={\left[\begin{array}{cc}{\mathit{H}}_{1\mathrm{R}}^{\mathrm{H}}& {\mathit{H}}_{12}^{\mathrm{H}}\end{array}\right]}^{\mathrm{H}}$, ${\mathit{H}}_{2}={\left[\begin{array}{cc}{\mathit{H}}_{2\mathrm{R}}^{\mathrm{H}}& {\mathit{H}}_{21}^{\mathrm{H}}\end{array}\right]}^{\mathrm{H}}$, ${\mathit{H}}_{5}=\left[\begin{array}{cc}{\mathit{H}}_{21}& {\mathit{H}}_{\mathrm{R}1}\end{array}\right]$, ${\mathit{H}}_{6}=\left[\begin{array}{cc}{\mathit{H}}_{12}& {\mathit{H}}_{\mathrm{R}2}\end{array}\right]$and ${\mathit{D}}_{2}^{(5)}$, ${\mathit{D}}_{\mathrm{R}}^{(5)}$, ${\mathit{D}}_{1}^{(6)}$, ${\mathit{D}}_{\mathrm{R}}^{(6)}$ being appropriate selection matrices as defined in (3). It is straightforward to verify that ${\mathcal{S}}_{1},\dots ,{\mathcal{S}}_{6}$ are convex sets which are parameterized by means of the (joint) transmit covariance matrices R ^{(1)},…,R ^{(6)}, respectively. They are not compact, however, because neither the rates nor the transmit powers are bounded above. In fact, this is the main difference to the problem considered in [21], where the average transmit powers for each phase and thus also the rate regions associated with each phase are bounded. As a result, the derivation of the dual problem and its solution by means of the cutting plane algorithm become considerably more difficult, as discussed in Sections 3.2 and 3.3.
Remark 1
In order to arrive at above formulations for ${\mathcal{S}}_{5}$ and ${\mathcal{S}}_{6}$, the corresponding constraints on R _{1} have to be reformulated. This is done by introducing the auxiliary variables Q ^{(5)} and Q ^{(6)} to relax the equality constraints on the conditional covariance matrices ${\mathit{R}}_{2\mathrm{R}}^{(5)}={\mathit{R}}_{2}^{(5)}{\mathit{R}}_{2\mathrm{R}}^{(5)}{\mathit{R}}_{\mathrm{R}}^{(5),\u2021}{\mathit{R}}_{2\mathrm{R}}^{(5),\mathrm{H}}$ and ${\mathit{R}}_{1\mathrm{R}}^{(6)}={\mathit{R}}_{1}^{(6)}{\mathit{R}}_{1\mathrm{R}}^{(6)}{\mathit{R}}_{\mathrm{R}}^{(6),\u2021}{\mathit{R}}_{1\mathrm{R}}^{(6),\mathrm{H}}$, respectively, before applying the (generalized) Schur complement condition. For more details, we refer the reader to [10], where this reformulation was first presented assuming that ${\mathit{R}}_{\mathrm{R}}^{(5),1}$ exists, or to [11], where the same result was later independently derived for the more general case when ${\mathit{R}}_{\mathrm{R}}^{(5)}$ need not have full rank.
Suppose that P _{1}, P _{2}, and P _{R} denote the finite transmit powers that terminal 1, terminal 2, and the relay may consume on average, respectively, and let ${\mathit{p}}_{\text{Tx}}={\left[\begin{array}{ccc}{P}_{1}& {P}_{2}& {P}_{\mathrm{R}}\end{array}\right]}^{\mathrm{T}}$. Having defined the six ratepower regions and the vector p _{Tx}, we can now rewrite problem (6) as follows:
Like in [21], each row of $\mathit{A}={\left[\begin{array}{cccc}1& 1& 0& 0\\ 0& 0& 1& 1\end{array}\right]}^{\mathrm{T}}$ selects one of the four rate constraints as defined in the outer bound region ${\mathcal{C}}_{\text{OB}}$ (cf. (5)), and the corresponding rows of the matrices B _{ i }∈{0,1}^{4×2} specify the structures of these constraints with regard to the sets ${\mathcal{S}}_{i}$: ${\mathit{B}}_{1}={\mathit{B}}_{6}={\left[\begin{array}{cccc}1& 0& 0& 0\\ 0& 1& 0& 0\end{array}\right]}^{\mathrm{T}}$, ${\mathit{B}}_{2}={\mathit{B}}_{5}={\left[\begin{array}{cccc}0& 0& 1& 0\\ 0& 0& 0& 1\end{array}\right]}^{\mathrm{T}}$, ${\mathit{B}}_{3}={\left[\begin{array}{cccc}1& 0& 0& 0\\ 0& 0& 1& 0\end{array}\right]}^{\mathrm{T}}$, ${\mathit{B}}_{4}={\left[\begin{array}{cccc}0& 1& 0& 0\\ 0& 0& 0& 1\end{array}\right]}^{\mathrm{T}}$. Furthermore, the fact that the three nodes are subject to average transmit power constraints is reflected in the term ${\sum}_{i=1}^{6}{\tau}_{i}{\mathit{p}}_{i}\le {\mathit{p}}_{\text{Tx}}$, where ${\mathit{p}}_{i}={\left[\begin{array}{ccc}{p}_{1}^{(i)}& {p}_{2}^{(i)}& {p}_{\mathrm{R}}^{(i)}\end{array}\right]}^{\mathrm{T}}$is the vector of average transmit powers consumed by the three nodes during phase i.
Remark 2
The optimization problem (13) would be convex for fixed τ _{1},…,τ _{6}. The reason it is a nonconvex parameterization of (6) if the time shares are optimization variables is that the functions τ _{ i } B _{ i } r _{ i } and τ _{ i } p _{ i } are not jointly concave in τ _{ i },r _{ i } and jointly convex in τ _{ i },p _{ i }, respectively.
Consequently, another reformulation step is required, and for this purpose, we define the set
Proposition 3. $\mathcal{S}$ is a convex set.
Proof. See Appendix 1. □
Using this definition of $\mathcal{S}$, the weighted sum rate maximization problem (6) is equivalently expressed as
Because $\mathcal{S}$ is a convex set with nonempty relative interior, (15) is a convex optimization problem for which strong duality holds ([28], Sec. 5.3.2). In particular, the constraints of problem (15) specify a convex set, which means that a convex parameterization of the outer bound region ${\mathcal{C}}_{\text{OB}}$ is given by
Derivation of the dual function
Since we have strong duality for problem (15), we can equivalently solve it in the dual domain. In the approach considered here, the constraints Ar≤y and z≤p _{Tx} are incorporated into the objective function using the Lagrangian multipliers $\mathit{\lambda}\in {\mathbb{R}}^{4}$ and $\mathit{\mu}\in {\mathbb{R}}^{3}$. This leads to a dual problem where the six phases are decoupled. In particular, it will show that this approach allows to solve (15) without explicitly optimizing the time allocation parameters τ _{1},…τ _{6}. The Lagrangian function reads as
and the resulting dual function is given by
Applying the definition of $\mathcal{S}$ yields
If none of the channel gain matrices between the two terminals or between one of the terminals and the relay is a zero matrix, we have the following proposition ^{d}.
Proposition 4
For any μ≥0 and any λ≥0 that satisfies A ^{T} λ=w, the value of the dual function Θ(λ,μ) is finite if and only if the following three conditions hold:

1.
μ _{1}>0or μ _{1}=0,λ _{1}=λ _{2}=0,

2.
μ _{2}>0or μ _{2}=0,λ _{3}=λ _{4}=0,

3.
μ _{3}>0or μ _{3}=0,λ _{2}=λ _{4}=0.
Proof
See Appendix 2. □
The meaning of Proposition 4 is as follows. For the subproblems
the Lagrangian multipliers μ _{1}, μ _{2}, and μ _{3} can be understood as prices associated with the powers P _{1}, P _{2}, and P _{R} consumed by node 1, node 2, and the relay, respectively. If all prices are positive, each of the subproblems is guaranteed to have a finite optimal solution because the cost of power μ ^{T} p _{ i } is a linear function of ${p}_{1}^{(i)}$, ${p}_{2}^{(i)}$, ${p}_{\mathrm{R}}^{(i)}$, whereas λ ^{T} B _{ i } r _{ i } increases only logarithmically with the powers. If one of the prices is zero, however, the transmit power of the corresponding node and the associated transmit data rates can be increased to infinity without incurring any costs. Consequently, the subproblems for all phases i∈{1,…,6} in which this node transmits take the value infinity unless all the entries of the r _{ i }’s to which transmissions by the node contribute are weighted with zero.
Remark 3
Note that λ _{3}=λ _{4}=0 (λ _{1}=λ _{2}=0) may result in Θ(λ,μ)<∞ only if w _{2}=0 (w _{1}=0) because otherwise A ^{T} λ≠w. If w _{2}=0 (w _{1}=0), however, the WSR maximization over ${\mathcal{C}}_{\text{OB}}$ (6) reduces to maximizing the cutset bound for the oneway relay channel with terminal 1 (terminal 2) being the source and terminal 2 (terminal 1) being the destination. In particular, w _{2}=0 yields ${\lambda}_{3}^{\star}={\lambda}_{4}^{\star}=0$, which in turn implies ${\mu}_{2}^{\star}=0$ and
That is, only phases 1 and 6 of our 6phase protocol need to be considered for the optimal solution, which is equivalent to setting τ _{2}=τ _{3}=τ _{4}=τ _{5}=0 in (13). The de facto communication protocol for this case is therefore consistent with that used for the halfduplex oneway relay channel if terminal 1 is the source and terminal 2 is the destination [12, 21]. Similarly, w _{1}=0 implies ${\lambda}_{1}^{\star}={\lambda}_{2}^{\star}=0$, ${\mu}_{1}^{\star}=0$, and the optimal solution involves only phases 2 and 5. If w>0, on the other hand, we can conclude from Proposition 4 that Θ(λ,μ)<∞ requires μ _{1}>0 and μ _{2}>0.
Remark 4
For μ _{3}=0, it follows from Proposition 4 that Θ(λ,μ)<∞ only if λ _{2}=λ _{4}=0. But λ _{2}=λ _{4}=0 means that transmissions by the relay have no effect on the dual function since the corresponding rates are all weighted with zero. This is independent of P _{R} and the channel gain matrices, and as a result, we have
in this case. Moreover, it is clear that phase 3 contributes nothing to the bidirectional communication if the relay cannot forward the information it previously received. Hence, the optimal solution could only involve phases 1 and 2 if μ _{3}=0, meaning that only the direct link between the terminals would be utilized, and $\mathit{\lambda}={\left[\begin{array}{cccc}{w}_{1}& 0& {w}_{2}& 0\end{array}\right]}^{\mathrm{T}}$would be the optimizer of the dual problem. But for this λ the primal feasibility and complementary slackness conditions of the primal problem (15) would only be satisfied simultaneously if ${X}_{2}^{(2)}{Y}_{1}^{(2)}{Y}_{\mathrm{R}}^{(2)}$ and ${X}_{1}^{(1)}{Y}_{2}^{(1)}{Y}_{\mathrm{R}}^{(1)}$ formed Markov chains. This is an academic special case that our system model does not permit. Consequently, μ _{3}>0 if λ≥0 and Θ(λ,μ)<∞.
From Proposition 4 and the two subsequent remarks, it follows that for λ≥0 and positive weight vectors w>0 the dual function is equal to
In order to determine an optimal solution to the original weighted sum rate maximization problem (6), we thus have to solve the dual problem
Remark 5
Because μ>0, the constraint set of this dual problem is not closed so that the existence of a minimizing solution cannot be guaranteed by Weierstrass’ theorem. However, since the maximum of (6) is welldefined and strong duality holds, the minimum of (24) is also welldefined.
Solution by means of cutting plane algorithm
A simple yet efficient algorithm that can be used to solve the dual problem (24) is the cutting plane algorithm ([27], Sec. 6.4), an outerapproximation method where the feasible set of the problem is approximated by a finite number of feasible points and iteratively refined by a set of linear inequalities. In each iteration of the cutting plane algorithm, a linear program, the socalled master program, must be solved and the dual function Θ(λ,μ) must be evaluated. In the ℓ th iteration, the master program reads as
where, for all k∈{1,…,ℓ}, we have $({\mathit{r}}^{(k)},{\mathit{p}}^{(k)})\in {\mathcal{S}}_{i}$ for some i∈{1,…,6} and v ^{(k)}=B _{ i } r ^{(k)}. As can be seen from (23), evaluating the dual function requires to solve six independent convex optimization problems, one over each of the sets ${\mathcal{S}}_{i}$ associated with the six phases of the communication protocol. For this purpose, standard semidefinite program (SDP) solvers like SDPT3 [29] that are capable of dealing with the weighted sum of logdet terms in the objective function can be applied. For a convergence analysis and more details on the cutting plane method, we refer the reader to ([27], Sec. 6.4).
Remark 6
In order for the cutting plane algorithm to work in practice, we replace the constraint μ>0 by μ≥0. This does not change the optimal solution of the dual problem (24) because we know that the optimizer satisfies μ ^{⋆}>0. However, proper initialization of the cutting plane method then has to be ensured. In particular, if α ^{(1)},λ ^{(1)},μ ^{(1)} are the optimizers of the master program in the first iteration, we actually have to choose several initial points $({\mathit{r}}^{(k)},{\mathit{p}}^{(k)})\in {\mathcal{S}}_{i}$ for some i∈{1,…,6} such that α ^{(1)} is finite and μ ^{(1)}>0 is guaranteed. Otherwise, the algorithm runs into problems when the dual function is evaluated.
Remark 7
Since μ ^{⋆}>0 for w>0, it follows from the complimentary slackness condition of the primal problem (15) that z ^{⋆}=p _{Tx}, which means that the three nodes use all their available transmit power. This in turn implies r ^{⋆}>0 whenever w>0, i.e., the tangents to the boundary of ${\mathcal{C}}_{\text{OB}}$ at the optimal unidirectional points (C _{1,max},0) and (0,C _{2,max}) are orthogonal to the axes.
Primal reconstruction
As previously mentioned, the proposed dual decomposition approach allows to determine the optimal value of (6) without explicitly optimizing the time shares allocated to the six phases of the communication protocol. On the one hand, the decoupling of the phases considerably simplifies the optimization, but on the other, we want to know the optimal rate vector r ^{⋆} and possibly the optimal time shares ${\tau}_{i}^{\star}$, e.g., for the purpose of designing resource allocation protocols. To this end, we need to generate the optimal primal solution from the optimal solution to the dual problem, a process that is generally referred to as primal reconstruction or primal recovery. Since we apply the cutting plane algorithm to solve the dual problem, the primal recovery scheme to obtain the optimal rate vector r ^{⋆} and the optimal time shares ${\tau}_{i}^{\star}$ is fairly simple. Assume the cutting plane algorithm has converged to the optimal solution of the dual problem after L iterations, and consider the dual problem of the corresponding master program (25) given by
We remark that this problem is an approximation of the primal problem (15) where the set $\mathcal{S}$ is replaced by a convex combination of feasible points $\left\{({\mathit{v}}^{(1)},{\mathit{p}}^{(1)}),\dots ,({\mathit{v}}^{(L)},{\mathit{p}}^{(L)})\right\}\subset \mathcal{S}$ and where $\mathit{x}\in {\mathbb{R}}^{2}$ and u _{ k } denote the Lagrangian multipliers associated with the constraints A ^{T} λ=w and α≥μ ^{T}(p _{Tx}−p ^{(k)})+λ ^{T} v ^{(k)} of the master program, respectively. Letting ${\mathcal{K}}_{i}={\left\{k:({\mathit{r}}^{(k)},{\mathit{p}}^{(k)})\in {\mathcal{S}}_{i},{\mathit{v}}^{(k)}={\mathit{B}}_{i}{\mathit{r}}^{(k)}\right\}}^{\mathrm{e}}$, we can rewrite (26) as
Furthermore, it can be shown that ${\sum}_{k\in {\mathcal{K}}_{i}}{u}_{k}{\mathit{B}}_{i}{\mathit{r}}^{(k)}=\left({\sum}_{k\in {\mathcal{K}}_{i}}{u}_{k}\right){\mathit{B}}_{i}{\stackrel{~}{\mathit{r}}}_{i}$ and ${\sum}_{k\in {\mathcal{K}}_{i}}{u}_{k}{\mathit{p}}^{(k)}=\left({\sum}_{k\in {\mathcal{K}}_{i}}{u}_{k}\right){\stackrel{~}{\mathit{p}}}_{i}$ for some $({\stackrel{~}{\mathit{r}}}_{i},{\stackrel{~}{\mathit{p}}}_{i})\in {\mathcal{S}}_{i}$ since ${\mathcal{S}}_{i}$ is a convex set for all i∈{1,…,6}. If we insert these expressions in (26) and compare the result to (13), we can conclude that
The optimal time shares ${\tau}_{i}^{\star}$ are therefore easily obtained from the optimal Lagrangian dual variables ${u}_{k}^{\star}$, k∈{1,…,L}, that correspond to the constraints α≥μ ^{T}(p _{Tx}−p ^{(k)})+λ ^{T} v ^{(k)} in the master program. Moreover, it is clear that x ^{⋆}, which denotes the vector of optimal dual variables corresponding to the equality constraints A ^{T} λ=w, yields the optimal rate vector r ^{⋆}.
Remark 8
Since all ${\mathcal{S}}_{i}$ are convex, time sharing within any of the six phases of the communication protocol is not necessary. As a result, there will be no more than one $k\in {\mathcal{K}}_{i}$ with ${u}_{k}^{\star}>0$ for every i∈{1,…,6}.
An achievable rate region using the DF scheme
To obtain an inner bound on the capacity region of the restricted halfduplex Gaussian MIMO twoway relay channel, we consider the rate region that is achievable with the decodeandforward coding scheme in this section. Like the cutset bound, the DF coding scheme is due to Cover and El Gamal [4]. Requiring the relay to decode the source message can be a severe constraint so that other relaying strategies like compressandforward or amplifyandforward can achieve higher rates for certain channel conditions. For singleantenna nodes, this is for example illustrated in [8, 26]. Nevertheless, we consider only the DF strategy in this article because the corresponding achievable rate region ${\mathcal{R}}_{\text{DF}}$ is very similar in structure to ${\mathcal{C}}_{\text{OB}}$ and can thus be evaluated using the same methodology as described in the previous section.
Theorem 5
If the relay uses the decodeandforward coding scheme, the following rate region is achievable for the restricted halfduplex twoway relay channel:
Proof
This result is derived in [22] by adapting the DF coding scheme to the 6phase communication protocol introduced in Section 2 (with the phases performed in exactly that order) and applying it to both directions of information transfer. A brief outline of the coding scheme that achieves ${\mathcal{R}}_{\text{DF}}$ is given in Appendix 3. □
In theory, a different ordering of the phases may increase the achievable rate region ${\mathcal{R}}_{\text{DF}}$. To the best of our knowledge, however, the 6phase protocol we use is the most general protocol for the halfduplex twoway relay channel that has been considered in the literature so far. In particular, it includes the 2phase multiple access broadcast protocol (MABC: consisting of phases 3, 4), the 3phase time division broadcast protocol (TDBC: 1, 2, 4), and the 4phase hybrid broadcast protocol (HBC: 1, 2, 3, 4) used in [14–20], for example ^{f}. In addition, it also covers the approach of using time sharing between the oneway relay channels in both directions to exchange information between the terminals, which we termed oneway time sharing (OWTS: 1, 2, 5, 6) in [21].
Like for the outer bound region ${\mathcal{C}}_{\text{OB}}$, the optimal joint input distribution factors as $\prod _{i=1}^{6}{p}_{{X}_{1}^{(i)}{X}_{2}^{(i)}{X}_{\mathrm{R}}^{(i)}}$, where ${p}_{{X}_{1}^{(3)}{X}_{2}^{(3)}}={p}_{{X}_{1}^{(3)}}{p}_{{X}_{2}^{(3)}}$ must be fulfilled due to the assumption of the restricted halfduplex twoway relay channel, which prohibits the nodes from cooperating in encoding their messages. Furthermore, the optimal input distribution for each phase i∈{1,…,6} can be shown to be Gaussian again.
Note that, as in Section 3, our main objective is again to evaluate the achievable rate region ${\mathcal{R}}_{\text{DF}}$ for the Gaussian MIMO relay channel. Clearly, the boundary of the achievable rate region ${\mathcal{R}}_{\text{DF}}$ can also be determined by means of solving WSR maximization problems with different weight vectors. As ${\mathcal{R}}_{\text{DF}}$ and ${\mathcal{C}}_{\text{OB}}$ are very similar in structure, the approach we use to solve one such problem is essentially the same as for the outer bound region. First, we find a convex parameterization for ${\mathcal{R}}_{\text{DF}}$. Subsequently, we solve the problem in the dual domain by means of the cutting plane algorithm, and finally, we perform the primal reconstruction.
For the purpose of deriving a convex parameterization for ${\mathcal{R}}_{\text{DF}}$, let
and ${\mathcal{S}}_{i}^{\prime}={\mathcal{S}}_{i}$ for i∈{4,5,6}. Like ${\mathcal{S}}_{i}$ defined in the previous section, every ${\mathcal{S}}_{i}^{\prime}$ is a convex set that is parameterized by means of the (joint) transmit covariance matrix R ^{(i)} and that specifies the contribution of phase i to ${\mathcal{R}}_{\text{DF}}$. Having defined these unbounded convex sets ${\mathcal{S}}_{i}^{\prime}$, we can now express the weighted sum rate maximization problem that yields a point on the boundary of ${\mathcal{R}}_{\text{DF}}$ as follows:
Observe that the main difference compared to (13) is the additional constraint on the sum rate R _{1}+R _{2} in ${\mathcal{R}}_{\text{DF}}$ so that ${\mathit{A}}^{\prime}={\left[\begin{array}{ccccc}1& 1& 0& 0& 1\\ 0& 0& 1& 1& 1\end{array}\right]}^{\mathrm{T}}$and ${\mathit{B}}_{1}^{\prime}={\mathit{B}}_{6}^{\prime}={\left[\begin{array}{ccccc}1& 0& 0& 0& 1\\ 0& 1& 0& 0& 0\end{array}\right]}^{\mathrm{T}}$, ${\mathit{B}}_{2}^{\prime}={\mathit{B}}_{5}^{\prime}={\left[\begin{array}{ccccc}0& 0& 1& 0& 1\\ 0& 0& 0& 1& 0\end{array}\right]}^{\mathrm{T}}$, ${\mathit{B}}_{3}^{\prime}={\left[\begin{array}{ccccc}1& 0& 0& 0& 1\\ 0& 0& 1& 0& 1\end{array}\right]}^{\mathrm{T}}$, ${\mathit{B}}_{4}^{\prime}={\left[\begin{array}{ccccc}0& 1& 0& 0& 0\\ 0& 0& 0& 1& 0\end{array}\right]}^{\mathrm{T}}$. This constraint comes from the third phase of the communication protocol, a multiple access phase where both terminals transmit to the relay. The sum rate constraint in ${\mathcal{R}}_{\text{DF}}$ occurs because the relay must decode the messages from node 1 and node 2 when it uses DF.
Since all results from Section 3 apply here accordingly ^{g}, the remaining steps of the optimization follow along the same lines as for the outer bound region ${\mathcal{C}}_{\text{OB}}$. First, we define the convex set
and reformulate (33) as
Then, we use the dual decomposition approach in combination with the cutting plane method to obtain an optimal solution to this convex optimization problem.
Note that, after having obtained the solution, the optimal time shares ${\tau}_{i}^{\star}$, i∈{1,…,6}, i.e., the optimal durations of the six protocol phases, tell us which of these phases are part of the optimal transmission protocol for a given weight vector w. In particular, the optimal protocol includes phase i if and only if ${\tau}_{i}^{\star}>0$. Furthermore, our dual decomposition approach cannot only be applied to WSR maximization problems, but to any convex optimization problem for which strong duality holds. As a result, this approach may be used for the design of resource allocation protocols, e.g., by considering utility maximization problems with concave utility functions.
Numerical results
In this section, numerical results yielding bounds on the capacity of the halfduplex Gaussian relay channel as well as numerical results giving bounds on the capacity region of the restricted halfduplex Gaussian tworelay channel are presented. More specifically, we evaluate and compare the outer bound region ${\mathcal{C}}_{\text{OB}}$ and the rate region ${\mathcal{R}}_{\text{DF}}$ that can be achieved with the relay using the decodeandforward scheme for different scenarios in the twoway case. For unidirectional communication, these regions reduce to the cutset bound C _{OB} and the achievable rate R _{DF}, which give upper and lower bounds on the capacity of the halfduplex Gaussian oneway relay channel.
As an example scenario, let us consider the line network depicted in Figure 3. This is a simple but commonly used geometry (cf. [20, 26]) where the distance d _{12}=1 between the terminals is fixed and the relay is positioned on the line connecting the two terminals such that d _{1R}=d and d _{2R}=1−d. Furthermore, it is assumed that each node may consume the same transmit power P _{1}=P _{2}=P _{R}=10 on average, which for instance is a reasonable assumption in ad hoc networks. Finally, we assume that the path loss exponent is equal to α=4, which is a typical value for urban macrocell environments or multilevel office buildings (cf. [30], Table 2.2), and that all channel coefficients are perfectly known at all nodes.
Within this framework, two different relay network configurations are considered. In the first one, all nodes have a single antenna and the realvalued scalar channel coefficients are specified by ${h}_{\text{AB}}={d}_{\text{AB}}^{\alpha /2}$, which of course implies h _{AB}=h _{BA}. Note that, due to the assumption of realvalued channels, all rate vectors obtained with the presented optimization framework have to be divided by 2 since the rates are specified by $\frac{1}{2}log(\xb7)$ in this case as opposed to log(·) for complexvalued channels. In the second configuration, all nodes are equipped with two antennas. The channel gain matrices are then assumed to be complex random and independent, where the entries of H _{AB} are independent and identically distributed complex Gaussian random variables with zero mean and variance ${d}_{\text{AB}}^{\alpha}$. In addition, we assume that the channels are reciprocal, i.e., ${\mathit{H}}_{\text{AB}}={\mathit{H}}_{\text{BA}}^{\mathrm{T}}$.
For both the single and the multiantenna scenario, Figure 4 shows the cutset outer bound C _{OB} and the achievable DF rate R _{DF} for the halfduplex oneway relay channel over the distance d=d _{1R} between terminal 1 and the relay. Here, we have assumed that terminal 1 is the source and that terminal 2 is the destination of the unidirectional communication, which means that only phases 1 and 6 of the 6phase communication protocol are used. We remark that the results for the multiantenna case are averaged over 1000 independent channel realizations. For comparison, the best outer bound C _{OB,PP} and the best achievable DF rate R _{DF,PP} that can be obtained if the source and the relay are subject to per protocol phase transmit power constraints of the form $\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{\mathrm{A}}^{(i)}\right)\le {P}_{\mathrm{A}},i\in \{1,6\}$, are plotted as well. Note that this condition is more restrictive than the average transmit power constraint ${\sum}_{i=1,6}{\tau}_{i}\phantom{\rule{1pt}{0ex}}\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{\mathrm{A}}^{(i)}\right)\le {P}_{\mathrm{A}}$ with τ _{1},τ _{6}≥0,τ _{1}+τ _{6}=1 so that C _{OB}≥C _{OB,PP} and R _{DF}≥R _{DF,PP}.
It can be observed from Figure 4 that the decodeandforward strategy achieves capacity if the relay is close enough to the source, which is a wellknown fact that has previously been noted for the fullduplex case, e.g., in [26]. We also see that the optimal relay positions lie in the range 0.3≤d≤0.5, with the optimal values of d being almost the same for both power constraints. These observations are to be interpreted with caution, however, as the optimal relay position heavily depends on the path loss coefficient as well as the available transmit powers. Another nonsurprising observation is that, although a factor of 2 is due to the fact that we use realvalued channels for the singleantenna configuration, substantial rate gains can be achieved without increasing P _{1} or P _{R} when multiple antennas are used at each node. More interestingly, the gap between C _{OB} and C _{OB,PP} as well as that between R _{DF} and R _{DF,PP} vanishes when the relay is moved closer to the destination. This can be explained as follows. The sourcerelay link, and thus the phase in which the relay listens to the source, increasingly becomes the bottleneck of the information transfer with increasing d. As d approaches d _{12}=1, the optimal time share ${\tau}_{1}^{\star}$ of phase 1 also approaches 1. Hence, the relay power and the transmit power constraint imposed on the relay have no effect on the optimal solution. Furthermore, the average transmit power constraint imposed on the source becomes ${\tau}_{1}^{\star}\phantom{\rule{1pt}{0ex}}\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{1}^{(1)}\right)+{\tau}_{6}^{\star}\phantom{\rule{1pt}{0ex}}\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{1}^{(6)}\right)\approx \phantom{\rule{1pt}{0ex}}\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{1}^{(1)}\right)\le {P}_{1}$, i.e., it basically amounts to a per phase power constraint for phase 1.
For the bidirectional communication in the halfduplex twoway relay channel, we consider three different relay positions: (a) the relay is exactly in the middle between the two terminals (d=0.5); (b) the relay is placed near terminal 1 (d=0.25); (c) the relay is very close to terminal 1 (d=0.1). For each of these scenarios, Figures 5 (singleantenna) and 6 (multiantenna, results for one particular random channel realization) show the achievable DF rate regions ${\mathcal{R}}_{\text{DF}}$ and the outer bound regions ${\mathcal{C}}_{\text{OB}}$. For comparison, the best achievable DF rate regions ${\mathcal{R}}_{\text{DF,PP}}$ and the best outer bound regions ${\mathcal{C}}_{\text{OB,PP}}$ that can be obtained with per phase power constraints imposed on all nodes are also illustrated. Like for unidirectional transmission, we observe that ${\mathcal{C}}_{\text{OB}}\supset {\mathcal{C}}_{\text{OB,PP}}$ and ${\mathcal{R}}_{\text{DF}}\supset {\mathcal{R}}_{\text{DF,PP}}$ for all scenarios since the average power constraint ${\sum}_{i=1}^{6}{\tau}_{i}\phantom{\rule{1pt}{0ex}}\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{\mathrm{A}}^{(i)}\right)\le {P}_{\mathrm{A}}$ with τ _{ i }≥0,∀i∈{1,…,6}, and ${\sum}_{i=1}^{6}{\tau}_{i}=1$ is less restrictive than the per phase power constraint $\text{tr}\phantom{\rule{1pt}{0ex}}\left({\mathit{R}}_{\mathrm{A}}^{(i)}\right)\le {P}_{\mathrm{A}},\forall i\in \{1,\dots ,6\}$.
First of all, note that the results shown in Figures 5 and 6 allow to draw the same conclusions as for the oneway case: If the relay is close enough to terminal 1, the decodeandforward scheme achieves the cutset bound for the unidirectional communication from terminal 1 to terminal 2, i.e., R _{1,max}=C _{1,max}, regardless of whether we consider average or per phase transmit power constraints. Furthermore, the same R _{2,max} (or C _{2,max}) is obtained for both types of power constraints when d _{2R} approaches 1. Beyond that, a noteworthy observation is that the greatest benefit of the less restrictive average transmit power constraints is obtained if we are interested in the sum rate R _{1}+R _{2}, whereas the performance improvement is less pronounced for asymmetric rate requirements. Finally, observe that the gaps between the boundaries of ${\mathcal{C}}_{\text{OB}}$ and ${\mathcal{C}}_{\text{OB,PP}}$ are like the gaps between the boundaries of ${\mathcal{R}}_{\text{DF}}$ and ${\mathcal{R}}_{\text{DF,PP}}$ for all ratios $\frac{{R}_{1}}{{R}_{2}}$ and all scenarios considered here.
In order to assess the complexity of determining the achievable rate regions and outer bound regions, Table 1 illustrates the average number of iterations the cutting plane algorithm needs per weighted sum rate maximization problem until it converges for the different scenarios in the multiantenna case ^{h}. Here, the parameter ϵ that specifies the absolute accuracy of the optimal value was set to 10^{−2}. Note that the number of required iterations is very small if we consider the per protocol phase transmit power constraint. Unfortunately, the numbers roughly triple with the average power constraint that yields the information theoretic bounds on the capacity and the capacity region of the halfduplex Gaussian relay channel and the halfduplex Gaussian twoway relay channel, respectively. The main reason for this is that we need more dual variables to formulate the dual problem in the latter case. Since the number of required iterations remains reasonably small, however, these results confirm that the proposed dual decomposition approach indeed allows to efficiently evaluate achievable rate regions and corresponding outer bounds for the considered halfduplex Gaussian relay networks. Assuming knowledge of all channel gain matrices, it is hence possible to numerically evaluate their fundamental limits.
Conclusion
In this article, we presented a generic method that allows to determine the fundamental limits of uni and bidirectional communication in the halfduplex Gaussian MIMO relay channel. More specifically, we proposed a dual decomposition approach to evaluate upper and lower bounds on the capacity or the capacity region of the considered MIMO relay channels, for which perfect channel state information (CSI) was assumed. To this end, we modified the approach that was previously proposed in [21] such that the average transmit power constraints under which the cutset outer bound and the achievable decodeandforward (DF) rates were derived can be handled. It was shown that the joint optimization of input signals and time allocation decomposes into subproblems that are easier to solve in the dual domain, and we gave an example of how to solve the resulting dual problem by means of the cutting plane algorithm. The beauty of the proposed approach lies in the fact that the phases of the respective communication protocol decouple in the dual problem. As a result, evaluating the dual function only requires to solve one convex problem for each phase of the communication protocol, which can be done by applying standard semidefinite program (SDP) tools like SDPT3. It is this property that makes dual decomposition so attractive here, especially since the cutting plane algorithm converges after a reasonably small number of iterations.
Furthermore, we remark that our results may be used for protocol design with DF relays in the future. For the oneway case, we can determine what fraction of time the relay should listen to the source and how long it should transmit. For the twoway case, the benefit of our approach is even greater. By not restricting ourselves to any specific protocol from the outset, we let an optimization problem determine which protocol phases should be used and for what fraction of time they should be active to obtain the best performance. At the same time, the approach allows to evaluate any specific communication protocol. All we need to do is set the time shares of the phases that shall not be part of the considered protocol to zero.
Finally, note that average and per phase transmit power constraints can easily be combined using the framework presented in this article. For this purpose, we simply need to add the per phase transmit power constraints to the definitions of the sets ${\mathcal{S}}_{i}$ and ${\mathcal{S}}_{i}^{\prime}$ that specify the contributions of the different protocol phases to the outer bound region and the achievable rate region, respectively. Since the sets are then bounded, Proposition 4 becomes obsolete as we do not need a condition on the dual variables μ for the dual function to be finite. The per phase power constraints considered in [21] can therefore easily be incorporated into the optimization framework presented in this article. Since the converse is not true, the optimization approach presented here generalizes that of [21].
Endnotes
^{a}In contrast to fullduplex devices, halfduplex nodes cannot transmit and receive simultaneously in the same frequency band, which means that they require orthogonal resources (time, frequency) for transmission and reception. ^{b}Another option to determine points on the boundary of the outer bound region would be to solve rate balancing problems over ${\mathcal{C}}_{\text{OB}}$ for different ratios of the two rates. ^{c}Note that R _{1} and R _{2} only denote two entries of the sets ${\mathcal{S}}_{1},\dots ,{\mathcal{S}}_{6}$. They are not to be confused with R _{1} and R _{2}, which specify the rates of the information exchanged by nodes 1 and 2. ^{d}Note that this assumption is not really a restriction. If the relay is not connected to both terminals, it cannot help the communication between the terminals. And while the direct channel between the terminals may be very weak, e.g., due to high path loss, it is still reasonable to assume it supports rates strictly greater than zero. ^{e}If there exists an (r ^{(k)},p ^{(k)}) such that $({\mathit{r}}^{(k)},{\mathit{p}}^{(k)})\in {\mathcal{S}}_{i}$ and v ^{(k)}=B _{ i } r ^{(k)} for more than one i∈{1,…,6}, we assign the index k to only one set ${\mathcal{K}}_{i}$ so that ${\mathcal{K}}_{i}\cap {\mathcal{K}}_{j}=\varnothing $ for i≠j. ^{f}The protocol names are due to [19, 20], which are the only two articles among references [14–20] that do not only consider the multiple access broadcast (MABC) protocol. ^{g}The reasoning why μ _{3}>0 must hold for Θ(λ,μ)<∞ is more complicated in this case since λ _{2}=λ _{4}=0 does not imply (22). However, the final conclusion remains the same. ^{h}In order to obtain the results for the oneway case, we simply let $\mathit{w}=\left[\begin{array}{c}1\\ 0\end{array}\right]$ and considered only phases 1 and 6 in the evaluation of the dual function as explained in Remark 3.
Appendix 1
Proof of Proposition 3
Let $(\mathit{y},\mathit{z}),({\mathit{y}}^{\prime},{\mathit{z}}^{\prime})\in \mathcal{S}$ and λ∈[0,1]. Moreover, define α _{ i }=λ τ _{ i } and ${\beta}_{i}=(1\lambda ){\tau}_{i}^{\prime}$. Then,
and
Since α _{ i },β _{ i }≥0, $({\mathit{r}}_{i},{\mathit{p}}_{i}),({\mathit{r}}_{i}^{\prime},{\mathit{p}}_{i}^{\prime})\in {\mathcal{S}}_{i}$, and ${\mathcal{S}}_{i}$ is convex, it follows that $\frac{{\alpha}_{i}}{{\alpha}_{i}+{\beta}_{i}}({\mathit{r}}_{i},{\mathit{p}}_{i})+\frac{{\beta}_{i}}{{\alpha}_{i}+{\beta}_{i}}({\mathit{r}}_{i}^{\prime},{\mathit{p}}_{i}^{\prime})\in {\mathcal{S}}_{i}$, i.e.,
Furthermore, 0≤α _{ i }+β _{ i }≤1, ∀i∈{1,…,6}, and ${\sum}_{i=1}^{6}{\alpha}_{i}+{\beta}_{i}={\sum}_{i=1}^{6}\left(\lambda {\tau}_{i}+(1\lambda ){\tau}_{i}^{\prime}\right)=1$, which means that $\lambda (\mathit{y},\mathit{z})+(1\lambda )({\mathit{y}}^{\prime},{\mathit{z}}^{\prime})\in \mathcal{S}$. This proves the proposition. □
Appendix 2
Proof of Proposition 4
For any λ≥0 such that ${\mathit{A}}^{\mathrm{T}}\mathit{\lambda}=\left[\begin{array}{c}{\lambda}_{1}+{\lambda}_{2}\\ {\lambda}_{3}+{\lambda}_{4}\end{array}\right]=\left[\begin{array}{c}{w}_{1}\\ {w}_{2}\end{array}\right]=\mathit{w}$, note that Θ(λ,μ)<∞ is equivalent to ${sup}_{({\mathit{r}}_{i},{\mathit{p}}_{i})\in {\mathcal{S}}_{i}}\left\{{\mathit{\lambda}}^{\mathrm{T}}{\mathit{B}}_{i}{\mathit{r}}_{i}{\mathit{\mu}}^{\mathrm{T}}{\mathit{p}}_{i}\right\}={\text{max}}_{({\mathit{r}}_{i},{\mathit{p}}_{i})\in {\mathcal{S}}_{i}}\left\{{\mathit{\lambda}}^{\mathrm{T}}{\mathit{B}}_{i}{\mathit{r}}_{i}{\mathit{\mu}}^{\mathrm{T}}{\mathit{p}}_{i}\right\}<\infty $ for all i∈{1,…,6}. For μ>0, we hence prove the “if” part of the proposition by exemplarily showing that ${sup}_{({\mathit{r}}_{i},{\mathit{p}}_{i})\in {\mathcal{S}}_{i}}\left\{{\mathit{\lambda}}^{T}{\mathit{B}}_{i}{\mathit{r}}_{i}{\mathit{\mu}}^{T}{\mathit{p}}_{i}\right\}<\infty $ for i=1 as corresponding statements for i=2,…,6 follow along the same lines.
With ${\mathit{B}}_{1}={\left[\begin{array}{cccc}1& 0& 0& 0\\ 0& 1& 0& 0\end{array}\right]}^{\mathrm{T}}$ and only terminal 1 transmitting during phase 1, we have
where the inequality is due to the fact that $I({X}_{1}^{(1)};{Y}_{\mathrm{R}}^{(1)}{Y}_{2}^{(1)})=I({X}_{1}^{(1)};{Y}_{2}^{(1)})+I({X}_{1}^{(1)};{Y}_{\mathrm{R}}^{(1)}{Y}_{2}^{(1)})\ge I({X}_{1}^{(1)};{Y}_{2}^{(1)})$, which follows from the nonnegativity of and the chain rule for mutual information ([25], Chap. 2). Now, suppose ${\mathit{H}}_{1}^{\mathrm{H}}{\mathit{H}}_{1}=\mathit{V}\mathit{\Phi}{\mathit{V}}^{\mathrm{H}}$ with $\mathit{\Phi}=\mathit{\text{diag}}({\phi}_{1},\dots ,{\phi}_{{N}_{1}})$ is the eigenvalue decomposition of ${\mathit{H}}_{1}^{\mathrm{H}}{\mathit{H}}_{1}$, and let us also express R ^{(1)} by means of its eigenvalue decomposition R ^{(1)}=U Σ U ^{H}. Then, the trace of R ^{(1)} is independent of the modal matrix U and equal to the sum of its nonnegative eigenvalues ${\sigma}_{1},\dots ,{\sigma}_{{N}_{1}}$. Moreover, Hadamard’s inequality ([25], Thm. 17.9.2) can be used to show that, with U=V,
For μ _{1}>0, the righthand side of above equality has a waterfilling type solution given by
which implies that $0\le {\sigma}_{k}^{\star}\le \frac{{\lambda}_{1}+{\lambda}_{2}}{{\mu}_{1}}<\infty $ for all k∈{1,…,N _{1}}, and consequently,
The proofs of the converse and the “if” part of the proposition for μ≯0 are omitted because they directly follow from the necessary and sufficient conditions for Θ(λ,μ)<∞ if μ _{ k }=0, k∈{1,2,3}. □
Appendix 3
Outline of coding scheme that achieves ${\mathcal{R}}_{\text{DF}}$
The achievability of ${\mathcal{R}}_{\text{DF}}$ is proved in [22] for a discrete memoryless channel (DMC) without feedback. The coding scheme uses random encoding and jointly typical decoding on the n th extension of the DMC (see [25], Sec. 7.5 for a definition), meaning that the data transmission is performed with n channel uses. Furthermore, it is assumed that TDD phase i is used n _{ i } times, where $\frac{{n}_{i}}{n}\to {\tau}_{i}\in [0,1]$ as n grows large.
The message ${W}_{1}\in \{1,\dots ,{2}^{n{R}_{1}}\}$ is to be transmitted from node 1 to node 2, whereas ${W}_{2}\in \{1,\dots ,{2}^{n{R}_{2}}\}$ denotes the message to be sent from terminal 2 to terminal 1 that is independent of W _{1}. Both messages are split into six parts: W _{1}=(W _{11},…,W _{16}) and W _{2}=(W _{21},…,W _{26}) such that ${W}_{1a}\in \{1,\dots ,{2}^{n{R}_{1a}}\}$ and ${W}_{2b}\in \{1,\dots ,{2}^{n{R}_{2b}}\}$, a,b∈{1,…,6}. The messages are then conveyed to the other terminal as follows:
Phase 1: Node 1 transmits a codeword ${X}_{1}^{(1)}({W}_{11},{W}_{12},{W}_{13})$.
Phase 2: Node 2 transmits a codeword ${X}_{2}^{(2)}({W}_{21},{W}_{22},{W}_{23})$.
Phase 3: Node 1 transmits a codeword ${X}_{1}^{(3)}({W}_{14},{W}_{15})$ and node 2 sends ${X}_{2}^{(3)}({W}_{24},{W}_{25})$. The two codewords are independent!
After phase 3, the relay reliably decodes the messages (W _{11},…,W _{15}) and (W _{21},…,W _{25}), which requires