Skip to main content

How Much Feedback Is Required for TDD Multi-Antenna Broadcast Channels with User Selection?


The enormous gains in a multi-antenna transmitter broadcast channel require the Channel State Information at the Transmitter (CSIT). Although the fundamental question "How much feedback is required for a broadcast channel?" has been treated in the literature to some extent, a more comprehensive treatment is certainly desirable. We study the time-division duplex broadcast channel with initial assumption of channel state information (CSI) neither at the base station (BS) nor at the users' side. We provide two transmission strategies through which the BS and the users get necessary CSI. We derive novel lower and upper bounds for the sum rate reflecting the rate loss compared to a perfect CSIT system. Corresponding approximate sum rate expressions are also developed for both schemes. These expressions fully capture the benefits of the CSIT feedback, allowing multi-user diversity gain and better inter-user interference cancellation, and the cost of exchange of information required. These expressions can be optimized for any set of system parameters to unveil the trade-off between the cost and the gains associated to feedback. Thus they allow to characterize the optimal amount of feedback which maximizes the sum rate of the broadcast channel, a well-accepted metric of system performance at the physical layer.

1. Introduction

1.1. Background and Motivation

In a broadcast channel (BC) having a base station (BS) equipped with transmit antennas and () single antenna users, the dominant term of the sum capacity is [13], where denotes the signal-to-noise ratio of the received signal and is called the multiplexing gain. The BC enjoys another gain, coined as multiuser diversity [4], which is due to the possibility of user selection from a larger () pool of users. It has been shown in [5, 6] that the sum capacity of the Gaussian broadcast channel scales with the number of users as , where is the total number of users in the system whose channel information is available at the BS. Apart from the multiplexing gain and the multiuser diversity benefit, the BC enjoys two other advantages over the single user multiple-input multiple-output (SU MIMO) channel. It allows mobile users to have a single antenna each so user terminals can remain quite inexpensive and simple. The second advantage is that the BC channel matrix is often much better conditioned as compared to that of an SU MIMO link which may suffer from line-of-sight channel conditions and strong spatial correlation [7]. These same advantages typically continue to hold compared to multiuser (MU) MIMO systems in which the total number of receive antennas equals or exceeds .

These promising advantages of broadcast MIMO do not come for free as without channel state information at the transmitter (CSIT) and perfect channel state information at the receiver (CSIR), the dominant term of the sum capacity is only because of the optimality of transmitting to a single user in that case [810]. Thus the CSIT of users is indispensable to achieve the full multiplexing gain [1, 3]. Furthermore, to capture the multiuser diversity benefit of in the sum rate, the BS should know the CSI of all these users, where normally could be much larger than .

With perfect CSIT, [11] shows that zero-forcing (ZF) precoding achieves the full multiplexing gain of and the full multiuser diversity gain of of the broadcast channel if the number of users is asymptotically large, although the optimal transmission strategy for the Gaussian BC is dirty paper coding (DPC) [12]. In [5], the authors introduced a very innovative scheme coined as Orthogonal Random Beam Forming (ORBF), where only a few bits of feedback are required from every user and the sum rate was shown to converge to the optimal DPC sum capacity [6] for a large number of users.

There is an enormous volume of research publications analyzing CSIT acquisition techniques and the associated feedback gains in different scenarios but the fundamental issue, which is usually ignored, is the feedback overhead cost of providing CSI to the BS which leads to reduced sum capacity. Both the gain and the acquisition overhead increase with the amount of feedback but there is an optimal operating point (optimal amount of feedback) that maximizes the net gain. To the best of our knowledge, there is no single contribution that has properly analyzed the net gain of feedback in a general broadcast setting (with ) which can be defined as the gain in downlink (DL) sum rate due to feedback, taking into account the uplink (UL) feedback load. For a given feedback load, the problem boils down to a secondary problem, namely, partitioning of this feedback, either to improve the CSIT quality (better interference cancellation for a given user selection) or to get the multiuser diversity gain (select from a larger pool). We take a step back and address the more fundamental problem of how much feedback should be there to achieve the feedback gain-cost tradeoff. A very simple example showing the importance of this absolute gain would be the ORBF transmission scheme which requires as little feedback as bits plus a scalar from each user in the system, but considering the fact that ORBF requires the presence and the feedback from a large number of users, the absolute gain would become questionable.

The second fundamental aspect which often gets overlooked in the analysis of multiuser systems is the consideration of channel coherence time. Wireless channels have a finite coherence time and when multiuser transmission strategies with multiple rounds of training, feedback and data are devised, there is the possibility that the channel has significantly changed during the preliminary training and feedback intervals and that the channel information attained during these phases has become meaningless.

1.2. Contribution

In this paper, we do not make any assumption of CSI. Hence initially, the BS and the users are ignorant of the channel realization but they can estimate/feed back the CSI as is done in practice. To analyze the cost incurred and the attainable benefit of feedback in a meaningful and tractable fashion, we simplify the problem by selecting a time-division duplex (TDD) broadcast channel with perfect reciprocity. TDD reciprocal channels simplify the CSIT acquisition through UL pilot transmission [13, 14] in contrast to frequency-division duplex (FDD) systems in which the users first estimate the DL channel and then send its quantized version in an UL slot. We restrict the CSIT acquisition to be training based only, thanks to TDD reciprocity [13, 14]. In the sequel, we use the terms training and feedback synonymously due to this restriction [14]. So we have a fixed resource (bandwidth and time slot) available, a BS having transmit antennas and () single antenna users. Now this fixed resource can be used for UL/DL data transmission or training/feedback. We assume that the users have no data to transmit in the UL direction. So the UL is solely reserved for channel training/feedback. But due to the TDD mode of operation, any UL transmission will come at the expense of having reduced DL transmission in the overall time slot, hence the cost of training/feedback gets properly accounted for. In this paper, we propose two transmission schemes. In the first scheme, the users, who feed back, are chosen independently of their channel realizations (whence oblivious users). In the second scheme, the users first learn their channel information and decide to feedback based upon their channel realizations (whence informed users). We derive a novel lower bound for the sum rate, capturing the gains and the costs of the CSIT acquisition, which shows explicitly the rate loss with respect to (w.r.t.) a perfect CSI system. We furthermore introduce a sum rate upper bound, that turns out to be closely related to the lower bound. The simplified expressions obtained for the two schemes allow us to maximize the DL sum rate (the performance metric considered here) achieving the cost-benefit tradeoff of the feedback.

1.3. Related Work

Caire et al. [15] studied the achievable rates for multiuser MIMO DL removing any assumption of CSIR or CSIT for FDD systems. They gave transmission schemes incorporating all the necessary training and feedback stages and compared achievable rates for analog and digital feedback schemes. This work was conducted under the assumptions of extremely large channel coherence lengths (which permits to neglect the training and feedback overhead) and of a number of users () equal to . Later in [16], training and feedback parameters were optimized as a function of channel coherence time and SNR, although the number of users was still restricted to . In [15], the sum rate for a system with training-based CSIT is lower bounded in terms of the sum rate of a related system with perfect CSIT. We provide here a novel similar lower bound in which the channel distribution corresponding to the perfect CSIT is actually that of the estimated channels, thus providing a much tighter lower bound compared to the one in [15], where the usual perfect CSIT channel distribution is that of the true channel. Furthermore, our novel lower bound is much easier to derive than the one in [15].

In [17], the authors analyze the tradeoff between multiuser diversity and the accuracy of quantized channel information at the BS. Under the restriction of a fixed number of feedback bits, they conclude that accurate channel information is more important than having multiuser diversity.

In another recent work [18] treating essentially UL systems, the authors considered chunk size optimization (amount of feedback) taking UL and DL of a SU MIMO channel into account.

References [13, 19] are related to our work as they also treat the TDD broadcast channel without any assumption of CSI. But there are major differences in the scope. They treat the case when the number of users in the system is less than the number of BS antennas and try to exploit the channel hardening effect [20] due to a large number of BS antennas, which eliminates the multiuser diversity gain completely. Moreover, in both of these references, the users are never trained about their effective channels and the data is transmitted on the expected value of the effective channel. In contrast, our analysis is for systems with a larger number of users than BS transmit antennas because this setting is certainly more practical than its opposite counterpart. And in both of our transmission strategies (oblivious and informed users), the users are explicitly trained about their effective channel after precoding. The other major difference is in the achievable sum rate. The sum rate in [13, 19] saturates with DL SNR, giving zero multiplexing gain, even if DL and UL SNRs are of the same order, whereas our schemes achieve full multiplexing gain in this setting. A very recent related reference is [21] which is similar to [19] for the most part. Section VII of [21] gives a scheme similar to our scheme with oblivious users (see Section 3), but their sum rate lower bound, given in Theorem involving four expectation operations, neither brings any insight about the sum rate behavior nor seems amenable to any further analysis.

1.4. Organization

This paper is structured as follows. First the system model is described in Section 2. Then in Section 3, the transmission scheme with oblivious users is detailed and the novel lower bound for the sum rate is derived. Section 4 gives the parallel developments for the scheme with informed users. The tightness of the sum rate lower bound and the accuracy of the approximate expressions are illustrated in Section 5. The behavior of the sum rate for oblivious users strategy under various asymptotic regimes is investigated in Section 6 followed by its counterpart for informed users in Section 7. The results for optimal feedback load (optimal number of users) with finite system parameters are explored in Section 8. Section 9 presents the conclusions and some directions for future research.

Notation 1.4.

denotes statistical expectation. Lowercase letters represent scalars, boldface lowercase letters represent vectors, and boldface uppercase letters denote matrices. Moreover denotes the Hermitian transpose of matrix . The identity matrix of dimensions is denoted by . The logarithm with base is denoted by . The cardinality of a set is expressed as .

2. System Model

The frequency-flat system we consider consists of a BS having transmit antennas and () single-antenna user terminals. In the DL, the signal received by th user can be expressed as


where , are the (complex conjugated) channel vectors of users through , with ( denotes the -dimensional complex space), denotes the -dimensional signal transmitted by the BS, and are independent complex Gaussian additive noise terms with zero mean and unit variances. We denote the concatenation of the channels by , so is the forward channel matrix. The channel input from the BS must satisfy an (average) transmit power constraint of , that is, . In this setting, the transmit power is equal to the true signal-to-noise ratio at each user due to normalized noise variances.

The channel is assumed to follow a block-fading model having a coherence length of symbol intervals without channel variation, with independent fading from one block to the next [22]. The entries of the forward channel matrix are independent and identically distributed (i.i.d.) complex Gaussian with zero mean and unit variance. Due to the no CSI assumption, initially all the users and the BS are oblivious of the channel realizations in each block.

For the power constraint at the user terminals, we mainly treat the case of peak power constraint. The peak power per user per channel use is bounded by . The sum rate bounds for average power constrained users are provided in the appendix. The noise in the UL at the BS is also assumed to be spatiotemporally white complex Gaussian with unit variance.

3. Transmission Scheme with Oblivious Users

In this scheme, the users who feed back are unaware of their channel state. So they might be selected in a round-robin or any other fashion, independently of their channel realizations. For the block fading channel with coherence length of symbol intervals, we divide this interval in three phases (see Figure 1): uplink training, downlink training, and coherent data transmission with imperfect CSI. The first phase is the uplink training phase in which a certain number of users train the BS about their forward channels and the BS makes an estimate of the associated forward channel matrix. Based upon this channel information, the BS does the scheduling and chooses the transmit precoding which in general could be simple linear ZF, some nonlinear strategy like vector perturbation or the optimal DPC. The second phase is the downlink training phase, where the BS transmits pilots so that the scheduled users estimate their corresponding effective channels. When this second phase ends, both sides of the broadcast channel have necessary CSI, albeit imperfect. Hence in the third data phase, the BS transmits simultaneously to the selected users who can decode the data coherently.

Figure 1
figure 1

Transmission Phases for Oblivious Users.

Remark 3.

When transmission is switched from UL to DL or vice versa, a guard interval must be inserted in practice. We do not take this guard interval into account as it does not affect our absolute feedback gain analysis.

Below we provide a detailed analysis of the three transmission phases and the necessary BS processing steps.

3.1. Uplink Training Phase

In a TDD system with perfect channel reciprocity, CSIT can be provided to the BS just by transmitting pilots from the users. The BS estimates the users' uplink channels and these are then also the forward channels due to the perfect reciprocity assumption. Suppose (superscript obl stands for oblivious users) of the users transmit pilots, then the length of this uplink training interval is where   should assume a real value sothat is an integer ( can be used when we want a given number of users to transmit for more time and improve their channel estimates at the BS, leads to significantly degraded channel estimates). Assuming orthogonal codes (which is optimal here) of length , the users can transmit simultaneously with transmit energy per user equal to . As each antenna at the BS receives the transmitted code from a particular user through the channel coefficient which links this antenna to that user, the energy received for each channel coefficient (of CSIT) would be . Assuming the BS employs MMSE channel estimation, the resulting channel coefficient estimation errors are i.i.d. complex Gaussian with zero mean and mean-square error (MSE)


For the th user, with channel , the channel estimate is denoted as and the corresponding estimation error is according to . The also have i.i.d. complex Gaussian entries with zero mean and variance (due to the orthogonality property of MMSE estimation). Note that due to training code orthogonality, the channel estimation quality improves with the number of users .

The training length is basically the price of obtaining CSIT at the BS through feedback which reduces the effective channel coherence time to . Hence, the CSIT acquisition from a very large number of users may be very suboptimal.

3.2. BS Transmission Strategy: ZF Precoding with Semiorthogonal User Selection

It is known that DPC allows to achieve the full capacity region of the MIMO broadcast channel [12] but this scheme is complex and its implementation is quite tedious. ZF linear precoding with user selection has been shown to behave quite optimally at high SNR achieving full multiplexing gain in the sum rate [23]. Furthermore in [11], the authors showed that ZF preceded by semiorthogonal user selection (SUS) achieves both the multiplexing gain and the multiuser diversity gain. SUS has been modified in [24] to work with imperfect CSIT in a robust manner. Due to its simplicity, analytical tractability and attractive performance, we choose SUS and ZF precoding as the BS transmission strategy.

We adopt the SUS algorithm of [11] where user orthogonality is imposed at each selection stage. Suppose denotes the set of selected users having cardinality and denotes the BS estimate of the channel matrix of the selected users. In ZF precoding, the unit-norm beamforming vector for the th selected user (denoted as ) is chosen to be orthogonal to the channel vectors of all other selected users, that is, . If is the pseudoinverse of , that is


then the precoding matrix can be obtained from by normalizing all of its columns. For ZF with perfect CSIT, each user receives only the beam directed to it and no multiuser interference is experienced. For the imperfect CSIT case, there is some residual interference. If represents the vector of information symbols ( intended for the th user), the transmitted signal becomes and the signal received by the th selected user (1) can be expressed as follows:


3.3. Downlink Training Phase

It was remarked in [25] that only one symbol interval is sufficient to let the selected users learn their effective scalar channels . In a very recent reference [26], the authors show that this minimal training becomes optimal with joint pilot and data processing. As this DL training length has no relation with the number of users present in the system or the number of BS antennas (), we assume that the selected users are able to estimate their effective scalar channels perfectly even though we ignore the overhead of this phase. This simplifies the analysis without influencing the underlying cost-benefit tradeoff of the feedback.

3.4. Coherent Data Phase

We adopt uniform power allocation. So the th user input signal is i.i.d. Gaussian, , where is the power allocated to th user data stream. The BS is bound to satisfy an average power constraint of but it does not transmit during the entire coherence block due to the initial UL training phase of length . Hence, for the rest of the coherence block, the BS is able to transmit an average per symbol power of instead of . So the power allocated to each of the selected users becomes


3.5. Sum Rate Lower Bound

We are interested in getting an expression for the achievable sum rate of this broadcast channel which captures the gain and the cost associated with feedback. The received signal from (4) can be further written as


This is obtained by exploiting the fact that for due to beamforming () and by splitting the effective channel into two parts, one of which, , is perfectly known at the BS. The above equation can be rewritten as


Now we transition from the above exact signal model to the following degraded model:


In this degraded model, we have relegated, as in [27, 28], the signal part into the interference. Considering this interference or noise as independent from the signal leads to a first reduction in the capacity of system (8) w.r.t. the true system (7). Furthermore, for a noise term with given variance we get a further capacity reduction by taking the worst case noise distribution, namely, a Gaussian distribution. Now, for the system model (8) with Gaussian noise, the optimal input is Gaussian: . So the (instantaneous) capacity expression for system (8) is that of an AWGN channel. Note that the channel that we need to average over in order to get the ergodic capacity is now , or in other words average over (on which may in principle also still depend). So, the capacity of model (8) is


which is a lower bound for the capacity of (7). Now, to compute , the variance of each interference coefficient can be computed based upon the fact that the BS performs MMSE estimation which makes the estimation error (with variance per channel entry) independent of any function of the channel estimates [29], of which beamforming vectors are one particular instance:


Furthermore, by introducing and , we get and the SINR lower bound becomes


For a system with perfect CSIT (), would be the SINR and hence its coefficient in the above expression represents the SINR loss factor w.r.t. a system with perfect CSIT. So during the data phase, the lower bound for the per symbol sum rate can be written as


Due to the average power constraint and the reduction of the transmission time to , the transmit power during the data phase gets boosted by a factor . Putting now the value of , we get


If the same system had perfect CSI (), the sum rate obtained through and beamforming with equal power allocation would be [3, 11]


So the lower bound of the sum rate from (13) can be written in terms of the sum rate of a perfect CSI system as


where , the reduced transmission power due to imperfect CSIT, is given by


An important subtlety however is that in (14), the expectation is over the , the true channels, whereas in (13) the are a per-coefficient variance normalized version of the , the channel estimates. With the assumptions taken here, of i.i.d. channel coefficients and noise elements, this does not make any difference here. However, this issue could potentially make a big difference in the case of spatially correlated channels and/or receiver noise at the BS. In that case also, (13) would allow for a more straightforward analysis of SUS, which is also based on the ! By taking now into account the loss of coherence interval due to feedback (training) interval of length , the per symbol average sum rate lower bound for this oblivious scheme becomes


The biggest virtue of this lower bound is that it gives the achievable sum rate of this scheme in terms of the sum rate of a perfect CSI system (employing SUS and ZF precoding) with loss appearing as an SNR reduction factor and as a reduced multiplexing gain due to the feedback interval.

At this point, we introduce a large user regime approximation for . As the and are perfectly known at the BS (perfect CSIT but with a different channel distribution), we can invoke Theorem from [11] for a large number of users. This allows to approximate by in the above expression. (The origin of this approximation can easily be traced as follows. The effective channel strength can be written as the product of the channel norm squared and the inner product . This inner product has value close to one as the selected users are close to orthogonal. The channel norm squared , a chi-square distributed random variable with varying degrees of freedom depending upon the SUS selection stage, shows a growth with (see, e.g., in [5]) using results from order statistics.) Theoretically this scaling kicks in only when is sufficiently large but we show in Section 5 that this starts to hold very well for being a reasonable multiple of . Using this approximation, the fact that for large userregime and putting the value of from (2), the sum rate lower bound (17) becomes the following approximation(Though at low SNR, strictly speaking is optimal, for large the sum rate becomes essentially insensitive to since for small , . Hence can be maintained at all SNR. At this point we wish to add that perhaps the main characteristic of user power optimization, namely that it leads to varying , is captured by our simplified approach with uniform power loading over a set of selected users of varying size):


Due to the approximation made at this final step, this sum rate expression is not necessarily a lower bound but, as we will see, it closely follows both the lower bound and the true sum rate of the system.

3.6. Sum Rate Upper Bound

Consider the signal model in (7), in which certain channel realizations appear. We get for the mutual information between input and output for user


where denotes entropy. For a given distribution (including the optimal one) of the inputs and of the channel estimation errors, the output with variance has an entropy that is upper bounded by that of a Gaussian distribution. Hence


The conditional distribution of given (and ) is Gaussian with (zero mean and) variance . We have


Hence, we get


where the inequality follows from Jensen's inequality (. Hence, a simple application of Jensen's inequality leads to a lower bound for the upper bound which coincides with the rate lower bound we had before in (9). Note that one cannot say that upper and lower bounds coincide with the case of constant because the lower bound assumes Gaussian signal and interference. Nevertheless, one can sense that upper and lower bound are fairly close. In particular, they behave similarly as far as the rate degradations due to imperfect CSIT as considered here are concerned. This implies in particular that the lower bound should also be a good rate approximation.

4. Transmission Scheme with Informed Users

Similar to the previous scheme with "oblivious users," this scheme comprises transmission phases through which both the BS and the users get necessary CSI. We call this the scheme with "informed users" as the users who feed back are no longer randomly selected. These users are selected based upon their channel realizations in a manner to be described shortly. This scheme divides the coherence length of symbol intervals in four phases (see Figure 2): initial downlink training, uplink training, downlink training, and coherent data transmission.

Figure 2
figure 2

Transmission Phases for Informed Users.

In the first phase the BS transmits DL pilots based upon which all the users estimate their corresponding channel vectors. As the BS has antennas, this training interval length is lower bounded by , , and is independent of the number of users . The channel estimation quality will depend on the transmit energy spent by the BS, where is the BS power during . The contribution of to the BS average transmit power constraint is also through the product . On the other hand, as leads to reduced data transmission time, the optimal choice is the minimal . As can be sufficiently high to allow good channel estimates, we do not take into account the estimation error during this phase but only subtract from the coherence length. Also, we will neglect the effect of on the BS power constraint. Another point is that the channel estimation quality in phase 1) will only affect the selection of informed users, but not the quality of CSIT acquisition in phase 2).

Once the users have acquired the information regarding their respective channels, there could be plenty of criteria to prioritize users depending upon their channel realizations but we restrict ourselves to the simple scheme where best users with largest channel norm are selected for feedback. Hence the BS receives the CSIT from the of the users that have the largest channel norms for the current channel coherence block.

The next three transmission phases are exactly similar as those for the transmission scheme with oblivious users.

Important Remark

In this transmission scheme involving informed users, we select the strongest users (having largest channel norms) who train the BS about their channels. Strictly speaking, this is impractical as how can the users know about being the strongest or not with only information about their own channels. But the underlying idea is to evaluate how much feedback load (how many users) should be there to maximize the DL sum rate if good users feed back. Then, in practice, those users can be made to feed back, on the average, by intelligent selection of a threshold with which users compare their channel strength locally as detailed in [30] and decide to feed back or not, and by designing a proper UL channel access protocol. This threshold will be a function of the total number of users, their channel statistics, the number of BS antennas, and the optimal number of users who should feed back. In practice also then, users should use pseudorandom instead of orthogonal codes in phase , with the resulting nonorthogonality influencing the ensuing analysis.

4.1. Sum Rate Lower Bound

We will be quite brief here as the treatment resembles a lot the one for the oblivious users. If every user is constrained with a peak per symbol power constraint of and users transmit pilots in the UL direction, the feedback length would be where . The MSE of CSIT per channel coefficient at the BS is given by


For this scheme with informed users, we have an initial step of DL training so the length of the data phase reduces to . Thus the DL sum rate for the informed scheme with peak power constrained users is given by


One striking difference from the oblivious users case is that the channel strength factor due to multiuser diversity of now becomes where is the total number of users in the system. This difference arises due to the fact that in the informed users case, the optimization eventually involves all users.

5. Accuracy of the Approximate Sum Rate Expressions

To obtain the approximate sum rate expressions, we derived the novel sum rate lower bound and then used the large user regime approximation in the final step. To see how closely these sum rate expressions capture the true sum rate behavior of the two schemes, it is sufficient to show the accuracy for one of the rate expressions because the same approximation is made for both.

We choose the approximate sum rate expression for oblivious peak power constrained users and we compare it with the true sum rate. We use the version of (18) for the data transmission segment


which is the part where the approximation occurs. For fixed values of ,and, we absorb the constant factor of in the BS power constraint both in numerator and denominator of SINR and leaving the constant multiplying factor of outside of the logarithm. So in this form it captures the behavior of the sum rate expression independent of the coherence length . The corresponding saturation level is


To obtain the true sum rate for various system parameter settings, we use Monte-Carlo simulations in which all the steps in the transmission strategy, for example, feedback, SUS scheduling, and ZF beam formation are replicated and then the SINR at each user is evaluated.

Figure 3 shows plots of the sum rate versus DL power constraint . The uplink power constraint for each user has been fixed to 10 dB and the BS is equipped with 4 antennas. The curves for the true sum rate, the lower bound from (17) (but without the factors ) and the approximation (25) of the LB are plotted when , , , and users feed back their channel information to the BS. The SR LB and its approximation almost coincide. The approximate expression captures very closely the true sum rate behavior for any SNR, even at saturation. This saturation of the sum rate is caused by the imperfect CSIT based upon which ZF beamforming vectors are computed (shown for quantized FB in [3] and for analog feedback in [25]). Note that squaring the number of users from 10 to 100 leads to roughly a doubling of the sum rate at high SNR and hence the multiuser diversity aspect is indeed operating.

Figure 3
figure 3

Sum Rate versus DL power for various numbers of users feeding back.

Figure 4 shows the plot of the sum rate with varying numbers of users feeding back and for DL power constraint levels of , , and  dB. These curves further illustrate the tightness of the lower bound (17) and the approximate expression (25) in capturing the multiuser diversity benefit.

Figure 4
figure 4

Sum Rate versus number of users feeding back, for various DL powers .

6. Asymptotic Analysis with Oblivious Users

In this section, we analyze how the sum rate of oblivious users behaves in different asymptotic regimes. Although this analysis is asymptotic, it gives valuable insight about the optimal amount of feedback and its utilization.

6.1. Noise Limited Regime

For the noise limited regime, the power available to the BS is very limited, that is, . In this regime, the noise in each user's received signal fully dominates the interference coming from the beams of other selected users


Using the approximation of for very small , the above sum rate becomes


Plugging in the value of shows that the above expression is an increasing function of . Hence all the users should feed back provided that and then the BS chooses the strongest user for transmission with full power. This asymptote shows that, at low SNR, the multiplexing gain is lost but the multiuser diversity gives logarithmic instead of double logarithmic gain.

6.2. Interference Limited Regime

The interference power due to imperfect CSIT at the BS scales up with the increase in DL power. So when the DL gets very large (), the interference completely dominates the noise and the sum rate saturates to


which reduces to


This equation shows that the sum rate saturation level can be increased by improving the CSIT quality, that is, by reducing . The SNR switching point at which saturation starts is   the point at which the interference equals the channel noise power and after which the interference starts dominating.

6.3. Asymptotically Large Number of Users

For the oblivious scheme with peak power constrained users, the sum rate expression is completely independent of the number of users present in the system and only depends upon the users who actually feed back.

7. Asymptotic Analysis with Informed Users

In this section, we analyze how the sum rate of the scheme with informed users behaves in different asymptotic regimes.

7.1. Noise Limited Regime

In the noise limited regime, the noise completely dominates the interference at each active user, so


Again by using the approximation of for very small , the above sum rate becomes


7.2. Interference Limited Regime

In this case the SR,


saturates at


Again, this saturation level can be increased by refining the CSIT quality.

7.3. Asymptotically Large Number of Users

For the informed scheme, for any power constraint imposed on user terminals, the effective signal strength increases with as only strong users feed back to the BS and hence get scheduled. For this reason, the sum rate shows unbounded growth with the number of users present in the system.

8. Feedback Load Optimization

As Section 5 showed that the approximate sum rate expressions match closely the true sum rate, they can be used to find the optimal number of users to train the BS about their channel information. There are two parameters, namely, the number of users who feed back and the factor. A careful observation of the final sum rate expressions (18) and (24) reveals that for any fixed value of the product and the number of users who feedback, can always be selected to be without any loss of optimality of the sum rate. The optimization of may affect other system parameters such as the total UL energy spent for training but we focus here on the DL sum rate maximization. Hence, with , the amount of feedback load appears as the number of users who feedback.

8.1. Optimal Number of Users versus DL Power

We formulate the problem here for the scheme with oblivious users. The sum rate for this scheme was developed to be


For high SNR (), and also assuming , (35) becomes


where denotes the sum rate for the case of perfect CSIT. We get for the derivative (denoting as temporarily and ignoring the weak dependence of on )


In (37), for large , and , we can ignore the second term in the square brackets compared to the first term , and we can ignore the factor in front of the second term (as can be verified a posteriori from the resulting solution for ). We now distinguish two cases. In the first case we assume high downlink SNR () and finite uplink SNR (). Solving (37) for now yields the optimal


Hence in this case and . In the second case, with finite ratio . Solving for now yields


In this case and .

We also investigate the optimization w.r.t. by numerical optimization. First we check how the optimal number of users (feeding back) scales with DL power . We plot the curves for the optimal number of users versus in Figure 5 and plot corresponding sum rates achieved by using that optimal number of users for each value of in Figure 6. The parameters are symbol intervals, users in the system, per user peak power constraint of  dB, and the BS is equipped with antennas. It is evident that the gains with optimal feedback are undeniable as the sum rate with only feedback from users is much less than the sum rate with the optimal number of users. The saturation of the sum rate due to imperfect CSIT as depicted in Figure 6 and analyzed in (30) and (34) has previously been investigated in [3, 25].

Figure 5
figure 5

Optimal Number of Users versus DL Power .

Figure 6
figure 6

Sum rate with optimal number of users versus DL power .

The behavior in Figure 5 of the curves of optimal number of users feeding back for the two schemes versus is not very straightforward. At high SNR (interference limited regime, see (30) and (34)), both schemes require very good quality CSIT and due to peak power constrained users, it translates to obtaining feedback from each user for longer intervals which comes out to be a lot of users transmitting feedback (users have orthogonal codes and hence can be separated). The numerical optimization seemingly giving a constant at high SNR does not agree closely to (38), where should behave as , which indicates nevertheless a slow variation with , but at high values of , and , the sensitivity of the sum rate to is fairly low.

At low SNR both curves show very different behavior. At low SNR, the system is noise limited and the multiuser diversity factor is very important. Hence the users with very strong channels should be scheduled (see (28) and (32)). In the informed users scheme, only the very strong users feed back so it requires feedback from a small number of users. As SNR increases and interference, and hence CSIT quality start to become more important, more users start feeding back (which is the way to improve CSIT quality for peak-power constrained users). The scheme with oblivious users requires feedback from a large number of users initially to enjoy multiuser diversity. Since that consumes a lot of coherence time in feedback, the number of users who feed back decreases initially as SNR improves, and starts increasing again only at medium to high SNR, to provide higher quality CSIT.

Although the optimal number of users feeding back in the two schemes differs significantly in the lower to medium SNR range, the corresponding sum rates are very comparable, with just a tiny edge for the informed user strategy. In Figure 6, we have also plotted the sum rate curves when users feed back (this number is close to optimal at high DL SNR for both schemes). The resulting sum rate curves are very close to the ones with optimized numbers of users. This indicates that for a fixed channel coherence length , a fixed value of users feeding back (normally much larger than ) can basically achieve the cost-benefit tradeoff of feedback. In other words, the sum rate is not very sensitive to the number of users who feed back, for the whole SNR range.

8.2. Optimal Number of Users versus Channel Coherence Time

We now analyze how the optimal number of users varies with the channel coherence time. We plot two figures, one showing the optimal number of users versus coherence interval in Figure 7 and the other showing the sum rate corresponding to the optimal number of users versus coherence interval in Figure 8. Here the BS has antennas, its power constraint is  dB and there are users in the system with each user restricted to a peak power constraint of  dB.

Figure 7
figure 7

Optimal number of users versus coherence length.

Figure 8
figure 8

Sum rate with optimal number of users versus coherence length.

The curves of the optimal number of users vs. channel coherence time show eventually a linear behavior in as predicted by (38) though the actual behavior appears to be intermediate between (38) and (39). For smaller values of the coherence interval, a small number of users is optimal so that not much of the coherence interval gets consumed in feedback. For very large values of the coherence interval, feedback from a large number of users is optimal so as to select the good users with good quality CSIT. Thus the number of users to feed back scales up with the increase in channel coherence time. The optimal number is always larger for the scheme with oblivious users than in the case of informed users. This behavior can be anticipated from Figure 5 which shows that from low to medium DL SNR values, the optimal number of users in the oblivious scheme is larger than that in the informed scheme.

Sum rate curves for optimal users are plotted at  dB in Figure 8, so the informed user scheme performs better as can be guessed from Figure 6. Sum rate curves have also been plotted for a fixed number of users () feeding back but contrary to the sum rate versus SNR curves where a single suitable number of users feeding back capture the gain of optimal feedback; here it is not possible to find one such number of users capturing the sum rate gains. So the sum rate as a function of is relatively sensitive to the number of users who feed back.

9. Concluding Remarks

We studied the problem of determining the optimal amount of feedback/training for the sum rate maximization of the broadcast channel with no initial assumption of CSI. We introduced two transmission strategies for providing the CSIT to the BS and derived a novel tight lower bound which clearly shows the rate loss w.r.t. a perfect CSI system. The corresponding simplified sum rate expressions, incorporating the gains of the feedback and the cost of exchange of information, allow us to determine the optimal amount of feedback for any set of system parameters. Moreover, the asymptotic analysis carried out for both schemes gives us insight into the amount and the split of the optimal feedback between obtaining multiuser diversity and accurate channel information for better interuser interference cancellation. Apart from the system and channel parameters, the optimal split is also a function of the regime of operation of the system. The noise limited regime demands the use of feedback to benefit from multiuser diversity whereas the interference limited regime requires the use of feedback resources to get fine quality CSIT because the MSE of CSIT is the principal factor to determine the saturation level of the sum rate versus SNR. In between these two regimes, the feedback split depends upon the contribution of multiuser diversity gain, the importance of CSIT quality in the sum rate and the fraction of the coherence interval used for feedback.

The analysis of net rate gain due to feedback for a TDD system with data transmission in both directions and for an FDD system is very interesting direction for future research. The practical implementation of the informed users scheme requires the design of a SNR threshold and a channel access protocol. The design of such an effective threshold metric as well as the analysis of the effect of using pseudorandom training codes could also be interesting research topics.


Average Power Constrained Users

We treated the case when the users in the system are peak power constrained. For average power constrained users, the feedback behavior will change as the CSIT MSE changes. We keep the discussion to a minimum as we believe this power constraint to be unrealistic and impractical. If there are users in the system having channel coherence length of and each is constrained to an average power per channel use, the total UL energy available in each coherence block is . Now if users feed back, each one of these can transmit an energy of . Here the use of orthogonal codes is not necessary because, due to more flexible power constraint, the users can transmit their available power in short intervals. Hence with this energy transmitted for every channel coefficient, the CSIT MSE at the BS will be


Although the users feeding back will be able to transmit pilots with larger energy (if ), they will be transmitting only occasionally, the probability of which will reduce with more users in the system, and hence the long term average power constraint will be satisfied. Such power constraint for the transmission in the UL direction was employed in [31]. The sum rate expressions for the two schemes when the users are average power constrained can be obtained by plugging in the MSE of CSIT from (A.1):


Then these sum rates can be optimized w.r.t. the amount of feedback (the number of users).


  1. Caire G, Shamai (Shitz) S: On the achievable throughput of a multiantenna Gaussian broadcast channel. IEEE Transactions on Information Theory 2003, 49(7):1691-1706. 10.1109/TIT.2003.813523

    Article  MathSciNet  MATH  Google Scholar 

  2. Jindal N, Goldsmith A: Dirty-paper coding versus TDMA for MIMO broadcast channels. IEEE Transactions on Information Theory 2005, 51(5):1783-1794. 10.1109/TIT.2005.846425

    Article  MathSciNet  MATH  Google Scholar 

  3. Jindal N: MIMO broadcast channels with finite-rate feedback. IEEE Transactions on Information Theory 2006, 52(11):5045-5060.

    Article  MathSciNet  MATH  Google Scholar 

  4. Knopp R, Humblet PA: Information capacity and power control in single-cell multiuser communications. Proceedings of the IEEE International Conference on Communications, June 1995, Seattle, Wash, USA 331-335.

    Chapter  Google Scholar 

  5. Sharif M, Hassibi B: On the capacity of MIMO broadcast channels with partial side information. IEEE Transactions on Information Theory 2005, 51(2):506-522. 10.1109/TIT.2004.840897

    Article  MathSciNet  MATH  Google Scholar 

  6. Sharif M, Hassibi B: A comparison of time-sharing, DPC, and beamforming for MIMO broadcast channels with many users. IEEE Transactions on Communications 2007, 55(1):11-15.

    Article  Google Scholar 

  7. Gesbert D, Kountouris M, Heath JRW, Chae CB, Salzer T: From single user to multiuser communications: shifting the MIMO paradigm. IEEE Signal Processing Magazine 2007, 24(5):36-46.

    Article  Google Scholar 

  8. Cover TM, Thomas JA: Elements of Information Theory. John Wiley & Sons, New York, NY, USA; 1991.

    Book  MATH  Google Scholar 

  9. Cover T: Broadcast channels. IEEE Transactions on Information Theory 1972, 18(1):2-14. 10.1109/TIT.1972.1054727

    Article  MathSciNet  MATH  Google Scholar 

  10. Tse D, Viswanath P: Fundamentals of Wireless Communications. Cambridge University Press, Cambridge, UK; 2005.

    Book  MATH  Google Scholar 

  11. Yoo T, Goldsmith A: On the optimality of multiantenna broadcast scheduling using zero-forcing beamforming. IEEE Journal on Selected Areas in Communications 2006, 24(3):528-541.

    Article  Google Scholar 

  12. Weingarten H, Steinberg Y, Shamai S: The capacity region of the Gaussian multiple-input multiple-output broadcast channel. IEEE Transactions on Information Theory 2006, 52(9):3936-3964.

    Article  MathSciNet  MATH  Google Scholar 

  13. Marzetta TL: How much training is required for multiuser MIMO? Proceedings of the 40th Asilomar Conference on Signals, Systems, and Computers (ACSSC '06), November 2006, Pacific Grove, Calif, USA 359-363.

    Google Scholar 

  14. Marzetta TL, Hochwald BM: Fast transfer of channel state information in wireless systems. IEEE Transactions on Signal Processing 2006, 54(4):1268-1278.

    Article  Google Scholar 

  15. Caire G, Jindal N, Kobayashi M, Ravindran N: Multiuser MIMO downlink made practical: achievable rates with simple channel state estimation and feedback schemes. submitted to IEEE Transactions on Information Theory

  16. Kobayashi M, Caire G, Jindal N, Ravindran N: How much training and feedback are needed in MIMO broadcast channels? Proceedings of the IEEE International Symposium on Information Theory (ISIT '08), July 2008 2663-2667.

    Google Scholar 

  17. Ravindran N, Jindal N: Multi-user diversity vs. accurate channel feedback for MIMO broadcast channels. Proceedings of the IEEE International Conference on Communications (ICC '08), May 2008, Beijing, China 3684-3688.

    Google Scholar 

  18. Jorswieck E, Sezgin A, Ottersten B, Paulraj A: Feedback reduction in uplink MIMO OFDM systems by chunk optimization. EURASIP Journal on Advances in Signal Processing 2008, 2008:-14.

    Google Scholar 

  19. Jose J, Ashikhmin A, Whiting P, Vishwanath S: Scheduling and pre-conditioning in multi-user MIMO TDD systems. Proceedings of the IEEE International Conference on Communications (ICC '08), May 2008, Beijing, China 4100-4105.

    Google Scholar 

  20. Hochwald BM, Marzetta TL, Tarokh V: Multiple-antenna channel hardening and its implications for rate feedback and scheduling. IEEE Transactions on Information Theory 2004, 50(9):1893-1909. 10.1109/TIT.2004.833345

    Article  MathSciNet  MATH  Google Scholar 

  21. Jose J, Ashikhmin A, Whiting P, Vishwanath S: Scheduling and precoding in multi-user multiple antenna time division duplex systems. preprint, 2008,

    Google Scholar 

  22. Marzetta TL, Hochwald BM: Capacity of a mobile multiple-antenna communication link in Rayleigh flat fading. IEEE Transactions on Information Theory 1999, 45(1):139-157. 10.1109/18.746779

    Article  MathSciNet  MATH  Google Scholar 

  23. Jindal N: High SNR analysis of MIMO broadcast channels. Proceedings of the IEEE International Symposium on Information Theory (ISIT '05), September 2005, Adelaide, Australia 2310-2314.

    Google Scholar 

  24. Sjöbergh O, Jorswieck EA, Larsson EG: Greedy user selection for zero-forcing and MMSE multiuser beamforming with channel estimation errors. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '08), March-April 2008, Las Vegas, Nev, USA 3137-3140.

    Google Scholar 

  25. Salim U, Slock D: Broadcast channel: degrees of freedom with no CSIR. Proceedings of the 46th Annual Allerton Conference on Communication, Control, and Computing, September 2008 119-125.

    Chapter  Google Scholar 

  26. Jindal N, Lozano A, Marzetta TL: What is the value of joint processing of pilots and data in block-fading channels? Proceedings of the IEEE International Symposium on Information Theory (ISIT '09), July 2009 2189-2193.

    Google Scholar 

  27. Médard M: The effect upon channel capacity in wireless communications of perfect and imperfect knowledge of the channel. IEEE Transactions on Information Theory 2000, 46(3):933-946. 10.1109/18.841172

    Article  MATH  Google Scholar 

  28. Hassibi B, Hochwald BM: How much training is needed in multiple-antenna wireless links? IEEE Transactions on Information Theory 2003, 49(4):951-963. 10.1109/TIT.2003.809594

    Article  MATH  Google Scholar 

  29. Kay SM: Fundamentals of Statistical Signal Processing—Estimation Theory. Prentice-Hall, Englewood Cliffs, NJ, USA; 1993.

    MATH  Google Scholar 

  30. Gesbert D, Alouini M-S: How much feedback is multi-user diversity really worth? Proceedings of the IEEE International Conference on Communications, June 2004, Paris, France 234-238.

    Google Scholar 

  31. Murugesan S, Uysal-Biyikoglu E, Schniter P: Optimization of training and scheduling in the non-coherent SIMO multiple access channel. IEEE Journal on Selected Areas in Communications 2007, 25(7):1446-1456.

    Article  Google Scholar 

Download references


EURECOM's research is partially supported by its industrial members: BMW Group, Cisco, Monaco Telecom, ORANGE, SFR, SAP, STEricssson, Swisscom, Symantec, and Thales. The research reported in this paper has also been partially supported by the European FP7 project WHERE and NoE NEWCOM++ and by the French ANR project APOGEE.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Umer Salim.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Salim, U., Slock, D. How Much Feedback Is Required for TDD Multi-Antenna Broadcast Channels with User Selection?. EURASIP J. Adv. Signal Process. 2010, 278952 (2010).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: