- Research Article
- Open Access
Optimization of Hierarchical Modulation for Use of Scalable Media
© Y. Liu and C. Heneghan. 2010
- Received: 2 August 2009
- Accepted: 13 January 2010
- Published: 7 April 2010
This paper studies the Hierarchical Modulation, a transmission strategy of the approaching scalable multimedia over frequency-selective fading channel for improving the perceptible quality. An optimization strategy for Hierarchical Modulation and convolutional encoding, which can achieve the target bit error rates with minimum global signal-to-noise ratio in a single-user scenario, is suggested. This strategy allows applications to make a free choice of relationship between Higher Priority (HP) and Lower Priority (LP) stream delivery. The similar optimization can be used in multiuser scenario. An image transport task and a transport task of an H.264/MPEG4 AVC video embedding both QVGA and VGA resolutions are simulated as the implementation example of this optimization strategy, and demonstrate savings in SNR and improvement in Peak Signal-to-Noise Ratio (PSNR) for the particular examples shown.
- Fading Channel
- Additive White Gaussian Noise
- Convolutional Code
- Scalable Video
- Space Ratio
Recent developments in media source coding have evolved from consideration not only of compression efficiency in terms of rate-distortion curves, but also on methods for providing easy-to-use scalability features. Scalability refers to the ability of the media delivery system to easily provide a range of spatial, temporal, and quality profiles in response to changing system conditions or user demands. For example, a person viewing a sports event on a mobile phone may be content to view a QCIF ( pixels) resolution level at 25 fps, whereas a person with access to an HDTV may wish for a 50 fps, 720 p ( pixels) version of the same media. Such demands can be met using scalable video and audio coding, where lower resolution or lower quality signals can be reconstructed from partial bit streams. This allows simpler delivery of digital media, as networks and terminals can autonomously adapt to issues such as network heterogeneity and error-prone environments (e.g., wireless fading channels) . Scalability allows the removal of parts of the bitstream, while achieving a rate-distortion (R-D) performance with the remaining partial bitstream (at any supported spatial, temporal, or SNR resolution), that is, comparable to a "single-layer" approach , that is, nonscalable H.264/MPEG-4 AVC coding (at that particular resolution) .
However, in order to take maximum advantage of scalable coding, we need to ensure that scalability is treated at a system level, so that all layers of the communication stack can make intelligent decisions about how to use scalability. For example, in real-time audio-visual traffic, consecutive packets carry data of different importance for the user perceived quality. Header information is of vital importance, whereas texture information (in video coding) can tolerate some errors. So, although data may be lost due to congestion or poor wireless channel conditions, the class of data lost will have the largest impact on user experience . Nevertheless, many current media transmission systems assume all data from higher layers is equal in importance, and rely upon the higher layers to provide the additional redundancy which can help protect more important information. However, it can be agreed that scalable media codecs often have the inherent property that some data is more important than others, and exploiting that knowledge may enhance overall system performance.
One strategy that could be employed is to use time-slicing of data with different priorities; however in , Cover proved that if a sender wants to send information simultaneously to several receivers, given specific channel conditions, superimposing high-rate information on low-rate information may achieve higher bandwidth efficiency than the time-sharing strategy.
Scalable coding interacts naturally with hierarchical modulation. Since the packets encoded by scalable codecs can be divided into different classes of priority, a simple scheme would create two classes such as "base information'' and "refinement information'' according to their contribution to the quality/temporal/spatial resolution of the media. The packets belonging to the base level can be allocated to the base bits of the hierarchial constellation, meanwhile the refinement packets can be assigned to the refinement bits of the constellation. The user who is able to decode the base bits of the hierarchical constellation can achieve the lower resolution. Furthermore, if a user is able to decode both the base bits and the refinement bits, a higher resolution is achieved. The enhancement layer cannot reconstruct a higher resolution alone. It has to reuse the information of the lower resolution embedded in the base layer. In order to provide two different resolutions using a nonscalable codec, the media must be encoded twice and the media packets for different resolutions cannot reuse the information from each other. Since the base layer packets encoded by a scalable codec can be reused by the enhancement layer packets, the scalable codec is more efficient than the nonscalable codec in providing multiresolution media simultaneously. In this case, the source packets contributing to the low resolution are allocated to the base bits of the hierarchical constellation and the packets which only contribute to the high resolution are carried by the refinement bits. The users close to the station are able to get all packets decoded and receive a high resolution program. Due to reduced radio signal attenuation, the users far away from the station will probably not be able to decode the refinement bits, but they can still decode the packets for low resolution with acceptable quality.
Note however that the flexibility introduced by hierarchical modulation does not come without a price. In , Jiang and Wilford illustrated that a penalty of slightly reduced SNR in base layer bits is introduced by hierarchical modulation. This penalty has equal impact on both scalable and nonscalable codec in a hierarchical system.
However, as we shall see in Section 2, hierarchical modulation also imposes a second performance penalty, namely, for a given choice of hierarchial constellation, and fixed target bit error rates of the two streams, the system will almost certainly be operating at a higher overall SNR than is needed to satisfy the target BERs.
In this paper, we will show how the constellation can be dynamically adapted at the physically layer in order to remove this performance penalty. This adaptation can be done at a session level, or even with finer granularity (e.g., at a one-second interval) in response to the changing dynamics of the transmitted bit-streams.
The paper is presented as follow. In Section 2 we discuss the basic analytical tools for calculating bit error rates in a sample hierarchical system. A simulation of the single-user scenario, a simulation of the multiuser scenario and their results are described in Sections 3 and 4, separately. Section 5 concludes this paper.
As introduced above, hierarchical modulation is a physical layer modulation technique in which the received signal constellations can be treated in two (or more) parts, by first making coarse decisions about the constellation location, followed by a refined decision on the exact location. Figure 1 shows a 16-QAM constellation diagram to illustrate hierarchical modulation. The data carried by this constellation is broken into two classes: a low priority (LP) and high priority (HP) class. The bits from the HP stream are used to select the quadrant of the constellation point, and the LP stream is used to choose the exact constellation point. The notations and represent the intra- and interconstellation group distances, respectively. The ratio is an important parameter, as it defines the achievable error rates of the system in the presence of noise. When is equal to 1, the constellation reverts to a standard 16-QAM constellation. When is larger than 1, the HP stream is more heavily protected against noise than the LP stream. This is compatible with the typical definition of constellation ratio in DVB-T/DVB-H standard .
Before assigning the HP and LP streams to Hierarchical Modulation constellation points, we can decrease the bit error probability of the streams by using standard coding techniques such as convolutional coding. A high-rate code is suitable for the LP bit stream because of its lower bit error rate demand. Using different rates of code in the HP and LP bit streams is helpful in achieving arbitrary target bit error rates in the Physical Layer.
Exact (in ) BER expressions for uniform -QAM over an additive white Gaussian noise (AWGN) channel have been developed in [17, 18] based on signal-space concepts and a recursive algorithm, respectively. Exact expressions for the BER of 16-QAM and 64-QAM in nonfading and frequency flat fading channels were derived in . The exact and generic (in ) expression for the BER of uniform square QAM in the presence of AWGN channel was obtained in .
For uncoded hierarchical constellation scenarios, an approximate BER expression is described in [9, 10] for 4/16-QAM, 4/64-QAM and in  for multicast -PSK. Reference  obtains exact and generic expressions in for the BER of the -QAM (square and rectangular) constellations over additive white Gaussian noise (AWGN) and fading channels. Over the AWGN channel, these expressions can be described by a weighted sum of complementary error functions.
In the analysis and simulations which follow, we assume two bit streams, separately fed into convolutional encoders with code rates and , which are then gray-coded and modulated onto a 16-QAM constellation. After the encoding and modulation, the two streams are converged into one symbol sequence and transmitted through an AWGN channel. In the receiver the symbols contaminated by noise are demodulated using a Maximum-Likelihood-Sequence-Estimation technique (Viterbi).
In order to determine the performance of this hierarchical modulated scheme, we carry out an analysis of the error probability for the uncoded case. An exact bit error probability expression has been derived in . In this section, the expression will be further developed into a simpler form. This will allow us to minimize the overall SNR which satisfies the target BERs. For the sake of clarity, we will start the analysis from the original step.
As described in , the 16-QAM constellation is equivalent to two 4 PAM signals on quadrature carriers. Since the signals in the phase-quadrature components can be perfectly separated at the demodulator, the probability of error for QAM can be easily determined from the probability of error for PAM. Therefore, the probability of a bit error for the -ary QAM is
where and are the error probabilities of the -ary PAMs with one-half the average power in each quadrature signal of the equivalent QAM system. It should be emphasized here that the error probability discussed here is bit error, which is different from the symbol error in .
The error probability for the bits contained in the HP stream is
where is the received symbol contaminated by white Gaussian noise with zero-mean and variance , and is the transmitted symbol (i.e., ). We assume that each symbol is equiprobable. Given this AWGN channel, the error probability can be given generically as
The average bit energy is
Using (1)–(5), we obtain the error probability for the HP bit of 4-PAM as
From the same argument, we can determine the error probability for the LP bit of the 4-PAM constellation as
Assume that the distances between the corresponding signal points in the Imaginary component and the Quadrature component are same:
2.1. Optimization of Hierarchical Modulation for AWGN Channel
From (9) we can derive the Signal-to-Noise Ratio (SNR) for low priority bits and high priority bits as a function of space ratio and the target bit error rate for high priority bits and low priority bits:
The overall SNR required by the transmission of both high priority bits and low priority bits is the bigger one of the SNR described by (10). Thus, given target bit error rates for high priority bits and low priority bits,
the optimization of the hierarchical modulation can be described by the following equation:
Since the function in (9) does not have an expression with finite number of coefficients, it is difficult to get an exact expression for (10). There are several approximations proposed in [23–26]. However, all these approximations are suitable for a specific range of the independent variable. For example, when the independent variable is smaller and far away from ( ), an approximation of function is derived from the Maclaurin series:
The objective of the optimization is to find out an optimum number given , which leads to a minimum overall SNR. Thus the above approximation of function is not suitable. In this section, we first analyze the property of (10) by aid of the BER versus SNR curve. Then, a realistic method is used to calculate the tabulation of the overall SNR versus the space ratio and the target BER for high priority bits and low priority bits.
2.2. Optimization of Hierarchical Modulation for Flat Rayleigh Fading Channel
Since an OFDM system is employed in the simulation, the multipath Rayleigh fading channel is converted to a flat Rayleigh fading channel for a specific subcarrier, given that the cyclic prefix length is longer than the number of taps used by the multipath fading channel. In this section, the bit error probability of high priority bits and low priority bits over flat Rayleigh fading channel are deployed and the optimization of the hierarchical modulation over flat Rayleigh fading channel is explained.
In the simulation of this paper we employed a frequency-selective fading channel. That is, we simulated an indoor small scale multiple reflective paths radio environment and there is no line-of-sight component. There is relatively slow motion between the transmitter and the receiver. The mathematical model of the multipath radio channel is expressed by (14):
In one subcarrier of the OFDM symbol, the multipath Rayleigh Fading channel is converted to a single path channel:
or in normalized continuous version,
The system channel model is described by
in which is the received signal, is the transmitted signal, is the complex flat Rayleigh fading component and n is Additive White Gaussian Noise (AWGN) with mean 0 and variance . When the received signal is equalized in the receiver, the flat Rayleigh fading component is estimated by the receiver and used to divide (19). The following equation is derived from (19):
Equation (20) indicates that by taking into account the flat Rayleigh Fading component the generic error probability over AWGN channel as described by (3) becomes
in which is a Rayleigh distribution random variable and is chi-square random distributed with two degrees of freedom, if the variance of and is 1, which is an assumption without loss of generality. Thus, the following equation is used to calculate the generic error probability over flat Rayleigh fading channel:
The PDF for chi-square distributed random variable with two degrees of freedom is described by
By introduction of (24) to (22) the generic bit error probability over flat Rayleigh fading channel is derived as
From (25), (1), (4), (5), and (6), we can derive the bit error probability of high priority bits and low priority bits for Hierarchical Modulation over flat Rayleigh fading channel:
From (26) we can derive the Signal-to-Noise Ratio (SNR) for low priority bits and high priority bits as a function of space ratio and the target bit error rate for high priority bits and low priority bits over flat Rayleigh fading channel:
The optimization of the hierarchical modulation over flat Rayleigh fading channel can be described by the following equation:
2.3. Analysis of Packet Error Rate over AWGN Channel
The previous analysis is based on bit error rate. In practice, higher layers may be packet-oriented, so that package error rate is the more important parameter. We can make a simple mapping from BER to expected PER, under some simple assumptions. Assuming that the probability of decoding one bit wrongly ( ) is a stationary uncorrelated process, we can consider the decoded bit stream as a Poisson process. This yields the relationship between BER and PER:
2.4. Impact of Coding on Performance
Analytical results to date have been based on uncoded bit error rates. In practice, the performance of coded hierarchical modulation systems is of more practical interest. The effect of coding will shift the BER curve to the left by the coding gain.
As a proof-of-concept of the use of the Optimum Hierarchical Modulation scheme for single user in scalable video delivery, we send a still image through an AWGN single carrier channel.
The convolutional code of rate 1/2 and 16-QAM Hierarchical Modulation is employed for transmission. The data bits with higher priority and lower priority are convolutional coded and padded with parity bits. The coded bits with high priority are used to select the base bits in 16-QAM constellation and the coded bits with low priority are used to select the refinement bits in the constellation. The average distances of the base bits and the refinement bits can be tuned in order to give an optimized overall image quality (all save the under the same quality).
We employ a specific example of a scalable still image encoder. The pixels image is processed by a progressive encoder called the Embedded Zero-tree Wavelet-Spatial Orientation Tree (EZW-SOT) . An embedded code represents a sequence of binary decisions that distinguish an image from the all gray image. The embedded coding possesses the property that all the bits are ordered in importance in the bit stream. The importance of the bits can be determined by the precision, magnitude, scale, and spatial location of the wavelet coefficients. For example, there are several real numbers described by 4 digits— . The digit is the most significant digit of each number and the is the least significant digit. Thus, the numbers can be stored by a new order in significance, say, Using embedded coding, a decoder can stop decoding at any position and an optimized quality of the same image will be achieved. A discrete wavelet transform provides a multiresolution presentation of the image. The wavelet coefficients can be embedded coded according to their significance. The zero-tree coding provides a binary map, which can indicate the positions of the significant wavelet coefficients.
In the single-user hierarchical modulation scenario, the two or more data channels mapped to the base bits and refinement bits of the constellation points are used to carry the data belonging to different priority levels of one service aiming at one user. As an alternative to the single user case, in the two users case, the two users are assumed to receive the data carried by the hierarchical constellation points and collect the part useful to them. In our simulation, we transmitted an H.264 scalable coded video trailer in which two different resolution sizes are embedded, a VGA ( pixels) size and a QVGA ( pixels) size. The video packets used for decoding the VGA and the QVGA versions are carried by the two different data channels of the hierarchical constellation. All the data packets are encoded using a convolutional code with a rate of before being mapped to the constellations. Assuming the video signal is transmitted in an indoor wireless environment, one user is close to the transmitter and has good average , the other is relatively far from the transmitter and relatively bad average . The user in a good receiving condition is able to decode most of the data packets and is able to watch the VGA version of the trailer. The user in bad receiving condition cannot obtain enough data packets for decoding a VGA trailer due to the wireless loss, but can decode a QVGA size video with acceptable quality.
To evaluate the overall quality performance received by the two users, we calculated the average PSNR performance of the VGA and QVGA versions. In this calculation we assumed that the two users' perceptive quality are equally important. According to the definition of PSNR,
the Mean Squared Error (MSE) is described by
Thus, the average MSE of the VGA and the QVGA version of the video trailer is described by
From (31) and (32) the average PSNR of the VGA and QVGA version of the video is derived to
This paper proposed an optimized hierarchical modulation strategy directed by cross-layer transport priority information for both single user scenario and two users scenario. In the one user scenario, the hierarchical modulation combined with a convolutional code is designed to achieve the objective bit error rates of two data channels with different priority level by an overall minimum signal to noise ratio. In the two-user scenario, the hierarchical modulation strategy, the video codec scalability in spatial domain is considered and an optimized strategy is proposed for the best perceivable quality of the scalable video transmission. The optimization strategy can be implemented in the timescale of ms to ms. The simulation results show 1.5 dB gain in in the single user scenario and 6 dB gain in PSNR (perceivable quality of the scalable video) in two user scenario. Hierarchical modulation is proved to be a promising candidate for the transmission system for scalable digital media.
The authors would like to thank Krishna Sankar because the inspirations about how to calculate the generic bit error probability over flat Rayleigh fading channel is from his web site http://www.dsplog.com/.
- Ohm J-R: Advances in scalable video coding. Proceedings of the IEEE 2005, 93(1):42-56.View ArticleGoogle Scholar
- Lallet A, Dolbear C, Hughes J, Hobson P: Review of scalable video strategies for distributed video applications. Distributed Imaging IEE European Workshop, 1999, London, UK 1999: 2/1-2/7.Google Scholar
- Schwarz H, Marpe D: D2.2:documentation of the Open-Loop SNR-Scalable Video Coder. M-PIPE, September 2005, http://cordis.europa.eu/ist/kct/mpipe_synopsis.htm.Google Scholar
- van der Schaar M, Krishnamachari S, Choi S, Xu X: Adaptive cross-layer protection strategies for robust scalable video transmission over 802.11 WLANs. IEEE Journal on Selected Areas in Communications 2003, 21(10):1752-1763. 10.1109/JSAC.2003.815231View ArticleGoogle Scholar
- Cover TM: Broadcast channels. IEEE Transactions on Information Theory 1972, 18: 2-14. 10.1109/TIT.1972.1054727MathSciNetView ArticleMATHGoogle Scholar
- Wei L-F: Coded modulation with unequal error protection. IEEE Transactions on Communications 1993, 41(10):1439-1449. 10.1109/26.237878View ArticleMATHGoogle Scholar
- Sundberg CEW, Wong WC, Steele R: Logarithmic PCM weighted QAM transmission over gaussian and Rayleigh fading channels. IEE Proceedings 1987, 134(6):557-570. 10.1049/ip-f-1.1987.0094Google Scholar
- Ramchandran K, Ortega A, Uz KM, Vetterli M: Multiresolution broadcast for digital HDTV using joint source/channel coding. IEEE Journal on Selected Areas in Communications 1993, 11(1):6-23. 10.1109/49.210540View ArticleGoogle Scholar
- Morimoto M, Harada H, Okada M, Komaki S: A study on power assignment of hierarchical modulation schemes for digital broadcasting. IEICE Transactions on Communications 1994, E77-B(12):1439-1449.Google Scholar
- Morimoto M, Okada M, Komaki S: A hierarchical image transmission system in a fading channel. Proceedings of the 4th IEEE International Conference on Universal Personal Communications (ICUPC '95), November 1995, Tokyo, Japan 769-772.View ArticleGoogle Scholar
- Pursley MB, Shea JM: Nonuniform phase-shift-key modulation for multimedia multicast transmission in mobile wireless networks. IEEE Journal on Selected Areas in Communications 1999, 17(5):774-783. 10.1109/49.768194View ArticleGoogle Scholar
- Pursley MB, Shea JM: Adaptive nonuniform phase-shift-key modulation for multimedia traffic in wireless networks. IEEE Journal on Selected Areas in Communications 2000, 18(8):1394-1407. 10.1109/49.864005View ArticleGoogle Scholar
- DVB-T standard: ETS 300 744, digital broadcasting systems for television, sound and data services; framing structure, channel coding and modulation for digital terrestrial television The European Telecommunications Standards Institute 2004, 1.5.1(EN300):744.Google Scholar
- Nokes C, Mitchell J: Protential benefits of hierarchical modes of the DVB-T specification. Proceedings of the IEE Colloquium on Digital Television—Where Is It and Where Is It Going?, 1999, London, UK 10/1-10/6.Google Scholar
- Marsden P: Some thoughts on the use of hierarchical modulation in dvb-t. BBC Research and Development Whiter Paper WHP April 2002., (028):Google Scholar
- Jiang H, Wilford PA: A hierarchical modulation for upgrading digital broadcast systems. IEEE Transactions on Broadcasting 2005, 51(2):223-229. 10.1109/TBC.2005.847619View ArticleGoogle Scholar
- Lu J, Letaief KB, Chuang JC-I, Liou ML: M-PSK and M-QAM BER computation using signal-space concepts. IEEE Transactions on Communications 1999, 47(2):181-184. 10.1109/26.752121View ArticleGoogle Scholar
- Yang L-L, Hanzo L: A recursive algorithm for the error probability evaluation of M-QAM. IEEE Communications Letters 2000, 4(10):304-306. 10.1109/4234.880816View ArticleGoogle Scholar
- Fitz MP, Seymour JP: On the bit error probability of QAM modulation. International Journal of Wireless Information Networks 1994, 1(2):131-139. 10.1007/BF02106515View ArticleGoogle Scholar
- Yoon D, Cho K, Lee J: Bit error probability of M-ary quadrature amplitude modulation. Proceedings of IEEE Vehicular Technology Conference (VTC '00), September 2000, Boston, Mass, USA 5: 2422-2427.Google Scholar
- Vitthaladevuni PK, Alouini M-S: BER computation of 4/M-QAM hierarchical constellations. IEEE Transactions on Communications 2001, 47(3):228-239.Google Scholar
- Proakis JG: Digital Communications. McGraw-Hill, New York, NY, USA; 2001.MATHGoogle Scholar
- Rappaport S: Computing approximations for the generalized Q function and its complement. IEEE Transactions on Information Theory 1971, 17(4):497-498. 10.1109/TIT.1971.1054651View ArticleMATHGoogle Scholar
- Borjesson P, Sundberg CE: Simple approximations of the error function Q(x) for communications applications. IEEE Transactions on Communications 1979, 27(3):639-643. 10.1109/TCOM.1979.1094433View ArticleGoogle Scholar
- Dyer SA, Dyer JS: Approximations to error functions. IEEE Instrumentation and Measurement Magazine 2007, 10(6):45-48.MathSciNetView ArticleGoogle Scholar
- Van Halen P: Accurate analytical approximations for error function and its integral. Electronics Letters 1989, 25(9):561-563. 10.1049/el:19890383View ArticleGoogle Scholar
- Shapiro J: Embedded image coding using zerotrees of wavelet coefficients. IEEE Transactions on Signal Processing 1993, 41(12):3445-3462. 10.1109/78.258085View ArticleMATHGoogle Scholar
- Medbo J: Channel models for hiperlan/2 in different indoor scenarios. ETSI BRAN 3ERI085B, March 1998Google Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.