- Research Article
- Open access
- Published:
Scalable Video Coding with Interlayer Signal Decorrelation Techniques
EURASIP Journal on Advances in Signal Processing volume 2007, Article number: 054342 (2007)
Abstract
Scalability is one of the essential requirements in the compression of visual data for present-day multimedia communications and storage. The basic building block for providing the spatial scalability in the scalable video coding (SVC) standard is the well-known Laplacian pyramid (LP). An LP achieves the multiscale representation of the video as a base-layer signal at lower resolution together with several enhancement-layer signals at successive higher resolutions. In this paper, we propose to improve the coding performance of the enhancement layers through efficient interlayer decorrelation techniques. We first show that, with nonbiorthogonal upsampling and downsampling filters, the base layer and the enhancement layers are correlated. We investigate two structures to reduce this correlation. The first structure updates the base-layer signal by subtracting from it the low-frequency component of the enhancement layer signal. The second structure modifies the prediction in order that the low-frequency component in the new enhancement layer is diminished. The second structure is integrated in the JSVM 4.0 codec with suitable modifications in the prediction modes. Experimental results with some standard test sequences demonstrate coding gains up to 1 dB for I pictures and up to 0.7 dB for both I and P pictures.
References
JVT : Joint scalable video model JSVM-4. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), October 2005, Nice, France
Burt PJ, Adelson EH: The Lapacian pyramid as a compact image code. IEEE Transactions on Communications 1983,31(4):532-540. 10.1109/TCOM.1983.1095851
Do MN, Vetterli M: Framing pyramids. IEEE Transactions on Signal Processing 2003,51(9):2329-2342. 10.1109/TSP.2003.815389
Flierl M, Vandergheynst P: An improved pyramid for spatially scalable video coding. Proceedings of IEEE International Conference on Image Processing (ICIP '05), September 2005, Genova, Italy 2: 878–881.
Santa-Cruz D, Reichel J, Ziliani F: Opening the Laplacian pyramid for video coding. Proceedings of IEEE International Conference on Image Processing (ICIP '05), September 2005, Genova, Italy 3: 672–675.
Segall A: Study of upsampling/down-sampling for spatial scalability. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), October 2005, Nice, France
Segall A: Upsampling and down-sampling for spatial scalability. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), January 2006, Bangkok, Thailand
Kim CK, Suh DY, Park GH: Directional filtering for upsampling according to direction information of the spatially lower layer. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), January 2006, Bangkok, Thailand
Goyal VK, Kovačević J, Kelner JA: Quantized frame expansions with erasures. Applied and Computational Harmonic Analysis 2001,10(3):203-233. 10.1006/acha.2000.0340
Daubechies I: Ten Lectures on Wavelets. SIAM, Philadelphia, Pa, USA; 1992.
Flierl M, Vandergheynst P: Inter-resolution transform for spatially scalable video coding. Proceedings of Picture Coding Symposium (PCS '04), December 2004, San Francisco, Calif, USA 243–247.
Vaidyanathan PP: Multirate Systems and Filter Banks. Prentice-Hall, Englewood Cliffs, NJ, USA; 1993.
Cover TM, Thomas JA: Elements of Information Theory. Wiley-Interscience, New York, NY, USA; 1991.
Strang G: Linear Algebra and Its Applications. 3rd edition. Brooks Cole Publishers, Florence, Ky, USA; 1988.
Rath G, Guillemot C: Compressing the Laplacian pyramid. Proceedings of the 8th IEEE Workshop on Multimedia Signal Processing (MMSP '06), October 2006, Victoria, BC, Canada 75–79.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://doi.org/creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Yang, W., Rath, G. & Guillemot, C. Scalable Video Coding with Interlayer Signal Decorrelation Techniques. EURASIP J. Adv. Signal Process. 2007, 054342 (2007). https://doi.org/10.1155/2007/54342
Received:
Accepted:
Published:
DOI: https://doi.org/10.1155/2007/54342