Skip to main content

Advertisement

Scalable Video Coding with Interlayer Signal Decorrelation Techniques

Article metrics

  • 756 Accesses

  • 3 Citations

Abstract

Scalability is one of the essential requirements in the compression of visual data for present-day multimedia communications and storage. The basic building block for providing the spatial scalability in the scalable video coding (SVC) standard is the well-known Laplacian pyramid (LP). An LP achieves the multiscale representation of the video as a base-layer signal at lower resolution together with several enhancement-layer signals at successive higher resolutions. In this paper, we propose to improve the coding performance of the enhancement layers through efficient interlayer decorrelation techniques. We first show that, with nonbiorthogonal upsampling and downsampling filters, the base layer and the enhancement layers are correlated. We investigate two structures to reduce this correlation. The first structure updates the base-layer signal by subtracting from it the low-frequency component of the enhancement layer signal. The second structure modifies the prediction in order that the low-frequency component in the new enhancement layer is diminished. The second structure is integrated in the JSVM 4.0 codec with suitable modifications in the prediction modes. Experimental results with some standard test sequences demonstrate coding gains up to 1 dB for I pictures and up to 0.7 dB for both I and P pictures.

References

  1. 1.

    JVT : Joint scalable video model JSVM-4. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), October 2005, Nice, France

  2. 2.

    Burt PJ, Adelson EH: The Lapacian pyramid as a compact image code. IEEE Transactions on Communications 1983,31(4):532-540. 10.1109/TCOM.1983.1095851

  3. 3.

    Do MN, Vetterli M: Framing pyramids. IEEE Transactions on Signal Processing 2003,51(9):2329-2342. 10.1109/TSP.2003.815389

  4. 4.

    Flierl M, Vandergheynst P: An improved pyramid for spatially scalable video coding. Proceedings of IEEE International Conference on Image Processing (ICIP '05), September 2005, Genova, Italy 2: 878–881.

  5. 5.

    Santa-Cruz D, Reichel J, Ziliani F: Opening the Laplacian pyramid for video coding. Proceedings of IEEE International Conference on Image Processing (ICIP '05), September 2005, Genova, Italy 3: 672–675.

  6. 6.

    Segall A: Study of upsampling/down-sampling for spatial scalability. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), October 2005, Nice, France

  7. 7.

    Segall A: Upsampling and down-sampling for spatial scalability. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), January 2006, Bangkok, Thailand

  8. 8.

    Kim CK, Suh DY, Park GH: Directional filtering for upsampling according to direction information of the spatially lower layer. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), January 2006, Bangkok, Thailand

  9. 9.

    Goyal VK, Kovačević J, Kelner JA: Quantized frame expansions with erasures. Applied and Computational Harmonic Analysis 2001,10(3):203-233. 10.1006/acha.2000.0340

  10. 10.

    Daubechies I: Ten Lectures on Wavelets. SIAM, Philadelphia, Pa, USA; 1992.

  11. 11.

    Flierl M, Vandergheynst P: Inter-resolution transform for spatially scalable video coding. Proceedings of Picture Coding Symposium (PCS '04), December 2004, San Francisco, Calif, USA 243–247.

  12. 12.

    Vaidyanathan PP: Multirate Systems and Filter Banks. Prentice-Hall, Englewood Cliffs, NJ, USA; 1993.

  13. 13.

    Cover TM, Thomas JA: Elements of Information Theory. Wiley-Interscience, New York, NY, USA; 1991.

  14. 14.

    Strang G: Linear Algebra and Its Applications. 3rd edition. Brooks Cole Publishers, Florence, Ky, USA; 1988.

  15. 15.

    Rath G, Guillemot C: Compressing the Laplacian pyramid. Proceedings of the 8th IEEE Workshop on Multimedia Signal Processing (MMSP '06), October 2006, Victoria, BC, Canada 75–79.

Download references

Author information

Correspondence to Wenxian Yang.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://doi.org/creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Yang, W., Rath, G. & Guillemot, C. Scalable Video Coding with Interlayer Signal Decorrelation Techniques. EURASIP J. Adv. Signal Process. 2007, 054342 (2007) doi:10.1155/2007/54342

Download citation

Keywords

  • Pyramid
  • Quantum Information
  • Lower Resolution
  • Base Layer
  • Test Sequence