Skip to main content

Scalable Video Coding with Interlayer Signal Decorrelation Techniques

Abstract

Scalability is one of the essential requirements in the compression of visual data for present-day multimedia communications and storage. The basic building block for providing the spatial scalability in the scalable video coding (SVC) standard is the well-known Laplacian pyramid (LP). An LP achieves the multiscale representation of the video as a base-layer signal at lower resolution together with several enhancement-layer signals at successive higher resolutions. In this paper, we propose to improve the coding performance of the enhancement layers through efficient interlayer decorrelation techniques. We first show that, with nonbiorthogonal upsampling and downsampling filters, the base layer and the enhancement layers are correlated. We investigate two structures to reduce this correlation. The first structure updates the base-layer signal by subtracting from it the low-frequency component of the enhancement layer signal. The second structure modifies the prediction in order that the low-frequency component in the new enhancement layer is diminished. The second structure is integrated in the JSVM 4.0 codec with suitable modifications in the prediction modes. Experimental results with some standard test sequences demonstrate coding gains up to 1 dB for I pictures and up to 0.7 dB for both I and P pictures.

References

  1. 1.

    JVT : Joint scalable video model JSVM-4. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), October 2005, Nice, France

    Google Scholar 

  2. 2.

    Burt PJ, Adelson EH: The Lapacian pyramid as a compact image code. IEEE Transactions on Communications 1983,31(4):532-540. 10.1109/TCOM.1983.1095851

    Article  Google Scholar 

  3. 3.

    Do MN, Vetterli M: Framing pyramids. IEEE Transactions on Signal Processing 2003,51(9):2329-2342. 10.1109/TSP.2003.815389

    MathSciNet  Article  Google Scholar 

  4. 4.

    Flierl M, Vandergheynst P: An improved pyramid for spatially scalable video coding. Proceedings of IEEE International Conference on Image Processing (ICIP '05), September 2005, Genova, Italy 2: 878–881.

    Google Scholar 

  5. 5.

    Santa-Cruz D, Reichel J, Ziliani F: Opening the Laplacian pyramid for video coding. Proceedings of IEEE International Conference on Image Processing (ICIP '05), September 2005, Genova, Italy 3: 672–675.

    Google Scholar 

  6. 6.

    Segall A: Study of upsampling/down-sampling for spatial scalability. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), October 2005, Nice, France

    Google Scholar 

  7. 7.

    Segall A: Upsampling and down-sampling for spatial scalability. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), January 2006, Bangkok, Thailand

    Google Scholar 

  8. 8.

    Kim CK, Suh DY, Park GH: Directional filtering for upsampling according to direction information of the spatially lower layer. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), January 2006, Bangkok, Thailand

    Google Scholar 

  9. 9.

    Goyal VK, Kovačević J, Kelner JA: Quantized frame expansions with erasures. Applied and Computational Harmonic Analysis 2001,10(3):203-233. 10.1006/acha.2000.0340

    MathSciNet  Article  Google Scholar 

  10. 10.

    Daubechies I: Ten Lectures on Wavelets. SIAM, Philadelphia, Pa, USA; 1992.

    Google Scholar 

  11. 11.

    Flierl M, Vandergheynst P: Inter-resolution transform for spatially scalable video coding. Proceedings of Picture Coding Symposium (PCS '04), December 2004, San Francisco, Calif, USA 243–247.

    Google Scholar 

  12. 12.

    Vaidyanathan PP: Multirate Systems and Filter Banks. Prentice-Hall, Englewood Cliffs, NJ, USA; 1993.

    Google Scholar 

  13. 13.

    Cover TM, Thomas JA: Elements of Information Theory. Wiley-Interscience, New York, NY, USA; 1991.

    Google Scholar 

  14. 14.

    Strang G: Linear Algebra and Its Applications. 3rd edition. Brooks Cole Publishers, Florence, Ky, USA; 1988.

    Google Scholar 

  15. 15.

    Rath G, Guillemot C: Compressing the Laplacian pyramid. Proceedings of the 8th IEEE Workshop on Multimedia Signal Processing (MMSP '06), October 2006, Victoria, BC, Canada 75–79.

    Google Scholar 

Download references

Author information

Affiliations

Authors

Corresponding author

Correspondence to Wenxian Yang.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://doi.org/creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Yang, W., Rath, G. & Guillemot, C. Scalable Video Coding with Interlayer Signal Decorrelation Techniques. EURASIP J. Adv. Signal Process. 2007, 054342 (2007). https://doi.org/10.1155/2007/54342

Download citation

Keywords

  • Pyramid
  • Quantum Information
  • Lower Resolution
  • Base Layer
  • Test Sequence