Skip to main content

Advertisement

You are viewing the new article page. Let us know what you think. Return to old version

A Multiple-Window Video Embedding Transcoder Based on H.264/AVC Standard

Abstract

This paper proposes a low-complexity multiple-window video embedding transcoder (MW-VET) based on H.264/AVC standard for various applications that require video embedding services including picture-in-picture (PIP), multichannel mosaic, screen-split, pay-per-view, channel browsing, commercials and logo insertion, and other visual information embedding services. The MW-VET embeds multiple foreground pictures at macroblock-aligned positions. It improves the transcoding speed with three block level adaptive techniques including slice group based transcoding (SGT), reduced frame memory transcoder (RFMT), and syntax level bypassing (SLB). The SGT utilizes prediction from the slice-aligned data partitions in the original bitstreams such that the transcoder simply merges the bitstreams by parsing. When the prediction comes from the newly covered area without slice-group data partitions, the pixels at the affected macroblocks are transcoded with the RFMT based on the concept of partial reencoding to minimize the number of refined blocks. The RFMT employs motion vector remapping (MVR) and intra mode switching (IMS) to handle intercoded blocks and intracoded blocks, respectively. The pixels outside the macroblocks that are affected by newly covered reference frame are transcoded by the SLB. Experimental results show that, as compared to the cascaded pixel domain transcoder (CPDT) with the highest complexity, our MW-VET can significantly reduce the processing complexity by 25 times and retain the rate-distortion performance close to the CPDT. At certain bit rates, the MW-VET can achieve up to 1.5 dB quality improvement in peak signal-to-noise-ratio (PSNR).

References

  1. 1.

    ITU-T Rec. H.264 , ISO/IEC 14496-10 (MPEG4-AVC) : Advanced Video Coding for Generic Audiovisual Services. v1, May 2003; v2, January 2004; v3, September 2004; v4, July 2005

  2. 2.

    Wenger S: H.264/AVC over IP. IEEE Transactions on Circuits and Systems for Video Technology 2003,13(7):645-656. 10.1109/TCSVT.2003.814966

  3. 3.

    Nava MD, Del-Toso C: A short overview of the VDSL system requirements. IEEE Communications Magazine 2002,40(12):82-90. 10.1109/MCOM.2002.1106164

  4. 4.

    Naimpally S, Johnson L, Darby T, Meyer R, Phillips L, Vantrease J: Integrated digital IDTV receiver with features. IEEE Transactions on Consumer Electronics 1988,34(3):410-419. 10.1109/30.20135

  5. 5.

    Gillies D, Schweer R, Zibold H: VLSI realisations for picture in picture and flicker free television display. IEEE Transactions on Consumer Electronics 1988,34(1):253-261. 10.1109/30.75390

  6. 6.

    Burkert M, Frieling F, Langenkamp U, Libal U, Mende M, Scheffler G: IC set for a picture-in-picture system with on-chip memory. IEEE Transactions on Consumer Electronics 1990,36(1):23-31. 10.1109/30.46605

  7. 7.

    Mancini CA, Markhauser CP: Microprocessor controlled picture in picture system. IEEE Transactions on Consumer Electronics 1990,36(3):375-379. 10.1109/30.103147

  8. 8.

    Honzawa M, Koyama M, Hibino T, Miyashita H, Shiine Y: New picture in picture LSI enhanced functionality for high picture quality. IEEE Transactions on Consumer Electronics 1990,36(3):387-394. 10.1109/30.103149

  9. 9.

    Johnson LD, Pratt JN, Greene DC: Low cost picture-in-picture for color TV receivers. IEEE Transactions on Consumer Electronics 1990,36(3):380-386. 10.1109/30.103148

  10. 10.

    Tsuchida S, Yoshida C: Multi-picture system for high resolution wide aspect ratio screen. IEEE Transactions on Consumer Electronics 1991,37(3):313-319. 10.1109/30.85531

  11. 11.

    Perkins GW, Hathaway RC, Lai SW, et al.: A low cost, monolithic, color picture-in-picture device. IEEE Transactions on Consumer Electronics 1994,40(3):306-316. 10.1109/30.320810

  12. 12.

    Rick A, Herfet T, Prange SJ: Digital color decoder for PIP-applications. IEEE Transactions on Consumer Electronics 1996,42(3):716-720. 10.1109/30.536177

  13. 13.

    Brett M, Wendel D: High performance picture-in-picture (PIP) IC using embedded DRAM technology. IEEE Transactions on Consumer Electronics 1999,45(3):698-705. 10.1109/30.793574

  14. 14.

    Schu M, Scheffler G, Tuschen C, Stolze A: System on silicon-IC for motion compensated scan rate conversion, picture-in-picture processing, split screen applications and display processing. IEEE Transactions on Consumer Electronics 1999,45(3):842-850. 10.1109/30.793620

  15. 15.

    Schu M, Wendel D, Tuschen C, Hahn M, Langenkamp U: System-on-silicon solution for high quality consumer video processing—the next generation. IEEE Transactions on Consumer Electronics 2001,47(3):412-419. 10.1109/30.964128

  16. 16.

    Hentschel C, Bril RJ, Chen Y, Braspenning R, Lan T-H: Video quality-of-service for consumer terminals—a novel system for programmable components. IEEE Transactions on Consumer Electronics 2003,49(4):1367-1377. 10.1109/TCE.2003.1261242

  17. 17.

    Ahmad I, Wei X, Sun Y, Zhang Y-Q: Video transcoding: an overview of various techniques and research issues. IEEE Transactions on Multimedia 2005,7(5):793-804.

  18. 18.

    Chang S-F, Messerschmitt DG: Compositing motion-compensated video within the network. Proceedings of the 4th IEEE ComSoc International Workshop on Multimedia Communications (MULTIMEDIA '92), April 1992, Monterey, Calif, USA 40–56.

  19. 19.

    Chang S-F, Messerschmitt DG: Manipulation and compositing of MC-DCT compressed video. IEEE Journal on Selected Areas in Communications 1995,13(1, part 2):1-11. 10.1109/49.363151

  20. 20.

    Noguchi Y, Messerschmitt DG, Chang S-F: MPEG video compositing in the compressed domain. Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '96), May 1996, Atlanta, Ga, USA 2: 596–599.

  21. 21.

    Yu B, Nahrstedt K: Internet-based interactive HDTV. Multimedia Systems 2004,9(5):477-489. 10.1007/s00530-003-0121-4

  22. 22.

    Tan Y-P, Sun H: Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard. IEEE Transactions on Consumer Electronics 2004,50(3):887-894. 10.1109/TCE.2004.1341696

  23. 23.

    Li C-H, Wang C-N, Chiang T: A fast downsizing video transcoding based on H.264/AVC standard. Proceedings of the 5th IEEE Pacific Rim Conference on Multimedia (PCM '04), November-December 2004, Tokyo, Japan 215–223.

  24. 24.

    Shen H, Sun X, Wu F, Li H, Li S: A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), July 2006, Toronto, Ontario, Canada 1: 2017–2020.

  25. 25.

    Merhav N, Bhaskaran V: A fast algorithm for DCT-domain inverse motion compensation. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '96), May 1996, Atlanta, Ga, USA 4: 2307–2310.

  26. 26.

    Song J, Yeo B-L: A fast algorithm for DCT-domain inverse motion compensation based on shared information in a macroblock. IEEE Transactions on Circuits and Systems for Video Technology 2000,10(5):767-775. 10.1109/76.856453

  27. 27.

    Liu S, Bovik AC: Local bandwidth constrained fast inverse motion compensation for DCT-domain video transcoding. IEEE Transactions on Circuits and Systems for Video Technology 2002,12(5):309-319. 10.1109/TCSVT.2002.1003470

  28. 28.

    Shen H, Sun X, Wu F, Li H, Li S: A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), July 2006, Toronto, Ontario, Canada 2017–2020.

  29. 29.

    Bialkowski J, Barkouwsky M, Leschka F, Kaup A: Low-complexity transcoding of inter coded video frames from H.264 to H.263. Proceedings of IEEE International Conference on Image Processing (ICIP '06), October 2006, Atlanta, Ga, USA 837–840.

  30. 30.

    Hur JH, Lee YL: H.264 to MPEG-4 transcoding using block type information. Proceedings of IEEE International Conference Region 10 (TENCON '05), November 2005, Melbourne, Australia 1–6.

  31. 31.

    Tan Y-P, Sun H: Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard. IEEE Transactions on Consumer Electronics 2004,50(3):887-894. 10.1109/TCE.2004.1341696

  32. 32.

    Zhang P, Lu Y, Huang Q, Gao W: Mode mapping method for H.264/AVC spatial downscaling transcoding. Proceedings of International Conference on Image Processing (ICIP '04), October 2004, Singapore 4: 2781–62784.

  33. 33.

    Shin I-H, Lee Y-L, Park H-W: Motion estimation for frame-rate reduction in H.264 transcoding. Proceedings of the 2nd IEEE Workshop on Software Technologies for Future Embedded and Ubiquitous Systems (WSTFES '04), May 2004, Vienna, Austria 4: 63–67.

  34. 34.

    Lefol D, Bull D: Mode refinement algorithm for H.264 inter frame requantization. Proceedings of IEEE International Conference on Image Processing (ICIP '06), October 2006, Atlanta, Ga, USA 845–848.

  35. 35.

    Zhang J, Perkis A, Georganas ND: H.264/AVC and transcoding for multimedia adaptation. Proceedings of the 6th COST 276 Workshop, May 2004, Thessaloniki, Greece

  36. 36.

    Xiu X, Zhuo L, Shen L: A H.264 bit rate transcoding scheme based on PID controller. Proceedings of IEEE International Symposium on Communications and Information Technologies (ISCIT '05), October 2005, Beijing, China 2: 1074–1077.

  37. 37.

    Lefol D, Bull D, Canagarajah N: Performance evaluation of transcoding algorithms for H.264. IEEE Transactions on Consumer Electronics 2006,52(1):215-222.

  38. 38.

    Li C-H, Wang C-N, Chiang T: A low complexity picture-in-picture transcoder for video-on-demand. Proceedings of IEEE International Conference on Wireless Networks, Communications and Mobile Computing (WirelessCom '05), June 2005, Maui, Hawaii, USA 2: 1382–1387.

  39. 39.

    Li C-H, Lin H, Wang C-N, Chiang T: A fast H.264-based picture-in-picture (PIP) transcoder. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '04), June 2004, Taipei, Taiwan 3: 1691–1694.

  40. 40.

    Levi A, Stark H: Restoration from phase and magnitude by generalized projections. In Image Recovery Theory and Application. Academic Press, Orlando, Fla, USA; 1987:277-319.

  41. 41.

    Wang S-H, Peng W-H, He Y, et al.: A software-hardware co-implementation of MPEG-4 advanced video coding (AVC) decoder with block level pipelining. The Journal of VLSI Signal Processing 2005,41(1):93-110. 10.1007/s11265-005-6253-3

  42. 42.

    Chen C, Wu P-H, Chen H: Transform-domain intra prediction for H.264. Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '05), May 2005, Kobe, Japan 2: 1497–1500.

Download references

Author information

Correspondence to Chih-Hung Li.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://doi.org/creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Keywords

  • Information Technology
  • Reference Frame
  • Quality Improvement
  • Visual Information
  • Quantum Information