Skip to main content
  • Research Article
  • Open access
  • Published:

A Multiple-Window Video Embedding Transcoder Based on H.264/AVC Standard

Abstract

This paper proposes a low-complexity multiple-window video embedding transcoder (MW-VET) based on H.264/AVC standard for various applications that require video embedding services including picture-in-picture (PIP), multichannel mosaic, screen-split, pay-per-view, channel browsing, commercials and logo insertion, and other visual information embedding services. The MW-VET embeds multiple foreground pictures at macroblock-aligned positions. It improves the transcoding speed with three block level adaptive techniques including slice group based transcoding (SGT), reduced frame memory transcoder (RFMT), and syntax level bypassing (SLB). The SGT utilizes prediction from the slice-aligned data partitions in the original bitstreams such that the transcoder simply merges the bitstreams by parsing. When the prediction comes from the newly covered area without slice-group data partitions, the pixels at the affected macroblocks are transcoded with the RFMT based on the concept of partial reencoding to minimize the number of refined blocks. The RFMT employs motion vector remapping (MVR) and intra mode switching (IMS) to handle intercoded blocks and intracoded blocks, respectively. The pixels outside the macroblocks that are affected by newly covered reference frame are transcoded by the SLB. Experimental results show that, as compared to the cascaded pixel domain transcoder (CPDT) with the highest complexity, our MW-VET can significantly reduce the processing complexity by 25 times and retain the rate-distortion performance close to the CPDT. At certain bit rates, the MW-VET can achieve up to 1.5 dB quality improvement in peak signal-to-noise-ratio (PSNR).

References

  1. ITU-T Rec. H.264 , ISO/IEC 14496-10 (MPEG4-AVC) : Advanced Video Coding for Generic Audiovisual Services. v1, May 2003; v2, January 2004; v3, September 2004; v4, July 2005

  2. Wenger S: H.264/AVC over IP. IEEE Transactions on Circuits and Systems for Video Technology 2003,13(7):645-656. 10.1109/TCSVT.2003.814966

    Article  Google Scholar 

  3. Nava MD, Del-Toso C: A short overview of the VDSL system requirements. IEEE Communications Magazine 2002,40(12):82-90. 10.1109/MCOM.2002.1106164

    Article  Google Scholar 

  4. Naimpally S, Johnson L, Darby T, Meyer R, Phillips L, Vantrease J: Integrated digital IDTV receiver with features. IEEE Transactions on Consumer Electronics 1988,34(3):410-419. 10.1109/30.20135

    Article  Google Scholar 

  5. Gillies D, Schweer R, Zibold H: VLSI realisations for picture in picture and flicker free television display. IEEE Transactions on Consumer Electronics 1988,34(1):253-261. 10.1109/30.75390

    Article  Google Scholar 

  6. Burkert M, Frieling F, Langenkamp U, Libal U, Mende M, Scheffler G: IC set for a picture-in-picture system with on-chip memory. IEEE Transactions on Consumer Electronics 1990,36(1):23-31. 10.1109/30.46605

    Article  Google Scholar 

  7. Mancini CA, Markhauser CP: Microprocessor controlled picture in picture system. IEEE Transactions on Consumer Electronics 1990,36(3):375-379. 10.1109/30.103147

    Article  Google Scholar 

  8. Honzawa M, Koyama M, Hibino T, Miyashita H, Shiine Y: New picture in picture LSI enhanced functionality for high picture quality. IEEE Transactions on Consumer Electronics 1990,36(3):387-394. 10.1109/30.103149

    Article  Google Scholar 

  9. Johnson LD, Pratt JN, Greene DC: Low cost picture-in-picture for color TV receivers. IEEE Transactions on Consumer Electronics 1990,36(3):380-386. 10.1109/30.103148

    Article  Google Scholar 

  10. Tsuchida S, Yoshida C: Multi-picture system for high resolution wide aspect ratio screen. IEEE Transactions on Consumer Electronics 1991,37(3):313-319. 10.1109/30.85531

    Article  Google Scholar 

  11. Perkins GW, Hathaway RC, Lai SW, et al.: A low cost, monolithic, color picture-in-picture device. IEEE Transactions on Consumer Electronics 1994,40(3):306-316. 10.1109/30.320810

    Article  Google Scholar 

  12. Rick A, Herfet T, Prange SJ: Digital color decoder for PIP-applications. IEEE Transactions on Consumer Electronics 1996,42(3):716-720. 10.1109/30.536177

    Article  Google Scholar 

  13. Brett M, Wendel D: High performance picture-in-picture (PIP) IC using embedded DRAM technology. IEEE Transactions on Consumer Electronics 1999,45(3):698-705. 10.1109/30.793574

    Article  Google Scholar 

  14. Schu M, Scheffler G, Tuschen C, Stolze A: System on silicon-IC for motion compensated scan rate conversion, picture-in-picture processing, split screen applications and display processing. IEEE Transactions on Consumer Electronics 1999,45(3):842-850. 10.1109/30.793620

    Article  Google Scholar 

  15. Schu M, Wendel D, Tuschen C, Hahn M, Langenkamp U: System-on-silicon solution for high quality consumer video processing—the next generation. IEEE Transactions on Consumer Electronics 2001,47(3):412-419. 10.1109/30.964128

    Article  Google Scholar 

  16. Hentschel C, Bril RJ, Chen Y, Braspenning R, Lan T-H: Video quality-of-service for consumer terminals—a novel system for programmable components. IEEE Transactions on Consumer Electronics 2003,49(4):1367-1377. 10.1109/TCE.2003.1261242

    Article  Google Scholar 

  17. Ahmad I, Wei X, Sun Y, Zhang Y-Q: Video transcoding: an overview of various techniques and research issues. IEEE Transactions on Multimedia 2005,7(5):793-804.

    Article  Google Scholar 

  18. Chang S-F, Messerschmitt DG: Compositing motion-compensated video within the network. Proceedings of the 4th IEEE ComSoc International Workshop on Multimedia Communications (MULTIMEDIA '92), April 1992, Monterey, Calif, USA 40–56.

    Chapter  Google Scholar 

  19. Chang S-F, Messerschmitt DG: Manipulation and compositing of MC-DCT compressed video. IEEE Journal on Selected Areas in Communications 1995,13(1, part 2):1-11. 10.1109/49.363151

    Article  Google Scholar 

  20. Noguchi Y, Messerschmitt DG, Chang S-F: MPEG video compositing in the compressed domain. Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '96), May 1996, Atlanta, Ga, USA 2: 596–599.

    Google Scholar 

  21. Yu B, Nahrstedt K: Internet-based interactive HDTV. Multimedia Systems 2004,9(5):477-489. 10.1007/s00530-003-0121-4

    Article  Google Scholar 

  22. Tan Y-P, Sun H: Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard. IEEE Transactions on Consumer Electronics 2004,50(3):887-894. 10.1109/TCE.2004.1341696

    Article  Google Scholar 

  23. Li C-H, Wang C-N, Chiang T: A fast downsizing video transcoding based on H.264/AVC standard. Proceedings of the 5th IEEE Pacific Rim Conference on Multimedia (PCM '04), November-December 2004, Tokyo, Japan 215–223.

    Google Scholar 

  24. Shen H, Sun X, Wu F, Li H, Li S: A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), July 2006, Toronto, Ontario, Canada 1: 2017–2020.

    Google Scholar 

  25. Merhav N, Bhaskaran V: A fast algorithm for DCT-domain inverse motion compensation. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '96), May 1996, Atlanta, Ga, USA 4: 2307–2310.

    Google Scholar 

  26. Song J, Yeo B-L: A fast algorithm for DCT-domain inverse motion compensation based on shared information in a macroblock. IEEE Transactions on Circuits and Systems for Video Technology 2000,10(5):767-775. 10.1109/76.856453

    Article  Google Scholar 

  27. Liu S, Bovik AC: Local bandwidth constrained fast inverse motion compensation for DCT-domain video transcoding. IEEE Transactions on Circuits and Systems for Video Technology 2002,12(5):309-319. 10.1109/TCSVT.2002.1003470

    Article  Google Scholar 

  28. Shen H, Sun X, Wu F, Li H, Li S: A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), July 2006, Toronto, Ontario, Canada 2017–2020.

    Google Scholar 

  29. Bialkowski J, Barkouwsky M, Leschka F, Kaup A: Low-complexity transcoding of inter coded video frames from H.264 to H.263. Proceedings of IEEE International Conference on Image Processing (ICIP '06), October 2006, Atlanta, Ga, USA 837–840.

    Google Scholar 

  30. Hur JH, Lee YL: H.264 to MPEG-4 transcoding using block type information. Proceedings of IEEE International Conference Region 10 (TENCON '05), November 2005, Melbourne, Australia 1–6.

    Google Scholar 

  31. Tan Y-P, Sun H: Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard. IEEE Transactions on Consumer Electronics 2004,50(3):887-894. 10.1109/TCE.2004.1341696

    Article  Google Scholar 

  32. Zhang P, Lu Y, Huang Q, Gao W: Mode mapping method for H.264/AVC spatial downscaling transcoding. Proceedings of International Conference on Image Processing (ICIP '04), October 2004, Singapore 4: 2781–62784.

    Google Scholar 

  33. Shin I-H, Lee Y-L, Park H-W: Motion estimation for frame-rate reduction in H.264 transcoding. Proceedings of the 2nd IEEE Workshop on Software Technologies for Future Embedded and Ubiquitous Systems (WSTFES '04), May 2004, Vienna, Austria 4: 63–67.

    Article  Google Scholar 

  34. Lefol D, Bull D: Mode refinement algorithm for H.264 inter frame requantization. Proceedings of IEEE International Conference on Image Processing (ICIP '06), October 2006, Atlanta, Ga, USA 845–848.

    Google Scholar 

  35. Zhang J, Perkis A, Georganas ND: H.264/AVC and transcoding for multimedia adaptation. Proceedings of the 6th COST 276 Workshop, May 2004, Thessaloniki, Greece

    Google Scholar 

  36. Xiu X, Zhuo L, Shen L: A H.264 bit rate transcoding scheme based on PID controller. Proceedings of IEEE International Symposium on Communications and Information Technologies (ISCIT '05), October 2005, Beijing, China 2: 1074–1077.

    Google Scholar 

  37. Lefol D, Bull D, Canagarajah N: Performance evaluation of transcoding algorithms for H.264. IEEE Transactions on Consumer Electronics 2006,52(1):215-222.

    Article  Google Scholar 

  38. Li C-H, Wang C-N, Chiang T: A low complexity picture-in-picture transcoder for video-on-demand. Proceedings of IEEE International Conference on Wireless Networks, Communications and Mobile Computing (WirelessCom '05), June 2005, Maui, Hawaii, USA 2: 1382–1387.

    Google Scholar 

  39. Li C-H, Lin H, Wang C-N, Chiang T: A fast H.264-based picture-in-picture (PIP) transcoder. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '04), June 2004, Taipei, Taiwan 3: 1691–1694.

    Google Scholar 

  40. Levi A, Stark H: Restoration from phase and magnitude by generalized projections. In Image Recovery Theory and Application. Academic Press, Orlando, Fla, USA; 1987:277-319.

    Google Scholar 

  41. Wang S-H, Peng W-H, He Y, et al.: A software-hardware co-implementation of MPEG-4 advanced video coding (AVC) decoder with block level pipelining. The Journal of VLSI Signal Processing 2005,41(1):93-110. 10.1007/s11265-005-6253-3

    Article  Google Scholar 

  42. Chen C, Wu P-H, Chen H: Transform-domain intra prediction for H.264. Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '05), May 2005, Kobe, Japan 2: 1497–1500.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chih-Hung Li.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://doi.org/creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Li, CH., Wang, CN. & Chiang, T. A Multiple-Window Video Embedding Transcoder Based on H.264/AVC Standard. EURASIP J. Adv. Signal Process. 2007, 013790 (2007). https://doi.org/10.1155/2007/13790

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1155/2007/13790

Keywords