- Research Article
- Open access
- Published:
A Multiple-Window Video Embedding Transcoder Based on H.264/AVC Standard
EURASIP Journal on Advances in Signal Processing volume 2007, Article number: 013790 (2007)
Abstract
This paper proposes a low-complexity multiple-window video embedding transcoder (MW-VET) based on H.264/AVC standard for various applications that require video embedding services including picture-in-picture (PIP), multichannel mosaic, screen-split, pay-per-view, channel browsing, commercials and logo insertion, and other visual information embedding services. The MW-VET embeds multiple foreground pictures at macroblock-aligned positions. It improves the transcoding speed with three block level adaptive techniques including slice group based transcoding (SGT), reduced frame memory transcoder (RFMT), and syntax level bypassing (SLB). The SGT utilizes prediction from the slice-aligned data partitions in the original bitstreams such that the transcoder simply merges the bitstreams by parsing. When the prediction comes from the newly covered area without slice-group data partitions, the pixels at the affected macroblocks are transcoded with the RFMT based on the concept of partial reencoding to minimize the number of refined blocks. The RFMT employs motion vector remapping (MVR) and intra mode switching (IMS) to handle intercoded blocks and intracoded blocks, respectively. The pixels outside the macroblocks that are affected by newly covered reference frame are transcoded by the SLB. Experimental results show that, as compared to the cascaded pixel domain transcoder (CPDT) with the highest complexity, our MW-VET can significantly reduce the processing complexity by 25 times and retain the rate-distortion performance close to the CPDT. At certain bit rates, the MW-VET can achieve up to 1.5 dB quality improvement in peak signal-to-noise-ratio (PSNR).
References
ITU-T Rec. H.264 , ISO/IEC 14496-10 (MPEG4-AVC) : Advanced Video Coding for Generic Audiovisual Services. v1, May 2003; v2, January 2004; v3, September 2004; v4, July 2005
Wenger S: H.264/AVC over IP. IEEE Transactions on Circuits and Systems for Video Technology 2003,13(7):645-656. 10.1109/TCSVT.2003.814966
Nava MD, Del-Toso C: A short overview of the VDSL system requirements. IEEE Communications Magazine 2002,40(12):82-90. 10.1109/MCOM.2002.1106164
Naimpally S, Johnson L, Darby T, Meyer R, Phillips L, Vantrease J: Integrated digital IDTV receiver with features. IEEE Transactions on Consumer Electronics 1988,34(3):410-419. 10.1109/30.20135
Gillies D, Schweer R, Zibold H: VLSI realisations for picture in picture and flicker free television display. IEEE Transactions on Consumer Electronics 1988,34(1):253-261. 10.1109/30.75390
Burkert M, Frieling F, Langenkamp U, Libal U, Mende M, Scheffler G: IC set for a picture-in-picture system with on-chip memory. IEEE Transactions on Consumer Electronics 1990,36(1):23-31. 10.1109/30.46605
Mancini CA, Markhauser CP: Microprocessor controlled picture in picture system. IEEE Transactions on Consumer Electronics 1990,36(3):375-379. 10.1109/30.103147
Honzawa M, Koyama M, Hibino T, Miyashita H, Shiine Y: New picture in picture LSI enhanced functionality for high picture quality. IEEE Transactions on Consumer Electronics 1990,36(3):387-394. 10.1109/30.103149
Johnson LD, Pratt JN, Greene DC: Low cost picture-in-picture for color TV receivers. IEEE Transactions on Consumer Electronics 1990,36(3):380-386. 10.1109/30.103148
Tsuchida S, Yoshida C: Multi-picture system for high resolution wide aspect ratio screen. IEEE Transactions on Consumer Electronics 1991,37(3):313-319. 10.1109/30.85531
Perkins GW, Hathaway RC, Lai SW, et al.: A low cost, monolithic, color picture-in-picture device. IEEE Transactions on Consumer Electronics 1994,40(3):306-316. 10.1109/30.320810
Rick A, Herfet T, Prange SJ: Digital color decoder for PIP-applications. IEEE Transactions on Consumer Electronics 1996,42(3):716-720. 10.1109/30.536177
Brett M, Wendel D: High performance picture-in-picture (PIP) IC using embedded DRAM technology. IEEE Transactions on Consumer Electronics 1999,45(3):698-705. 10.1109/30.793574
Schu M, Scheffler G, Tuschen C, Stolze A: System on silicon-IC for motion compensated scan rate conversion, picture-in-picture processing, split screen applications and display processing. IEEE Transactions on Consumer Electronics 1999,45(3):842-850. 10.1109/30.793620
Schu M, Wendel D, Tuschen C, Hahn M, Langenkamp U: System-on-silicon solution for high quality consumer video processing—the next generation. IEEE Transactions on Consumer Electronics 2001,47(3):412-419. 10.1109/30.964128
Hentschel C, Bril RJ, Chen Y, Braspenning R, Lan T-H: Video quality-of-service for consumer terminals—a novel system for programmable components. IEEE Transactions on Consumer Electronics 2003,49(4):1367-1377. 10.1109/TCE.2003.1261242
Ahmad I, Wei X, Sun Y, Zhang Y-Q: Video transcoding: an overview of various techniques and research issues. IEEE Transactions on Multimedia 2005,7(5):793-804.
Chang S-F, Messerschmitt DG: Compositing motion-compensated video within the network. Proceedings of the 4th IEEE ComSoc International Workshop on Multimedia Communications (MULTIMEDIA '92), April 1992, Monterey, Calif, USA 40–56.
Chang S-F, Messerschmitt DG: Manipulation and compositing of MC-DCT compressed video. IEEE Journal on Selected Areas in Communications 1995,13(1, part 2):1-11. 10.1109/49.363151
Noguchi Y, Messerschmitt DG, Chang S-F: MPEG video compositing in the compressed domain. Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '96), May 1996, Atlanta, Ga, USA 2: 596–599.
Yu B, Nahrstedt K: Internet-based interactive HDTV. Multimedia Systems 2004,9(5):477-489. 10.1007/s00530-003-0121-4
Tan Y-P, Sun H: Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard. IEEE Transactions on Consumer Electronics 2004,50(3):887-894. 10.1109/TCE.2004.1341696
Li C-H, Wang C-N, Chiang T: A fast downsizing video transcoding based on H.264/AVC standard. Proceedings of the 5th IEEE Pacific Rim Conference on Multimedia (PCM '04), November-December 2004, Tokyo, Japan 215–223.
Shen H, Sun X, Wu F, Li H, Li S: A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), July 2006, Toronto, Ontario, Canada 1: 2017–2020.
Merhav N, Bhaskaran V: A fast algorithm for DCT-domain inverse motion compensation. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '96), May 1996, Atlanta, Ga, USA 4: 2307–2310.
Song J, Yeo B-L: A fast algorithm for DCT-domain inverse motion compensation based on shared information in a macroblock. IEEE Transactions on Circuits and Systems for Video Technology 2000,10(5):767-775. 10.1109/76.856453
Liu S, Bovik AC: Local bandwidth constrained fast inverse motion compensation for DCT-domain video transcoding. IEEE Transactions on Circuits and Systems for Video Technology 2002,12(5):309-319. 10.1109/TCSVT.2002.1003470
Shen H, Sun X, Wu F, Li H, Li S: A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), July 2006, Toronto, Ontario, Canada 2017–2020.
Bialkowski J, Barkouwsky M, Leschka F, Kaup A: Low-complexity transcoding of inter coded video frames from H.264 to H.263. Proceedings of IEEE International Conference on Image Processing (ICIP '06), October 2006, Atlanta, Ga, USA 837–840.
Hur JH, Lee YL: H.264 to MPEG-4 transcoding using block type information. Proceedings of IEEE International Conference Region 10 (TENCON '05), November 2005, Melbourne, Australia 1–6.
Tan Y-P, Sun H: Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard. IEEE Transactions on Consumer Electronics 2004,50(3):887-894. 10.1109/TCE.2004.1341696
Zhang P, Lu Y, Huang Q, Gao W: Mode mapping method for H.264/AVC spatial downscaling transcoding. Proceedings of International Conference on Image Processing (ICIP '04), October 2004, Singapore 4: 2781–62784.
Shin I-H, Lee Y-L, Park H-W: Motion estimation for frame-rate reduction in H.264 transcoding. Proceedings of the 2nd IEEE Workshop on Software Technologies for Future Embedded and Ubiquitous Systems (WSTFES '04), May 2004, Vienna, Austria 4: 63–67.
Lefol D, Bull D: Mode refinement algorithm for H.264 inter frame requantization. Proceedings of IEEE International Conference on Image Processing (ICIP '06), October 2006, Atlanta, Ga, USA 845–848.
Zhang J, Perkis A, Georganas ND: H.264/AVC and transcoding for multimedia adaptation. Proceedings of the 6th COST 276 Workshop, May 2004, Thessaloniki, Greece
Xiu X, Zhuo L, Shen L: A H.264 bit rate transcoding scheme based on PID controller. Proceedings of IEEE International Symposium on Communications and Information Technologies (ISCIT '05), October 2005, Beijing, China 2: 1074–1077.
Lefol D, Bull D, Canagarajah N: Performance evaluation of transcoding algorithms for H.264. IEEE Transactions on Consumer Electronics 2006,52(1):215-222.
Li C-H, Wang C-N, Chiang T: A low complexity picture-in-picture transcoder for video-on-demand. Proceedings of IEEE International Conference on Wireless Networks, Communications and Mobile Computing (WirelessCom '05), June 2005, Maui, Hawaii, USA 2: 1382–1387.
Li C-H, Lin H, Wang C-N, Chiang T: A fast H.264-based picture-in-picture (PIP) transcoder. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '04), June 2004, Taipei, Taiwan 3: 1691–1694.
Levi A, Stark H: Restoration from phase and magnitude by generalized projections. In Image Recovery Theory and Application. Academic Press, Orlando, Fla, USA; 1987:277-319.
Wang S-H, Peng W-H, He Y, et al.: A software-hardware co-implementation of MPEG-4 advanced video coding (AVC) decoder with block level pipelining. The Journal of VLSI Signal Processing 2005,41(1):93-110. 10.1007/s11265-005-6253-3
Chen C, Wu P-H, Chen H: Transform-domain intra prediction for H.264. Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '05), May 2005, Kobe, Japan 2: 1497–1500.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://doi.org/creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Li, CH., Wang, CN. & Chiang, T. A Multiple-Window Video Embedding Transcoder Based on H.264/AVC Standard. EURASIP J. Adv. Signal Process. 2007, 013790 (2007). https://doi.org/10.1155/2007/13790
Received:
Accepted:
Published:
DOI: https://doi.org/10.1155/2007/13790