Summarizing Audiovisual Contents of a Video Program

Gong, Yihong

doi:10.1155/S1110865703211082

Research Article
Published: 25 February 2003

Summarizing Audiovisual Contents of a Video Program

Yihong Gong¹

EURASIP Journal on Advances in Signal Processing volume 2003, Article number: 102838 (2003) Cite this article

1571 Accesses
17 Citations
3 Altmetric
Metrics details

Abstract

In this paper, we focus on video programs that are intended to disseminate information and knowledge such as news, documentaries, seminars, etc, and present an audiovisual summarization system that summarizes the audio and visual contents of the given video separately, and then integrating the two summaries with a partial alignment. The audio summary is created by selecting spoken sentences that best present the main content of the audio speech while the visual summary is created by eliminating duplicates/redundancies and preserving visually rich contents in the image stream. The alignment operation aims to synchronize each spoken sentence in the audio summary with its corresponding speaker′s face and to preserve the rich content in the visual summary. A Bipartite Graph-based audiovisual alignment algorithm is developed to efficiently find the best alignment solution that satisfies these alignment requirements. With the proposed system, we strive to produce a video summary that: (1) provides a natural visual and audio content overview, and (2) maximizes the coverage for both audio and visual contents of the original video without having to sacrifice either of them.

Author information

Authors and Affiliations

NEC Laboratories America, Inc., 10080 North Wolfe Road, SW3-350, Cupertino, CA, 95014, USA
Yihong Gong

Authors

Yihong Gong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yihong Gong.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gong, Y. Summarizing Audiovisual Contents of a Video Program. EURASIP J. Adv. Signal Process. 2003, 102838 (2003). https://doi.org/10.1155/S1110865703211082

Download citation

Received: 19 March 2002
Revised: 22 October 2002
Published: 25 February 2003
DOI: https://doi.org/10.1155/S1110865703211082

Summarizing Audiovisual Contents of a Video Program

Abstract

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords