A System for the Semantic Multimodal Analysis of News Audio-Visual Content

Mezaris, Vasileios; Gidaros, Spyros; Papadopoulos, GeorgiosTh; Kasper, Walter; Steffen, Jörg; Ordelman, Roeland; Huijbregts, Marijn; de Jong, Franciska; Kompatsiaris, Ioannis; Strintzis, MichaelG

doi:10.1155/2010/645052

Research Article
Open access
Published: 11 April 2010

A System for the Semantic Multimodal Analysis of News Audio-Visual Content

Vasileios Mezaris¹,
Spyros Gidaros¹,
GeorgiosTh Papadopoulos^1,2,
Walter Kasper³,
Jörg Steffen³,
Roeland Ordelman⁴,
Marijn Huijbregts^4,5,
Franciska de Jong⁴,
Ioannis Kompatsiaris¹ &
…
MichaelG Strintzis^1,2

EURASIP Journal on Advances in Signal Processing volume 2010, Article number: 645052 (2010) Cite this article

1689 Accesses
5 Citations
6 Altmetric
Metrics details

Abstract

News-related content is nowadays among the most popular types of content for users in everyday applications. Although the generation and distribution of news content has become commonplace, due to the availability of inexpensive media capturing devices and the development of media sharing services targeting both professional and user-generated news content, the automatic analysis and annotation that is required for supporting intelligent search and delivery of this content remains an open issue. In this paper, a complete architecture for knowledge-assisted multimodal analysis of news-related multimedia content is presented, along with its constituent components. The proposed analysis architecture employs state-of-the-art methods for the analysis of each individual modality (visual, audio, text) separately and proposes a novel fusion technique based on the particular characteristics of news-related content for the combination of the individual modality analysis results. Experimental results on news broadcast video illustrate the usefulness of the proposed techniques in the automatic generation of semantic annotations.

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

Centre for Research and Technology Hellas, Informatics and Telematics Institute, 6th Km Charilaou-Thermi Road, P.O. BOX 60361, 57001, Thermi, Greece
Vasileios Mezaris, Spyros Gidaros, GeorgiosTh Papadopoulos, Ioannis Kompatsiaris & MichaelG Strintzis
Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, 54006, Thessaloniki, Greece
GeorgiosTh Papadopoulos & MichaelG Strintzis
Language Technology Lab, DFKI GmbH, Stuhlsatzenhausweg 3, 66123, Saarbrucken, Germany
Walter Kasper & Jörg Steffen
Department of Computer Science/Human Media Interaction, University of Twente, 7500 AE, Enschede, The Netherlands
Roeland Ordelman, Marijn Huijbregts & Franciska de Jong
Centre for Language and Speech Technology, Radboud University Nijmegen, 6525 HT, Nijmegen, The Netherlands
Marijn Huijbregts

Authors

Vasileios Mezaris
View author publications
You can also search for this author in PubMed Google Scholar
Spyros Gidaros
View author publications
You can also search for this author in PubMed Google Scholar
GeorgiosTh Papadopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Walter Kasper
View author publications
You can also search for this author in PubMed Google Scholar
Jörg Steffen
View author publications
You can also search for this author in PubMed Google Scholar
Roeland Ordelman
View author publications
You can also search for this author in PubMed Google Scholar
Marijn Huijbregts
View author publications
You can also search for this author in PubMed Google Scholar
Franciska de Jong
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Kompatsiaris
View author publications
You can also search for this author in PubMed Google Scholar
MichaelG Strintzis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vasileios Mezaris.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Mezaris, V., Gidaros, S., Papadopoulos, G. et al. A System for the Semantic Multimodal Analysis of News Audio-Visual Content. EURASIP J. Adv. Signal Process. 2010, 645052 (2010). https://doi.org/10.1155/2010/645052

Download citation

Received: 24 July 2009
Revised: 09 December 2009
Accepted: 21 February 2010
Published: 11 April 2010
DOI: https://doi.org/10.1155/2010/645052

A System for the Semantic Multimodal Analysis of News Audio-Visual Content

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords